xf86-video-intel

Commit Graph

Author	SHA1	Message	Date
Chris Wilson	1ba983034b	uxa: Emit the damage after the render for the workaround in uxa_solid_rects Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-12-07 12:27:29 +00:00
Chris Wilson	81d355a8dc	uxa: Fix crash after allocation failure Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=31487 Reported-by: Thomas Fjellstrom <tfjellstrom@shaw.ca> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-11-09 09:30:12 +00:00
Chris Wilson	23ee926bcd	uxa: Skip a pixmap lookup if there is no driver finish access function Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-10-06 12:17:14 +01:00
Chris Wilson	7c7294ec00	shadow+dri2: Allow dri2 to be independently enabled with shadow To enable DRI we create GEM buffers for the client to render into with hardware acceleration. In order to maintain coherency between any 2D render operations with the independent 3D clients (this includes the reading of 2D rasterisation by the direct rendering client, e.g. compiz using texture_from_pixmap) we need to replace the shadow pixmap with the GTT mapping. Therefore 2D rendering to a DRI buffer will be to uncached memory and thus penalised -- but the direct rendering clients will have full hardware acceleration. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-10-04 20:24:36 +01:00
Matthias Hopf	b84925b9c0	Make driver compile for 1.6 Xserver series again. Signed-off-by: Matthias Hopf <mhopf@suse.de>	2010-09-22 17:45:06 +02:00
Chris Wilson	2b96c18165	Enable a shadow buffer and disable GPU acceleration. An attempt to workaround the incoherency in gen2 chipsets, we avoid using dynamic reallocation as much as possible. The first step is to disable allocation of pixmaps using GEM and simply create them in system memory without a backing buffer object. This forces all rendering to use S/W fallbacks. The second step is to allocate a shadow front buffer and assign that to the Screen pixmap. This ensure that the front buffer remains in the GTT and pinned for scanout. The shadow buffer will be rendered to in the normal fashion via the Screen pixmap, and be marked dirty. In the block handler, the dirty shadow buffer is then blitted (using the GPU) over the front buffer. This should completely avoid having to move pages around in the GTT and avoid incurring the wrath of those early chipsets. Secondly, performance should be reasonable as we avoid the ping-pong caused by the small aperture and weak GPU forcing software fallbacks. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-09-08 13:33:37 +01:00
Chris Wilson	68a5ad497b	uxa: Fallback if faced with large A1 glyphs. Rather than assert, we should fixup the use of large A1 glyphs. However, the simplest approach is to simply fallback to s/w. Fixes: Bug 29430 - [UXA] Crash due assert (uxa_pixmap_is_offscreen(src_pixmap)); https://bugs.freedesktop.org/show_bug.cgi?id=29430 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-08-25 15:01:50 +01:00
Chris Wilson	c57840b272	uxa: Check for failed pixmap allocation Fixes: Bug 29187 - crash in intel_drv https://bugs.freedesktop.org/show_bug.cgi?id=29187 Backtrace: 0: /usr/bin/X (xorg_backtrace+0x28) [0x466808] 1: /usr/bin/X (0x400000+0x67c79) [0x467c79] 2: /lib/libpthread.so.0 (0x7ff19b297000+0xef60) [0x7ff19b2a5f60] 3: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7ff197986000+0x34684) => uxa/uxa-render.c:841 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-07-21 09:08:28 +01:00
Keith Packard	e30f0338fb	Destroy screen pixmap on screen close. This avoids a memory leak on server reset. Signed-off-by: Keith Packard <keithp@keithp.com> [ickle: Added comments from Keith that explain the necessity of destroying the pixmap ourselves and why chaining up in this instance is not the correct approach.] Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-07-12 17:40:55 +01:00
Fernando Carrijo	6e08b0f48f	Purge macro NEED_EVENTS Signed-off-by: Fernando Carrijo <fcarrijo@yahoo.com.br> Acked-by: Tiago Vignatti <tiago.vignatti@nokia.com> Reviewed-by: Alan Coopersmith <alan.coopersmith@oracle.com>	2010-07-09 20:49:13 -07:00
Dave Airlie	a2aa4c23f6	uxa: oops typo in previous commit	2010-07-05 14:02:42 +10:00
Dave Airlie	feff2ec80e	uxa: don't compare planemask with FB_ALLONES. planemask is an unsigned long initialised to ~0, on 64-bit this is not equal to an (unsigned int)-1. Use the macro provided to do this. Signed-off-by: Dave Airlie <airlied@redhat.com>	2010-07-05 09:07:08 +10:00
Chris Wilson	b58a6a39c1	uxa: Fallback to pixman if source is out-of-bounds If the source is outside the drawable, then CopyArea will fail to initialise the source correctly. The simplest fix in this case is to fallback to pixman to generate the source texture. Fixes: Bug 28497 - Graphics corruption after opening a specific website https://bugs.freedesktop.org/show_bug.cgi?id=28497 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-23 23:56:44 +01:00
Chris Wilson	e8783869ad	uxa: Apply the source offsets to the pixmap source, not target. A slight confusion in computing the correction image location resulted in the application of the source offsets to the pixel location in the target and not in the source as intended. Fixes the visual corruption of the scrollbar in Chromium, and hopefully the crash reported by Robert Hooker when starting gdm after plymouth. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-20 00:10:43 +01:00
Chris Wilson	4b7142baa0	uxa: Enable SHM pixmaps Now with streaming uploads and downloads for composite operations in place, shared memory pixmaps are no longer that dire performance wise. With careful use these can in fact be the most efficient means of transfer between a wholly software renderer in the client and a backing store. For instance, Chromium renders internally to an ARGB32 image buffer and uses a shared pixmap to composite dirty regions into the backing store. Thereby using the GPU to either perform the blit or the format conversion. Enabling shared pixmaps, reduces our CPU overhead whilst scrolling by a factor of 5 or so. And this is achieved simply by deleting obsolete code! Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-19 13:39:48 +01:00
Chris Wilson	d748f8e6fc	uxa: Use accelerated get_image for copying to !offscreen Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-19 13:39:48 +01:00
Chris Wilson	78ee25f005	uxa: Match depth 30 to format. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-19 13:39:48 +01:00
Chris Wilson	af5c4fc96d	uxa: Check for allocation failure. Check for the NULL Picture prior to passing it to the backends for inspection. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-19 13:39:48 +01:00
Chris Wilson	94217ed5f5	uxa: Always clip glyphs to destination. Even if there is only a single clip rect, since the clip may be smaller than the drawing rectangle on the destination we need to actually compute the clipped glyph rectangle. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-12 18:07:17 +01:00
Chris Wilson	35a12f0290	Fallback implementation for trapezoids for hung GPUs. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-10 23:02:31 +01:00
Chris Wilson	8c1a8d2297	Revert "xp:trapezoids" This reverts commit `f429fb9d87`. An experimental patch I forgot was on my main branch as I was bugfixing. ARGH!	2010-06-09 10:03:29 +01:00
Chris Wilson	994aa1ef57	uxa: Handle all-clipped out case with destination glyphs. Fixes the crash reported in: Bug 28446 - Garbled Font with Mathematica 7 https://bugs.freedesktop.org/show_bug.cgi?id=28446 pDst=0x3d663c0, src_x=0, src_y=0, xDst=142, yDst=112, nlist=0, list=0x7fffea026580, glyphs=0x7fffea025d88, extents=0x0) at uxa-glyphs.c:809 dx = 0 y1 = 101 x2 = 150 x1 = 142 dy = 0 y2 = 112 rects = 0x5491000 this_atlas = 0x2456d00 mask_y = 128 glyph = 0x35933a0 mask_x = 736 priv = 0x39309e0 screen = 0x8d2cc0 uxa_screen = 0x2443eb0 src_pixmap = 0x37c29e0 dst_pixmap = 0x45ddbf0 localSrc = 0x361a450 glyph_atlas = 0x2456d00 x = 142 y = 112 n = 18 nrect = -9975128 box = {x1 = 23152, y1 = -5630, x2 = 32767, y2 = 0} __PRETTY_FUNCTION__ = "uxa_glyphs_to_dst" Though the meat of that bug regarding the incorrect remains unsolved. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-09 09:59:36 +01:00
Chris Wilson	f429fb9d87	xp:trapezoids	2010-06-08 19:52:46 +01:00
Chris Wilson	e6acbc7632	uxa: Setup acceleration functions prior to the damage layer We need to install the acceleration functions so that they are wrapped by the Damage layer. This fixes the corruption under a compositing WM introduced in commit `8700673157`. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Reported-and-tested-by: Arkadiusz Miśkiewicz <arekm@maven.pl>	2010-06-07 18:23:17 +01:00
Chris Wilson	d56ea7a852	Use the direct dixGevPrivate() API when available This is quicker and smaller than the old indirect function call to dixLookupPrivate(). Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-07 00:20:35 +01:00
Chris Wilson	8700673157	Adapt glyphs for changes in devPrivates API Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-07 00:17:32 +01:00
Keith Packard	42ddc39430	Adapt to DevPrivate API changes This allows the driver to be built against either the old or new DevPrivate API. Signed-off-by: Keith Packard <keithp@keithp.com>	2010-06-06 16:00:12 -07:00
Eric Anholt	2c1fda08e8	Use libc instead of deprecated libc wrappers for malloc/calloc/free.	2010-06-06 15:56:35 -07:00
Chris Wilson	b586624d4f	uxa: Force fallback for copies. All but uxa_copy_window() perform the preliminary checks for whether acceleration is available. The simplest method for adding the fallback for uxa_copy_window() seems to be to add it in the core copy function, so be it. This allows X to survive a little longer once we encounter a GPU hang. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-06 17:14:10 +01:00
Chris Wilson	a386a003e7	uxa: Spans, try again to get the early break correct. Trigger happy bug fixing. The sign was right, the endpoint was wrong. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-31 22:19:49 +01:00
Chris Wilson	1672ee0421	uxa: Sign reversal on early break from spans passing the YXband Introduced with `e5c971e763`. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-31 22:08:43 +01:00
Chris Wilson	cd38b705be	Disable acceleration if we detect a hardware error. This is wildly optimistic, but it should work in a surprising number of error situations and some output in those cases will be hopefully be better than none... If we submit a batchbuffer and the kernel reports the GPU is hung (which will be caused by an earlier execbuffer, and so the kernel should have had enough time to determine whether or not it could reset the GPU) then disable any further attempt to accelerate gfx and force fallbacks to map the buffers and use the CPU. We cannot normally map any more buffers if the GPU is hung, so only those already mapped prior to the hang can be written to, or those allocated in system memory. However, we can expect that the framebuffer is already mapped, and so have a reasonable expectation to continue to see the display update. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-31 18:00:11 +01:00
Chris Wilson	5fff430046	uxa: Mega-Glyphs! Rewrite glyph rendering to avoid the intermediate buffer, accumulating the glyph rectangles directly in the backend composite routines. And modify the glyph cache routines to fully utilise the allocated size of the tiled buffer on older hardware. To do this we alias all glyph sizes into the same texture using a technique suggested by Keith Packard. PineView: 885/856-> 1150/1110 kglyph/s (aa/rgb) Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-31 14:03:42 +01:00
Eric Anholt	a94ae175d6	uxa: Fix prepare_solid being called without check_solid first. Fixes GPU hang on gen6.	2010-05-28 12:40:46 -07:00
Chris Wilson	66c90158e4	uxa: Skip the redundant miComputeCompositeRects() when adding to the mask As we are in full control of the destination (the temporary glyph mask) and the source (the glyph cache) we know that there are no clip regions on either and so can skip computing the composite rectangles. (We trust the device clipping to prevent compositing outside the target.) x11perf on PineView: 701/686 -> 881/856 kglyphs/s [aa/rgb] Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-28 17:13:30 +01:00
Chris Wilson	5b2254838e	uxa: Make the glyph caches' fixed size explicit. Until we actual resize the glyph cache dynamically, make it obvious to the reader and the compiler that the size is fixed. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-28 12:47:26 +01:00
Chris Wilson	11581dda99	uxa: Use a glyph private rather than a hash table. Store the cache position directly on the glyph using a devPrivate rather than an through auxiliary hash table. x11perf on PineView: 650/638 kglyphs/s -> 701/686 kglyphs/s [aa/rgb] Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-28 12:44:34 +01:00
Chris Wilson	03bbb4c896	uxa: Perform manual damage for CompositeRects [xserver-1.8] The damage layer doesn't wrap CompositeRects, so we need to manually append the damaged region ourselves. This works for miCompsiteRects since that translates the call into multiple invocations of either PolyFillRectangle or Composite, which themselves cause damage. Fixes: Bug 28120 - Tint2's tooltip borders end up at 0,0 and do not disappear https://bugs.freedesktop.org/show_bug.cgi?id=28120 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-26 10:21:03 +01:00
Chris Wilson	b9ada52a30	uxa: Force the alpha value to 0xffff when treating Over as Src Since we have at most 8 bits of alpha, we treat >= 0xff00 as opaque. However, being paranoid we should set the alpha value to 0xfff in case something unexpected happens when converting from the xRenderColor to the pixel value. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-26 10:21:03 +01:00
Chris Wilson	3055d40164	uxa: Use Composite rather than solid blitter for PolyRect Due to the relocation overhead, using a single composite with many rectangles outperforms many solid blits. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-26 10:21:03 +01:00
Chris Wilson	ec2437f958	uxa: Add PICT format mapping for depth 4 pixmaps. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-26 10:21:03 +01:00
Chris Wilson	b645ec83e0	uxa: Apply the drawable offset to the solid rects Fixes: Bug 28120 - Tint2's tooltip borders end up at 0,0 and do not disappear https://bugs.freedesktop.org/show_bug.cgi?id=28120 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-25 09:49:20 +01:00
Chris Wilson	80a9e64f50	uxa: Use temporary dest when target is too large for compositor If the destination cannot fit into the 3D pipeline when we need to composite, we fallback to doing the operation on the CPU. This is very slow, and quite easy to trigger on i915 by plugging in an external display. An alternative is to extract the extents of the operation from the destination using the blitter which can usually handle much larger operations. This gives us a temporary target that can fit into the 3D pipeline and thus be accelerated, before copying back into the larger real destination. For x11perf this boosts glyph rendering on PineView, from 38kglyphs/s to 480kglyphs/s. Just a little shy of the native performance of 601kglyphs/s Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-24 18:31:16 +01:00
Chris Wilson	91f560034f	uxa: Composite glyphs directly onto dst when possible. Without using a mask and compositing directly onto the destination, takes us from 580 kglyphs/s to 850 kglyphs/s on i945 [x11perf -aa10text]. However, the extra intersection check almost entirely cancels out the speed up and we discover that the glyphs in x11perf are always overlapping. Nothing is ever easy. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-24 18:31:15 +01:00
Chris Wilson	c2abf8d659	uxa: translate the region in line for composites When compositing, we need to convert the box into a rect and so the advantages of using REGION_TRANSLATE are lost. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-24 09:40:28 +01:00
Chris Wilson	e5c971e763	uxa: Spans! OMG! Use composite rather than solid blits in order to bring performance on a par with the CPU when using GEM and relocations. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-23 18:43:29 +01:00
Chris Wilson	2c00297bc3	uxa: Replace solid planemask [0xffffffff] with FB_ALLONES Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-16 20:19:22 +01:00
Chris Wilson	21b5fd427f	uxa: Tidy uxa_solid_rects() Move the operator reduction after a few fallbacks, closer to its use. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-16 13:52:35 +01:00
Chris Wilson	61835701fd	uxa: Patterns are acquired at 0,0 Set the correct offset for the gradients patterns after rendering to a local Picture. Fixes cairo/test/huge-radial and friends Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-16 13:51:35 +01:00
Chris Wilson	89f43f69a9	uxa: Force an alpha channel when rendering source fallbacks As the source may not cover the extents, we need to represent those areas as transparent in the fallback picture, ergo we need an alpha channel. We could be smarter and force a format conversion when necessary, and we could let the backend choose the most appropriate format. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-15 18:34:54 +01:00

1 2 3

142 Commits