xf86-video-intel

Commit Graph

Author	SHA1	Message	Date
Chris Wilson	4c66b28870	uxa: Fallback if the temporary is too large If the render operation requires a temporary source Picture and the operation is large, larger than the maximum permitted bo, then we will fail to allocate the bo. In this case, we need to fallback and perform the operation on the CPU rather than dereference a NULL bo. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=34399 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-02-17 16:52:58 +00:00
Bryce Harrington	6e721e098b	Check return value of uxa_acquire_solid() since it can return NULL uxa_acquire_solid returns NULL under OOM. Thus the value of solid must be checked before dereferencing it in the uxa_get_offscreen() call. Signed-off-by: Bryce Harrington <bryce@canonical.com> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-02-04 08:57:11 +00:00
Chris Wilson	da990536ec	uxa: Undo damage translation before appending The region is used to paint onto the backing pixmap (and thus translated) prior to being passed to the damage layer (wrt to the drawable). So the local translation needs to be undone first. Identified by Christopher James Halse Rogers. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=33650 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-02-03 09:41:48 +00:00
Chris Wilson	1ba983034b	uxa: Emit the damage after the render for the workaround in uxa_solid_rects Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-12-07 12:27:29 +00:00
Chris Wilson	81d355a8dc	uxa: Fix crash after allocation failure Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=31487 Reported-by: Thomas Fjellstrom <tfjellstrom@shaw.ca> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-11-09 09:30:12 +00:00
Matthias Hopf	b84925b9c0	Make driver compile for 1.6 Xserver series again. Signed-off-by: Matthias Hopf <mhopf@suse.de>	2010-09-22 17:45:06 +02:00
Chris Wilson	c57840b272	uxa: Check for failed pixmap allocation Fixes: Bug 29187 - crash in intel_drv https://bugs.freedesktop.org/show_bug.cgi?id=29187 Backtrace: 0: /usr/bin/X (xorg_backtrace+0x28) [0x466808] 1: /usr/bin/X (0x400000+0x67c79) [0x467c79] 2: /lib/libpthread.so.0 (0x7ff19b297000+0xef60) [0x7ff19b2a5f60] 3: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7ff197986000+0x34684) => uxa/uxa-render.c:841 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-07-21 09:08:28 +01:00
Chris Wilson	b58a6a39c1	uxa: Fallback to pixman if source is out-of-bounds If the source is outside the drawable, then CopyArea will fail to initialise the source correctly. The simplest fix in this case is to fallback to pixman to generate the source texture. Fixes: Bug 28497 - Graphics corruption after opening a specific website https://bugs.freedesktop.org/show_bug.cgi?id=28497 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-23 23:56:44 +01:00
Chris Wilson	af5c4fc96d	uxa: Check for allocation failure. Check for the NULL Picture prior to passing it to the backends for inspection. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-19 13:39:48 +01:00
Chris Wilson	35a12f0290	Fallback implementation for trapezoids for hung GPUs. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-10 23:02:31 +01:00
Chris Wilson	8c1a8d2297	Revert "xp:trapezoids" This reverts commit `f429fb9d87`. An experimental patch I forgot was on my main branch as I was bugfixing. ARGH!	2010-06-09 10:03:29 +01:00
Chris Wilson	f429fb9d87	xp:trapezoids	2010-06-08 19:52:46 +01:00
Eric Anholt	2c1fda08e8	Use libc instead of deprecated libc wrappers for malloc/calloc/free.	2010-06-06 15:56:35 -07:00
Chris Wilson	cd38b705be	Disable acceleration if we detect a hardware error. This is wildly optimistic, but it should work in a surprising number of error situations and some output in those cases will be hopefully be better than none... If we submit a batchbuffer and the kernel reports the GPU is hung (which will be caused by an earlier execbuffer, and so the kernel should have had enough time to determine whether or not it could reset the GPU) then disable any further attempt to accelerate gfx and force fallbacks to map the buffers and use the CPU. We cannot normally map any more buffers if the GPU is hung, so only those already mapped prior to the hang can be written to, or those allocated in system memory. However, we can expect that the framebuffer is already mapped, and so have a reasonable expectation to continue to see the display update. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-31 18:00:11 +01:00
Chris Wilson	03bbb4c896	uxa: Perform manual damage for CompositeRects [xserver-1.8] The damage layer doesn't wrap CompositeRects, so we need to manually append the damaged region ourselves. This works for miCompsiteRects since that translates the call into multiple invocations of either PolyFillRectangle or Composite, which themselves cause damage. Fixes: Bug 28120 - Tint2's tooltip borders end up at 0,0 and do not disappear https://bugs.freedesktop.org/show_bug.cgi?id=28120 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-26 10:21:03 +01:00
Chris Wilson	b9ada52a30	uxa: Force the alpha value to 0xffff when treating Over as Src Since we have at most 8 bits of alpha, we treat >= 0xff00 as opaque. However, being paranoid we should set the alpha value to 0xfff in case something unexpected happens when converting from the xRenderColor to the pixel value. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-26 10:21:03 +01:00
Chris Wilson	b645ec83e0	uxa: Apply the drawable offset to the solid rects Fixes: Bug 28120 - Tint2's tooltip borders end up at 0,0 and do not disappear https://bugs.freedesktop.org/show_bug.cgi?id=28120 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-25 09:49:20 +01:00
Chris Wilson	80a9e64f50	uxa: Use temporary dest when target is too large for compositor If the destination cannot fit into the 3D pipeline when we need to composite, we fallback to doing the operation on the CPU. This is very slow, and quite easy to trigger on i915 by plugging in an external display. An alternative is to extract the extents of the operation from the destination using the blitter which can usually handle much larger operations. This gives us a temporary target that can fit into the 3D pipeline and thus be accelerated, before copying back into the larger real destination. For x11perf this boosts glyph rendering on PineView, from 38kglyphs/s to 480kglyphs/s. Just a little shy of the native performance of 601kglyphs/s Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-24 18:31:16 +01:00
Chris Wilson	91f560034f	uxa: Composite glyphs directly onto dst when possible. Without using a mask and compositing directly onto the destination, takes us from 580 kglyphs/s to 850 kglyphs/s on i945 [x11perf -aa10text]. However, the extra intersection check almost entirely cancels out the speed up and we discover that the glyphs in x11perf are always overlapping. Nothing is ever easy. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-24 18:31:15 +01:00
Chris Wilson	c2abf8d659	uxa: translate the region in line for composites When compositing, we need to convert the box into a rect and so the advantages of using REGION_TRANSLATE are lost. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-24 09:40:28 +01:00
Chris Wilson	e5c971e763	uxa: Spans! OMG! Use composite rather than solid blits in order to bring performance on a par with the CPU when using GEM and relocations. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-23 18:43:29 +01:00
Chris Wilson	2c00297bc3	uxa: Replace solid planemask [0xffffffff] with FB_ALLONES Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-16 20:19:22 +01:00
Chris Wilson	21b5fd427f	uxa: Tidy uxa_solid_rects() Move the operator reduction after a few fallbacks, closer to its use. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-16 13:52:35 +01:00
Chris Wilson	61835701fd	uxa: Patterns are acquired at 0,0 Set the correct offset for the gradients patterns after rendering to a local Picture. Fixes cairo/test/huge-radial and friends Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-16 13:51:35 +01:00
Chris Wilson	89f43f69a9	uxa: Force an alpha channel when rendering source fallbacks As the source may not cover the extents, we need to represent those areas as transparent in the fallback picture, ergo we need an alpha channel. We could be smarter and force a format conversion when necessary, and we could let the backend choose the most appropriate format. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-15 18:34:54 +01:00
Chris Wilson	524fd2dd0d	uxa: Apply clip for solid rectangles. References: Bug 28120 - Tint2's tooltip borders end up at 0,0 and do not disappear https://bugs.freedesktop.org/show_bug.cgi?id=28120 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-15 18:28:05 +01:00
Chris Wilson	58b089febc	uxa: Avoid using blits when with PictFilterConvolution References: Bug 28098 Compiz renders shadows wrong, garbage line of pixels along left and top edge of windows https://bugs.freedesktop.org/show_bug.cgi?id=28098 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-15 09:11:46 +01:00
Chris Wilson	ef95899f5b	uxa: Check the w-scaling component is 1 for an translation matrix Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-15 09:02:07 +01:00
Chris Wilson	95654cffa8	uxa: Fix order of conditionals to only run fill_region for SRC or opaque Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-15 00:50:42 +01:00
Chris Wilson	f67b45965b	uxa: Expand the range of compatible formats to cover all bpp. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-15 00:50:20 +01:00
Chris Wilson	82d07fdf10	uxa: Only use 1x1R as a solid with an opaque format or SRC Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-15 00:49:39 +01:00
Chris Wilson	737de9a779	uxa: Disable compatible src xrgb and dst argb I'm seeing garbage alpha for rendercheck blend: x8r8g8b8a 10x10 SRC ar8g8b8a so disable blitting until I work out if we can fast-path it. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-14 23:56:26 +01:00
Chris Wilson	a7c318d21c	uxa: Parse BGRA pixel formats. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-14 23:32:44 +01:00
Chris Wilson	f7bbcc492a	Split the prepare blitter functions into check + prepare. Allow us to check whether we can handle the operation using the blitter prior to doing any work. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-14 23:31:57 +01:00
Chris Wilson	b9a5e36f95	uxa: enable solid rects for backends that require pixmaps Convert the color into a (cached) pixmap if the backend cannot handle the SolidFill natively. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-14 21:16:50 +01:00
Chris Wilson	8de09a0707	uxa: Convert 1x1R back to solid_fill In the change to prevent blitting between incompatible sources, we also prevented 1x1R pixmaps from being used for solid fills. Reorder the sequence of conditions to enable this fast path again.	2010-05-13 17:17:54 +01:00
Chris Wilson	92e9cf8af7	uxa: Only use solid_fill for SRC.	2010-05-13 17:17:54 +01:00
Chris Wilson	d1bd14e8b6	uxa: Replace source for CLEAR with a transparent solid This means that we will hit the faster try_solid_fill path instead.	2010-05-13 17:17:54 +01:00
Chris Wilson	cdab72c405	uxa: Fallback early if compositing with alphaMaps	2010-05-13 17:17:54 +01:00
Chris Wilson	d5383c2073	uxa: Avoid ping-pong with !offscreen destination and traps If we are destined to target an !offscreen drawable, then uploading the trapezoid mask to a bo is the last thing we actually want to do... Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-12 12:50:31 +01:00
Chris Wilson	00664b8f9d	uxa: Fallback when compositing to a !offscreen destination Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-12 12:50:31 +01:00
Chris Wilson	244b7cbfff	uxa: Use accelerated PutImage for uploading pixman images. Short-circuits the current use of PutImage from CopyArea, bypassing all the temporary allocations.	2010-05-12 12:50:31 +01:00
Chris Wilson	cb887cfc67	uxa: solid rects The cost of performing relocations outweigh the advantages of using the blitter for solids with lots of rectangles. References: Bug 22127 - [UXA] 50% performance regression for XRenderFillRectangles https://bugs.freedesktop.org/show_bug.cgi?id=22127 By using the 3D pipeline we improve our performance by around 4x on i945, measured by the jxbench microbenchmark, and a factor of 10x by short-cutting to the 3D pipeline for blended rectangles. Before, on a i945GME: 19982.412060 Ops/s; rects (!); 15x15 9599.131693 Ops/s; rects (!); 75x75 3803.654743 Ops/s; rects (!); 250x250 6836.743772 Ops/s; rects blended; 15x15 1443.750000 Ops/s; rects blended; 75x75 495.335821 Ops/s; rects blended; 250x250 23247.933884 Ops/s; rects composition (!); 15x15 10993.073048 Ops/s; rects composition (!); 75x75 3595.905172 Ops/s; rects composition (!); 250x250 After: 87271.145975 Ops/s; rects (!); 15x15 32347.744361 Ops/s; rects (!); 75x75 5884.177215 Ops/s; rects (!); 250x250 73500.000000 Ops/s; rects blended; 15x15 33580.882353 Ops/s; rects blended; 75x75 5858.811749 Ops/s; rects blended; 250x250 25582.317073 Ops/s; rects composition (!); 15x15 6664.728682 Ops/s; rects composition (!); 75x75 14965.909091 Ops/s; rects composition (!); 250x250 [suspicious] This has no impact on Cairo, but I have a suspicion from watching xtrace that Qt likes to blit thousands of 1x1 rectangles with the same colour. However, we are still around 2-3x slower than the reported figures for EXA! Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-12 12:50:31 +01:00
Chris Wilson	c8e10f7791	debug: Add names for operators Most useful for confirming my worst fears: unwarranted use of OutReverse + Add. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-12 12:48:21 +01:00
Chris Wilson	a35afd4a2d	uxa: Recheck texture after acquiring pattern. As the first step to handling unsupported texture formats, double check that the converted pattern can be used as a texture by the card. Fixes: rendercheck -t repeat Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-11 13:07:03 +01:00
Chris Wilson	1ecd89be03	uxa: Protect against valid SourcePict in uxa_acquire_mask() Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-10 23:33:52 +01:00
Chris Wilson	0d4dd00aea	uxa,i915: Handle SourcePict through uxa_composite() Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-10 12:29:26 +01:00
Chris Wilson	f52b6e8322	uxa: Rearrange checking and preparing of composite textures. x11perf regression caused by 2D driver https://bugs.freedesktop.org/show_bug.cgi?id=28047 caused by commit `a7b800513f` uxa: Extract sub-region from in-memory buffers. The issue is that as we extract the region prior to checking whether the composite can in fact be accelerated, we perform expensive surplus operations. This is particularly noticeable for ComponentAlpha text, such as rgb10text. The solution here is to rearrange the check_composite() prior to acquiring the sources, and only extracting the subregion if the render path can not actually handle the texture. Performance (on PineView): a7b800513^: aa=68600 glyphs/s, rgb=29900 glyphs/s a7b800513: aa=65700 glyphs/s, rgb=13200 glyphs/s now: aa=66800 glyph/s, rgb=28800 glyphs/s The residual lossage seems to be from the extra function call and dixPrivate lookups. Hmm. More warning is the extremely low performance, however the results are consistent so the improvement looks real... Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-10 10:36:14 +01:00
Chris Wilson	848ab66384	uxa: Transform composites with a simple translation into a blit We can also convert a composite with an integer translation into a blit, so long as the sample extents remains within the source. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-08 19:35:28 +01:00
Chris Wilson	a7b800513f	uxa: Extract sub-region from in-memory buffers. If the buffer is too large or not suitable for a GPU operation, we currently fallback and perform the composite on the CPU. An alternative is too extract the small region out of the source (as usually the sample extents are much smaller than the actual surface size) and try the composite with the new surface. The effect is particularly noticeable on pathological websites that use very large background images. For example, http://www.woodtv.com/ uses a 1299x15000 pattern that is obscured by another opaque pattern. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-08 19:35:07 +01:00

1 2

85 Commits