xf86-video-intel

Commit Graph

Author	SHA1	Message	Date
Chris Wilson	2c00297bc3	uxa: Replace solid planemask [0xffffffff] with FB_ALLONES Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-16 20:19:22 +01:00
Chris Wilson	21b5fd427f	uxa: Tidy uxa_solid_rects() Move the operator reduction after a few fallbacks, closer to its use. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-16 13:52:35 +01:00
Chris Wilson	61835701fd	uxa: Patterns are acquired at 0,0 Set the correct offset for the gradients patterns after rendering to a local Picture. Fixes cairo/test/huge-radial and friends Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-16 13:51:35 +01:00
Chris Wilson	89f43f69a9	uxa: Force an alpha channel when rendering source fallbacks As the source may not cover the extents, we need to represent those areas as transparent in the fallback picture, ergo we need an alpha channel. We could be smarter and force a format conversion when necessary, and we could let the backend choose the most appropriate format. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-15 18:34:54 +01:00
Chris Wilson	524fd2dd0d	uxa: Apply clip for solid rectangles. References: Bug 28120 - Tint2's tooltip borders end up at 0,0 and do not disappear https://bugs.freedesktop.org/show_bug.cgi?id=28120 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-15 18:28:05 +01:00
Chris Wilson	58b089febc	uxa: Avoid using blits when with PictFilterConvolution References: Bug 28098 Compiz renders shadows wrong, garbage line of pixels along left and top edge of windows https://bugs.freedesktop.org/show_bug.cgi?id=28098 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-15 09:11:46 +01:00
Chris Wilson	ef95899f5b	uxa: Check the w-scaling component is 1 for an translation matrix Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-15 09:02:07 +01:00
Chris Wilson	95654cffa8	uxa: Fix order of conditionals to only run fill_region for SRC or opaque Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-15 00:50:42 +01:00
Chris Wilson	f67b45965b	uxa: Expand the range of compatible formats to cover all bpp. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-15 00:50:20 +01:00
Chris Wilson	82d07fdf10	uxa: Only use 1x1R as a solid with an opaque format or SRC Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-15 00:49:39 +01:00
Chris Wilson	3bca186a7e	uxa: Call check_solid before running the solid blitter. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-15 00:48:31 +01:00
Chris Wilson	737de9a779	uxa: Disable compatible src xrgb and dst argb I'm seeing garbage alpha for rendercheck blend: x8r8g8b8a 10x10 SRC ar8g8b8a so disable blitting until I work out if we can fast-path it. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-14 23:56:26 +01:00
Chris Wilson	a7c318d21c	uxa: Parse BGRA pixel formats. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-14 23:32:44 +01:00
Chris Wilson	f7bbcc492a	Split the prepare blitter functions into check + prepare. Allow us to check whether we can handle the operation using the blitter prior to doing any work. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-14 23:31:57 +01:00
Chris Wilson	b9a5e36f95	uxa: enable solid rects for backends that require pixmaps Convert the color into a (cached) pixmap if the backend cannot handle the SolidFill natively. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-14 21:16:50 +01:00
Chris Wilson	8de09a0707	uxa: Convert 1x1R back to solid_fill In the change to prevent blitting between incompatible sources, we also prevented 1x1R pixmaps from being used for solid fills. Reorder the sequence of conditions to enable this fast path again.	2010-05-13 17:17:54 +01:00
Chris Wilson	92e9cf8af7	uxa: Only use solid_fill for SRC.	2010-05-13 17:17:54 +01:00
Chris Wilson	d1bd14e8b6	uxa: Replace source for CLEAR with a transparent solid This means that we will hit the faster try_solid_fill path instead.	2010-05-13 17:17:54 +01:00
Chris Wilson	cdab72c405	uxa: Fallback early if compositing with alphaMaps	2010-05-13 17:17:54 +01:00
Chris Wilson	6c27f6e4f7	uxa: Avoid glyph ping-pong with !offscreen destination Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-12 12:50:31 +01:00
Chris Wilson	d5383c2073	uxa: Avoid ping-pong with !offscreen destination and traps If we are destined to target an !offscreen drawable, then uploading the trapezoid mask to a bo is the last thing we actually want to do... Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-12 12:50:31 +01:00
Chris Wilson	00664b8f9d	uxa: Fallback when compositing to a !offscreen destination Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-12 12:50:31 +01:00
Chris Wilson	244b7cbfff	uxa: Use accelerated PutImage for uploading pixman images. Short-circuits the current use of PutImage from CopyArea, bypassing all the temporary allocations.	2010-05-12 12:50:31 +01:00
Chris Wilson	cb887cfc67	uxa: solid rects The cost of performing relocations outweigh the advantages of using the blitter for solids with lots of rectangles. References: Bug 22127 - [UXA] 50% performance regression for XRenderFillRectangles https://bugs.freedesktop.org/show_bug.cgi?id=22127 By using the 3D pipeline we improve our performance by around 4x on i945, measured by the jxbench microbenchmark, and a factor of 10x by short-cutting to the 3D pipeline for blended rectangles. Before, on a i945GME: 19982.412060 Ops/s; rects (!); 15x15 9599.131693 Ops/s; rects (!); 75x75 3803.654743 Ops/s; rects (!); 250x250 6836.743772 Ops/s; rects blended; 15x15 1443.750000 Ops/s; rects blended; 75x75 495.335821 Ops/s; rects blended; 250x250 23247.933884 Ops/s; rects composition (!); 15x15 10993.073048 Ops/s; rects composition (!); 75x75 3595.905172 Ops/s; rects composition (!); 250x250 After: 87271.145975 Ops/s; rects (!); 15x15 32347.744361 Ops/s; rects (!); 75x75 5884.177215 Ops/s; rects (!); 250x250 73500.000000 Ops/s; rects blended; 15x15 33580.882353 Ops/s; rects blended; 75x75 5858.811749 Ops/s; rects blended; 250x250 25582.317073 Ops/s; rects composition (!); 15x15 6664.728682 Ops/s; rects composition (!); 75x75 14965.909091 Ops/s; rects composition (!); 250x250 [suspicious] This has no impact on Cairo, but I have a suspicion from watching xtrace that Qt likes to blit thousands of 1x1 rectangles with the same colour. However, we are still around 2-3x slower than the reported figures for EXA! Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-12 12:50:31 +01:00
Chris Wilson	c8e10f7791	debug: Add names for operators Most useful for confirming my worst fears: unwarranted use of OutReverse + Add. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-12 12:48:21 +01:00
Chris Wilson	a35afd4a2d	uxa: Recheck texture after acquiring pattern. As the first step to handling unsupported texture formats, double check that the converted pattern can be used as a texture by the card. Fixes: rendercheck -t repeat Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-11 13:07:03 +01:00
Chris Wilson	1ecd89be03	uxa: Protect against valid SourcePict in uxa_acquire_mask() Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-10 23:33:52 +01:00
Chris Wilson	0d4dd00aea	uxa,i915: Handle SourcePict through uxa_composite() Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-10 12:29:26 +01:00
Chris Wilson	f52b6e8322	uxa: Rearrange checking and preparing of composite textures. x11perf regression caused by 2D driver https://bugs.freedesktop.org/show_bug.cgi?id=28047 caused by commit `a7b800513f` uxa: Extract sub-region from in-memory buffers. The issue is that as we extract the region prior to checking whether the composite can in fact be accelerated, we perform expensive surplus operations. This is particularly noticeable for ComponentAlpha text, such as rgb10text. The solution here is to rearrange the check_composite() prior to acquiring the sources, and only extracting the subregion if the render path can not actually handle the texture. Performance (on PineView): a7b800513^: aa=68600 glyphs/s, rgb=29900 glyphs/s a7b800513: aa=65700 glyphs/s, rgb=13200 glyphs/s now: aa=66800 glyph/s, rgb=28800 glyphs/s The residual lossage seems to be from the extra function call and dixPrivate lookups. Hmm. More warning is the extremely low performance, however the results are consistent so the improvement looks real... Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-10 10:36:14 +01:00
Chris Wilson	848ab66384	uxa: Transform composites with a simple translation into a blit We can also convert a composite with an integer translation into a blit, so long as the sample extents remains within the source. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-08 19:35:28 +01:00
Chris Wilson	a7b800513f	uxa: Extract sub-region from in-memory buffers. If the buffer is too large or not suitable for a GPU operation, we currently fallback and perform the composite on the CPU. An alternative is too extract the small region out of the source (as usually the sample extents are much smaller than the actual surface size) and try the composite with the new surface. The effect is particularly noticeable on pathological websites that use very large background images. For example, http://www.woodtv.com/ uses a 1299x15000 pattern that is obscured by another opaque pattern. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-08 19:35:07 +01:00
Chris Wilson	2d17bd50af	Revert "Revert "uxa: Try using put_image when copying from a memory buffer."" This reverts commit `6d50553e8f`. Now we have taught the fallback path not to infinitely recurse, re-enable the accelerated path for ShmPutImage and friends.	2010-04-14 17:10:09 +01:00
Eric Anholt	6d50553e8f	Revert "uxa: Try using put_image when copying from a memory buffer." This reverts commit `27195d7dba`. put_image often calls copy_area. Which calls put_image. Exhausting of the stack follows.	2010-04-12 13:46:24 -07:00
Chris Wilson	28024f6c5f	Revert "uxa: Add fallback warnings for PutImage." This reverts commit `299b0338d0`. A debugging patch, it was never intended to go into master	2010-04-12 13:44:01 +01:00
Chris Wilson	27195d7dba	uxa: Try using put_image when copying from a memory buffer. Often, for example in the fallback for ShmPutImage, we will attempt to use uxa_copy_area() copying to a normal pixmap from a memory buffer. This triggers a fallback, and maps the destination pixmap back into the GTT. The accelerated put_image path will attempt to stream a blit to the destination pixmap if it is currently active, avoiding the stall.	2010-04-10 18:50:26 +01:00
Chris Wilson	299b0338d0	uxa: Add fallback warnings for PutImage.	2010-04-10 18:08:07 +01:00
Gaetan Nadon	362a49e71f	uxa make: remove unused XORG_INCS and DIX_CFLAGS variables Most likely copied from xserver makefile. Acked-by: Dan Nicholson <dbn.lists@gmail.com> Signed-off-by: Gaetan Nadon <memsize@videotron.ca>	2010-03-25 15:49:14 -04:00
Chris Wilson	5537079c29	uxa: After filling the alpha channel xrgb src is compatible with argb dst. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-03-24 16:42:05 +00:00
Chris Wilson	90a971c607	uxa: Only reduce a composite to a BLT if it is wholly contained Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-03-24 16:42:05 +00:00
Chris Wilson	d6b7f96fde	Fill alpha on xrgb images. Do not try to fixup the alpha in the ff/shaders as this has the side-effect of overriding the alpha value of the border color, causing images to be padded with black rather than transparent. This can generate large and obnoxious visual artefacts. Fixes: Bug 17933 - x8r8g8b8 doesn't sample alpha=0 outside surface bounds http://bugs.freedesktop.org/show_bug.cgi?id=17933 and many related cairo test suite failures. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-03-16 10:53:29 +00:00
Carl Worth	1cc35a8ba4	uxa: Fix type mismatch to avoid compiler warning. The code was using uint32_t where an XID (currently an unsigned long) was specified in the prototype. Use XID to avoid both the warning and any potential problem.	2010-03-04 09:46:33 -08:00
Eric Anholt	6bdab84176	uxa: Skip adjusting mask coordinates when no mask is present. Quiets clang warnings about garbage variable usage.	2010-02-20 12:55:13 -05:00
Eric Anholt	ec5deb2bcb	Remove dead assignments noticed by clang.	2010-02-20 12:55:13 -05:00
Chris Wilson	918151a795	uxa: Fix compatible_formats() for OVER In separating the boolean logic out into a separate function, `dc6522dd`, I reversed the sense of one particular test: src->format == dst->format The OVER optimisation is only valid if the src and dst formats match, but not always. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-01-27 20:35:45 +00:00
Chris Wilson	197cb08a2d	Extract pixel value for all formats to avoid hitting fallbacks. On failing to extract the pixel value for an alpha-only solid we actually triggered a fallback. Since this path is commonly hitting whilst fading in images, for example cairo_paint_with_alpha(), the fallback was detected during the Moblin boot sequence where it was adding a second to the overall boot time. See fallback intel: Moblin startup is hitting a composite fallback, costing a ton of performance https://bugs.freedesktop.org/show_bug.cgi?id=26189 Based on the initial patch by Arjan van de Van. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-01-25 15:49:38 +00:00
Chris Wilson	5f93d019dc	uxa: Adjust uxa_get_color_for_pixmap to match prototype The prototype says this function returns a Bool and not just an int, so be pedantic and return TRUE/FALSE. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-01-25 15:48:48 +00:00
Chris Wilson	dc6522dd49	uxa: Protect against a potential NULL src->Drawable reference One of the convoluted if branches dereferenced Drawable when it is potentially NULL. Avoid this by explicitly handling the NULL Drawable cases earlier, and enabling solid fills for solid sources. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-01-24 09:37:23 +00:00
Chris Wilson	31bbd7f919	uxa/uxa-render: Always remove useless repeats during composite. I added a jump if there was no src or mask Drawable, but we do actually need to check for useless src repeats even if we have a source-only mask. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-01-24 09:04:16 +00:00
Chris Wilson	326fe00df4	uxa: Increase amount of composite fallback verbage The fallback log for http://bugs.freedesktop.org/show_bug.cgi?id=26189 does not actually state the reason why we actually fallback. This is possibly because we need to fallback for reasons other than the operation cannot be performed in hardware -- such as using an alpha map or the screen is swapped out, so add this information to the fallback log. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-01-24 09:02:05 +00:00
Chris Wilson	83626aba35	uxa-glyphs: Enable TILING_X on glyph caches. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-01-08 19:21:31 +00:00
Chris Wilson	37f631d669	Revert "uxa-glyphs: Enable TILING_X on glyph caches." This reverts commit `3f11bbec42`. For unknown reasons, enabling tiling for the glyph cache is causing glyph corruption both across suspend and resume and VT switching, on a wide range of chipsets (reports include both i8xx and gm45) This strongly suggests that we are handling tiling, or updates to tiled buffers, incorrectly across i915_gem_idle(). However, until we can find the root cause, we want to fix this regression before the next stable release, so simply revert this patch. :( Fixes: [Bug 25406] fonts garbled after resuming from suspend since `6729b508` http://bugs.freedesktop.org/show_bug.cgi?id=25406 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2009-12-10 08:54:46 +00:00
Chris Wilson	c1afc831c8	uxa: Cache solid fills. Maintain a small cache of pixmaps to hold SolidFill pictures. Currently we create a pixmap the size of the damaged region and fill that using pixman before downloading it to the GPU and compositing. Needless to say this is extremely expensive compared to simply emitting the solid colour. To mitigate this cost, we maintain a small cache of 1x1R pictures which is recognised by the driver as being a solid, but at the very least is maintained as a GPU ready pixmap. This gives a good boost to cairo-xcb (which uses solid fills) on a gm45: Before: gnome-terminal-vim: 41.9s After: gnome-terminal-vim: 31.7s Compare with using a cache of 1x1R pixmaps in cairo-xcb: gnome-terminal-vim: 31.6s Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2009-12-07 21:37:31 +00:00
Chris Wilson	ad68881b67	uxa_check_composite: Minor whitespace. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2009-12-02 14:14:39 +00:00
Chris Wilson	0ff4d42a42	uxa: Review uxa_prepare_access() to remove potential nesting Around a call to uxa_put_image() it is possible to mix both accelerated and fallback paths, with the fallback code making the presumed optimisation of only trying to call uxa_prepare_access() once. This fails if the accelerated path also uses prepare/finish access on the same drawable and then later fallback to the fallback path. This can happen currently if an error is reported whilst attempting to accelerate PutImage. #0 memcpy () at ../sysdeps/x86_64/memcpy.S:162 #1 0x00007ffff43ce4bd in fbBlt (srcLine=<value optimized out>, srcStride=40, srcX=<value optimized out>, dstLine=0xffffffffffffffff, dstStride=64, dstX=0, width=<value optimized out>, height=8, alu=3, pm=4294967295, bpp=8, reverse=0, upsidedown=0) at fbblt.c:93 #2 0x00007ffff43ce740 in fbBltStip (src=0xffffffffffffffff, srcStride=156555204, srcX=34, dst=0xfffffffc, dstStride=64, dstX=40, width=304, height=8, alu=3, pm=4294967295, bpp=8) at fbblt.c:944 #3 0x00007ffff4c32c53 in uxa_do_put_image (pDrawable=0x246aa410, pGC=0x2c0a4f0, depth=8, x=0, y=0, w=38, h=8, leftPad=0, format=2, bits=0x954d7c4 "") at uxa-accel.c:196 #4 uxa_do_shm_put_image (pDrawable=0x246aa410, pGC=0x2c0a4f0, depth=8, x=0, y=0, w=38, h=8, leftPad=0, format=2, bits=0x954d7c4 "") at uxa-accel.c:223 #5 uxa_put_image (pDrawable=0x246aa410, pGC=0x2c0a4f0, depth=8, x=0, y=0, w=38, h=8, leftPad=0, format=2, bits=0x954d7c4 "") at uxa-accel.c:289 #6 0x00000000004d574f in damagePutImage (pDrawable=0x246aa410, pGC=0x2c0a4f0, depth=8, x=0, y=0, w=38, h=8, leftPad=0, format=2, pImage=0x954d7c4 "") at damage.c:905 #7 0x00000000004287db in ProcPutImage (client=0x47ca72d0) at dispatch.c:2073 #8 0x000000000042bd94 in Dispatch () at dispatch.c:445 #9 0x000000000042513a in main (argc=4, argv=0x7fffffffe2a8, envp=<value optimized out>) at main.c:285 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2009-12-02 12:23:58 +00:00
Chris Wilson	3f11bbec42	uxa-glyphs: Enable TILING_X on glyph caches. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2009-11-30 00:58:05 +00:00
Chris Wilson	2c3aee2b57	uxa-glyphs: Stream uploads via temporary bo Avoid mapping the glyph cache back to the cpu by allocating temporary buffer objects to store the glyph pixmap and blit to the cache. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2009-11-29 20:55:33 +00:00
Chris Wilson	c180baf43b	i915: Derive the correct target color from the pixmap by checking its format Particularly noting to route alpha to the green channel when blending with a8 destinations. Fixes: rendercheck/repeat/triangles regressed http://bugs.freedesktop.org/show_bug.cgi?id=25047 introduced with commit 14109a. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2009-11-13 20:20:52 +00:00
Chris Wilson	e9064eacb0	uxa: Do not remove repeat from solids for 1x1 composites. Or else we hit the buggy 1x1 source path and trigger: rendercheck/mcoords regressed http://bugs.freedesktop.org/show_bug.cgi?id=25046 caused by the recent commit `e581ceb`.	2009-11-13 19:05:22 +00:00
Chris Wilson	998d6b3d8c	uxa: Force alpha bits to fill remaining bits In the case of x8r8g8b8 and similar where the alpha channel is ignored, but should be interpreted as being 1, then it is convenient if those bits are set appropriately in the colour. In order to do so for these formats, where PIXMAN_FORMAT_A() returns 0 we need to compute the alpha channel width as the remaining bits instead. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2009-11-10 15:18:56 +00:00
Rémi Cardona	3c0a43b24c	configure: use CWARNFLAGS from xorg-macros.m4 Signed-off-by: Rémi Cardona <remi@gentoo.org> Acked-by: Chris Wilson <chris@chris-wilson.co.uk>	2009-11-05 16:58:35 +01:00
Albert Damen	10946118dd	Fix crash in uxa_acquire_pattern when pDst is NULL This avoids a crash when an XRenderComposite call is made with a -1 value for width/height, (which apparently compiz's gtk-window- decorator likes to do). Fixes bug: X crashes in uxa_acquire_pattern when logging in (gdm) http://bugs.freedesktop.org/show_bug.cgi?id=24724 Signed-off-by: Albert Damen <albrt@gmx.net> Reviewed-by: Carl Worth <cworth@cworth.org>	2009-10-26 11:36:52 -07:00
Chris Wilson	b37ac9d317	uxa: Refactor create Picture for pixman format Pull the common methods for creating a Picture given a pixman format into its own method, and tidy the surrounding code. The benefit is that we can now composite directly to the Picture and so save an intermediate copy when creating patterns for gradients. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2009-10-12 14:36:26 +01:00
Chris Wilson	7e8f32d0a7	uxa: Free the ScratchPixmapHeader after its associated Picture Fixes: http://bugs.freedesktop.org/show_bug.cgi?id=24459 Intel Driver > 2.8: Cairo rendering bug, triggered in QtCurve GTK engine Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2009-10-12 14:36:20 +01:00
Eric Anholt	8ae0e44e42	Move to kernel coding style. We've talked about doing this since the start of the project, putting it off until "some convenient time". Just after removing a third of the driver seems like a convenient time, when backporting's probably not happening much anyway.	2009-10-06 17:10:31 -07:00
Chris Wilson	00e8de212b	Check the correct Picture for error during creation.	2009-09-22 01:27:25 +01:00
Chris Wilson	57fc09cef2	Avoid fallbacks for a1 src/mask Carl Worth did the hard work in identifying that the regression in cairo between X.org 1.6 and 1.7 was caused by cairo sending an a1 mask to the server in 1.7 whereas in 1.6 cairo used local fallbacks (as the source was using RepeatPad, which triggers cairo's 'buggy_pad_reflect' fallback for X.org 1.6). This was causing the driver to do a fallback to handle the a1 mask instead, which due to the GPU pipeline stall is much more expensive than the equivalent fallback in cairo. Reference: cairo's performance downgrades 4X with server master than server-1.6. https://bugs.freedesktop.org/show_bug.cgi?id=23184 The fix is a relatively simple extension of the current uxa_picture_from_pixman_image() to use CompositePicture() instead of CopyArea() when we need to convert to a new format. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2009-09-20 01:15:41 +01:00
Chris Wilson	c2abfa8e54	Avoid fallbacks for compositing gradient patterns Currently when asked to composite using a gradient source or mask, we fallback to using fbComposite(). This has the side-effect of causing a readback on the destination surface, stalling the GPU pipeline. Instead, like uxa_trapezoids(), we can use pixman to fill a scratch pixmap and then copy that to an offscreen pixmap for use with uxa_composite(). Speedups on i915: firefox-talos-svg: 710378.14 -> 549262.96: 1.29x speedup No slowdowns. Thanks to Søeren Sandmann Pedersen for spotting the missing ValidatePicture(). Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2009-09-14 16:26:57 +01:00
Keith Packard	6361c3b9af	Fix SHM functions to work with server after 1.6.0 Signed-off-by: Keith Packard <keithp@keithp.com>	2009-08-25 19:33:25 -07:00
Eric Anholt	22f7cbc32b	uxa: Tell the driver when we're just going to immediately map the pixmap. This lets the driver allocate a nice idle buffer object instead of a busy one, reducing runtime of firefox-20090601 on my G45 from 50.7 (+/- .41%) to 48.4 (+/- 1.1%).	2009-07-22 09:16:00 -07:00
Eric Anholt	5ef3db45e0	uxa: Skip fill of temporary alpha picture that just gets copied over. This was needed when we were doing the mask computations in this pixmap, but now they're done in a temporary and then uploaded later. This reduces runtime of firefox-20090601 from 52.6 (+/- .96%) to 50.7 (+/- .41%) seconds on my G45.	2009-07-22 09:15:59 -07:00
Peter Hutterer	0a4c4c5fe8	Update to xextproto 7.1 support. DPMS header was split into dpms.h (client) and dpmsconst.h (server). Drivers need to include dpmsconst.h if xextproto 7.1 is available. SHM is now shm.h instead of shmstr. Requires definition of ShmFuncs that's not exported by the server. Signed-off-by: Peter Hutterer <peter.hutterer@who-t.net>	2009-07-18 12:10:18 +10:00
Owain Ainsworth	57c7cbade9	accessing a pixmap if prepare_access fails is verboten. Don't do it, treat this the same as every other prepare access call in uxa. Reviewed-by: Keith Packard <keithp@keithp.com> Signed-off-by: Owain Ainsworth <zerooa@googlemail.com>	2009-07-17 11:28:29 -07:00
Eric Anholt	1e4784bf26	uxa: Fix segfault on source-only picture usage with FallbackDebug. Bug #22107.	2009-06-30 19:54:23 -07:00
Carl Worth	accdbd2367	UXA: Rasterize trapezoids to system memory, not a pixmap Since we're only doing software rasterization right now, anyway, it makes more sense to just rasterize to system memory and then upload to a pixmap once complete. This avoids expensive read-modify-write cycles. This results in a 2.4x speedup for a real-world test case that's heavy on trapezoids, which is swfdec running on the following file: http://michalevy.com/wp-content/uploads/Giant%20Steps%202007.swf Many thanks to Chris Wilson for his cairo-traces repository and cairo-perf-trace tool which makes it so easy to measure things like this.	2009-06-09 19:06:12 -07:00
Eric Anholt	47591334a1	Remove pre-server-1.5 support.	2009-04-27 16:50:34 -07:00
Alan Coopersmith	b8ca146b06	Fix UXA to build with Sun compilers (use __func__ instead of __FUNCTION__) Signed-off-by: Alan Coopersmith <alan.coopersmith@sun.com>	2009-04-24 16:04:13 -07:00
Keith Packard	fe08b81d0f	Use CopyArea to load glyphs from per-glyph pixmap to cache pixmap With glyphs sitting in per-glyph pixmaps, there's no reason to use the CPU to move them to the cache pixmap, and lots of reasons to use the accelerator. Signed-off-by: Keith Packard <keithp@keithp.com>	2009-03-13 15:03:38 -07:00
Eric Anholt	22dc9a5580	Fix UXA for server 1.4.	2009-02-26 14:20:42 -08:00
Eric Anholt	cb1f7ec087	uxa: Fix composite fallback debug printing of main memory versus bo info. It was just printing whether it was a pixmap (it is), instead of whether the pixmap was offscreen.	2009-02-26 14:20:42 -08:00
Eric Anholt	3012d85cc5	uxa: Fix breakage from UXA_FALLBACK conversion from "do {} while (0)" construct. Thanks to keithp for post-commit review.	2009-02-10 18:47:28 -08:00
Eric Anholt	5009127de7	uxa: Fix driver against fbDoCopy -> miDoCopy change in the server.	2009-02-10 18:23:35 -08:00
Eric Anholt	b53977f4c5	uxa: Fix failure to --amend in further changes in previous commit.	2009-02-10 18:23:16 -08:00
Eric Anholt	5212ec6515	uxa: hook up the fallback debug to the driver's fallback debug option.	2009-02-10 15:35:20 -08:00
Bernhard Rosenkraenzer	c80f1a9c51	UXA: Declare glyph cache picture as component-alpha when necessary. Without this, rendering component-alpha glyphs may break without a mask. Bug #19534. Ported from fix by Michel Dänzer <daenzer@vmware.com> in xserver commit 639f289dcdbe00a516820f573c01a8339e120ed4	2009-01-13 10:37:41 -08:00
Keith Packard	632f816c72	uxa: handle uxa_prepare_access failure uxa_prepare_access may fail to map the pixmap into user space. Recover from this without crashing. Signed-off-by: Keith Packard <keithp@keithp.com>	2009-01-06 09:31:39 -08:00
Eric Anholt	ecdd706873	uxa: Correctly prepare/finishaccess of stipple in ValidateGC (and only it) This avoids prepare/finish_access_gc overhead when we're not changing things (since GCTile is already handled) and get us the RW flag for the prepare on of the stipple pixmap so thing will be synced correctly.	2008-12-16 10:21:59 -08:00
Eric Anholt	261c20a479	uxa: Add in EnableDisableFBAccess handling like examodule.c did. This fixes assertion failures when rendering text while VT switched.	2008-12-05 12:13:26 -08:00
Dave Airlie	293f6232c6	uxa: don't call composite routines with no buffer. We can get a case with gnome-terminal + links, where we get two arrays of glyphs all with 0 width and 0 heights in them. If this happens we manage to get to this case without any buffer setup and segfault. (cherry picked from commit 717c7492a0f6ba3fb3eabda33515881eef314155)	2008-12-03 16:55:31 -08:00
Jesse Barnes	f082e877d5	Work around gcc uninitialized variable warnings GCC isn't smart enough to analyze the control flow and figure out that these are false positives, but initializing them shouldn't hurt, so work around it.	2008-09-30 12:06:46 -07:00
Eamon Walsh	808b72f814	Change uxa private keys to integer variables. Prepares for a devPrivates system that will store an index.	2008-08-26 22:34:05 -04:00
Keith Packard	fc3e287e6b	[uxa] Remove unused pixmap size limits. All size-related rendering limits should be managed by the driver in the pixmap_is_offscreen call. There's no need for uxa to even know these values.	2008-08-05 22:50:01 -07:00
Keith Packard	68f0872db6	[uxa] Check xalloc returns and deal with failure Failing xalloc in a rendering function means just dropping the drawing on the floor (that's what we've always done).	2008-08-05 22:36:03 -07:00
Keith Packard	b2d058d80c	Rename uxa using _ instead of caps	2008-08-05 15:41:52 -07:00
Keith Packard	fc4d9c55a7	Change PrepareAccess to take access mode rather than index	2008-08-05 15:41:51 -07:00
Keith Packard	4cc20b7f6e	Don't call sync on prepare_access -- just let the driver deal with it. Let the driver do whatever sync is necessary from the prepare_access hook rather than forcing a full sync.	2008-08-05 15:29:50 -07:00
Keith Packard	59774e9aca	Add UXA - the unified memory acceleration architecture. This eliminates the cost of EXA migration management while providing full pixmap allocation control to the driver. The goal is to make something useful for UMA drivers.	2008-08-05 15:29:50 -07:00

1 2 3

146 Commits