xf86-video-intel/src/sna
Chris Wilson a3d37fb29f sna: Force remapping for IO transfer
Should fix regression from fcccc5528 (sna: Improve handling of inplace
IO for large transfers) whereby it was aborting the transfer it we need
to remap the buffer for the upload.

Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=49546
Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2012-05-08 15:47:14 +01:00
..
Makefile.am sna: Add video sprite support for ILK+ 2012-03-28 22:53:17 +01:00
README
blt.c sna: Allow ridiculously large bo, up to half the total GATT 2012-01-29 14:47:12 +00:00
compiler.h sna: Encourage large operations to be migrated to the GPU 2011-12-24 21:31:06 +00:00
gen2_render.c sna: Fix off-by-one in checking available execbuffer slots 2012-05-08 14:30:46 +01:00
gen2_render.h sna/gen2: Use specular component for solid spans 2011-07-01 21:41:23 +01:00
gen3_render.c sna: Fix off-by-one in checking available execbuffer slots 2012-05-08 14:30:46 +01:00
gen3_render.h
gen4_render.c sna: Fix off-by-one in checking available execbuffer slots 2012-05-08 14:30:46 +01:00
gen4_render.h
gen5_render.c sna: Fix off-by-one in checking available execbuffer slots 2012-05-08 14:30:46 +01:00
gen5_render.h sna/gen5: Avoid bitfields for simple assignments 2011-09-12 19:25:08 +01:00
gen6_render.c sna: Fix off-by-one in checking available execbuffer slots 2012-05-08 14:30:46 +01:00
gen6_render.h sna: Move the flush to the backends 2012-01-20 00:02:05 +00:00
gen7_render.c sna: Fix off-by-one in checking available execbuffer slots 2012-05-08 14:30:46 +01:00
gen7_render.h sna: Move the flush to the backends 2012-01-20 00:02:05 +00:00
kgem.c sna: Only submit a batch to the scanout if it is not already busy 2012-05-08 14:30:46 +01:00
kgem.h sna: Force remapping for IO transfer 2012-05-08 15:47:14 +01:00
kgem_debug.c sna: Begin debugging gen7 2011-11-11 00:15:44 +00:00
kgem_debug.h sna: Begin debugging gen7 2011-11-11 00:15:44 +00:00
kgem_debug_gen2.c sna/gen2: Improve batch decoder. 2011-09-04 12:46:32 +01:00
kgem_debug_gen3.c sna: Explicitly retire the bo following a serialisation point 2012-01-11 12:10:18 +00:00
kgem_debug_gen4.c sna: Explicitly retire the bo following a serialisation point 2012-01-11 12:10:18 +00:00
kgem_debug_gen5.c sna: Allow ridiculously large bo, up to half the total GATT 2012-01-29 14:47:12 +00:00
kgem_debug_gen6.c sna/gen6: Reduce PictOpClear to PictOpSrc (with blending disabled) 2012-02-04 20:07:45 +00:00
kgem_debug_gen7.c sna/gen7: Fix debug printing of primitives 2012-05-03 14:48:26 +01:00
rop.h sna: Reduce and clarify dependencies 2011-11-16 22:15:39 +00:00
sna.h sna: Throttle independently of scanout updates 2012-05-08 14:30:46 +01:00
sna_accel.c sna: Force remapping for IO transfer 2012-05-08 15:47:14 +01:00
sna_blt.c sna: Micro-optimise common case of checking a single fenced bo 2012-04-28 14:19:47 +01:00
sna_composite.c sna: Adjust the damage region for the composite offset 2012-03-22 11:23:02 +00:00
sna_damage.c sna: Avoid reducing damage for synchronisation 2012-05-03 14:48:26 +01:00
sna_damage.h sna: Fix offset for combining damage 2012-05-03 22:37:09 +01:00
sna_display.c sna: Replace timerfd with OsTimer 2012-05-06 21:55:04 +01:00
sna_dri.c sna/dri: Only track a single pending flip across all pipes 2012-05-06 12:51:23 +01:00
sna_driver.c sna: Update select timeout when installing a timer in the block handler 2012-05-07 21:22:28 +01:00
sna_glyphs.c sna: Minor glyph fallback fixes 2012-05-03 14:48:27 +01:00
sna_gradient.c sna/gradient: Compute the absolute delta between color stops 2012-04-07 10:02:23 +01:00
sna_io.c sna: Force remapping for IO transfer 2012-05-08 15:47:14 +01:00
sna_module.h sna: Add zaphod support 2011-06-07 16:54:57 +01:00
sna_reg.h sna: Support performing alpha-fixup on the source 2012-01-06 17:50:01 +00:00
sna_render.c sna: Add a pair of asserts to track down a NULL pointer dereference 2012-05-06 12:51:23 +01:00
sna_render.h sna/gen3: Convert the clear-color from picture->format to a8r8g8b8 2012-04-03 12:36:22 +01:00
sna_render_inline.h sna: Don't consider upload proxies as being on the GPU for render targets 2012-04-19 09:12:18 +01:00
sna_stream.c sna: Pass usage hint for creating linear buffers 2012-03-02 14:34:23 +00:00
sna_tiling.c sna: Maintain a reference to the chain of proxies 2012-05-03 22:40:52 +01:00
sna_transform.c
sna_trapezoids.c sna/traps: Use a temporary variable for the write pointer 2012-04-09 10:48:08 +01:00
sna_video.c sna: Add video sprite support for ILK+ 2012-03-28 22:53:17 +01:00
sna_video.h sna: Add video sprite support for ILK+ 2012-03-28 22:53:17 +01:00
sna_video_hwmc.c sna: Disable use of xvmc for SNB+ 2012-02-06 09:19:56 +00:00
sna_video_hwmc.h
sna_video_overlay.c sna/video: Constify a couple of attribute arrays 2011-11-13 13:13:03 +00:00
sna_video_sprite.c sna: Avoid leaking the plane resources when determining sprite planes 2012-04-14 20:45:26 +01:00
sna_video_textured.c sna/video: Only wait upon the scanout pixmap 2012-04-06 21:10:50 +01:00

README

SandyBridge's New Acceleration
------------------------------

The guiding principle behind the design is to avoid GPU context switches.
On SandyBridge (and beyond), these are especially pernicious because the
RENDER and BLT engine are now on different rings and require
synchronisation of the various execution units when switching contexts.
They were not cheap on early generation, but with the increasing
complexity of the GPU, avoiding such serialisations is important.

Furthermore, we try very hard to avoid migrating between the CPU and GPU.
Every pixmap (apart from temporary "scratch" surfaces which we intend to
use on the GPU) is created in system memory. All operations are then done
upon this shadow copy until we are forced to move it onto the GPU. Such
migration can only be first triggered by: setting the pixmap as the
scanout (we obviously need a GPU buffer here), using the pixmap as a DRI
buffer (the client expects to perform hardware acceleration and we do not
want to disappoint) and lastly using the pixmap as a RENDER target. This
last is chosen because when we know we are going to perform hardware
acceleration and will continue to do so without fallbacks, using the GPU
is much, much faster than the CPU. The heuristic I chose therefore was
that if the application uses RENDER, i.e. cairo, then it will only be
using those paths and not intermixing core drawing operations and so
unlikely to trigger a fallback.

The complicating case is front-buffer rendering. So in order to accommodate
using RENDER on an application whilst running xterm without a composite
manager redirecting all the pixmaps to backing surfaces, we have to
perform damage tracking to avoid excess migration of portions of the
buffer.