xf86-video-intel/src/sna
Chris Wilson bf4be323e3 sna: Fix declaration of write intent for can_map__cpu in PutImage
We always are going to write to the image, despite the flag set in

commit fa961ec99a
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Sun Jul 21 18:00:22 2013 +0100

    sna: Allow linear inplace uploads along the tiled X PutImage blt paths

which was accidentally conditional on the image not being too large.

Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
2013-09-12 13:58:32 +01:00
..
brw intel: Suppress some extremely noisy warnings 2013-07-29 23:49:08 +01:00
fb Micro-optimise box intersections 2013-08-08 09:28:20 +01:00
Makefile.am Revert "sna: Add XMir support" 2013-09-07 08:42:26 +01:00
README
atomic.h sna: Begin sketching out a threaded rasteriser for spans 2013-01-27 13:06:46 +00:00
blt.c sna: Don't force inline string-ops for the general memcpy_blt routine 2013-08-01 11:18:18 +01:00
compiler.h sna: Rename the attribute macro __packed__ to avoid clang barfing 2013-08-05 16:20:35 +01:00
gen2_render.c sna: Skip copying to the intermediate target if we will completely overwrite it 2013-07-13 00:01:08 +01:00
gen2_render.h
gen3_render.c sna: Skip copying to the intermediate target if we will completely overwrite it 2013-07-13 00:01:08 +01:00
gen3_render.h
gen4_render.c sna/gen4: Always try the BLT composite routines first 2013-09-11 10:20:28 +01:00
gen4_render.h
gen4_source.c sna/gen4+: Handle solids passed to the general texcoord emitter 2013-01-06 15:20:14 +00:00
gen4_source.h sna/gen4+: Specialise linear vertex emission 2013-01-02 11:52:58 +00:00
gen4_vertex.c sna/gen4+: Rebalance creation of vbo during batch flushing 2013-09-11 10:20:30 +01:00
gen4_vertex.h sna/gen4+: Begin specialising vertex programs for ISA 2013-02-26 00:03:28 +00:00
gen5_render.c intel: Remove some unused macros 2013-07-28 14:49:07 +01:00
gen5_render.h Revert "sna/gen4+: Backport tight vertex packing for simple renderblits" 2013-01-06 13:30:37 +00:00
gen6_render.c sna/gen6: Prefer the RCS on large GT systems 2013-09-08 22:34:09 +01:00
gen6_render.h
gen7_render.c sna/gen7: Prefer the BLT for gt1 systems 2013-09-08 22:34:03 +01:00
gen7_render.h sna/gen7: Fix MOCS for Haswell 2013-03-27 16:58:41 +00:00
kgem.c sna: Disable use of Y-tiling for gen4 2013-09-10 12:29:24 +01:00
kgem.h sna: Disable use of Y-tiling for gen4 2013-09-10 12:29:24 +01:00
kgem_debug.c
kgem_debug.h
kgem_debug_gen2.c
kgem_debug_gen3.c
kgem_debug_gen4.c
kgem_debug_gen5.c
kgem_debug_gen6.c sna: Begin sketching out a threaded rasteriser for spans 2013-01-27 13:06:46 +00:00
kgem_debug_gen7.c
rop.h
sna.h sna: Disable use of Y-tiling for gen4 2013-09-10 12:29:24 +01:00
sna_accel.c sna: Fix declaration of write intent for can_map__cpu in PutImage 2013-09-12 13:58:32 +01:00
sna_acpi.c sna: Search /sys/class/power_state for the AC adapter 2013-09-06 23:55:56 +01:00
sna_blt.c sna/gen6+: Only apply the BLT copy w/a for self-copies 2013-09-09 15:42:06 +01:00
sna_composite.c sna: Fix the assertions for the composite fb path 2013-08-13 11:02:05 +01:00
sna_cpu.c sna: Wrap cpuid.h 2013-07-17 21:02:02 +01:00
sna_cpuid.h sna: Wrap cpuid.h 2013-07-17 21:02:02 +01:00
sna_damage.c sna: Add some more DBG 2013-09-03 17:02:12 +01:00
sna_damage.h sna: Remember to offset the box before asserting damage 2013-09-09 11:19:19 +01:00
sna_display.c sna/hsw: Scanline waits require both DERRMR and forcewake 2013-09-09 16:23:04 +01:00
sna_display_fake.c sna: Include some DBG for the virtual output detection 2013-09-06 11:11:47 +01:00
sna_dri.c Revert "sna: Add XMir support" 2013-09-07 08:42:26 +01:00
sna_driver.c Revert "sna: Add XMir support" 2013-09-07 08:42:26 +01:00
sna_glyphs.c sna: Disable use of Y-tiling for gen4 2013-09-10 12:29:24 +01:00
sna_gradient.c sna: Markup when a gradient is opaque 2013-06-28 10:14:16 +01:00
sna_io.c sna: Try the indirect upload if we elect to use a mmap and it fails 2013-07-26 15:53:01 +01:00
sna_module.h
sna_reg.h
sna_render.c sna: Rename sna_pixmap_get_bo() 2013-08-14 18:12:15 +01:00
sna_render.h sna: Rename the attribute macro __packed__ to avoid clang barfing 2013-08-05 16:20:35 +01:00
sna_render_inline.h sna: Treat a source with a CPU bo as being attached. 2013-07-19 14:04:38 +01:00
sna_stream.c
sna_threads.c sna: Tweak threading thresholds 2013-08-04 13:10:26 +01:00
sna_tiling.c sna: Avoid integer overflow when computing the tiled extents 2013-07-12 23:29:10 +01:00
sna_transform.c
sna_trapezoids.c sna: Rename sna_pixmap_get_bo() 2013-08-14 18:12:15 +01:00
sna_vertex.c sna: Begin sketching out a threaded rasteriser for spans 2013-01-27 13:06:46 +00:00
sna_video.c sna: Add a few more checks for a hosted Xserver before walking CRTC lists 2013-08-25 11:00:32 +01:00
sna_video.h sna: Add a few more checks for a hosted Xserver before walking CRTC lists 2013-08-25 11:00:32 +01:00
sna_video_hwmc.c Suppress a bunch of compiler warnings for using constant strings 2013-07-30 13:26:14 +01:00
sna_video_hwmc.h sna/video: Convert to a pure Xv backend 2013-05-21 11:14:52 +01:00
sna_video_overlay.c sna: Add a few more checks for a hosted Xserver before walking CRTC lists 2013-08-25 11:00:32 +01:00
sna_video_sprite.c sna/video: Copy the ClipRegion for the sprite 2013-08-27 19:06:52 +01:00
sna_video_textured.c sna: Add a few more checks for a hosted Xserver before walking CRTC lists 2013-08-25 11:00:32 +01:00

README

SandyBridge's New Acceleration
------------------------------

The guiding principle behind the design is to avoid GPU context switches.
On SandyBridge (and beyond), these are especially pernicious because the
RENDER and BLT engine are now on different rings and require
synchronisation of the various execution units when switching contexts.
They were not cheap on early generation, but with the increasing
complexity of the GPU, avoiding such serialisations is important.

Furthermore, we try very hard to avoid migrating between the CPU and GPU.
Every pixmap (apart from temporary "scratch" surfaces which we intend to
use on the GPU) is created in system memory. All operations are then done
upon this shadow copy until we are forced to move it onto the GPU. Such
migration can only be first triggered by: setting the pixmap as the
scanout (we obviously need a GPU buffer here), using the pixmap as a DRI
buffer (the client expects to perform hardware acceleration and we do not
want to disappoint) and lastly using the pixmap as a RENDER target. This
last is chosen because when we know we are going to perform hardware
acceleration and will continue to do so without fallbacks, using the GPU
is much, much faster than the CPU. The heuristic I chose therefore was
that if the application uses RENDER, i.e. cairo, then it will only be
using those paths and not intermixing core drawing operations and so
unlikely to trigger a fallback.

The complicating case is front-buffer rendering. So in order to accommodate
using RENDER on an application whilst running xterm without a composite
manager redirecting all the pixmaps to backing surfaces, we have to
perform damage tracking to avoid excess migration of portions of the
buffer.