fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-22 13:30:12 +01:00

Author	SHA1	Message	Date
Jordan Justen	a11dfc11cf	iris: Use mi_builder for load/store reg/mem/imm functions Ref: `06cf838cbd` ("intel/mi_builder: Support gen11 command-streamer based register offsets") Ref: `6ffdcc335e` ("iris: Use mi_builder in iris_load_indirect_location()") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14340>	2022-01-18 23:11:38 +00:00
Jordan Justen	e29ed39d63	iris: Use mi_builder to set 3DPRIM registers for draws Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14340>	2022-01-18 23:11:38 +00:00
Lionel Landwerlin	2e3490dd0f	iris: utrace/perfetto support v2: Fixup gpu_id computation, use minor of /dev/dri/* % 128 since we don't know whether we get card0 or renderD128 for instance. (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> (v1) Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Nanley Chery	f3c629733f	anv,iris: PSS Stall Sync around color fast clears Needed for XeHP (see Bspec 47704). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14024>	2022-01-12 01:30:34 +00:00
Francisco Jerez	074bde9989	intel/xehp: Switch to coarser cross-slice pixel hashing with table permutation. The coarser 32x32 cross-slice hashing mode seems to lead to better L1 and L2 utilization due to the improved execution locality, however it can also lead to a bottleneck in a single slice, especially in workloads that concentrate heavy rendering in small areas of the screen (e.g. SynMark2 OglGeomPoint, OglTerrain*) -- This effect is mitigated here by performing a permutation of the pixel pipe hashing tables that ensures that adjacent rows map to pixel pipes as far away as possible in the caching hierarchy. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13569>	2022-01-10 18:28:35 -08:00
Francisco Jerez	d149c5e6e0	iris: Program pixel hashing tables on XeHP. Unlike the Gen11 code, this requires us to allocate a pipe_resource for the pixel pipe hashing tables and hold a reference to it from the context, since we need to add it to the validation list of every batch, the tables may be accessed by the hardware at any time after they're specified via 3DSTATE_SLICE_TABLE_STATE_POINTERS. Note that this has an effect even for unfused native die platforms, since the pixel pipe hashing tables we intend to program aren't equivalent to the hardware's defaults on such configs. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13569>	2022-01-10 18:28:35 -08:00
Francisco Jerez	283d5bff4e	intel: Rename intel_compute_pixel_hash_table() to intel_compute_pixel_hash_table_3way(). For consistency with intel_compute_pixel_hash_table_nway(). Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13569>	2022-01-10 18:28:35 -08:00
Francisco Jerez	68cb551b1d	intel: Move pixel hashing table computation into common header file. In order to avoid some duplication between the GL and Vulkan driver, which will get worse as we introduce additional code in order to handle more recent generations. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13569>	2022-01-10 18:28:35 -08:00
Francisco Jerez	3d3c571db3	iris: Merge gfx11_ and gfx12_upload_pixel_hashing_tables() into the same function. Will save some boilerplate as we introduce another variant of this function. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13569>	2022-01-10 18:28:12 -08:00
Francisco Jerez	ffa2ca8a77	intel/xehp: Update 3DSTATE_PS maximum number of threads per PSD. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13569>	2022-01-10 18:27:41 -08:00
Francisco Jerez	8e21cad39b	intel/xehp: Implement XeHP workaround Wa_14014148106. Actually, no, there's no need to do anything, just update some comments for the record. An earlier revision of this change that implemented the workaround text to the letter required no less than 8 new PIPE_CONTROLs throughout the tree. However Felix Degrood noticed that the cost of some of the PIPE_CONTROLs was showing up in workloads like Shadow of the Tomb Raider. The Windows driver wasn't emitting many of those pipe controls, contrary to the W/A instructions, so we engaged in a back and forth with the hardware team, who concluded that the original suggested workaround was unnecessarily strict, and the Windows driver's behavior acceptable. It turns out that Wa_1408224581 we had already implemented for TGL is roughly equivalent to the Windows behavior, so no need to do anything new after all. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14278>	2022-01-11 00:17:32 +00:00
Francisco Jerez	eeb3f4594d	intel/xehp: Implement XeHP workaround Wa_14013910100. XeHP platforms require the invalidation of the instruction cache after a STATE_BASE_ADDRESS change due to a hardware bug potentially leading to instruction cache pollution. Note that the workaround text says it's applicable "DG2 128/256/512-A/B", however it's also marked as permanent and not confirmed to be fixed in any specific steping, so we apply it to all Gfx12HP platforms. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14278>	2022-01-11 00:17:32 +00:00
Francisco Jerez	e48c29acca	intel/dev: Add support for pixel pipe subslice accounting on multi-slice GPUs. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14436>	2022-01-07 07:58:27 +00:00
Rafael Antognolli	e9b509755b	intel: Emit 3DSTATE_BINDING_TABLE_POOL_ALLOC for XeHP On XeHP+, Binding Table Pointers are an offset relative to the Surface State Base Address anymore. Instead, they are relative to the State Binding Table Pool Address, which is set by the command above. We emit that command (pointing to the same address as the Surface State Base Addresss), and everything should stay working as before. Reworks: * Jordan: Add iris * Jordan: Drop i965 * Ken: Set MOCS to avoid a major perf impact. (Found by Felix DeGrood.) * Jordan: Shrink size from 2MiB to actual iris, anv usage * Lionel: Add BINDING_TABLE_POOL_BLOCK_SIZE Ref: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4995 Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> [jordan.l.justen@intel.com: Add Iris, adjust sizes] Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13992>	2021-12-20 17:58:13 +00:00
Sagar Ghuge	cd38b6e2e8	anv, iris: Implement Wa_14014890652 for DG2 Workaround is to set: 3DSTATE_VFG::GranularityThresholdDisable = 1 3DSTATE_VFG::DistributionGranularity = BATCH 3DSTATE_VF::GeometryDistributionEnable = 1 Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14212>	2021-12-16 00:00:23 +00:00
Anuj Phogat	40b66a4499	anv, iris: Add Wa_22011440098 for DG2 Rework: * Jordan: Set MOCS after `7b78b2fcac` ("intel/genxml: Assert that all MOCS fields are non-zero on Gfx7+") Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14212>	2021-12-16 00:00:22 +00:00
Anuj Phogat	17a1df79ba	anv, iris: Add Wa_16011773973 for DG2 Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14212>	2021-12-16 00:00:22 +00:00
Jason Ekstrand	b8d04863e2	intel/fs: Drop high_quality_derivatives We've never bothered to hook it up in crocus or iris. If we do in the future, it should probably be a NIR pasa anyway. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14056>	2021-12-10 21:20:47 +00:00
Jordan Justen	7eb13fc2f2	anv,blorp,iris: Set MOCS for COMPUTE_WALKER post-sync operation We don't current enable post sync operations, but it is probably better to set them to "internal" MOCS than to remove the non-zero checking for this genxml field. Reworks: * Fix COMPUTE_WALKER in cmd_buffer_trace_rays (s-b Jason) Fixes: `7b78b2fcac` ("intel/genxml: Assert that all MOCS fields are non-zero on Gfx7+") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13624>	2021-11-08 23:29:51 +00:00
Lionel Landwerlin	361b3fee3c	intel: move away from booleans to identify platforms v2: Drop changes around GFX_VERx10 == 75 (Luis) v3: Replace (GFX_VERx10 < 75 && devinfo->platform != INTEL_PLATFORM_BYT) by (devinfo->platform == INTEL_PLATFORM_IVB) Replace (devinfo->ver >= 5 \|\| devinfo->platform == INTEL_PLATFORM_G4X) by (devinfo->verx10 >= 45) Replace (devinfo->platform != INTEL_PLATFORM_G4X) by (devinfo->verx10 != 45) v4: Fix crocus typo v5: Rebase v6: Add GFX3, ILK & I965 platforms (Jordan) Move ifdef to code expressions (Jordan) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12981>	2021-11-08 16:48:06 +00:00
Jordan Justen	6ffdcc335e	iris: Use mi_builder in iris_load_indirect_location() For example, this allows us to take advantage of command-streamer based register offsets in mi_builder. Ref: `06cf838cbd` ("intel/mi_builder: Support gen11 command-streamer based register offsets") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13652>	2021-11-04 21:23:21 -07:00
Kenneth Graunke	256d48eb8c	iris: Set MOCS on NULL stream output buffers We'd like to add safeguards against accidental use of MOCS 0 (uncached), which can have large performance implications. One case where we use MOCS of 0 is disabled stream output targets, MOCS shouldn't matter, as there's no actual buffer to be cached. That said, it should be harmless to set MOCS for these null stream output buffers; we can just assume a MOCS for generic internal buffers. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	d8e1d0fecc	iris: Set MOCS on NULL vertex buffers We'd like to add safeguards against accidental use of MOCS 0 (uncached), which can have large performance implications. One case where we use MOCS of 0 is 3DSTATE_VERTEX_BUFFERS where we set NullVertexBuffer. It shouldn't matter here, as there's no actual buffer to be cached. That said, it should be harmless to set MOCS for null vertex buffers. We can assume an internal buffer and request isl's vertex buffer MOCS. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	369cd9ae28	iris: Set MOCS on 3DSTATE_CONSTANT_ALL packets that disable all buffers We'd like to add safeguards against accidental use of MOCS 0 (uncached), which can have large performance implications. One case where we missed setting a non-zero MOCS was in 3DSTATE_CONSTANT_ALL packets which fully disable all constant buffers. (If any constant buffer was present, we would set an actual MOCS value.) MOCS really shouldn't matter here, as there are no actual constant buffers to be cached. That said, it should be harmless to do so, and we can just assume a generic MOCS for internal buffers. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	0544afd2df	iris: Set MOCS on 3DSTATE_CONSTANT_XS on Gfx9+ We were leaving this blank due to a Broadwell restriction, causing our constant buffers to be uncached. We later fixed this for Gfx12+, but left Gfx9-11 without a fix. We should specify one. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	8336054024	iris: Set default MOCS for NULL depth/stencil/HiZ buffers isl now uses info->mocs regardless of whether there's any actual depth/stencil/HiZ buffers involved, so pass it a legitimate one, rather than zero. When we have entirely NULL surfaces, we just default to isl's MOCS value for an internal depth buffer. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	0a5e225779	iris: Set Bindless Sampler State MOCS We don't use bindless sampler states today, but when we do, we'll want them to have proper MOCS values. This also avoids asserts in upcoming patches which enforce that MOCS isn't zero. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	a6690dc1ee	iris: Drop unnecessary parenthesis Trivial. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Sagar Ghuge	29762ea897	iris: Drop hint if primitive id is required or not Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13474>	2021-10-26 18:22:15 +00:00
Kenneth Graunke	e79e1ca304	intel: Drop Tigerlake revision 0 workarounds Tigerlake revision 0 is an early stepping that should not be used in production anywhere, so this code was only used for hardware bringup. We can drop the unnecessary workarounds. This also keeps them from triggering on early steppings of other Gfx12 parts. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13266>	2021-10-21 16:53:43 -07:00
Marcin Ślusarz	d05f7b4a2c	intel: fix INTEL_DEBUG environment variable on 32-bit systems INTEL_DEBUG is defined (since `4015e1876a`) as: #define INTEL_DEBUG __builtin_expect(intel_debug, 0) which unfortunately chops off upper 32 bits from intel_debug on platforms where sizeof(long) != sizeof(uint64_t) because __builtin_expect is defined only for the long type. Fix this by changing the definition of INTEL_DEBUG to be function-like macro with "flags" argument. New definition returns 0 or 1 when any of the flags match. Most of the changes in this commit were generated using: for c in `git grep INTEL_DEBUG \| grep "&" \| grep -v i915 \| awk -F: '{print $1}' \| sort \| uniq`; do perl -pi -e "s/INTEL_DEBUG & ([A-Z0-9a-z_]+)/INTEL_DBG(\1)/" $c perl -pi -e "s/INTEL_DEBUG & ($[A-Z0-9_ \|]+$)/INTEL_DBG\1/" $c done but it didn't handle all cases and required minor cleanups (like removal of round brackets which were not needed anymore). Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13334>	2021-10-15 19:55:14 +00:00
Marcin Ślusarz	5387522bd0	iris: fix scratch address patching for TESS_EVAL stage Scratch patching code in iris_upload_dirty_render_state (see MERGE_SCRATCH_ADDR calls) assumes that in all shader stages derived_data field stores 3DSTATE_XS packet first. This is not true for TESS_EVAL (DS), so we end up patching 3DSTATE_TE instead of 3DSTATE_DS leading to DWordLength becoming 11 instead of 9 (9 == 3DSTATE_DS.DWordLength, 2 == 3DSTATE_TE.DWordLength, and 9\|2 == 11), and hardware hanging on the next instruction. Fix this by reversing the order of packets for TESS_EVAL stage. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5499 Fixes: `4256f7ed58` ("iris: Fill out scratch base address dynamically") Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13358>	2021-10-15 07:07:51 +00:00
Anuj Phogat	20c0ca75f5	iris: Enable tessellation redistribution This patch adds Tessellation Distribution on top of Geometry Distribution. Using recommended values based on performance studies across a range of workloads. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12091>	2021-10-13 22:36:54 +00:00
Anuj Phogat	efa27572a1	iris: Enable geometry distribution Using recommended values based on performance studies across a range of workloads. Rework: * Always enable geometry distribution * Set ListCutIndexEnable if primitive restart is enabled * Set distribution mode based on TEEnable v2: - Flag missing IRIS_DIRTY_VFG bit (Ken) Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12091>	2021-10-13 22:36:54 +00:00
Jason Ekstrand	3e13c4ccf2	anv,iris,genxml: Use NumberOfBarriers on XeHP Ref: bspec 55400 Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11564>	2021-09-30 17:41:33 +00:00
Jason Ekstrand	5f8e043fb6	iris: Handle states=NULL in iris_bind_sampler_states Clover likes to do this to clear our a bunch of samplers without actually passing an array of NULL pointers. It's easy enough to handle in iris. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13072>	2021-09-28 20:54:29 +00:00
Caio Marcelo de Oliveira Filho	f1a7cc54f3	iris: Document push constants allocation Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13014>	2021-09-27 20:51:29 +00:00
Nanley Chery	69242f188c	iris: Finish aux import in iris_resource_from_handle This allows us to delete iris_resource_unfinished_aux_import, which incorrectly assumed that a CCS-enabled resource needs an aux BO. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12795>	2021-09-16 18:07:23 +00:00
Nanley Chery	d32a4cdab9	iris: Simplify an iris_use_pinned_bo call Avoid using a helper function to get the resource BO. This fits in better with the previous iris_use_pinned_bo calls. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12795>	2021-09-16 18:07:23 +00:00
Nanley Chery	89319a0dfd	iris: Split clear color and aux BO checks CCS_E-enabled resources on XeHP have a clear color without an aux BO. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12795>	2021-09-16 18:07:23 +00:00
Nanley Chery	d25515fbf1	iris: Support NULL aux BOs in fill_surface_state XeHP can use CCS_E without an aux BO. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12795>	2021-09-16 18:07:23 +00:00
Nanley Chery	08edf0f7fc	iris: Delete iris_resource_get_clear_color This helper simply is a wrapper to the clear color fields in the iris_resource struct. We choose to delete it for two reasons: 1) It incorrectly asserts that the resource argument has an aux BO. This doesn't hold for CCS_E on XeHP. 2) The majority of functions ignore the helper anyway and access these fields directly. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12795>	2021-09-16 18:07:23 +00:00
Jordan Justen	32e848aeaa	intel: Move subslice_total into devinfo Reworks: * Move asserts for subslice_total into intel_device_info.c (s-b Ken) * Drop now unused intel_device_info_subslice_total (s-b Ken) * Add comment for subslice_total (Ken) Suggested-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12799>	2021-09-13 13:26:23 -07:00
Francisco Jerez	9f1053a1f3	iris: Track dirty UBOs per-stage for more targeted flushing. This allows us to skip over individual constant buffer bindings which haven't been changed since the last flush, or which are set to a user buffer, which means they don't require flushing. Omitting this commit would lead to the following statistically significant Piglit Draw Overhead regressions: 107/DrawArrays (16 VBO\| 8 UBO\| 8 Tex) w/ 1 UBO change: XXX ±2.31% x22 -> XXX ±2.55% x21 d=-3.49% ±2.38% p=0.00% 79/DrawArrays ( 1 VBO\| 8 UBO\| 8 Tex) w/ 8 UBOs change: XXX ±1.90% x22 -> XXX ±2.25% x21 d=-3.20% ±2.04% p=0.00% 78/DrawArrays ( 1 VBO\| 8 UBO\| 8 Tex) w/ 1 UBO change: XXX ±2.64% x22 -> XXX ±2.58% x21 d=-2.74% ±2.58% p=0.12% 45/DrawElements (16 VBO\| 8 UBO\| 8 Tex) w/ 1 UBO change: XXX ±2.53% x22 -> XXX ±2.29% x21 d=-2.41% ±2.39% p=0.20% 108/DrawArrays (16 VBO\| 8 UBO\| 8 Tex) w/ 8 UBOs change: XXX ±2.10% x22 -> XXX ±1.41% x21 d=-2.36% ±1.78% p=0.01% 16/DrawElements ( 1 VBO\| 8 UBO\| 8 Tex) w/ 1 UBO change: XXX ±2.44% x22 -> XXX ±1.19% x21 d=-2.12% ±1.93% p=0.09% 46/DrawElements (16 VBO\| 8 UBO\| 8 Tex) w/ 8 UBOs change: XXX ±2.93% x22 -> XXX ±2.44% x21 d=-1.99% ±2.68% p=1.93% Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12691>	2021-09-02 03:14:37 +00:00
Francisco Jerez	8be320117b	iris: Use separate dirty bits for UBO and SSBO flushes. This moves UBO+SSBO flushing into a dirty bit separate from the one used for image and sampler views, which saves some CPU overhead in the frequent case where buffers from only one or the other set are updated. Omitting this commit would lead to the following statistically significant Piglit Draw Overhead regressions: 107/DrawArrays (16 VBO\| 8 UBO\| 8 Tex) w/ 1 UBO change: XXX ±2.31% x22 -> XXX ±1.80% x21 d=-24.31% ±1.91% p=0.00% 78/DrawArrays ( 1 VBO\| 8 UBO\| 8 Tex) w/ 1 UBO change: XXX ±2.64% x22 -> XXX ±2.21% x21 d=-24.13% ±2.22% p=0.00% 45/DrawElements (16 VBO\| 8 UBO\| 8 Tex) w/ 1 UBO change: XXX ±2.53% x22 -> XXX ±1.90% x21 d=-23.63% ±2.07% p=0.00% 16/DrawElements ( 1 VBO\| 8 UBO\| 8 Tex) w/ 1 UBO change: XXX ±2.44% x22 -> XXX ±1.97% x21 d=-23.23% ±2.04% p=0.00% 108/DrawArrays (16 VBO\| 8 UBO\| 8 Tex) w/ 8 UBOs change: XXX ±2.10% x22 -> XXX ±1.50% x21 d=-22.15% ±1.71% p=0.00% 79/DrawArrays ( 1 VBO\| 8 UBO\| 8 Tex) w/ 8 UBOs change: XXX ±1.90% x22 -> XXX ±1.70% x21 d=-22.12% ±1.64% p=0.00% 17/DrawElements ( 1 VBO\| 8 UBO\| 8 Tex) w/ 8 UBOs change: XXX ±2.85% x22 -> XXX ±1.59% x21 d=-21.03% ±2.22% p=0.00% 46/DrawElements (16 VBO\| 8 UBO\| 8 Tex) w/ 8 UBOs change: XXX ±2.93% x22 -> XXX ±1.09% x21 d=-20.62% ±2.18% p=0.00% 7/DrawElements ( 1 VBO\| 8 UBO\| 8 Tex) w/ vertex attrib change: XXX ±9.30% x22 -> XXX ±7.02% x21 d=-6.49% ±8.08% p=1.19% 68/DrawArrays ( 1 VBO\| 8 UBO\| 8 Tex) w/ shader program change: XXX ±1.60% x22 -> XXX ±1.93% x21 d=-2.23% ±1.75% p=0.01% 6/DrawElements ( 1 VBO\| 8 UBO\| 8 Tex) w/ shader program change: XXX ±2.90% x22 -> XXX ±2.71% x21 d=-2.04% ±2.78% p=2.08% Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12691>	2021-09-02 03:14:37 +00:00
Francisco Jerez	5c44df011f	iris: Insert buffer-local memory barriers for UBO reads. Similar to what was previously done for other kinds of buffers -- Insert memory barriers at resolves-and-flushes time instead of relying on the history flush mechanism. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12691>	2021-09-02 03:14:37 +00:00
Francisco Jerez	077af5c928	iris: Insert buffer-local memory barriers for SSBO reads and writes. Similar to what was previously done for vertex buffers, render buffers, etc -- Insert memory barriers at resolves-and-flushes time instead of relying on the history flush mechanism. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12691>	2021-09-02 03:14:37 +00:00
Francisco Jerez	cb9f02f863	iris: Add read-write domain for data cache. This will allow us to remove the history flushes performed for SSBOs and instead take advantage of the same mechanism used for tracking other memory accesses. v2: Use C99 designated initializers (Ken). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12691>	2021-09-02 03:14:37 +00:00
Francisco Jerez	c677e76483	iris: Insert buffer-local memory barriers for indirect draw parameters. This adds buffer-local barriers so any required synchronization commands are emitted before a buffer object is used as source for indirect draw parameters. An unconditional PIPE_CONTROL meant to flush the contents of the draw count buffer can now be removed, since it's redundant with the more accurate buffer-local barrier introduced here, which should avoid flushing in cases where the buffer wasn't written by any incoherent cache since the last flush. (Rebased by Kenneth Graunke.) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12691>	2021-09-02 03:14:37 +00:00
Francisco Jerez	51f022cc03	iris: Add separate dirty bit for VBO flushes. Instead of emitting barriers every time IRIS_DIRTY_VERTEX_BUFFERS is flagged, use a separate dirty bit and optimize out the barriers in cases where the same buffer object is re-bound as vertex buffer. Omitting this commit would lead to the following statistically significant Piglit Draw Overhead regressions: 36/DrawElements (16 VBO\| 8 UBO\| 8 Tex) w/ vertex attrib change: XXX ±7.22% x22 -> XXX±11.09% x21 d=-20.10% ±8.06% p=0.00% 98/DrawArrays (16 VBO\| 8 UBO\| 8 Tex) w/ vertex attrib change: XXX ±7.27% x22 -> XXX ±7.70% x21 d=-17.76% ±6.83% p=0.00% 69/DrawArrays ( 1 VBO\| 8 UBO\| 8 Tex) w/ vertex attrib change: XXX ±9.94% x22 -> XXX ±8.72% x21 d=-7.46% ±9.08% p=1.02% 53/DrawElements (16 VBO\| 8 UBO\| 8 Tex) w/ depth enable change: XXX ±8.34% x22 -> XXX ±6.88% x21 d=-7.30% ±7.45% p=0.26% 61/DrawElements (16 VBO\| 8 UBO\| 8 Tex) w/ cull face enable change: XXX±10.22% x22 -> XXX ±8.63% x21 d=-6.75% ±9.23% p=2.11% 55/DrawElements (16 VBO\| 8 UBO\| 8 Tex) w/ stencil enable change: XXX ±9.30% x22 -> XXX ±7.25% x21 d=-6.60% ±8.16% p=1.14% 50/DrawElements (16 VBO\| 8 UBO\| 8 Tex) w/ viewport change: XXX ±6.48% x22 -> XXX ±5.93% x21 d=-6.58% ±6.04% p=0.09% 54/DrawElements (16 VBO\| 8 UBO\| 8 Tex) w/ depth clamp enable change: XXX ±9.95% x22 -> XXX ±7.95% x21 d=-6.50% ±8.81% p=2.02% 35/DrawElements (16 VBO\| 8 UBO\| 8 Tex) w/ shader program change: XXX ±7.27% x22 -> XXX ±7.25% x21 d=-5.77% ±7.06% p=1.06% Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12691>	2021-09-02 03:14:37 +00:00

1 2 3 4 5 ...

766 commits