fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-24 15:20:10 +01:00

Author	SHA1	Message	Date
Iván Briano	db48dcb4f3	intel/compiler: remove what looks like a bad rebase This bit in the compiler looks like it was added by accident on one of the latest versions of the original commit, but it clearly doesn't belong there. Fixes: `03e1e19246` ("anv: Refactor descriptor copy") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15016>	2022-02-15 01:04:47 +00:00
Marcin Ślusarz	b6557b80a5	intel/compiler: fix array & struct IO lowering in mesh shaders We really need offsets to be in dwords, not in vec4s. The bug manifests as random failure of func.mesh.clipdistance.5 crucible test, where stores to gl_MeshVerticesNV[x].gl_ClipDistance[4+n] actually write to gl_MeshVerticesNV[x].gl_ClipDistance[1+n]. Fixes: `1f438eb033` ("intel/compiler: Implement Mesh Output") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14997>	2022-02-14 19:18:23 +00:00
Timur Kristóf	0445802ab2	compiler: Extract num_mesh_vertices_per_primitive function. Prevent code duplication. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15005>	2022-02-14 11:13:42 +01:00
Daniel Schürmann	2a92452a0e	nir/opt_shrink_vectors: Remove shrinking of store intrinsics data source This is done via nir_opt_shrink_stores. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14480>	2022-02-11 11:50:47 +01:00
Ian Romanick	38a94c82e6	intel/fs: Don't optimize out 1.0x and -1.0x This (sort of) matches the behavior of nir_opt_algebraic. This ensures that subnormal values are properly flushed to zero. With the aid of "nir/search: Float sources of texture instructions are float users" and "nir/search: Transitively apply is_only_used_as_float", there would have been no shader-db regressions on Intel platforms. However, those caused a significant increase in compile time. Since the instruction regressions were so small, I just dropped those commits rather than improve them. All Haswell and newer platforms had similar results. (Ice Lake shown) total instructions in shared programs: 20125042 -> 20125094 (<.01%) instructions in affected programs: 7184 -> 7236 (0.72%) helped: 0 HURT: 32 HURT stats (abs) min: 1 max: 4 x̄: 1.62 x̃: 2 HURT stats (rel) min: 0.11% max: 1.49% x̄: 0.85% x̃: 0.78% 95% mean confidence interval for instructions value: 1.39 1.86 95% mean confidence interval for instructions %-change: 0.74% 0.96% Instructions are HURT. total cycles in shared programs: 862745586 -> 862746551 (<.01%) cycles in affected programs: 109872 -> 110837 (0.88%) helped: 12 HURT: 23 helped stats (abs) min: 2 max: 774 x̄: 90.83 x̃: 19 helped stats (rel) min: 0.07% max: 25.23% x̄: 3.06% x̃: 0.40% HURT stats (abs) min: 2 max: 1106 x̄: 89.35 x̃: 12 HURT stats (rel) min: 0.08% max: 45.40% x̄: 3.01% x̃: 0.47% 95% mean confidence interval for cycles value: -60.09 115.23 95% mean confidence interval for cycles %-change: -2.21% 4.07% Inconclusive result (value mean confidence interval includes 0). All of the shaders hurt are in either UE4 shooter-game or shooter_demo. Tiger Lake Instructions in all programs: 159893213 -> 159893290 (+0.0%) SENDs in all programs: 6936431 -> 6936431 (+0.0%) Loops in all programs: 38385 -> 38385 (+0.0%) Cycles in all programs: 7019259514 -> 7019260087 (+0.0%) Spills in all programs: 101389 -> 101389 (+0.0%) Fills in all programs: 131532 -> 131532 (+0.0%) Ice Lake Instructions in all programs: 143624164 -> 143624235 (+0.0%) SENDs in all programs: 6980289 -> 6980289 (+0.0%) Loops in all programs: 38383 -> 38383 (+0.0%) Cycles in all programs: 8440082767 -> 8440083238 (+0.0%) Spills in all programs: 102246 -> 102246 (+0.0%) Fills in all programs: 131908 -> 131908 (+0.0%) Skylake Instructions in all programs: 134185424 -> 134185495 (+0.0%) SENDs in all programs: 6938790 -> 6938790 (+0.0%) Loops in all programs: 38356 -> 38356 (+0.0%) Cycles in all programs: 8222366529 -> 8222366923 (+0.0%) Spills in all programs: 98821 -> 98821 (+0.0%) Fills in all programs: 125218 -> 125218 (+0.0%) Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Fixes: `f5dd6dfe01` ("anv: enable VK_KHR_shader_float_controls and SPV_KHR_float_controls") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Jordan Justen	e2cd0c3a3c	intel/fs: Assert that old pull-const code is not used if devinfo->has_lsc Jason changed this to use LSC in: `f5876dfdb9` ("intel/fs: Lower uniform pull constant load message to LSC dataport") Cc: 22.0 <mesa-stable> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14384>	2022-02-09 10:39:17 +00:00
Rohan Garg	03e1e19246	anv: Refactor descriptor copy Refactor descriptor copies to use the existing helper functions instead of rolling our own. In order to facilitate this, we need to store the appropriate buffer views for the relevant descriptors internally and reuse them in the helpers. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14909>	2022-02-09 09:24:37 +00:00
Lionel Landwerlin	c78be5da30	intel/fs: lower ray query intrinsics v2: Add helper for acceleration->root_node computation (Caio) v3: Update comment on "done" bit (Caio) Remove progress bool value for impl function (Caio) Don't use nir_shader_instructions_pass to search the shader (Caio) v4: Rename variable for if/else block (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	712d8fb043	intel/nir: document RT builder Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	0465714790	intel/nir/rt: add more helpers for ray queries v2: Split stack_id helper in sync/async version (Caio) Fixup a few bit field mistake (Caio) Simplify some bitfield manipulations (Caio) v3: Remove duplicated helper (Caio) Simplify brw_nir_rt_set_dword_bit_at (Caio) Comment brw_nir_rt_query_mark_init (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	d5b994ec8a	intel/nir/rt: make RT manipulation helpers helper invocations ready Since we need to be able to perform ray queries in helper invocations, we need to have all the helpers properly tag their load/store operations so that they operate in helper lanes. v2: Switch from macros to inline functions (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	fb69fed65b	intel/nir: document committed argument Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	b0624e414f	intel/fs: make trivial shader complete tracing operations with missing shaders v2: Apply workaround only on < DG2-512-C0 & < DG2-128-B0 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	35bd19f53d	intel/nir/rt: load bvh_level value off mem_hit structure Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	6d9ae6ec1e	intel: add a new intrinsic to get the shader stage from bindless shaders We'll use this to apply ray tracing operations in our trivial return shader based on the stage we're in. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	bb40e999d1	intel/nir: use a single intel intrinsic to deal with ray traversal In the future we'll want to reuse this intrinsic to deal with ray queries. Ray queries will use a different global pointer and programmatically change the control/level arguments of the trace send instruction. v2: Comment on barrier after sync trace instruction (Caio) Generalize lsc helper (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	39f6cd5d79	intel/nir: fix shader call lowering We're replacing a generic instruction by an intel specific one, we need to remove the previous instruction. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c5a42e4010` ("intel/fs: fix shader call lowering pass") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	2665595244	intel/fs: limit FS dispatch to SIMD16 when using ray queries Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	57eed6698b	intel/compiler: tracker number of ray queries in prog_data Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	9b366243ed	intel/fs: load more fields from BVH instance leafs v2: Fixup mask (Caio) Drop old comment (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:24 +00:00
Lionel Landwerlin	c89024e446	intel/fs: don't set allow_sample_mask for CS intrinsics Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `77486db867` ("intel/fs: Disable sample mask predication for scratch stores") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:24 +00:00
Lionel Landwerlin	9d22f8ed23	intel/fs: add support for ACCESS_ENABLE_HELPER v2: Factor out fragment shader masking on send messages (Caio) Update comments (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:24 +00:00
Lionel Landwerlin	c199f44d17	intel/fs: name sources for A64 opcodes Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:24 +00:00
Lionel Landwerlin	23ce94ff7e	intel/nir/rt: add a new number of SIMD lanes per DSS helper v2: Add prefix brw_nir_rt (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:24 +00:00
Lionel Landwerlin	61c9b7a82e	intel/fs: add support for Eu/Thread/Lane id This index will be used for accessing ray query data in memory. v2: Drop a MOV (Caio) v3: Rework back code emission (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:24 +00:00
Lionel Landwerlin	3dabe93257	intel/fs: rework dss_id opcode into generic opcode We'll want different types of IDs based on topology. Let's make this more flexible and also move the bit shifting code a layer above where it's easier to do bitshifting operations, especially if you need to stash things into temporary registers. v2: Keep previous comment. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:24 +00:00
Lionel Landwerlin	4deb8e86df	nir: change intel dss_id intrinsic to topology_id This will allow to reuse the same intrinsic for various topology based ID. v2: fix intrinsic comment (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:24 +00:00
Marcin Ślusarz	18e628135d	anv: Add support for UBOs, SSBOs and push constants in Mesh pipeline Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13662>	2022-02-02 18:17:57 +00:00
Lionel Landwerlin	0cd93c59ef	intel/compiler: add primitive rate output support Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13739>	2022-02-02 17:09:46 +00:00
Lionel Landwerlin	cebf284ac1	intel/compiler: add a new pass to lower shading rate into HW format Rework: * Jason: Modernize brw_nir_lower_shading_rate_output: 1. Use nir_shader_instructions_pass() 2. Use *_imm builder helpers. 3. Use nir_intrinsic_base() instead of ->const_index[0] v2: Also lower loads (Caio) v3: Update stage check to trigger lowering (Caio) v4: Assert on != MESH (Caio) v5: Fixup instruction insertion (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13739>	2022-02-02 17:09:46 +00:00
Caio Oliveira	8bab8f6422	compiler, intel: Add gl_shader_stage_is_mesh() And replace the previous Intel-specific function. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14823>	2022-02-01 17:41:25 +00:00
Connor Abbott	913bec10c4	nir/lower_subgroups: Rename lower_shuffle to lower_relative_shuffle This option only applies to relative shuffles (up/down/xor), and in a moment we're going to add an option to lower normal shuffles, so rename it. While we're here, rename lower_shuffle() to lower_to_shuffle() for similar reasons. Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14412>	2022-02-01 16:27:45 +00:00
Marcin Ślusarz	24fef8f33d	intel/compiler: Use Task/Mesh InlineData for the first few push constants Replace load_mesh_global_arg_addr_intel with a more general intrinsic load_mesh_inline_data_intel, since inline data now hold both a pointer descriptor information and the first few push constants. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14788>	2022-01-29 06:32:19 +00:00
Marcin Ślusarz	1d9f47325b	intel/compiler: handle gl_[Clip\|Cull]Distance from mesh in fragment shaders Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14788>	2022-01-29 06:32:19 +00:00
Marcin Ślusarz	baa17865de	intel/compiler: handle gl_[Clip\|Cull]Distance in mesh shaders Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14788>	2022-01-29 06:32:19 +00:00
Caio Oliveira	856a0cacb1	intel/compiler: Merge Per-Primitive attribute handling in Mesh case Just a refactor, no behavior change. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14788>	2022-01-29 06:32:19 +00:00
Caio Oliveira	2b8b884bcd	intel/compiler: Have specific mesh handling in calculate_urb_setup() Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14788>	2022-01-29 06:32:19 +00:00
Caio Oliveira	448a840b39	intel/fs/xehp: Add unit test for handling of RaR deps across multiple pipelines. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Paulo Zanoni	d107a0bff8	intel/fs: Assert the GPU supports 64bit ops if present at lower_scoreboard time. On platforms where we don't support 64 bit instructions we shouldn't pass such instructions for the code generator to lower into supported instructions, because this makes their execution pipeline unpredictable to the scoreboard lowering pass on XeHP+ platforms. We really should be reducing all these 64 bit instructions before code generation, so here we add an assert to help us catch and fix these cases more easily. Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> [ Francisco Jerez: Also allow has_integer_dword_mul. ] Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	79fb7f9de8	intel/fs: Perform 64-bit CLUSTER_BROADCAST lowering in the lower_regioning pass. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	bdf8ac2466	intel/fs: Honor strided source regions specified by the IR for CLUSTER_BROADCAST. This fixes a bug in the CLUSTER_BROADCAST code generation that causes the original IR region to be ignored, this will be a problem when we start lowering 64-bit CLUSTER_BROADCAST instructions at the IR level, since it will lead to instructions with non-trivial regioning. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	6c8782c135	intel/fs: Perform 64-bit SEL_EXEC lowering in the lower_regioning pass. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	9449b71bdd	intel/fs: Perform 64-bit SHUFFLE lowering in the lower_regioning pass. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	d2d72fccf1	intel/fs: Fix destination suboffset calculations for non-trivial strides in SHUFFLE codegen. One of the two SHUFFLE implementations wasn't taking into account the destination stride at all, and the other (more commonly used) one was taking it into account incorrectly since brw_reg::hstride represents the stride logarithmically, so we need to use a left-shift operator instead of product. Found by inspection. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	d1038197f3	intel/fs: Take into account region strides during SIMD lowering decision of SHUFFLE. This fixes a bug in the handcrafted SIMD lowering done by the SHUFFLE code generation, which wasn't taking into account the source and destination region strides while deciding whether it needs to split an instruction. v2: Use new element_sz() helper instead of left shift. (Lionel) Fixes: `90c9f29518` ("i965/fs: Add support for nir_intrinsic_shuffle") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	44e48751d2	intel/fs: Teach the lower_regioning pass how to split instructions of unsuported exec type. This adds some generic infrastructure that allows splitting any instruction into a number of instructions of a smaller legal execution type. This is meant to replace several instances of handcrafted 64bit type lowering done manually in the code generator, which is rather error-prone, prevents scheduling of the lowered instructions, and makes them invisible to the SWSB pass on Gfx12+ platforms, which will become especially problematic on Gfx12.5+ since the EUs introduce multiple asynchronous execution pipelines which the SWSB pass needs to be able to synchronize to one another, so it's critical for the real execution type of the instruction to be visible to the SWSB pass. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	539c879a6b	intel/fs: Move legal exec type calculation into helper function in lower_regioning pass. Right now the execution type lowering functionality of this pass assumes that an integer type of the original bit size is always acceptable, however we'll want more complex behavior than that in order to leverage this pass to automate the lowering of unsupported 64-bit operations into multiple 32-bit operations. In order to do that calculate the closest legal execution type from a new helper function, and take advantage of that function from the has_invalid_exec_type() helper, along the lines of other lower_regioning() helpers structured as a pair of has_invalid_foo() + required_foo() functions. This shouldn't have any functional changes. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	3886e63033	intel/fs/xehp: Merge repeated in-order read dependencies instead of replacement. Previously the software scoreboard structure would drop previous dependencies for a given register and replace them with the most recent one for the same register when a new instruction (or set of instructions) is processed. This worked correctly on the Gfx12LP platforms this code was originally designed for, because a repeated dependency on the same register would either require the second instruction to synchronize against the first (so the first dependency could be disregarded from that point on) or require the dependency to be RaR and in-order, which allows the synchronization to be optimized out (the first dependency could still be disregarded as well, since the pipeline is in-order). However the latter assumption will break on upcoming Gfx12HP platforms, because they have multiple asynchronous FPU pipelines, so whenever we hit a RaR dependency we need to propagate forward both dependencies, since the order in which both reads will complete is not guaranteed by the hardware in cases where they occur from different asynchronous pipelines. Note that this dependency propagation change requires us to change the definition of dependency::done as well, since that constant is defined to discard any previous dependency information when used as argument for shadow(). This has been reported to fix the following conformance failures on DG2: KHR-GL46.shaders.uniform_block.random.all_per_block_buffers.19 dEQP-GLES3.functional.shaders.derivate.fwidth.* Reported-by: Tapani Pälli <tapani.palli@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5670 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Ian Romanick	945fb51fb5	intel/fs: Fix gl_FrontFacing optimization on Gfx12+ It's not obvious why the (gl_FrontFacing ? -1.0 : 1.0) case was handled different for Gfx12+ than for previous generations, and it's not correct. It tries to negate the result as an integer, and it does this before the mask operation that clears the other bits in the value. When we eventually support dual-SIMD8 dispatch, the other front-facing bit is in g1.6 at bit 15, so similar code should be possible there. Reviewed-by: Matt Turner <mattst88@gmail.com> Fixes: `c92fb60007` ("intel/fs/gen12: Implement gl_FrontFacing on gen12+.") Closes: #5876 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14625>	2022-01-20 22:37:18 +00:00
Dave Airlie	f83f72be8e	intel/brw: drop gl header from the brw backend. This shouldn't be used anywhere now once we drop the GLbitfield64 types. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14605>	2022-01-19 21:54:58 +00:00

1 2 3 4 5 ...

2002 commits