fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-21 18:00:13 +01:00

Author	SHA1	Message	Date
Ian Romanick	6b14da33ad	intel/fs: nir: Add nir_intrinsic_dpas_intel v2: Fix parameter order in nir_intrinsic_dpas_intel to DPAS conversion. v3: Fix float16 destination DPAS on DG2. v4: Use nir_component_mask(...) instead of 0xffff. Suggested by Caio. v5: Rebase on !26323. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25994>	2023-12-29 20:28:43 -08:00
Francisco Jerez	702eabaaae	intel/fs/xe2+: Update for new layout of vertex setup data in PS payload. The interpolation deltas of PS inputs now show up as a 12B vec3 (A0, A1-A0, A2-A0) in the ATTR file, instead of the previously used 16B format with an unused component. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26606>	2023-12-28 11:07:03 -08:00
Francisco Jerez	f295494cee	intel/fs/xe2+: Update poly info PS payload for new multi-polygon dispatch format. This includes the render target array index, viewport index, and front/back facing fields, which are now replicated per pair of subspans in order to support fixed-layout multi-polygon PS dispatch. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26606>	2023-12-28 11:07:03 -08:00
Francisco Jerez	4cc9c37bba	intel/fs/xe2+: Update location of sample ID fields in PS payload. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26606>	2023-12-28 11:07:03 -08:00
Francisco Jerez	a0ae3c0dba	intel/fs/xe2+: Update uses of pixel/sample mask from PS thread payload. Note from Caio: proper handling of brw_sample_mask_reg will appear in later patches. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26606>	2023-12-28 11:07:03 -08:00
Francisco Jerez	28aec45eed	intel/fs/gfx12: Implement multi-polygon format of render target array index in PS payload. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	5b1ab77423	intel/fs/gfx12: Implement multi-polygon format of back/front-facing flag in PS payload. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	83a0252e8d	intel/fs: Pass builder to per_primitive_reg(). Matches prototype of interp_reg(), will be useful in a subsequent commit. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Francisco Jerez	8e9f09dbe5	intel/fs: Provide component index explicitly to interp_reg(). Main motivation is that for multipolygon PS shaders the i-th plane parameter for the j-th input attribute will no longer necessarily be a scalar, since different channels may be processing different polygons with different input plane parameters, so simply taking a component() of the result of interp_reg() will no longer work. Instead of duplicating the multipolygon handling logic in every caller of interp_reg(), fold the component() call into interp_reg() so we can replace it with multipolygon-correct code more easily. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Francisco Jerez	b26cf8b189	intel/fs: Map all TES input attributes to ATTR register number 0. Instead of treating fs_reg::nr as an offset for ATTR registers simply consider different indices as denoting disjoint spaces that can never be accessed simultaneously by a single region. From now on geometry stages will just use ATTR #0 for everything and select specific attributes via offset() with the native dispatch width of the program, which should work on current platforms as well as on Xe2+. See "intel/fs: Map all GS input attributes to ATTR register number 0." for the rationale. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Francisco Jerez	ef12565a37	intel/fs: Map all VS input attributes to ATTR register number 0. Instead of treating fs_reg::nr as an offset for ATTR registers simply consider different indices as denoting disjoint spaces that can never be accessed simultaneously by a single region. From now on geometry stages will just use ATTR #0 for everything and select specific attributes via offset() with the native dispatch width of the program, which should work on current platforms as well as on Xe2+. See "intel/fs: Map all GS input attributes to ATTR register number 0." for the rationale. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Francisco Jerez	1d22721b5a	intel/fs: Map all GS input attributes to ATTR register number 0. The fs_reg::nr field currently has a somewhat inconsistent meaning for ATTR registers depending on the shader stage. In geometry stages it has a similar effect as fs_reg::offset except it's expressed in 32B units instead of B units. In the PS however it's expressed in units of logical scalar attributes (16B on present platforms), which isn't currently handled correctly throughout the back-end since some places assume 32B units in all cases. The different format of the PS setup data in multi-polygon dispatch modes would make its behavior even more irregular, which would be worsened further (for both geometry and pixel stages) by the register size changes coming up on Xe2, particularly in brw_ir_fs.h helpers where neither the devinfo struct nor the shader stage are available. Instead of treating it as an offset simply consider different fs_reg::nr indices as denoting disjoint spaces that can never be accessed simultaneously by a single region. From now on geometry stages will just use ATTR #0 for everything and select specific attributes via offset() with the native dispatch width of the program, which should work on current platforms as well as on Xe2+. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Caio Oliveira	bfc953add7	intel/compiler: Use C helpers to access builtin types Remove usage of C++ static members as they are going to be removed. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26658>	2023-12-15 03:09:19 +00:00
Sagar Ghuge	a4947f7bd8	intel/fs: Adjust destination size for load ubo on Xe2+ Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26639>	2023-12-13 19:06:21 +00:00
Sagar Ghuge	e0ce94318b	intel/fs: Adjust destination size for global load constant on Xe2+ Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26639>	2023-12-13 19:06:21 +00:00
Sagar Ghuge	11fea46bdc	intel/fs: Adjust destination size for image size intrinsic Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26639>	2023-12-13 19:06:21 +00:00
Caio Oliveira	a8b2426419	intel/compiler: Use reference instead of pointer for fs_visitor Per Ian suggestion. Also clear up a few unnecessary casts around the code and use `s` for fs_visitor ("shader"). Note to include a reference in ntf we need to set it during initialization, so create an explicit mem_ctx for it. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:14 +00:00
Caio Oliveira	77ab74ccc2	intel/compiler: Use reference instead of pointer for nir_to_brw_state Per Ian suggestion. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:14 +00:00
Caio Oliveira	4e5fcccd01	intel/compiler: Create and use nir_to_brw() function Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:14 +00:00
Caio Oliveira	38a42e5aa1	intel/compiler: Add ctor to fs_builder that just takes the shader Uses the dispatch_width from the shader (fs_visitor). This was not possible before because the dispatch_width was not part of backend_shader. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:14 +00:00
Caio Oliveira	cf730adc58	intel/compiler: Make fs_builder include fs_visitor and not the other way This will allow fs_builder have a reference to an fs_visitor (a "fs_shader" really), instead of a reference to a backend_shader. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:14 +00:00
Caio Oliveira	5b8ec015f2	intel/compiler: Don't use fs_visitor::bld in remaining places The remaining users can simply create a new builder at_end() if needed. In many places a new builder object is already being constructed, so just give more specific instructions. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:14 +00:00
Caio Oliveira	c73c1aa496	intel/compiler: Annotate and use nir_to_brw_state::bld Use the "current bld" in nir_to_brw_state more widely, and also replace it with an annotated version when applicable (to associate it with a NIR instruction being lowered). After filling a block we reset it back to the original value. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	34c28680b1	intel/compiler: Stop using fs_visitor::bld field in NIR conversion Provide its own builder in nir_to_brw_state. Will allow eventually remove the one in fs_visitor. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	79735fa783	intel/compiler: Move remaining NIR conversion fields to nir_to_brw_state Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	5cb189636d	intel/compiler: Move nir_ssa_value into a local structure Create a nir_to_brw_state struct that is valid only during the NIR to backend translation and use it for nir_ssa_values array. This removes some NIR specific handling out of the fs_visitor -- nowadays effectively an fs_shader. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	afe75d65be	intel/compiler: Make NIR resources helpers static Remove get_nir_src_block() since it is not used anywhere. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	a7a27ee95e	intel/compiler: Make NIR atomic conversion functions static Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	5777943381	intel/compiler: Make non-intrinsic NIR conversion functions static Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	2385d6087a	intel/compiler: Make setup functions of NIR emission static Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	3899e6b1d8	intel/compiler: Make functions for NIR control flow conversion static Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	860ec33f9a	intel/compiler: Make more functions in NIR conversion static Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	acca9dbf6b	intel/compiler: Make a NIR intrinsic emission functions static Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	c12460b01e	intel/compiler: Move NIR emission code to brw_fs_nir.cpp This is a preparation to reorganize NIR emission code. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	1ef6415d22	intel/compiler: Remove unused headers Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Rohan Garg	db6aaa691d	intel/compiler: infer the number of operands using lsc_op_num_data_values nir_emit_global_atomic should utilize lsc_op_num_data_values to infer the number of operands for global atomic ops, following the same pattern as nir_emit_surface_atomic Fixes: `90a2137` ('intel/compiler: Use LSC opcode enum rather than legacy BRW_AOPs') Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26432>	2023-12-07 14:40:24 +00:00
Rohan Garg	46d98a71ef	intel/compiler: use the proper enum type to store the op Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26432>	2023-12-07 14:40:24 +00:00
Faith Ekstrand	e3ff5a3b0e	intel/vec4: Use MESA_PRIM_* instead of GL_* Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24821>	2023-12-05 23:12:32 +00:00
Lionel Landwerlin	83a1657b6c	intel/fs: fix incorrect register flag interaction with dynamic interpolator mode Once NIR code is lowered and a few optimization passes have run, there might be flag register interactions between instructions quite far away from one another. In the following case : f0 = and r0, r1 ... fs_interpolate r2, r3 ... if f0 ... endif If we lower fs_inteporlate while using the f0 register, we completely garble the value meant for the if block. To fix this, emit the predication for fs_interpolate in brw_fs_nir.cpp when doing the NIR translation to the backend IR. This will guarantee that the flag register interactions are visible to the optimization passes, avoiding the problem above. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `68027bd38e` ("intel/fs: implement dynamic interpolation mode for dynamic persample shaders") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9757 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26306>	2023-11-28 13:40:07 +00:00
Caio Oliveira	e8220b9319	intel/compiler: Simplify allocation of NIR related arrays Those are not reused, so this will be the first and only allocation, so no need to use the "realloc" variants. For the fs_reg arrays, there's currently no particular reason to keep them uninitialized, so zero-initialize them too -- not ideal but better than random values. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26302>	2023-11-21 18:31:05 +00:00
Lionel Landwerlin	295734bf88	intel/fs: fix residency handling on Xe2 We're missing a few reg_unit() scaling when dealing with residency data. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26208>	2023-11-15 20:06:12 +00:00
Caio Oliveira	40416850f1	intel/compiler: Re-enable opt_zero_samples() in many cases for Gfx12.5 The workaround applies specifically to Cube and Cube Arrays, so we can still apply the optimization for the others. Ideally we would like to pull opt_zero_samples logic into the lowering sends -- to avoid adding a bit to communicate between passes. However the texture coordinates for the LOGICAL backend instructions, which are a common target for the optimization, are combined into offsets over a single VGRF, so we can't easily identify the constant cases. The copy-prop pass make this more visible for opt_zero_samples. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25742>	2023-11-09 03:56:28 +00:00
Caio Oliveira	e017bcae59	intel/compiler: Clarify the asserts in nir_load_workgroup_id lowering For Task/Mesh WorkgroupID is now lowered to WorkgroupIndex by the generic NIR pass, so we shouldn't hit this. We can now simplify the asserting code in emit_work_group_id_setup(). Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25977>	2023-11-08 17:18:36 -08:00
Lionel Landwerlin	a25f96c00c	intel/fs: switch from SIMD 1 to 8 instructions surface/sampler rematerialization SIMD1 instructions are problematic because they are considered partial writes. This increases the liveness of the destination register written by those instructions. To workaround this we use UNDEF instructions to bound the liveness of the register. But this causing other issues like in this case : undef(1) vgrf2 mov(1) vgrf2, u4.0 add(1) vgrf3, vgrf2.0, 64UD In this case the copy propagation pass in unable to see that vgrf2 in the add() instruction can be replaced with the uniform u4.0. To fix this problem, we switch NoMask SIMD8 instructions that cover the entire register. We can drop the UNDEF instructions and now copy propagation can do its job. Good results on 2 apps : Cyberpunk 2077 : Totals from 7258 (68.80% of 10549) affected shaders: Instrs: 6332210 -> 6073833 (-4.08%); split: -4.11%, +0.03% Cycles: 130667501 -> 127351268 (-2.54%); split: -3.12%, +0.58% Subgroup size: 90320 -> 90400 (+0.09%) Spill count: 90 -> 68 (-24.44%) Fill count: 82 -> 64 (-21.95%) Scratch Memory Size: 8192 -> 6144 (-25.00%) Max live registers: 385464 -> 375152 (-2.68%) Max dispatch width: 64336 -> 64424 (+0.14%); split: +0.96%, -0.82% Gaining 60 SIMD16/SIMD32 shaders, loosing 33 Strange Brigade : Totals from 2137 (53.12% of 4023) affected shaders: Instrs: 1544031 -> 1457544 (-5.60%); split: -5.60%, +0.00% Cycles: 22292564 -> 21868978 (-1.90%); split: -2.43%, +0.53% Subgroup size: 25328 -> 25344 (+0.06%) Max live registers: 113716 -> 111214 (-2.20%) Max dispatch width: 17232 -> 18608 (+7.99%); split: +8.36%, -0.37% Gaining 138 SIMD16/SIMD32 shaders, loosing 4 On app slightly negatively affected : Dota2 : Totals from 232 (14.73% of 1575) affected shaders: Instrs: 30029 -> 28194 (-6.11%) Cycles: 385155 -> 371422 (-3.57%); split: -3.59%, +0.02% Max live registers: 6792 -> 6780 (-0.18%) Max dispatch width: 2256 -> 2160 (-4.26%) Loosing 6 SIMD32 shaders Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24554>	2023-09-29 10:46:47 +00:00
Lionel Landwerlin	d28f42f85d	intel/fs: handle add3 in surface/sampler rematerialization Some recent NIR changes started generated those instructions. We need to handle them to be able to rematerialize. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24554>	2023-09-29 10:46:47 +00:00
Lionel Landwerlin	05fd418e8b	intel/fs: handle ishl in surface/sampler rematerialization Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24554>	2023-09-29 10:46:47 +00:00
Francisco Jerez	7f3dc4505d	intel/fs: Delete manual 'inst->mlen' calculations from all uses of logical URB reads. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25195>	2023-09-27 23:57:25 +00:00
Francisco Jerez	53d1d793cb	intel/fs: Delete manual 'inst->mlen' calculations from all uses of logical URB writes. Rework: * Marcin: update emit_urb_indirect_vec4_write Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25195>	2023-09-27 23:57:25 +00:00
Francisco Jerez	34a2c9ce35	intel/fs: Specify number of data components of logical URB writes via control immediate. This is what most logical SEND messages do when they take a variable number of components. 'inst->mlen' is expected to be zero for logical SEND opcodes, which are expected to behave like plain arithmetic operations, so certain automated transformations (like SIMD lowering) can manipulate them without opcode-specific special-casing. Guessing the number of components from 'inst->mlen' has other disadvantages, because it requires duplicating the logic that infers the message payload size in every use of the instruction -- Instead we can just do the computation once during logical send lowering. In addition on LNL platform this causes the 'inst->mlen' field of URB writes to have units inconsistent with every other SEND instruction, which is likely to lead to confusion and bugs down the road. Rework: * Marcin: update emit_urb_indirect_vec4_write Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25195>	2023-09-27 23:57:25 +00:00
Caio Oliveira	c89597085a	intel/compiler/xe2: Update TCS ICP handle code to support SIMD16 Rework: * Use ffs(grf_size_bytes) (s-b Ken) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25195>	2023-09-27 23:57:25 +00:00

1 2 3 4 5 ...

574 commits