fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-21 05:00:09 +01:00

Author	SHA1	Message	Date
Jordan Justen	820e04ead4	intel/compiler: Implement nir_intrinsic_load_topology_id_intel for xe2 Rework: * Sagar: Rework BRW_TOPOLOGY_ID_DSS, BRW_TOPOLOGY_ID_EU_THREAD_SIMD calculations Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27529>	2024-02-14 20:07:13 +00:00
Sagar Ghuge	15129c7634	intel/compiler: Use nir_tex_src_backend1 to pack LOD and array index Since this lowering is totally Intel specific, we don't have to introduce the new texture source. We can use the nir_tex_src_backend1 source to pack LOD/LOD Bias and array index into 32 bit single value. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27458>	2024-02-12 21:25:48 +00:00
Ian Romanick	84de7a88d3	intel/compiler/xe2: Emit texture instructions w/ combined LOD and array index The extra assertions are just there to help validate pack_lod_and_array_index (in nir_lower_tex.c). v2: Split got_lod_or_bias into two variables. This simplifies some changes that Sagar is working on. Suggested by Sagar. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27305>	2024-02-02 02:39:10 +00:00
Kenneth Graunke	2e38024fd8	intel: Use hardware generated compute shader local invocation IDs Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27167>	2024-01-25 08:43:04 +00:00
Ian Romanick	c2a25cf75c	intel/fs: Fix shift counts for 8- and 16-bit types With regards to implicit masking of the shift counts for 8- and 16-bit types, the PRMs are incorrect. They falsely state that on Gen9+ only the low bits of src1 matching the size of src0 (e.g., 4-bits for W or UW src0) are used. The Bspec (backed by data from experimentation) state that 0x3f is used for Q and UQ types, and 0x1f is used for all other types. To match the behavior expected for the NIR opcodes, explicit masks for 8- and 16-bit types must be added. This fixes (the updated version, see crucible!138) of func.shader.shift.int16_t on all Intel platforms. According to Karol, this also fixes "integer_ops integer_rotate" tests in OpenCL CTS. No shader-db or fossil-db changes on any Intel platform. Tested-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23001>	2024-01-24 19:41:35 +00:00
Sagar Ghuge	6fcec87090	intel/fs: Track instance id in gs_thread_payload This change moves the instance id gs_thread_payload constructor and lowering code will simply use that. Also, this change takes the Xe2 register width in consideration that fixes a couple of tests involving geometry shaders with gl_InvocationID on Xe2. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26960>	2024-01-22 22:15:38 +00:00
Francisco Jerez	f974eacab3	intel/compiler/xe2: Fix for the removal of most predication modes. Reworks: * Remove changes to fixup_nomask workaround since it applies only for Gfx12 family. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26860>	2024-01-12 20:18:03 +00:00
Dave Airlie	56a72e014f	intel/compiler: reemit boolean resolve for inverted if on gen5 Gen5 adds some boolean conversion instructions after nir emits, but that nir srcs don't line up with them, so reemit the boolean conversion if we reemit the inot. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `31b5f5a51f` ("nir/opt_if: Simplify if's with general conditions") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26782>	2024-01-04 21:27:23 +00:00
Ian Romanick	7bfbeb79a7	anv: Set COMPUTE_WALKER systolic mode enable flag Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25994>	2023-12-29 20:28:54 -08:00
Ian Romanick	6b14da33ad	intel/fs: nir: Add nir_intrinsic_dpas_intel v2: Fix parameter order in nir_intrinsic_dpas_intel to DPAS conversion. v3: Fix float16 destination DPAS on DG2. v4: Use nir_component_mask(...) instead of 0xffff. Suggested by Caio. v5: Rebase on !26323. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25994>	2023-12-29 20:28:43 -08:00
Francisco Jerez	702eabaaae	intel/fs/xe2+: Update for new layout of vertex setup data in PS payload. The interpolation deltas of PS inputs now show up as a 12B vec3 (A0, A1-A0, A2-A0) in the ATTR file, instead of the previously used 16B format with an unused component. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26606>	2023-12-28 11:07:03 -08:00
Francisco Jerez	f295494cee	intel/fs/xe2+: Update poly info PS payload for new multi-polygon dispatch format. This includes the render target array index, viewport index, and front/back facing fields, which are now replicated per pair of subspans in order to support fixed-layout multi-polygon PS dispatch. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26606>	2023-12-28 11:07:03 -08:00
Francisco Jerez	4cc9c37bba	intel/fs/xe2+: Update location of sample ID fields in PS payload. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26606>	2023-12-28 11:07:03 -08:00
Francisco Jerez	a0ae3c0dba	intel/fs/xe2+: Update uses of pixel/sample mask from PS thread payload. Note from Caio: proper handling of brw_sample_mask_reg will appear in later patches. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26606>	2023-12-28 11:07:03 -08:00
Francisco Jerez	28aec45eed	intel/fs/gfx12: Implement multi-polygon format of render target array index in PS payload. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	5b1ab77423	intel/fs/gfx12: Implement multi-polygon format of back/front-facing flag in PS payload. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	83a0252e8d	intel/fs: Pass builder to per_primitive_reg(). Matches prototype of interp_reg(), will be useful in a subsequent commit. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Francisco Jerez	8e9f09dbe5	intel/fs: Provide component index explicitly to interp_reg(). Main motivation is that for multipolygon PS shaders the i-th plane parameter for the j-th input attribute will no longer necessarily be a scalar, since different channels may be processing different polygons with different input plane parameters, so simply taking a component() of the result of interp_reg() will no longer work. Instead of duplicating the multipolygon handling logic in every caller of interp_reg(), fold the component() call into interp_reg() so we can replace it with multipolygon-correct code more easily. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Francisco Jerez	b26cf8b189	intel/fs: Map all TES input attributes to ATTR register number 0. Instead of treating fs_reg::nr as an offset for ATTR registers simply consider different indices as denoting disjoint spaces that can never be accessed simultaneously by a single region. From now on geometry stages will just use ATTR #0 for everything and select specific attributes via offset() with the native dispatch width of the program, which should work on current platforms as well as on Xe2+. See "intel/fs: Map all GS input attributes to ATTR register number 0." for the rationale. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Francisco Jerez	ef12565a37	intel/fs: Map all VS input attributes to ATTR register number 0. Instead of treating fs_reg::nr as an offset for ATTR registers simply consider different indices as denoting disjoint spaces that can never be accessed simultaneously by a single region. From now on geometry stages will just use ATTR #0 for everything and select specific attributes via offset() with the native dispatch width of the program, which should work on current platforms as well as on Xe2+. See "intel/fs: Map all GS input attributes to ATTR register number 0." for the rationale. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Francisco Jerez	1d22721b5a	intel/fs: Map all GS input attributes to ATTR register number 0. The fs_reg::nr field currently has a somewhat inconsistent meaning for ATTR registers depending on the shader stage. In geometry stages it has a similar effect as fs_reg::offset except it's expressed in 32B units instead of B units. In the PS however it's expressed in units of logical scalar attributes (16B on present platforms), which isn't currently handled correctly throughout the back-end since some places assume 32B units in all cases. The different format of the PS setup data in multi-polygon dispatch modes would make its behavior even more irregular, which would be worsened further (for both geometry and pixel stages) by the register size changes coming up on Xe2, particularly in brw_ir_fs.h helpers where neither the devinfo struct nor the shader stage are available. Instead of treating it as an offset simply consider different fs_reg::nr indices as denoting disjoint spaces that can never be accessed simultaneously by a single region. From now on geometry stages will just use ATTR #0 for everything and select specific attributes via offset() with the native dispatch width of the program, which should work on current platforms as well as on Xe2+. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Caio Oliveira	bfc953add7	intel/compiler: Use C helpers to access builtin types Remove usage of C++ static members as they are going to be removed. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26658>	2023-12-15 03:09:19 +00:00
Sagar Ghuge	a4947f7bd8	intel/fs: Adjust destination size for load ubo on Xe2+ Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26639>	2023-12-13 19:06:21 +00:00
Sagar Ghuge	e0ce94318b	intel/fs: Adjust destination size for global load constant on Xe2+ Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26639>	2023-12-13 19:06:21 +00:00
Sagar Ghuge	11fea46bdc	intel/fs: Adjust destination size for image size intrinsic Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26639>	2023-12-13 19:06:21 +00:00
Caio Oliveira	a8b2426419	intel/compiler: Use reference instead of pointer for fs_visitor Per Ian suggestion. Also clear up a few unnecessary casts around the code and use `s` for fs_visitor ("shader"). Note to include a reference in ntf we need to set it during initialization, so create an explicit mem_ctx for it. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:14 +00:00
Caio Oliveira	77ab74ccc2	intel/compiler: Use reference instead of pointer for nir_to_brw_state Per Ian suggestion. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:14 +00:00
Caio Oliveira	4e5fcccd01	intel/compiler: Create and use nir_to_brw() function Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:14 +00:00
Caio Oliveira	38a42e5aa1	intel/compiler: Add ctor to fs_builder that just takes the shader Uses the dispatch_width from the shader (fs_visitor). This was not possible before because the dispatch_width was not part of backend_shader. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:14 +00:00
Caio Oliveira	cf730adc58	intel/compiler: Make fs_builder include fs_visitor and not the other way This will allow fs_builder have a reference to an fs_visitor (a "fs_shader" really), instead of a reference to a backend_shader. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:14 +00:00
Caio Oliveira	5b8ec015f2	intel/compiler: Don't use fs_visitor::bld in remaining places The remaining users can simply create a new builder at_end() if needed. In many places a new builder object is already being constructed, so just give more specific instructions. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:14 +00:00
Caio Oliveira	c73c1aa496	intel/compiler: Annotate and use nir_to_brw_state::bld Use the "current bld" in nir_to_brw_state more widely, and also replace it with an annotated version when applicable (to associate it with a NIR instruction being lowered). After filling a block we reset it back to the original value. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	34c28680b1	intel/compiler: Stop using fs_visitor::bld field in NIR conversion Provide its own builder in nir_to_brw_state. Will allow eventually remove the one in fs_visitor. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	79735fa783	intel/compiler: Move remaining NIR conversion fields to nir_to_brw_state Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	5cb189636d	intel/compiler: Move nir_ssa_value into a local structure Create a nir_to_brw_state struct that is valid only during the NIR to backend translation and use it for nir_ssa_values array. This removes some NIR specific handling out of the fs_visitor -- nowadays effectively an fs_shader. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	afe75d65be	intel/compiler: Make NIR resources helpers static Remove get_nir_src_block() since it is not used anywhere. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	a7a27ee95e	intel/compiler: Make NIR atomic conversion functions static Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	5777943381	intel/compiler: Make non-intrinsic NIR conversion functions static Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	2385d6087a	intel/compiler: Make setup functions of NIR emission static Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	3899e6b1d8	intel/compiler: Make functions for NIR control flow conversion static Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	860ec33f9a	intel/compiler: Make more functions in NIR conversion static Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	acca9dbf6b	intel/compiler: Make a NIR intrinsic emission functions static Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	c12460b01e	intel/compiler: Move NIR emission code to brw_fs_nir.cpp This is a preparation to reorganize NIR emission code. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Caio Oliveira	1ef6415d22	intel/compiler: Remove unused headers Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Rohan Garg	db6aaa691d	intel/compiler: infer the number of operands using lsc_op_num_data_values nir_emit_global_atomic should utilize lsc_op_num_data_values to infer the number of operands for global atomic ops, following the same pattern as nir_emit_surface_atomic Fixes: `90a2137` ('intel/compiler: Use LSC opcode enum rather than legacy BRW_AOPs') Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26432>	2023-12-07 14:40:24 +00:00
Rohan Garg	46d98a71ef	intel/compiler: use the proper enum type to store the op Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26432>	2023-12-07 14:40:24 +00:00
Faith Ekstrand	e3ff5a3b0e	intel/vec4: Use MESA_PRIM_* instead of GL_* Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24821>	2023-12-05 23:12:32 +00:00
Lionel Landwerlin	83a1657b6c	intel/fs: fix incorrect register flag interaction with dynamic interpolator mode Once NIR code is lowered and a few optimization passes have run, there might be flag register interactions between instructions quite far away from one another. In the following case : f0 = and r0, r1 ... fs_interpolate r2, r3 ... if f0 ... endif If we lower fs_inteporlate while using the f0 register, we completely garble the value meant for the if block. To fix this, emit the predication for fs_interpolate in brw_fs_nir.cpp when doing the NIR translation to the backend IR. This will guarantee that the flag register interactions are visible to the optimization passes, avoiding the problem above. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `68027bd38e` ("intel/fs: implement dynamic interpolation mode for dynamic persample shaders") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9757 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26306>	2023-11-28 13:40:07 +00:00
Caio Oliveira	e8220b9319	intel/compiler: Simplify allocation of NIR related arrays Those are not reused, so this will be the first and only allocation, so no need to use the "realloc" variants. For the fs_reg arrays, there's currently no particular reason to keep them uninitialized, so zero-initialize them too -- not ideal but better than random values. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26302>	2023-11-21 18:31:05 +00:00
Lionel Landwerlin	295734bf88	intel/fs: fix residency handling on Xe2 We're missing a few reg_unit() scaling when dealing with residency data. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26208>	2023-11-15 20:06:12 +00:00

1 2 3 4 5 ...

683 commits