fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 16:08:06 +02:00

Author	SHA1	Message	Date
Faith Ekstrand	e05cb967e7	nir: Add nir_foreach_block_in_cf_node_safe() iterators Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29591>	2024-06-13 20:43:46 +00:00
Faith Ekstrand	b107240474	nir: Add some new _nv intrinsics The ldc_nv and ldcx_nv intrinsics correspond to the index and bindless forms of NVIDIA's LDC instruction, respectively. ldc_nv is pretty much load_ubo without some of the unnecessary constant bits while ldcx_nv takes a 64-bit bindless handle instead of an index. The other two give us a little control over register allocation at the NIR level to ensure that LDCX handles are placed in uniform registers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29591>	2024-06-13 20:43:45 +00:00
Faith Ekstrand	290cbf413c	nir/print: Improve divergence information Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29591>	2024-06-13 20:43:44 +00:00
Timothy Arceri	4c3d1a09de	nir: add additional opt_loop_merge() test of deref handling Here we test the rematerialization of the deref produces valid nir when both the deref and array index value are moved to the else branch of the first terminator during the merge. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29686>	2024-06-13 15:00:35 +00:00
Timothy Arceri	abb51f449d	nir: test opt_loop_merge_terminators() skips unhandled loops This test makes sure the merge if pass skips loops with trainling phis as those are not handled by the pass. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29686>	2024-06-13 15:00:35 +00:00
Timothy Arceri	b26ef8f153	nir: correctly track current loop in nir_opt_loop() We were not restoring an outer loop as the current loop after we had finished processing a nested loop. Fixes: `9995f336e6` ("nir: add merge loop terminators optimisation") Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29686>	2024-06-13 15:00:35 +00:00
Timothy Arceri	3d2a821198	nir: add test for opt_loop_merge_terminators Makes sure we correctly rematerialize derefs moved during the merge. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29686>	2024-06-13 15:00:35 +00:00
Rhys Perry	92af96e0b3	nir/opt_loop: fix formatting Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29686>	2024-06-13 15:00:35 +00:00
Rhys Perry	cb51a93c1e	nir/opt_loop: rematerialize derefs instead of creating phis Fixes NIR validation of hogwarts_legacy/00ac08423ad6e422. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Fixes: `9995f336e6` ("nir: add merge loop terminators optimisation") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29686>	2024-06-13 15:00:35 +00:00
Alyssa Rosenzweig	f1144aa56f	nir/builtin_builder: factor out nir_build_texture_query useful for other queries too. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29614>	2024-06-11 13:10:22 +00:00
Timothy Arceri	9995f336e6	nir: add merge loop terminators optimisation Merge two consecutive basic terminators. Acked-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28998>	2024-06-11 01:42:23 +00:00
Timothy Arceri	e25da8d8d7	nir: support more loop unrolling for logical operators Here we support finding loop count when the termination condition is a logical or. Acked-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28998>	2024-06-11 01:42:23 +00:00
Timothy Arceri	987cf4b47d	nir: more aggressively remove in loop during partial unroll Acked-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28998>	2024-06-11 01:42:23 +00:00
Timothy Arceri	9702570994	nir: clarify and update loop conditional instruction This value is intended to be used to remove out of bounds array access when unrolling loops so it should contain the comparison that contains the the induction variable not the overall condition of the loop terminator. So here we update the instruction when dealing with iand/ior loop terminator conditions. Acked-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28998>	2024-06-11 01:42:23 +00:00
Alyssa Rosenzweig	31127d7b02	nir/lower_wpos_center: clean up Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29585>	2024-06-10 16:59:38 +00:00
Emma Anholt	3beae0f98e	nir,panfrost,agx: Fix driver PIXEL_COORD_INTEGER setting and drop workaround. nir_lower_frag_coord_to_pixel_coord was adding .5 to work around that the drivers were mistakenly setting PIXEL_COORD_HALF_INTEGER. With the setting corrected, the GL frontend handles it appropriately (instead of subtracting half in the frontend for ARB_fragment_coord_conventions integer setting and then adding the half back here), and makes the pass reusable from Intel. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29585>	2024-06-10 16:59:38 +00:00
Alyssa Rosenzweig	5f72234745	asahi: split param structs for GS internal kernel this simplifies state management consdierably Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29607>	2024-06-07 16:57:03 +00:00
Georg Lehmann	75b1fa9263	nir/opt_algebraic: alternative 8bit pack_[us]norm_4x8 lowering Foz-DB Navi21: Totals from 42 (0.05% of 79395) affected shaders: Instrs: 2709529 -> 2705848 (-0.14%) CodeSize: 14720732 -> 14711384 (-0.06%); split: -0.06%, +0.00% VGPRs: 4096 -> 4104 (+0.20%) Latency: 17907612 -> 17904468 (-0.02%); split: -0.02%, +0.00% InvThroughput: 4723551 -> 4722649 (-0.02%); split: -0.02%, +0.00% Copies: 223516 -> 219819 (-1.65%) Branches: 109578 -> 109594 (+0.01%); split: -0.00%, +0.02% VALU: 1730848 -> 1727151 (-0.21%) Tested-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28882>	2024-06-04 17:00:29 +00:00
Georg Lehmann	f66883a875	nir: lower pack_uvec4_to_uint to pack_32_4x8 if supported Foz-DB Navi31: Totals from 42 (0.05% of 79395) affected shaders: Instrs: 3326544 -> 3324640 (-0.06%) CodeSize: 16908376 -> 16896212 (-0.07%); split: -0.07%, +0.00% VGPRs: 4284 -> 4296 (+0.28%) Latency: 17862544 -> 17855438 (-0.04%); split: -0.05%, +0.01% InvThroughput: 3535291 -> 3533993 (-0.04%); split: -0.04%, +0.00% VClause: 95270 -> 95275 (+0.01%); split: -0.01%, +0.01% SClause: 65402 -> 65397 (-0.01%) Copies: 229723 -> 234124 (+1.92%) Branches: 109481 -> 109518 (+0.03%); split: -0.00%, +0.04% PreVGPRs: 3879 -> 3909 (+0.77%) VALU: 1789208 -> 1787370 (-0.10%); split: -0.10%, +0.00% SALU: 409136 -> 409129 (-0.00%); split: -0.00%, +0.00% Tested-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28882>	2024-06-04 17:00:29 +00:00
Faith Ekstrand	7e6cd395c7	nir: Handle cmat types in lower_variable_initializers Fixes: `b98f87612b` ("spirv: Implement SPV_KHR_cooperative_matrix") Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29509>	2024-06-04 16:34:48 +00:00
Georg Lehmann	18a0ff137f	nir: sink/move inverse_ballot like moves It's just a copy for the backends that don't lower it. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29502>	2024-06-04 15:40:57 +00:00
Georg Lehmann	690f880d18	nir/opt_uniform_atomics: handle inverse_ballot when detecting single lane ifs Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29502>	2024-06-04 15:40:57 +00:00
Ian Romanick	7b7e5cf5d4	nir/algebraic: intel/fs: Optimize some patterns before lowering 64-bit integers v2: Add some comments explaining some of the nuance of the shift optimizations. Fix a bug in the shift count calculation of the upper 32-bits. Move the @64 from the variable to the opcode. All suggested by Jordan. No shader-db changes on any Intel platform. fossil-db: Meteor Lake and DG2 had similar results. (Meteor Lake shown) Totals: Instrs: 154507026 -> 154506576 (-0.00%) Cycle count: 17436298868 -> 17436295016 (-0.00%) Max live registers: 32635309 -> 32635297 (-0.00%) Totals from 42 (0.01% of 632575) affected shaders: Instrs: 5616 -> 5166 (-8.01%) Cycle count: 133680 -> 129828 (-2.88%) Max live registers: 1158 -> 1146 (-1.04%) No fossil-db changes on any other Intel platform. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29148>	2024-05-31 09:13:23 -07:00
Ian Romanick	4834df82e2	nir/algebraic: More patterns to generate iadd3 I noticed some shaders with patterns similar to these while working on cooperative matrix lowering. Meteor Lake and DG2 are the only platforms that support iadd3, so there were no shader-db or fossil-db changes on any other platforms. shader-db: Meteor Lake and DG2 had similar results. (Meteor Lake shown) total instructions in shared programs: 19869445 -> 19868343 (<.01%) instructions in affected programs: 419426 -> 418324 (-0.26%) helped: 913 / HURT: 2 total cycles in shared programs: 936010029 -> 935909811 (-0.01%) cycles in affected programs: 31746523 -> 31646305 (-0.32%) helped: 495 / HURT: 356 LOST: 10 GAINED: 12 fossil-db: Meteor Lake and DG2 had similar results. (Meteor Lake shown) Totals: Instrs: 154514596 -> 154505466 (-0.01%); split: -0.01%, +0.00% Cycle count: 17540226067 -> 17436266198 (-0.59%); split: -0.63%, +0.04% Spill count: 146887 -> 146886 (-0.00%) Fill count: 272499 -> 272489 (-0.00%); split: -0.01%, +0.00% Max live registers: 32634290 -> 32634739 (+0.00%); split: -0.00%, +0.00% Max dispatch width: 5550128 -> 5550368 (+0.00%) Totals from 4401 (0.70% of 632560) affected shaders: Instrs: `3095239` -> 3086109 (-0.29%); split: -0.30%, +0.00% Cycle count: 7327352564 -> 7223392695 (-1.42%); split: -1.51%, +0.10% Spill count: 28105 -> 28104 (-0.00%) Fill count: 45830 -> 45820 (-0.02%); split: -0.04%, +0.02% Max live registers: 264376 -> 264825 (+0.17%); split: -0.05%, +0.22% Max dispatch width: 43768 -> 44008 (+0.55%) Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29148>	2024-05-31 09:13:23 -07:00
Ian Romanick	f1b941aaec	nir/search: Refactor is_16_bits Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Suggested-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29148>	2024-05-31 09:13:23 -07:00
Ian Romanick	6e53be2a0a	nir/search: Fix is_16_bits for vectors Require that all elements of a vector be representable as either int16_t or uint16_t. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Fixes: `7ef45e661f` ("intel/fs: Add constant propagation for ADD3") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29148>	2024-05-31 09:13:23 -07:00
Ian Romanick	22095c60bc	nir/algebraic: Add nir_lower_int64_options::nir_lower_iadd3_64 This allows us to not generate 64-bit iadd3 on Intel but continue generating it for NVIDIA. No shader-db or fossil-db changes. v2: Add nir_lower_iadd3_64 flag so we can continue to generate 64-bit iadd3 on NVIDIA platforms. v3: s/bit_size == 64/s == 64/. This cut-and-paste bug prevented any of the optimizations from ever occuring. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29148>	2024-05-31 09:13:23 -07:00
Georg Lehmann	dcab408a6c	nir: remove unpack_half_flush_to_zero It doesn't make sense to have two sets of opcodes for this when all backends that support the flush_to_zero variant just rely on the global floating point mode anyway. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29433>	2024-05-31 09:46:35 +00:00
Timur Kristóf	0ea2bad74d	nir/lower_io: Add option to implement mediump as 32-bit. For drivers that don't lower mediump shader inputs / outputs to 16-bit, it's better to ignore the mediump flag completely, letting mediump inputs / outputs work like normal 32-bit IO. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29435>	2024-05-30 12:57:20 +00:00
Konstantin Seurer	a93f95c69c	radv/rt: Remove load_rt_dynamic_callable_stack_base_amd Reviewed-by: Friedrich Vock <friedrich.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28619>	2024-05-28 12:23:45 +00:00
Italo Nicola	62c8e58f39	nir: add {load,store}_global_etna intrinsics Acked-by: David Heidelberg <david@ixit.cz> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Signed-off-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29402>	2024-05-27 17:58:51 +00:00
Natanael Copa	0274518615	nir/opt_varyings: reduce stack usage Avoid put a huge struct on stack to fix a stack overflow on musl libc. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10988 Fixes: `c66967b5cb` (nir: add nir_opt_varyings, new pass optimizing and compacting varyings) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29375>	2024-05-24 13:15:33 +00:00
Timur Kristóf	c23c5c0a07	nir/opt_varyings: Don't promote flat inputs when moving post-dominator. Promoting flat inputs should only happen while assigning FS input slot groups. Otherwise we risk adding extra input slots, which is undesireable. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29208>	2024-05-23 13:14:46 +00:00
Timur Kristóf	9dad0ced52	nir/opt_varyings: Print FS VEC4 type when debugging relocate_slot. Useful when debugging this pass. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29208>	2024-05-23 13:14:46 +00:00
Timur Kristóf	c1d38b0b37	nir: Add nir_opt_load_store_update_alignments. New pass that shares code with nir_opt_load_store_vectorize but it only updates the alignment of load/store instructions. It is useful before running other passes which may potentially destroy that information (eg. by removing some instructions from which the alignment may be deduced). Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29210>	2024-05-21 16:06:23 +00:00
Alyssa Rosenzweig	0b582449f0	nir/lower_point_size: support lowered i/o Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29248>	2024-05-21 15:30:10 +00:00
Sil Vilerino	d8eb9fc9b4	nir: Mark variable as ASSERTED to fix unused variable warning treated as error Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29290>	2024-05-20 14:45:56 +00:00
Mike Blumenkrantz	ffe54ca293	nir/linking: fix nir_assign_io_var_locations for scalarized dual blend this would previously assign all scalar variables to the highest driver location cc: mesa-stable Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28753>	2024-05-18 13:50:27 +00:00
Marek Olšák	b4bd380704	nir/algebraic: eliminate pack+unpack and unpack+pack pairs A new NIR shader for AMD drivers will need this. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29233>	2024-05-17 22:04:00 +00:00
Alyssa Rosenzweig	9a8cb81f61	nir/tex_instr_result_size: handle subpass_ms I hit this and don't see any reason it shouldn't work Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29249>	2024-05-16 18:09:39 -04:00
Karol Herbst	564e569072	nir/lower_cl_images: set binding also for samplers Fixes https://github.com/darktable-org/darktable/issues/16717 on radeonsi. Fixes: `31ed24cec7` ("nir/lower_images: extract from clover") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29230>	2024-05-16 16:39:42 +00:00
Francisco Jerez	15a10786e3	nir: Add option to lower 64-bit uadd_sat. C.f. `16be909936`. Intel Xe2 won't support saturation for 64-bit integer addition, regardless of signedness. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28283>	2024-05-15 17:16:51 +00:00
Lionel Landwerlin	ecbec25e84	intel/nir: add reloc delta to load_reloc_const_intel intrinsic We'll use the delta for an upcoming internal printf mechanism, where the PARAM_IDX will be the base printf reloc identifier and the BASE will be the string id. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25814>	2024-05-15 13:13:38 +00:00
Lionel Landwerlin	c16e58eabd	nir: add a low level printf emission helper Uses the same memory layout as the print intrinsic lowering. This one just let's you do the emission without having to deal with variables. This useful for debug traces. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25814>	2024-05-15 13:13:37 +00:00
Lionel Landwerlin	c518a176f5	nir: add ptr_bit_size parameter to nir_lower_printf Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25814>	2024-05-15 13:13:37 +00:00
Lionel Landwerlin	2be28ee58a	nir: add a base offset for printf indexing This will allow a driver to use a single table of printf strings across all shaders. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25814>	2024-05-15 13:13:37 +00:00
Lionel Landwerlin	8d336f069e	nir/divergence: add missing load_printf_buffer_address Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25814>	2024-05-15 13:13:37 +00:00
Alyssa Rosenzweig	eb5f82d221	nir,agx: fix load_active_subgroup_index It can't be reordered globally, since its value is control-flow dependent. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29179>	2024-05-14 04:57:25 +00:00
Alyssa Rosenzweig	7fb60c4c81	nir,agx: add depth=never workaround There seems to be a hardware issue where fragment shaders with side effects get skipped if depth testing with NEVER. Add a workaround for this case where we discard programmatically instead. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29179>	2024-05-14 04:57:25 +00:00
Alyssa Rosenzweig	9d824bd123	nir: add quad_ballot_agx intrinsic to lower quad votes in nir. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29179>	2024-05-14 04:57:24 +00:00

1 2 3 4 5 ...

5375 commits