fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 20:18:06 +02:00

Author	SHA1	Message	Date
Jesse Natalie	f56bb3ec4b	nir_lower_mem_access_bit_sizes: Fix write-mask-constrained 3-byte stores as atomics The code here handled stores of actual 3-byte values (8-bit, 3-component), but didn't correctly handle stores of larger 8-bit vectors that were constrained by write mask to just 3 bytes. In that case, the pad-to-vec4 step was unnecessary and problematic. Seen in CL CTS test_basic vector_swizzle test group for char3 with CLOn12. Fixes: `c70d94a8` ("nir_lower_mem_access_bit_sizes: Support unaligned stores via a pair of atomics") Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26034> (cherry picked from commit `cd0cff951a`)	2023-11-15 21:21:24 +00:00
Sviatoslav Peleshko	b419916e7f	nir/loop_analyze: Fix inverted condition handling in iterations calculation In the tagged commit, we stopped actually inverting the condition, and instead relied on the "invert_cond" flag. But we missed a few places where this flag should've been handled too. Also, add a few more tests to make sure this won't regress in the future. Fixes: `99a7a664` ("nir/loop_analyze: Change invert_cond instead of changing the condition") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10012 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26024> (cherry picked from commit `aa33ca0a52`)	2023-11-07 13:30:20 +00:00
Samuel Pitoiset	cf3bd8bedc	nir: fix inserting the break instruction for partial loop unrolling If the break in the original loop isn't in the first top-level if, this would have re-inserted it in the wrong block. Fixes this by re-inserting the break block to the corresponding break block in the new loop by using the remap hashtable. fossils-db (NAVI21): Totals from 88 (0.11% of 79330) affected shaders: Instrs: 109602 -> 109929 (+0.30%); split: -0.10%, +0.40% CodeSize: 570968 -> 573332 (+0.41%); split: -0.08%, +0.49% Latency: 1682510 -> 1682505 (-0.00%); split: -0.01%, +0.01% Copies: 12832 -> 12746 (-0.67%); split: -1.54%, +0.87% Branches: 2879 -> 2930 (+1.77%) Deathloop and F1 2023 are affected but I'm not aware of any issues for these two games. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10001 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26009> (cherry picked from commit `abfd208cb0`)	2023-11-07 13:28:05 +00:00
antonino	28e36118af	nir: don't take the derivative of the array index in `nir_lower_tex` Previosuly when lowering to txd for sampler array the index would be derived as well, therefore the resulting derivative would have been a vec with one more component than what the txd instruction expects. This patch truncates the coordinate vector in this case to make sure the index is not derived. Fixes: `b154a4154b` ("nir/lower_tex: rewrite tex/txb -> txd/txl before saturating srcs") Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26012> (cherry picked from commit `4a627af0e3`)	2023-11-04 14:16:02 +00:00
Faith Ekstrand	2cfc7776bd	nir: Handle wildcards with casts in copy_prop_vars If we're propagating a copy from a cast where the copy copies an entire array, we end up with something like &((S )ssa_N)->f[] in the source where a wildcard has a cast in its parent chain. If we then try to propagate the read into a non-wildcard array load, we have to specialize the wildcard. This breaks because nir_build_deref_follower() doesn't handle casts. Since we know a priori that, because wildcards are only generated by copy_deref on arrays, we cannot have a cast with a wildcard parent so simply chasing the source deref to the first wildcard will ensure that any casts in the deref are handled properly. Fixes: `ba2bd20f87` ("nir: Rework opt_copy_prop_vars to use deref instructions") Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22580> (cherry picked from commit `15ab4d397f`)	2023-11-04 14:15:56 +00:00
Faith Ekstrand	8081cb909b	nir: add deref follower builder for casts. This fixes intel_clc builds with llvm 17 on gfx125_bvh_build_DFS_DFS where it dies in the lower indirect derefs pass. Co-authored-by: Dave Airlie <airlied@redhat.com> Fixes: `4a4e175738` ("nir: Support deref instructions in lower_var_copies") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25536> (cherry picked from commit `6388896985`)	2023-11-04 14:12:29 +00:00
Faith Ekstrand	8a7498e13f	nir/lower_bit_size: Fix subgroup lowering for floats Using u2u is always correct for integers, including signed integers, because we're doing a down-cast. It's wrong for floats, though. Fixes: `f95665cfeb` ("nir/lower_bit_size: Add support for lowering subgroup ops") Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25894> (cherry picked from commit `5979e74177`)	2023-10-30 15:47:22 +00:00
Ian Romanick	c23ba4e83a	nir/split_vars: Don't split arrays of cooperative matrix types glsl_type_is_vector_or_scalar would more accruately be called "can be an r-value that isn't an array, structure, or matrix. This optimization pass really shouldn't do anything to cooperative matrices. These matrices will eventually be lowered to something else (dependent on the backend), and that thing may (or may not) be handled by this or another pass. Fixes: `2d0f4f2c17` ("compiler/types: Add support for Cooperative Matrix types") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25871> (cherry picked from commit `18d8a96a00`)	2023-10-30 15:47:12 +00:00
Rhys Perry	1afd0878e9	nir/lower_shader_calls: skip zero-sized qsort Fixes UBSan: src/compiler/nir/nir_lower_shader_calls.c:1681:7: runtime error: null pointer passed as argument 1, which is declared to never be null Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25853>	2023-10-25 17:27:47 +00:00
Rhys Perry	f9289dfd02	nir/serialize: fix signed integer overflow Fixes UBSan error: src/compiler/nir/nir_serialize.c:1277:70: runtime error: left shift of 524287 by 13 places cannot be represented in type 'int' Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25853>	2023-10-25 17:27:47 +00:00
Alyssa Rosenzweig	9a6c20e64f	nir/trivialize_registers: Handle obscure load hazard Somebody less tired than me would add a unit test for this. Offending snippet: 32 %58 = @load_reg (%55) (base=0, legacy_fabs=0, legacy_fneg=0) 32 %57 = @load_reg (%55) (base=0, legacy_fabs=0, legacy_fneg=0) 32 %21 = iadd %57, %15 (0x1) @store_reg (%21, %55) (base=0, wrmask=x, legacy_fsat=0) 32 %56 = @load_reg (%55) (base=0, legacy_fabs=0, legacy_fneg=0) 32 %22 = i2f32 %56 32 %23 = load_const (0x41000000 = 8.000000) 32 %24 = fdiv %22, %23 (8.000000) 32 %90 = mov %24 @store_reg_indirect (%90, %78, %58) (base=0, wrmask=x, legacy_fsat=0) Closes: #10031 Fixes: `d313eba94e` ("nir: Add pass for trivializing register access") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reported-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25865>	2023-10-25 16:34:47 +00:00
Caio Oliveira	67450674c0	compiler/types: Move comments and reorganize declarations Move comments from C++ member functions to the C functions. In some cases just delete comments or consolidate them together. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	dfcca13800	compiler/types: Remove warnings about potential fallthrough None of those cases are expected to fallthrough, but should be unreachable. Just break them so they get to the unreachable entry at the end. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	9e32cc3d0b	compiler/types: Rename glsl_types.cpp to glsl_types.c Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	c45276c665	compiler/types: Annotate extern "C" only once in glsl_types.cpp Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	cecdc686e4	compiler/types: Remove usages of C++ members in glsl_types.cpp Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	94bdf351dc	compiler/types: Use C instead of C++ constants for builtin types Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	07ee4bd69f	compiler/types: Add remaining type extraction functions and use them in C++ Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	ada6183d60	compiler/types: Add glsl_simple_explicit_type() and simplify glsl_simple_type() Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	e17adf51db	compiler/types: Implement glsl_type::field_type() in terms of existing functions Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	9e514b89a0	compiler/types: Add glsl_get_explicit_*() functions and use them in C++ Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	d2a804a25b	compiler/types: Add glsl_get_std430_array_stride() and use it in C++ Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	7b42fe62a1	compiler/types: Add glsl_type_uniform_locations() and use it in C++ Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	e98ba3b53f	compiler/types: Add glsl_type_compare_no_precision() and use it in C++ Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	3ce4d5e033	compiler/types: Add glsl_get_mul_type() and use it in C++ Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	795bf4244c	compiler/types: Add more glsl_contains_*() functions and use them in C++ Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	68f80e6fc1	compiler/types: Move remaining code from nir_types to glsl_types Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	b2407d7859	compiler/types: Flip wrapping of numeric type conversion functions Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	8bebd40d5c	compiler/types: Flip wrapping of remaining small data getters Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	922fe24739	compiler/types: Flip wrapping of remaining non-trivial type getters Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	a5e6e5b6d3	compiler/types: Flip wrapping of get row/column type helpers Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	beaac525e8	compiler/types: Flip wrapping of various get instance functions Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	5c91cf9288	compiler/types: Flip wrapping of texture/sampler/image get instance functions Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	87b3812f10	compiler/types: Flip wrapping of get_instance() Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	2117158619	compiler/types: Flip wrapping of record_compare Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	e486384540	compiler/types: Flip wrapping of layout related functions Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	a4cfeea850	compiler/types: Flip wrapping of interface related functions Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	ad092fcab5	compiler/types: Flip wrapping of struct related functions Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	421a04f5ba	compiler/types: Flip wrapping of size related functions Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	418e3be14c	compiler/types: Flip wrapping of CL related functions Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:12 +00:00
Caio Oliveira	d78110d356	compiler/types: Flip wrapping of cmat related functions Also add a missing `struct` related to cmat. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:11 +00:00
Caio Oliveira	67210f90ad	compiler/types: Flip wrapping of array related functions Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:11 +00:00
Caio Oliveira	3bf500af7b	compiler/types: Flip wrapping of "type contains?" predicate functions Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25129>	2023-10-25 01:51:11 +00:00
Mary Guillemard	5308378a35	nir: Add NVIDIA-specific geometry shader opcodes Signed-off-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25000>	2023-10-24 22:21:18 +00:00
Faith Ekstrand	1fa7c37a36	nir: Add NVIDIA-specific I/O intrinsics NVIDIA hardware doesn't take a vertex index for per-vertex I/O. Instead, it takes an offset into the primitive. This has to be fetched using a combination of SR_INVOCATION_INFO and the ISBERD instruction. To keep things simple and allow for maximum CSE, we do the lowering in NIR and patch the load/store_per_vertex_input/output intrinsic. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25000>	2023-10-24 22:21:18 +00:00
Faith Ekstrand	8188842fdc	nir: Add a range to most I/O intrinsics Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25000>	2023-10-24 22:21:18 +00:00
Faith Ekstrand	a2b799c53c	nir: Add an load_barycentric_at_offset_nv intrinsic NVIDIA hardware takes the offset as two 4.12 fixed-point values packed into a single 32-bit value. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25000>	2023-10-24 22:21:18 +00:00
Faith Ekstrand	1a2e8290ab	nir: Add NV-specific texture opcodes These are for implementing various texture queries. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25000>	2023-10-24 22:21:18 +00:00
Faith Ekstrand	5984265d45	nir: Add a load_sysval_nv intrinsic Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25000>	2023-10-24 22:21:18 +00:00
Faith Ekstrand	abf3175161	nir/lower_tex: Add a lower_txd_clamp option Some of us want to lower all TXD with min_lod regardless of whether or not it's shadow or cube or whatever. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25000>	2023-10-24 22:21:17 +00:00

1 2 3 4 5 ...

8745 commits