fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 11:28:15 +02:00

Author	SHA1	Message	Date
Daniel Schürmann	1feb733cd4	Revert "nir: add nir_clear_divergence_info, use it in nir_opt_varyings" This reverts commit `9d043e138d`. It is no longer needed. nir_convert_from_ssa() is now capable to ignore divergence information. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33009>	2025-01-23 01:31:24 +00:00
Daniel Schürmann	f3be7ce01b	nir/from_ssa: only consider divergence if requested This pass used to unconditionally use divergence information which forced the caller to either call divergence_analysis or ensure that the divergence is properly reset. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33009>	2025-01-23 01:31:23 +00:00
Marek Olšák	bdd85c8393	nir: remove handling IO variables from passes used by st/mesa Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33146>	2025-01-22 02:15:04 +00:00
Marek Olšák	02516ff0f9	nir: remove dead code due to IO being always lowered in st/mesa Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33146>	2025-01-22 02:15:04 +00:00
Marek Olšák	f29530533c	glsl: simplify nir_lower_io_to_temporaries logic no change in behavior Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33146>	2025-01-22 02:15:04 +00:00
Marek Olšák	b7c4a1479e	glsl: remove dead code due to IO being always lowered Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33146>	2025-01-22 02:15:04 +00:00
Georg Lehmann	ee5017a0fa	nir/opt_algebaric: convert fadd(a, a) to a * 2.0 On AMD, this is a clear win. 2.0 is a free constant, the multiplication can be fused into fma, or it can be done as a free output modifier. Otherwise, fmul and fadd have the same throughput/latency, with the only possible downside being potentially power consumption. For other hardware this might not be as clear, but we should at least choose one option for NIR because it allows more CSE. Foz-DB Navi21: Totals from 12231 (15.41% of 79395) affected shaders: MaxWaves: 309068 -> 309032 (-0.01%) Instrs: 11826395 -> 11790132 (-0.31%); split: -0.31%, +0.00% CodeSize: 63531496 -> 63512868 (-0.03%); split: -0.03%, +0.00% VGPRs: 551256 -> 551328 (+0.01%); split: -0.00%, +0.02% SpillSGPRs: 984 -> 979 (-0.51%) Latency: 88486492 -> 88394296 (-0.10%); split: -0.11%, +0.01% InvThroughput: 22360595 -> 22300790 (-0.27%); split: -0.27%, +0.00% VClause: 226267 -> 226273 (+0.00%); split: -0.01%, +0.01% SClause: 293820 -> 293783 (-0.01%); split: -0.02%, +0.00% Copies: 727187 -> 727106 (-0.01%); split: -0.03%, +0.02% PreSGPRs: 539623 -> 539625 (+0.00%) PreVGPRs: 440843 -> 440946 (+0.02%); split: -0.00%, +0.03% VALU: 8324962 -> 8288809 (-0.43%); split: -0.43%, +0.00% SALU: 1278550 -> 1278538 (-0.00%); split: -0.00%, +0.00% VMEM: 440600 -> 440599 (-0.00%) Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32989>	2025-01-21 20:28:04 +00:00
Marek Olšák	b65973240c	nir: add a pass that moves output stores to the end of the shader required by vc4 & vc5 to merge the rest of the lowered IO code for st/mesa Acked-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33053>	2025-01-21 19:38:54 +00:00
Mike Blumenkrantz	48d0a0322f	glsl: plumb num_views down to shader_info::view_mask this is needed for drivers to more effectively compile multiview-enabled shaders Reviewed-by: Timothy Arceri <tarceri@itqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33016>	2025-01-20 22:43:23 +00:00
Connor Abbott	2d45836c95	ir3: Plumb through ray_intersection intrinsic Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28447>	2025-01-20 01:22:23 +00:00
Connor Abbott	91f19bcbe0	ir3: Plumb through two-dimensional UAV loads There is native support for D3D-style untyped UAVs, which are an unsized array of "records." This will be needed for acceleration structures, because normal SSBO descriptors aren't large enough to cover all the 128-byte instance descriptors for the maximum number of instances (2**24). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28447>	2025-01-20 01:22:23 +00:00
Timothy Arceri	7d41cfa1a9	glsl: enable layout qualifier if OVR_multiview enabled OVR_multiview requires 1.30 but makes use of layout qualifier Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33074>	2025-01-19 01:10:54 +00:00
Konstantin Seurer	01ec2f59a4	nir/print: Do not print trailing spaces after preds/succs Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32644>	2025-01-18 11:02:25 +00:00
Konstantin Seurer	eb3ab68e5e	nir/tests: Add reference shaders Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32644>	2025-01-18 11:02:25 +00:00
Konstantin Seurer	8838a0c595	nir/tests: Add a helper for comparing a shader against a string This allows unit tests to compare against a reference nir shader instead of implementing checks for interesting instructions/CF nodes. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32644>	2025-01-18 11:02:25 +00:00
Konstantin Seurer	6d1d15183f	nir/tests: Improve shader creation Sets some fields so they are not printed and allows specifying a stage. This decreases the size of reference shaders. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32644>	2025-01-18 11:02:25 +00:00
Konstantin Seurer	305be9cf5e	nir/print: Print less unused shader info Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32644>	2025-01-18 11:02:25 +00:00
Lionel Landwerlin	2603dbd796	nir: make lower-level printf helper respect buffer size Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33067>	2025-01-17 18:09:45 +00:00
Alyssa Rosenzweig	43e79b26de	nir/lower_printf: drop static buffer addr lowering no longer used, replaced by the new pass. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33067>	2025-01-17 18:09:45 +00:00
Alyssa Rosenzweig	07ad850787	nir: add nir_lower_printf_buffer pass this is a helper for lowering the printf buffer intrinsics to constants for backend convenience. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33067>	2025-01-17 18:09:45 +00:00
Alyssa Rosenzweig	7bc9bbcc6e	nir/lower_printf: support dynamic buffer size this is required for vtn_bindgen2 where we don't know the buffer size until the driver-specific code paths, but we need to lower printf (to hash format strings) in common code. so defer the buffer size decision to an intrinsic. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33067>	2025-01-17 18:09:45 +00:00
Alyssa Rosenzweig	6db9218ec3	nir/lower_printf: add option to hash format strings Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33067>	2025-01-17 18:09:45 +00:00
Alyssa Rosenzweig	e1368f0a30	nir,util: move printf serializing into util there's nothing NIR specific here and these routines will be useful otherwise. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33067>	2025-01-17 18:09:45 +00:00
Alyssa Rosenzweig	47e16cab5e	nir/lower_printf: drop default max buffer size no uses and it doesn't make sense. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33067>	2025-01-17 18:09:45 +00:00
Alyssa Rosenzweig	621ff262bc	nir/lower_printf: drop null check we derefernce options above. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33067>	2025-01-17 18:09:45 +00:00
Qiang Yu	e5041ef036	docs,src: replace doc and comments for PIPE_CAP with pipe_caps Use command: find . -type d \( -path "./.git" -o -path "./docs/relnotes" \) -prune -o -type f -exec sed -i 's/PIPE_CAP_\([A-Za-z0-9_]\)/pipe_caps.\L\1/g' {} + find . -type d \( -path "./.git" -o -path "./docs/relnotes" \) -prune -o -type f -exec sed -i 's/PIPE_CAPF_\([A-Za-z0-9_]\)/pipe_caps.\L\1/g' {} + With manual adjustment for docs/gallium/screen.rst to merge pipe_cap and pipe_capf section. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32955>	2025-01-17 04:39:47 +00:00
Marek Olšák	ff6e3e9f76	nir: add next_stage param to nir_slot_is_varying & nir_remove_sysval_output The result of nir_slot_is_varying depends on what the next shader stage is, and nir_remove_sysval_output uses it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32855>	2025-01-16 16:28:15 +00:00
Marek Olšák	0d961b0723	nir: add barycentric coordinates src to load_point_coord_maybe_flipped Just like other input loads, radeonsi needs to know the barycentric coordinates for it. This adds the src and determines the optimal barycentric coordinates in nir_lower_point_smooth, the only producer of the intrinsic. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33046>	2025-01-16 02:58:03 +00:00
Sil Vilerino	e061792e25	src/compiler: Fix warning C4389: An == or != operation involved signed and unsigned variables. This could result in a loss of data. Reviewed-By: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jesse Natalie <None> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32979>	2025-01-15 21:40:20 +00:00
Sil Vilerino	8ecb7bc2a2	src/compiler: Fix warning C4244 'argument' : conversion from 'type1' to 'type2', possible loss of data Reviewed-By: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jesse Natalie <None> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32979>	2025-01-15 21:40:20 +00:00
Timothy Arceri	6bca0cc3d9	glsl: drop opt_dead_code_local This does nothing useful anymore as we now convert to nir at compile time which will handle all this for us. shader-db radeonsi: TOTALS FROM AFFECTED SHADERS (63/168079) SGPRS: 3248.00 -> 3208.00 (-1.23 %) VGPRS: 2224.00 -> 2228.00 (0.18 %) Spilled SGPRs: 0.00 -> 0.00 (0.00 %) Spilled VGPRs: 0.00 -> 0.00 (0.00 %) Private memory VGPRs: 0.00 -> 0.00 (0.00 %) Scratch size: 0.00 -> 0.00 (0.00 %) dwords per thread Code Size: 138484.00 -> 138068.00 (-0.30 %) bytes Max Waves: 877.00 -> 877.00 (0.00 %) Outputs: 0.00 -> 0.00 (0.00 %) Patch Outputs: 0.00 -> 0.00 (0.00 %) shader-db Iris (BDW): total instructions in shared programs: 17805897 -> 17805917 (<.01%) instructions in affected programs: 1240 -> 1260 (1.61%) helped: 0 HURT: 8 HURT stats (abs) min: 1 max: 4 x̄: 2.50 x̃: 2 HURT stats (rel) min: 0.39% max: 7.14% x̄: 4.26% x̃: 4.06% 95% mean confidence interval for instructions value: 1.61 3.39 95% mean confidence interval for instructions %-change: 2.01% 6.51% Instructions are HURT. total cycles in shared programs: 856868505 -> 856876266 (<.01%) cycles in affected programs: 2879959 -> 2887720 (0.27%) helped: 79 HURT: 100 helped stats (abs) min: 1 max: 742 x̄: 61.96 x̃: 12 helped stats (rel) min: <.01% max: 41.84% x̄: 1.17% x̃: 0.20% HURT stats (abs) min: 1 max: 1231 x̄: 126.56 x̃: 14 HURT stats (rel) min: <.01% max: 33.98% x̄: 3.32% x̃: 0.30% 95% mean confidence interval for cycles value: 7.37 79.35 95% mean confidence interval for cycles %-change: 0.29% 2.38% Cycles are HURT. LOST: 1 GAINED: 4 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33008>	2025-01-15 02:01:09 +00:00
Alyssa Rosenzweig	401b400de3	nir,asahi,hk: add barrier argument to MESA_DISPATCH_PRECOMP In the current API, precomp implicitly assumes full barriers both before & after every dispatch. That's not good for performance. However, dropping the barriers and requiring user to explicitly call barrier functions before/after would have bad ergonomics. So, we add a new parameter to the standard MESA_DISPATCH_PRECOMP signature representing the barriers required around the dispatch. As usual, the actual type & semantic is left to drivers to define what makes sense for their hardware. We just reserve the place for it. (I think most drivers will want bitflags here, but I don't think the actual flags are worth. If a driver wanted to use a struct here, that would work too.) Since the asahi stack doesn't do anything clever with barriers yet, we mechnically add an AGX_BARRIER_ALL barrier to all precomp users in-tree. We can optimize that later, this just gets the flag-day change in with no functional change. For JM panfrost, this will provide a convenient place to stash both their "job barrier" bit and their "suppress prefetch" bit (which is really a sort of barrier / cache flush, if you think about it). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32980>	2025-01-14 16:39:57 +00:00
Kenneth Graunke	2f334e8baf	nir: Add a nir_def_first_component_read() helper Similar to nir_def_last_component_read(). Just a little nicer than prodding at the bitmask of components read directly. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32888>	2025-01-10 22:44:09 +00:00
Mike Blumenkrantz	010732b8ef	glsl: enable OVR_multiview if OVR_multiview2 is enabled according to spec Fixes: `328c29d600` ("mesa,glsl,gallium: add GL_OVR_multiview") Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32946>	2025-01-10 19:10:48 +00:00
Mike Blumenkrantz	3c5eae639d	glsl: make gl_ViewID_OVR visible to all shader stages according to spec Fixes: `328c29d600` ("mesa,glsl,gallium: add GL_OVR_multiview") Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32946>	2025-01-10 19:10:48 +00:00
Alyssa Rosenzweig	d9b4867e2a	nir/lower_robust_access: fix robustness with atomic swap this was missed in the original v3d pass, and then the common code port inherited the bug. (so strictly this fix "should" be backported even farther back but it won't apply before the Fixes here, and I don't think we do LTS that far back anyway). in theory this should fix a corner case with robustness on the gl (but not vulkan, at least for apple) drivers on broadcom & apple. Fixes: `f0fb8d05e3` ("nir: Add nir_lower_robust_access pass") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32907>	2025-01-08 15:59:05 +00:00
Alyssa Rosenzweig	7a4469681e	nir: pass a callback to nir_lower_robust_access rather than try to enumerate everything a driver might want with an unmanageable collection of booleans, just do a filter callback + data. this ends up simpler overall, and will allow Intel to use this pass for just 64-bit images without needing to add even more booleans. while we're churning the pass signature, also do a quick port to nir_shader_intrinsics_pass Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> [NIR and V3D] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32907>	2025-01-08 15:59:05 +00:00
Daniel Schürmann	d2f52e61c2	nir/divergence: change nir_has_divergent_loop() to return true only for divergent breaks The important information is whether a loop has a uniform number of iterations. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28627>	2025-01-08 13:33:54 +01:00
Mary Guillemard	42f6bb0456	libcl: Add VkQueryType and VkQueryResultFlagBits definitions Useful for query pool copy/clear meta shaders. Signed-off-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32939>	2025-01-08 11:37:27 +00:00
Mary Guillemard	2e38a15070	libcl: Respect NDEBUG for assert In C, NDEBUG allows disabling the assert macro, let's follow this behaviour. Signed-off-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32939>	2025-01-08 11:37:27 +00:00
Mary Guillemard	ecdccae990	nir,agx: Allow nir_precomp_print_blob to print a static array This makes it stop leaking shader binary blobs definition and is required for panfrost clc. Signed-off-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32939>	2025-01-08 11:37:27 +00:00
Mary Guillemard	5f8addfd99	util/bitpack_helpers: Make fixed packs CL safe We emulate roundf and llroundf for compatibility. Signed-off-by: Mary Guillemard <mary.guillemard@collabora.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32939>	2025-01-08 11:37:27 +00:00
Georg Lehmann	67d74a04b9	nir/peephole_select: allow load_vector/scalar_arg_amd Foz-DB Navi21: Totals from 1507 (1.90% of 79395) affected shaders: MaxWaves: 31830 -> 31870 (+0.13%); split: +0.20%, -0.08% Instrs: 938704 -> 937232 (-0.16%); split: -0.19%, +0.03% CodeSize: 4970860 -> 4964652 (-0.12%); split: -0.14%, +0.02% VGPRs: 79536 -> 79512 (-0.03%); split: -0.08%, +0.05% Latency: 5194524 -> 5218285 (+0.46%); split: -0.38%, +0.84% InvThroughput: 1200152 -> 1207251 (+0.59%); split: -0.02%, +0.61% VClause: 20728 -> 20741 (+0.06%); split: -0.11%, +0.17% SClause: 33612 -> 32871 (-2.20%); split: -2.78%, +0.57% Copies: 70601 -> 68847 (-2.48%); split: -2.62%, +0.13% Branches: 20032 -> 17521 (-12.53%) PreSGPRs: 47828 -> 47801 (-0.06%) VALU: 637446 -> 638094 (+0.10%); split: -0.02%, +0.13% SALU: 88627 -> 88462 (-0.19%); split: -1.08%, +0.90% VMEM: 36664 -> 36659 (-0.01%) Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32792>	2025-01-08 09:56:39 +00:00
Pierre-Eric Pelloux-Prayer	dd11eec06b	gl/spirv: update subgroup_size if GroupNonUniform is used This is similar to what link_intrastage_shaders is doing and it fixes the following test: KHR-Single-GL46.subgroups.builtin_var.compute.subgroupsize_compute Which was failing with SPIRV but passing with GLSL, the diff being: - SPIRV: "subgroup_size: 1" - GLSL: "subgroup_size: 2" Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32698>	2025-01-07 19:32:43 +00:00
Boris Brezillon	2af6e4beeb	pan: Don't pretend we support load_{vertex_id_zero_base,first_vertex} load_vertex_id_zero_base() is supposed to return the zero-based vertex ID, which is then offset by load_first_vertex() to get an absolute vertex ID. At the same time, when we're in a Vulkan environment, load_first_vertex() also encodes the vertexOffset passed to the indexed draw. Midgard/Bifrost have a sligtly different semantics, where load_first_vertex() returns vertexOffset + minVertexIdInIndexRange, and load_vertex_id_zero_base() returns an ID that needs to be offset by this vertexOffset + minVertexIdInIndexRange to get the absolute vertex ID. Everything works fine as long as all the load_first_vertex() and load_vertex_id_zero_base() calls are coming from the load_vertex_id() lowering. But as mentioned above, that's no longer the case in Vulkan, where gl_BaseVertexARB will be turned into load_first_vertex() and expect a value of vertexOffset in an indexed draw context. We thus need to fix the mismatch by introducing two new panfrost-specific intrinsic so we can stop abusing load_first_vertex() and load_vertex_id_zero_base(). Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32415>	2025-01-07 08:15:19 +00:00
Marek Olšák	3800f0af41	nir/algebraic: optimize pack_split(unpack(a).x, unpack(a).y) -> a This is required to optimize FP64 and Int64 shaders generated by virglrenderer. It generates pack/unpack around every 64-bit op, which NIR currently can't eliminate. This fixes that. There is a new constraint ".y", which means that the use of an instruction should have swizzle.y. This allows us to add patterns that have Y swizzle on results of instructions. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32172>	2025-01-07 05:47:52 +00:00
Marek Olšák	b1bc691b0f	nir/algebraic: add and improve pack/unpack patterns Some duplicated patterns are removed. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32172>	2025-01-07 05:47:52 +00:00
Marek Olšák	ebec182b04	nir/algebraic: use is_used_once for comparison patterns otherwise we are just creating new instructions while not removing any Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32172>	2025-01-07 05:47:52 +00:00
Alyssa Rosenzweig	09b5608607	glsl: fix glsl_get_word_size_align_bytes this was copypasted from the wrong function. fixes on asahi KHR-Single-GL46.arrays_of_arrays_gl.SubroutineArgumentAliasing4_var_type_index_13 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32871>	2025-01-06 22:09:49 +00:00
Marek Olšák	d09ba36f98	glsl: fix corruption due to blake3 hash not being set for nir_opt_undef NIR is generated sooner, so we need to set it sooner. This fixes Viewperf13/CATIA_car_04. Fixes: `cbfc225e2b` - glsl: switch to a full nir based linker Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32876>	2025-01-06 19:50:51 +00:00

1 2 3 4 5 ...

10145 commits