fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-24 11:00:11 +01:00

Author	SHA1	Message	Date
M Henning	e506955056	nir: Handle texop_*_nv in nir_tex_instr_is_query Fixes: `aa1f00cf` ("nir/gather_info: handle uses_fbfetch_output for texture operations") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11505 Tested-by: Thomas H.P. Andersen <phomes@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30166>	2024-07-13 15:36:29 +00:00
Marek Olšák	1b2cd628b8	nir: rename ordered_xfb_counter_add_gfx12_amd -> ordered_add_loop_gfx12_amd because it can also be used by compute. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30063>	2024-07-13 01:32:48 +00:00
Samuel Pitoiset	aa1f00cf5c	nir/gather_info: handle uses_fbfetch_output for texture operations Like nir_texop_txf_ms. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30109>	2024-07-12 09:33:51 +00:00
Samuel Pitoiset	0d0b949cd7	nir/gather_info: handle uses_fbfetch_output for sparse image loads Looks like this was missing. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30109>	2024-07-12 09:33:51 +00:00
Christian Gmeiner	87786a7a7e	nak: Move imad late optimization to nir It is more or less just a code move, but I touched is_only_used_by_iadd(..) to match the style of the other functions in that file. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30099>	2024-07-12 05:54:46 +00:00
Rhys Perry	c4706c6177	nir/linking_helpers: remove nested IF Just add a && to the condition. This is more readable to me. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25590>	2024-07-10 19:11:38 +00:00
Rhys Perry	525aacd9d7	nir/linking_helpers: remove varying accesses in nir_remove_unused_io_vars interp_deref_at_sample of a nir_var_shader_temp is nonsensical and might be ignored by later passes, instead of removed. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7818 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10588 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25590>	2024-07-10 19:11:38 +00:00
Rhys Perry	bcd98e091a	nir/linking_helpers: remove special case for read mesh outputs Only VK_NV_mesh_shader allows this kind of access, and no driver advertises that extension anymore. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25590>	2024-07-10 19:11:38 +00:00
Rhys Perry	57080749f7	gallium: remove PIPE_CAP_SHADER_CAN_READ_OUTPUTS nir_lower_io_to_temporaries is now done for all stages except TCS, and nir_lower_io_to_temporaries with a TCS is a no-op. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25590>	2024-07-10 19:11:38 +00:00
Rhys Perry	767ea18517	glsl: always lower non-TCS outputs to temporaries It seems only radeonsi and v3d sets CAN_READ_OUTPUTS/SupportsReadingOutputs, and v3d has lower_all_io_to_temps=true. It looks like radeonsi basically lowers the outputs to temporaries in the backend. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25590>	2024-07-10 19:11:38 +00:00
Connor Abbott	45a57fa735	ir3: Plumb through descriptor prefetch intrinsics Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29873>	2024-07-10 11:54:15 +00:00
Connor Abbott	ccf88d940b	nir/instr_set: Don't remove matching instruction We currently assume that the instruction is already inserted and we are optimizing it away, but in the use case I have where we are hoisting instructions into a preamble and deduplicating as we go along, that isn't the case. Move this responsibility onto the caller, which also makes it a bit clearer what's going on and turns this into something more similar to an actual set. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29873>	2024-07-10 11:54:15 +00:00
Connor Abbott	cda7d9c971	nir/instr_set: Return the matching instruction This allows use cases where we copy over expression trees and deduplicate as we go along. We can use the matching instruction to build up the rest of the expression tree. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29873>	2024-07-10 11:54:15 +00:00
Alyssa Rosenzweig	0ce2e6594d	nir/opt_constant_folding: fix array size define In practice these are equal but the old code was semantically wrong: that dimension is "sources" not "components". Use the correct #define. This came up when reviewing https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29994 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30066>	2024-07-08 14:34:29 +00:00
Timothy Arceri	d1767ddd13	glsl/tests: fix test_gl_lower_mediump This fixes test_gl_lower_mediump to properly test linking, which also means we can drop all the custom nir calls as we are now simply passing the tests directly through the real nir linking code. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30034>	2024-07-08 06:38:19 +00:00
Timothy Arceri	2f5b99ec17	glsl/standalone: init EmptyUniformLocations This updates the scaffolding to reflect init_shader_program() and will be required in the following patch to avoid a segfault. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30034>	2024-07-08 06:38:19 +00:00
Timothy Arceri	5ae5229e3d	glsl/mesa: remove UniformHash field Unused since `9617184bc2` Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30034>	2024-07-08 06:38:19 +00:00
Konstantin Seurer	d9e41e8a8c	nir: Stop using "capture : true" for nir_opt_algebraic "calture : true" is suboptimal and and prevents the script from writing multiple files in one go. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30041>	2024-07-06 15:51:06 +00:00
Qiang Yu	8e146512d1	glsl: fix indirect tess factor access for compact_arrays=false drivers Driver with compact_arrays=false (i.e. radeonsi) is broken when tess factor is accessed indirectly, for example: gl_TessLevelOuter[gl_InvocationID] = xxx; This fix use nir_vectorize_tess_levels to lower array tess factor access into direct vector access before nir_lower_io() like clip and cull distance way. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29799>	2024-07-03 02:06:56 +00:00
Qiang Yu	a071929f8d	nir: consider more deref types when fixup deref Fix ANV and virpipe CI test fail when nir_fixup_deref_types is used in nir_vectorize_tess_levels by later commits. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29799>	2024-07-03 02:06:56 +00:00
Qiang Yu	f9ed3158b4	nir: nir_vectorize_tess_levels support indirect access Replace the implementation with nir_lower_array_deref_of_vec. This will be used by compact_array=false drivers to lower indirect tess levels array access to direct vector access too. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29799>	2024-07-03 02:06:56 +00:00
Qiang Yu	3151f5ec47	nir: add filter parameter to nir_lower_array_deref_of_vec To be used by latter commits to limit the lowering to specific variables. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29799>	2024-07-03 02:06:56 +00:00
Timothy Arceri	370ed7b021	glsl: make warning tests pass linking The standalone compiler previously ran these tests through a hacked up partial linker. When this partial linker was recently removed from the standalone compiler the --link option was turned on because some tests are testing linking not just compilation. However in a future patchset we will switch the standalone linker to use the nir linking code and when this is done all of these shaders will need to pass full linking, so here we update them to do so. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29991>	2024-07-03 01:20:02 +00:00
Timothy Arceri	a71ce0a6d6	glsl: drop glsl ir optimisation from the standalone compiler There are no more users of the glsl ir at this point in the standalone compiler anymore for these optimisations. Later patches will also switch the standalone compiler to the nir linker. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29991>	2024-07-03 01:20:02 +00:00
Timothy Arceri	063d62f142	glsl: move call to create explicit ifc layout out of glsl_to_nir We move this later so that we can call glsl_to_nir() on glsl ir that has not set the array size on unsized ifc members. Later patches will move sizing of the arrays out of glsl ir and into the nir linker. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29991>	2024-07-03 01:20:02 +00:00
David Heidelberg	68215332a8	build: pass licensing information in SPDX form Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Dylan Baker <dylan.c.baker@intel.com> Acked-by: Eric Engestrom <eric@igalia.com> Acked-by: Daniel Stone <daniels@collabora.com> Signed-off-by: David Heidelberg <david@ixit.cz> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29972>	2024-06-29 12:42:49 -07:00
Jesse Natalie	c2b53d7bd0	nir: Remove assert-only variable by inlining its single use Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29970>	2024-06-28 20:44:36 +00:00
Alyssa Rosenzweig	30db807f79	nir/algebraic: explicitly suffix constants Make our intentions super duper clear. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Suggested-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29952>	2024-06-28 19:53:36 +00:00
Alyssa Rosenzweig	270446ee21	nir: fix miscompiles with rules with INT32_MIN `812b3415` added rules for upcasts with comparisons with a variety of types. The float & unsigned rules should be ok, but the signed integer rules are unsound as currently implemented. This can cause end-to-end miscompiles. I originally hit this issue while debugging a large real world OpenCL kernel. I found the bug symptoms changed when disabling loop unrolling, which tipped me off to a compiler bug. I've reduced it to a minimal test case. Imagine my surprise when I find out the NIR my backend ingested was already constant folded to be wrong. In the minimal test case, during optimization we have NIR: 32 %6 = .... 64 %9 = i2i64 %6 64 %44 = load_const (0x0000000000000001) 1 %45 = ilt %9, %44 (0x1) This is a simple check (int64_t)%6 < 1. nir_opt_algebraic turns this into: 32 %6 = ... 64 %9 = i2i64 %6 64 %44 = load_const (0x0000000000000001) 64 %55 = load_const (0x0000000080000000 = 2147483648) 1 %56 = ilt %55 (0x80000000), %44 (0x1) 64 %57 = load_const (0x000000007fffffff = 2147483647) 1 %58 = ilt %57 (0x7fffffff), %44 (0x1) 32 %59 = i2i32 %44 (0x1) 1 %60 = ilt %6, %59 1 %61 = ior %58, %60 1 %62 = iand %56, %61 This pile of math constant-folds to an unconditional "false"! The problem is %56. At first glance, INT32_MIN < 1 is true so %56 should be true. Indeed, it should. But here's the kicker: both constants are 64-bit here, so the ilt operation is a 64-bit comparison -- that left-hand side is INT32_MIN zero-extended to 64-bit for the signed comparison at 64-bit. So in fact, it evaluates to false, causing the whole expression to go false. If we're going to do a 64-bit comparison for %56, then we need to sign-extend the bound. So we'll just adjust the Python and be on our way, right? Unfortunately the issue is deeper. According to the comment in the generated nir_opt_algebraic.c file, the guilty algebraic rule is: ('ilt', ('i2i64', 'a@32'), '#b') => ('iand', ('ilt', -2147483648, 'b'), ('ior', ('ilt', 2147483647, 'b'), ('ilt', 'a', ('i2i32', 'b')))) From a Python perspective? That rule is correct. -2147483648 < 1 is a true statement. Adjusting the Python rule is not the appropriate solution here, since the issue is more fundamental and might affect other rules. The real problem is the translation of that Python replacement tree into C, incorrectly zero-extending -2147483648 into 0x0000000080000000 instead of sign-extending to 0xffffffff80000000. Crawling down the rabbit hole of the generated algebraic file, we see the constant encoded as: { .constant = { { nir_search_value_constant, 64 }, nir_type_int, { -0x80000000 /* -2147483648 */ }, } }, NIR correctly translates the negative constant to a C level negate operation of its absolute value. This maps to the correct sign-extension... ...for all constants except for INT_MIN. Because that constant lacks a ULL suffix, it is a 32-bit integer. And for this integer (only), negating it hits signed integer overflow (UB!) and then we end up with an effective zero-extension when going to 64-bit. This patch fixes the end-to-end miscompile. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Closes: #11402 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29952>	2024-06-28 19:53:36 +00:00
Georg Lehmann	3e86d2452f	nir/opt_algebraic: add various unordered/ordered patterns from aco Foz-DB Navi21: Totals from 6747 (8.50% of 79395) affected shaders: MaxWaves: 134646 -> 134642 (-0.00%) Instrs: 7830299 -> 7828851 (-0.02%); split: -0.03%, +0.01% CodeSize: 43045532 -> 43010260 (-0.08%); split: -0.09%, +0.00% VGPRs: 378960 -> 378968 (+0.00%) SpillSGPRs: 1209 -> 1208 (-0.08%) Latency: 74667977 -> 74670405 (+0.00%); split: -0.02%, +0.02% InvThroughput: 20124981 -> 20124768 (-0.00%); split: -0.02%, +0.02% VClause: 162870 -> 162868 (-0.00%); split: -0.00%, +0.00% SClause: 277280 -> 277315 (+0.01%); split: -0.00%, +0.02% Copies: 528627 -> 528667 (+0.01%); split: -0.00%, +0.01% PreSGPRs: 319526 -> 319508 (-0.01%) PreVGPRs: 334264 -> 334265 (+0.00%); split: -0.00%, +0.00% VALU: 5485412 -> `5485408` (-0.00%); split: -0.02%, +0.02% SALU: 743882 -> 742301 (-0.21%); split: -0.21%, +0.00% Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29467>	2024-06-27 08:12:30 +00:00
Georg Lehmann	434dfb51ca	nir/opt_algebraic: optimize cmp(fneg(a), #b) and feq with fabs Foz-DB Navi21: Totals from 2483 (3.13% of 79395) affected shaders: Instrs: 4067533 -> 4067756 (+0.01%); split: -0.00%, +0.01% CodeSize: 22525156 -> 22499904 (-0.11%); split: -0.12%, +0.01% Latency: 51967223 -> 51963654 (-0.01%); split: -0.01%, +0.00% InvThroughput: 16685020 -> 16683045 (-0.01%); split: -0.01%, +0.00% SClause: 131890 -> 131907 (+0.01%) Copies: 402557 -> 402510 (-0.01%); split: -0.01%, +0.00% Branches: 146962 -> 146958 (-0.00%) PreSGPRs: 118404 -> 118401 (-0.00%) PreVGPRs: 123791 -> 123787 (-0.00%) VALU: 2709846 -> 2710174 (+0.01%); split: -0.00%, +0.01% SALU: 565883 -> 565786 (-0.02%) Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29467>	2024-06-27 08:12:30 +00:00
Georg Lehmann	98cc57bccb	nir/optimize cmp(a, -0.0) +0.0 can use an inline constant for AMD hardware, -0.0 needs a literal. Foz-DB Navi21: Totals from 1014 (1.28% of 79395) affected shaders: Instrs: 3037490 -> 3036849 (-0.02%); split: -0.02%, +0.00% CodeSize: 17060228 -> 17051276 (-0.05%); split: -0.05%, +0.00% Latency: 45916788 -> 45916600 (-0.00%); split: -0.00%, +0.00% InvThroughput: 12982201 -> 12982187 (-0.00%); split: -0.00%, +0.00% VClause: 79475 -> 79478 (+0.00%) SClause: 119935 -> 119934 (-0.00%); split: -0.00%, +0.00% Copies: 301641 -> 300964 (-0.22%); split: -0.23%, +0.00% PreSGPRs: 59155 -> 59144 (-0.02%) VALU: 2032016 -> 2032034 (+0.00%) SALU: 386424 -> 385729 (-0.18%) Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29467>	2024-06-27 08:12:30 +00:00
Georg Lehmann	8e6bf596cb	nir/opt_algebraic: look through fabs/fneg when matching fmulz/ffmaz Prevents regressions when removing input modifiers from a == 0.0. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29467>	2024-06-27 08:12:30 +00:00
Georg Lehmann	99372c1ed7	nir: add ford, funord, fneo, fequ, fltu, fgeu Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29467>	2024-06-27 08:12:29 +00:00
Timothy Arceri	6006588ad8	glsl: remove out of date TODO The TODO was complete when the glsl version of this function was removed in `318d8ce6fc` Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29887>	2024-06-27 01:02:25 +00:00
Eli Schwartz	a4e0eb55ce	meson: create libglsl declared dependency to propagate order-only deps https://mesonbuild.com/FAQ.html#how-do-i-tell-meson-that-my-sources-use-generated-headers A few locations had underspecified deps on the header files, and this caused builds to fail given sufficient parallelism. Fix this by creating an interface library that can be linked against, instead. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29115>	2024-06-26 22:54:50 +00:00
Alyssa Rosenzweig	dd85b50d18	treewide: use nir_break_if Via Coccinelle patch and some manual hunk editing: @@ expression b, E; @@ -nir_push_if(b, E); -{ -nir_jump(b, nir_jump_break); -} -nir_pop_if(b, NULL); +nir_break_if(b, E); Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29877>	2024-06-26 19:07:35 +00:00
Alyssa Rosenzweig	d57934fdec	nir: add nir_break_if helper I see people open-coding this all over the tree and it makes nir_builder loops really annoying. Make them slightly less annoying with a helper. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29877>	2024-06-26 19:07:35 +00:00
Karol Herbst	3482ea599b	nir/schedule: add write dep also for shared_atomic Otherwise it might change the order between a load_shared and a shared_atomic on the same location. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29918>	2024-06-26 18:20:14 +00:00
Connor Abbott	ec37e65a2d	ir3: Introduce elect_any_ir3 For preambles, we don't actually care which invocation we get, so we don't have to enable helper invocations when the preamble uses "getone." Introduce a new intrinsic with the right semantics and plumb it through. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29914>	2024-06-26 17:40:15 +00:00
Karol Herbst	d5da434851	nir/opt_sink: add load_kernel_input Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25362>	2024-06-26 10:04:02 +00:00
Karol Herbst	535e617ccd	nir/lower_alu: support 8 and 16 bit bit_count Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25362>	2024-06-26 10:04:02 +00:00
Qiang Yu	93f790b04a	nir: fix clip cull distance lowering metadata preserve indirect store lowering will use if/else which changes the control flow of the shader. Fixes: `110887de2b` ("nir: Add a new pass to lower array dereferences on vectors") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29894>	2024-06-26 01:22:12 +00:00
Qiang Yu	09b4ba27a3	nir: fix lower array to vec metadata preserve indirect store lowering will change control flow, so we should not preserve control flow metadate when it's present. Fixes: `35b8f6f40b` ("nir: Add a new pass to lower array dereferences on vectors") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29894>	2024-06-26 01:22:12 +00:00
Ian Romanick	6b678d32cb	nir: dpas_intel second source can have different number of components The number of components for the second source is -1 to avoid validation of its value. Some supported configurations will have the component count of that matrix different than the others. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28834>	2024-06-25 14:17:47 -07:00
Timothy Arceri	539aaad6a3	glsl: remove unused symbol table functionality Added in `a8f52647b0` and `c17c790387` but not used since `b04ef3c08a` over 10 years ago. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29868>	2024-06-25 00:18:42 +00:00
Caio Oliveira	a0877c132c	glsl: Fix warning related to tg4_offsets in release mode Compiler can't know that array_size() of the offsets parameter in textureGatherOffsets is (at most) 4, so use a MIN2() to make the limit visible. Just adding an assert() gets ignored in Release builds. This fixes the following warning in Release compilation: ``` ../src/compiler/glsl/glsl_to_nir.cpp: In member function ‘virtual void {anonymous}::nir_visitor::visit(ir_texture*)’: ../src/compiler/glsl/glsl_to_nir.cpp:2453:41: warning: writing 1 byte into a region of size 0 [-Wstringop-overflow=] 2453 \| instr->tg4_offsets[i][j] = val; \| ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~ In file included from ../src/compiler/glsl/glsl_to_nir.h:31, from ../src/compiler/glsl/glsl_to_nir.cpp:29: ../src/compiler/nir/nir.h:2470:11: note: at offset 8 into destination object ‘nir_tex_instr::tg4_offsets’ of size 8 2470 \| int8_t tg4_offsets[4][2]; \| ^~~~~~~~~~~ ../src/compiler/glsl/glsl_to_nir.cpp:2453:41: warning: writing 1 byte into a region of size 0 [-Wstringop-overflow=] 2453 \| instr->tg4_offsets[i][j] = val; \| ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~ ../src/compiler/nir/nir.h:2470:11: note: at offset 9 into destination object ‘nir_tex_instr::tg4_offsets’ of size 8 2470 \| int8_t tg4_offsets[4][2]; \| ^~~~~~~~~~~ ``` This is from: `gcc (GCC) 14.1.1 20240522 (Red Hat 14.1.1-4)`. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29508>	2024-06-21 17:37:46 -07:00
Juan A. Suarez Romero	ee1ced9dc5	glsl: fix downcasting addresses to wrong object types This fixes several downcasting of address to object types when the original object types were either different or invalid. This has been detected throught Undefined Behaviour Sanitizer (UBSan). An example of such issue were: `downcast of address 0x55559c0cbcc0 which does not point to an object of type 'ir_variable' 0x55559c0cbcc0: note: object is of type 'ir_constant' Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29772>	2024-06-21 21:07:05 +00:00
Juan A. Suarez Romero	60e7cb7654	nir: use unsigned types when performing bitshifting Ensure unsigned integers are used instead of signed ones when performing left bit shifts. This has been detected by the Undefined Behaviour Sanitizer (UBSan). Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29772>	2024-06-21 21:07:05 +00:00
Juan A. Suarez Romero	e43cc49806	nir: fix overflow when negating maxint in constant expressions Undefined Behaviour Sanitizer (UBSan) detected the following when running testing `dEQP-VK.graphicsfuzz.cov-fold-negate-min-int-value`: `negation of -2147483648 cannot be represented in type 'int'; cast to an unsigned type to negate this value to itself` SPIR-V spec states that OpSNegate(0x80000000) has to return 0x80000000; in our case, -2147483648 should be -2147483648. While this is not causing any issue because compilers seem to be behaving like that, it is still undefined behaviour, so it expects to be this handled explicitly, which is the purpose of this commit. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29772>	2024-06-21 21:07:05 +00:00

1 2 3 4 5 ...

9505 commits