fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 20:28:05 +02:00

Author	SHA1	Message	Date
Timothy Arceri	cba2fd51a2	nir: add variable data fields required for NIR glsl varying linking These will be used in the following patches that add a NIR based varying linker. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Jason Ekstrand	98cc4c3a20	nir: Use nir_shader_instructions_pass in nir_lower_input_attachments This simplifies things a bit and also fixes metadata handling. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16482>	2022-05-13 22:51:38 +00:00
Jason Ekstrand	a170448a18	nir: Put the builder first in lower_input_attachments helpers This is more idiomatic. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16482>	2022-05-13 22:51:38 +00:00
Jason Ekstrand	5410f4ee89	mesa/st: Use lower_indirect_var_derefs in st_nir_lower_builtin Instead of having a special NIR helper for GL stuff, we can now use the more generic helper and do so directly. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16482>	2022-05-13 22:51:38 +00:00
Jason Ekstrand	e16197c46e	nir: Add a var set version of lower_indirect_derefs This version takes a set of variables and totally lowers indirects on any variable in the set. We also rewrite the builtin_uniform version to use the new helper internally. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16482>	2022-05-13 22:51:38 +00:00
Jason Ekstrand	c23b20d43a	nir: Preserve metadata if remove_dead_derefs makes no progress Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16482>	2022-05-13 22:51:38 +00:00
Georg Lehmann	bc5c68fc08	nir/opt_algebraic: Optimize Doom Eternal's word extract by LSB. Foz-db GFX10_3: Totals from 419 (0.31% of 134913) affected shaders: CodeSize: 4126032 -> 4121756 (-0.10%) Instrs: 783608 -> 782541 (-0.14%) Latency: 7889664 -> 7888521 (-0.01%); split: -0.02%, +0.00% InvThroughput: 1315690 -> 1314863 (-0.06%); split: -0.06%, +0.00% VClause: 11826 -> 11830 (+0.03%) SClause: 27736 -> 27734 (-0.01%) Copies: 50493 -> 50428 (-0.13%); split: -0.13%, +0.01% PreSGPRs: 23264 -> 23265 (+0.00%) Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16436>	2022-05-12 17:10:41 +00:00
Konstantin Seurer	938c9d9615	nir: Add a ray launch size addr intrinsic Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15712>	2022-05-12 15:04:31 +00:00
Timothy Arceri	0f98ed4afe	nir: remove unreachable loop terminators Remove the conditional break statements associated with all terminators that are associated with a fixed iteration count, except for the one associated with the limiting terminator. This logic matches similiar functionality that exists in the old GLSL IR unrolling code. This change helps a piglit test pass on the r300 driver once we switch off the old GLSL IR unrolling code. Shader-db results IRIS (BDW): total instructions in shared programs: 17538619 -> 17538595 (<.01%) instructions in affected programs: 216 -> 192 (-11.11%) helped: 3 HURT: 0 helped stats (abs) min: 7 max: 10 x̄: 8.00 x̃: 7 helped stats (rel) min: 10.00% max: 12.07% x̄: 11.38% x̃: 12.07% total cycles in shared programs: 858674910 -> 858672810 (<.01%) cycles in affected programs: 79540 -> 77440 (-2.64%) helped: 3 HURT: 0 helped stats (abs) min: 620 max: 800 x̄: 700.00 x̃: 680 helped stats (rel) min: 2.45% max: 2.83% x̄: 2.63% x̃: 2.62% Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16399>	2022-05-12 02:06:31 +00:00
Timothy Arceri	4c3d138e5d	nir: always set the exact_trip_count_unknown loop terminator property Previously we only cared if this was set for the limiting terminator. However in the following patch we will make use of this information on other terminators to decide if we can eliminate them. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16399>	2022-05-12 02:06:31 +00:00
Timur Kristóf	7de3034897	ac/nir: Add I/O lowering for task and mesh shaders. Task shaders store their output payload to VRAM where mesh shaders read from. There are two ring buffers: 1. Draw ring: this is where mesh dispatch sizes and the ready bit are stored. 2. Payload ring: this is where the optional payload is stored (up to 16K per task workgroup). Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14929>	2022-05-12 00:29:51 +00:00
Jason Ekstrand	df1876f615	nir: Mark negative re-distribution on fadd as imprecise Otherwise, it would mutate `fneg(fadd(-0, 0))` into `fadd(0, -0)` which isn't correct since -0 + (+0) = +0 + (-0) = +0. This fixes the OpenCL contraction tests on Iris. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16041>	2022-05-12 00:05:10 +00:00
Jason Ekstrand	25249e8be2	nir/lower_blend: Expand or shrink output variables as needed Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16309>	2022-05-11 14:47:33 +00:00
Jason Ekstrand	1d22465362	nir/builder: Add a nir_resize_vector helper We're about to use this a couple of places. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16309>	2022-05-11 14:47:33 +00:00
Jason Ekstrand	352e32e5ba	nir/builder: Add a nir_trim_vector helper This pattern pops up a bunch and the semantics of nir_channels() aren't very convenient much of the time. Let's add a nir_trim_vector() which matches nir_pad_vector(). Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16309>	2022-05-11 14:47:33 +00:00
Jason Ekstrand	244b654de6	nir/lower_blend: Support SNORM and integer formats for logic ops This fixes 158 of the dEQP-VK.pipeline.logic_op.* tests, once we turn the feature on. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16309>	2022-05-11 14:47:33 +00:00
Jason Ekstrand	730d2b7660	nir/lower_blend: Stop passing the whole options object around Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16309>	2022-05-11 14:47:33 +00:00
Jason Ekstrand	dcfffdcad1	nir/lower_blend: Be more explicit about deref assumptions Because we pull the RT from the variable location and use that to look up formats, we need a constant RT index. To deal with arrays (possibly of arrays), we would either need to handle array derefs (we don't today) or we need to require the variables to be split into one variable per RT. Given that we have to lower indirect derefs anyway (to get constant indices), we may as well require the client to split output variables by calling nir_lower_io_arrays_to_elements_no_indirect(). Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16309>	2022-05-11 14:47:33 +00:00
Georg Lehmann	60c9a45562	nir/opt_algebraic: Simple xor/ishr optimizations. The first pattern here removes the xor-swap pattern. Foz-DB GFX10_3: Totals from 305 (0.23% of 134913) affected shaders: CodeSize: 1589040 -> 1585164 (-0.24%) Instrs: 284344 -> 283375 (-0.34%) Latency: 4205148 -> 4198472 (-0.16%); split: -0.16%, +0.00% InvThroughput: 708745 -> 708739 (-0.00%) Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16411>	2022-05-10 19:29:31 +00:00
Georg Lehmann	66e917fff6	nir/opt_algebraic: Fix mask in shift by constant combining. The comment above is correct, but the code to calculate the mask was broken. No Foz-db changes outside of noise. Fixes: `0e6581b87d` ("nir/algebraic: Reassociate shift-by-constant of shift-by-constant") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15990>	2022-05-10 18:47:21 +00:00
Timur Kristóf	7f189e3467	nir: Add upper bound for AMD shader arg intrinsics. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13155>	2022-05-10 17:16:03 +00:00
Jason Ekstrand	625b352f14	nir: Set image_buffers and msaa_images in lower_samplers_as_deref This is where we set images_used so it's less likely that things will accidentally get out-of-sync. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15988>	2022-05-10 11:21:39 -05:00
Jordan Justen	1c3e584dfa	nir/divergence: handle more *_intel intrinsics v2: fix topo/btd (Lionel) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16421>	2022-05-10 08:49:58 +00:00
Karol Herbst	9c5fd100cc	nir: add a nir_remove_non_entrypoints helper This code just got duplicated a lot. There is still more, but the remaining instances do a bit more than just removing other functions. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16348>	2022-05-10 03:37:44 +00:00
Jason Ekstrand	4b67d70d22	nir: Fix constant folding for non-32-bit ifind_msb and clz Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16348>	2022-05-10 03:37:44 +00:00
Lionel Landwerlin	35d82ecf1e	nir/lower_shader_calls: put inserted instructions into a dummy block When moving code into the main block or loop blocks, put the code into its own : if(true) { ... } block so that we avoid break/continue/return issues. v2: Also take care of the main block with return instructions v3: Make deletion more obvious with dummy if blocks (Jason) v4: Fixup assert for loops (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `8dfb240b1f` ("nir: Add raytracing shader call lowering pass.") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16036>	2022-05-09 08:43:40 +00:00
Lionel Landwerlin	9cf986dcff	nir/lower_shader_calls: don't insert code after break/continue When moving code from below to the insertion cursor point, if the cursor points to a jump instruction, don't bother inserting the code. It would break the break/continue assumptions of NIR and would not be executed anyway. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `8dfb240b1f` ("nir: Add raytracing shader call lowering pass.") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16036>	2022-05-09 08:43:40 +00:00
Lionel Landwerlin	51dea59eb4	nir/lower_shader_calls: don't use nop instructions as cursors Stop using nop instructions which are causing issues with break/continue, instead use a nir_cursor (which brings its share of pain). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `8dfb240b1f` ("nir: Add raytracing shader call lowering pass.") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16036>	2022-05-09 08:43:40 +00:00
Jason Ekstrand	25661ea028	nir/cf: Return a cursor from nir_cf_extract as well Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16036>	2022-05-09 08:43:40 +00:00
Lionel Landwerlin	d65cf403f3	nir/cf: return cursor after insertion of cf_list This will be useful to cut code from one location and paste it at another place and later keep pasting after the previous insertions. v2: update comment (Jason) deal with stiching 2 empty blocks (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16036>	2022-05-09 08:43:40 +00:00
Icecream95	ad864a7c15	nir/lower_tex: Copy more fields in lower_tex_to_txd and friends Fixes NIR validation errors for OpenMW on Panfrost. Fixes: `1f97819fbe` ("panfrost: Emulate GL_CLAMP on Bifrost") Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15939>	2022-05-07 10:51:10 +00:00
Mike Blumenkrantz	5c24eb721a	nir/gather_info: flag fbfetch on subpass image loads might not be able to determine which output is being read, but these are definitely fbfetch uses (from lavapipe) Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16346>	2022-05-06 17:04:34 +00:00
Georg Lehmann	5833fab766	nir/lower_mediump: Add a new pass to fold 16bit image load/store. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15179>	2022-05-04 09:58:03 +00:00
Alyssa Rosenzweig	ca280b2283	nir: Don't set writes_memory for reading XFB That's a read, not a write. Fixes optimizations getting disabled for fragment shaders when linked with a shader producing transform feedback varyings. Fixes: `85a723975b` ("nir: add and gather shader_info::writes_memory") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16285>	2022-05-03 19:02:17 +00:00
Emma Anholt	e3607e96bb	nir: Eliminate out-of-bounds read/writes in local lowering. Avoids nir validation assertion failures, and it's not like backend drivers would want to see definitely-out-of-bounds read/writes either. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16066>	2022-05-03 18:32:47 +00:00
Timothy Arceri	180398f785	nir: fix sorting before assigning varying driver locations We need to make sure we also properly sort varyings sharing a single slot otherwise we can end up assigning earlier components to the next slot if we have already processed later components. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6392 Fixes: `1e93b0caa1` ("mesa/st: add support for NIR as possible driver IR") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16208>	2022-05-03 00:04:30 +00:00
Jason Ekstrand	c31db58f65	nir/deref: Add an alu-of-cast optimization Casts shouldn't change the bit pattern of the deref and you have to cast again after you're done with the ALU anyway so we can ignore casts on ALU sources. This means we can actually start constant folding NULL checks even if there are annoying casts in the way. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15673>	2022-04-28 23:05:48 +00:00
Emma Anholt	536c8ee96d	nir/lower_tex: Make the adding a 0 LOD to nir_op_tex in the VS optional. This controls the whole lowering of "make tex ops with implicit derivatives on non-implicit-derivative stages be tex ops with an explicit lod of 0 instead", but it's really hard to describe that in a git commit summary. All existing callers get it added except: - nir_to_tgsi which didn't want it. - nouveau, which didn't want it (fixes regressions in shadowcube and shadow2darray with NIR, since the shading languages don't expose txl of those sampler types and thus it's not supported in HW) - optional lowering passes in mesa/st (lower_rect, YUV lowering, etc) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16156>	2022-04-28 21:26:08 +00:00
Karol Herbst	a2c9e1cb50	nir: add 16 and 64 bit fisnormal lowering Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16206>	2022-04-28 18:36:52 +00:00
Gert Wollny	47d3f7c69f	nir: Don't optimize to 64 bit fsub if the driver doesn't support it Fixes: `a4840e15ab` r600: Use nir-to-tgsi instead of TGSI when the NIR debug opt is disabled. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16130>	2022-04-27 00:01:20 +00:00
Jason Ekstrand	e24d8760e9	nir: Constant fold sampler/texture offsets Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16171>	2022-04-26 22:34:39 +00:00
Jason Ekstrand	9332598b26	nir/constant_folding: Break TXB folding into a helper function Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16171>	2022-04-26 22:34:39 +00:00
Yevhenii Kolesnikov	65caf46b3b	nir: Remove single-source phis before opt_if_loop_last_continue We might have some single-source phis leftover after prior optimizations. We want to get rid of them before merging the blocks. Fixes: `5921a19d4b` ("nir: add if opt opt_if_loop_last_continue()") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6312 Signed-off-by: Yevhenii Kolesnikov <yevhenii.kolesnikov@globallogic.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16095>	2022-04-26 17:06:07 +00:00
Alyssa Rosenzweig	94b01ddcdd	nir: Use u_worklist to back nir_block_worklist u_worklist is nir_block_worklist, suitably generalized. All we need to do is define the macros to translate between the APIs. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16046>	2022-04-25 23:50:57 +00:00
Jason Ekstrand	1755730362	nir: Lower all bit sizes of usub_borrow It's not clear why this is restricted to 32-bit besides that being the only bit size where GLSL has an intrinsic for this. All drivers that set this probably want it lowered for all bit sizes as far as I can tell. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6353 Fixes: `8a3e344180` ("nir/opt_algebraic: Fix some expressions with ambiguous bit sizes") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16146>	2022-04-25 21:27:09 +00:00
Samuel Pitoiset	4ebb5391ac	nir: mark XFB varyings as unmoveable to prevent them to be remapped XFB varyings are considered as always active IO to prevent them to be removed or compacted. Though, if the NIR linker doesn't mark XFB varyings as unmoveable it still possible to remap other varyings to the same location/component. Fixes KHR-Single-GL46.enhanced_layouts.xfb_override_qualifiers_with_api with Zink and a bunch of other dEQP XFB tests. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6301 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16092>	2022-04-25 07:56:27 +00:00
Samuel Pitoiset	26f74f17d9	nir: fix marking XFB varyings as always active IO Components need to be handled, otherwise if a shader has two XFB varyings at the same location, only one will be marked as always active. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16092>	2022-04-25 07:56:27 +00:00
Marek Olšák	f7a77ff900	nir: fix an uninitialized variable valgrind warning in nir_group_loads pass_flags is only initialized for grouped loads, so change the order Fixes: `33b4eb149e` - nir: add new SSA instruction scheduler grouping loads into indirection groups Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16090>	2022-04-22 18:18:09 +00:00
Lionel Landwerlin	9f44a26462	nir/divergence: handle load_global_block_intel Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `dd39e311b3` ("nir: Add nir_intrinsic_{load,store}_deref_block_intel") Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16075>	2022-04-21 17:41:04 +00:00
Gert Wollny	496fd59d71	nir: Add pass to split 64 bit vec3 and vec4 variable access and phis NTT can't convert local 64 variables of type vec3 or vec4, therefore, they need to be split into vec2 + double or vec2 + vec2. At the same time deal splitting the phi nodes. The pass goes into the global namespace because it is also useful for r600. v2: only lower function_temps (Emma) and handle array of arrays (Jason) v3: - remove bool parameter in function to merge vec3 and vec4 - simplify load/store_deref_(array\|var) - use a pointer hash table - simplify PHI splitting (all Emma) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15945>	2022-04-21 16:57:11 +00:00

1 2 3 4 5 ...

3681 commits