fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-30 01:20:17 +01:00

Author	SHA1	Message	Date
Timothy Arceri	2cf8c8cba4	nir/opt_copy_prop_vars: drop reuse of dynamic arrays After the previous commit there are so few to reuse that this is no longer worth doing and actually causes compilation to slow down. The Blender shader compile time in issue #9326 improves as folows: 21.11 seconds -> 9.90 seconds The CTS test dEQP-GLES31.functional.ubo.random.all_per_block_buffers.20 improves as follows: 0.92 seconds -> 0.68 seconds Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9326 Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24227>	2023-07-24 02:29:54 +00:00
Timothy Arceri	d56e739417	nir/opt_copy_prop_vars: skip cloning of copies arrays until needed Most of the variables in the hash table will never actually be looked up for any given block so cloning every possible value just creates a bunch of unrequired memcpy calls. Here we change the code to only clone the copies array once it is actually looked up for the first time. The Blender shader compile time in issue #9326 improves as folows: 151.09 seconds -> 21.11 seconds The CTS test dEQP-GLES31.functional.ubo.random.all_per_block_buffers.20 improves as follows: 1.67 seconds -> 0.92 seconds Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24227>	2023-07-24 02:29:54 +00:00
Timothy Arceri	869b5a562e	nir/opt_copy_prop_vars: remove var hash entry on kill alias If kill alias results in the hash table entry holding an empty copies array then remove the hash entry and return the dynamic array to the unused pool. This helps avoid hash table size getting out of control in very large shaders. 151.09 seconds -> 118.60 seconds Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24227>	2023-07-24 02:29:54 +00:00
Timothy Arceri	9b4c7cc611	nir/opt_copy_prop_vars: speedup cloning of copy tables Here we change things to simply clone the entire hash table. This is much faster than trying to rebuild it and is needed to avoid slow compilation of very large shaders. The Blender shader compile time in issue #9326 improves as folows: 251.29 seconds -> 151.09 seconds The CTS test dEQP-GLES31.functional.ubo.random.all_per_block_buffers.20 improves as follows: 2.38 seconds -> 1.67 seconds Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24227>	2023-07-24 02:29:54 +00:00
Timothy Arceri	e9804bdc4c	nir/opt_copy_prop_vars: don't clone copies if branch empty There is no point doing an expensive clone of the copies if the if-branch is empty. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24227>	2023-07-24 02:29:54 +00:00
Alyssa Rosenzweig	190b1fdc64	nir: Convert to nir_foreach_function_impl Done by hand at each call site but going very quickly with funny Vim motions and common regexes. This is a very common idiom in NIR. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23807>	2023-06-27 22:44:04 +00:00
Alyssa Rosenzweig	815efcdf7e	nir: Use nir_builder_create perl -p0e 's/nir_builder ([^;]);\snir_builder_init\(&\1, /nir_builder \1 = nir_builder_create(/g' -i $(git grep -l nir_builder_init) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23860>	2023-06-27 18:13:02 +00:00
Alyssa Rosenzweig	df51464cac	nir: Remove handling for non-scoped barriers Nothing generates them so this is all dead. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23191>	2023-06-13 16:36:11 +00:00
Alyssa Rosenzweig	59e73674c3	nir: Drop legacy atomics in simple cases This commit drops legacy atomic support from core passes where we can simply delete switch cases with no other changes. As such it's separated from the more complex pass-specific commits for ease of review. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23036>	2023-05-16 22:36:21 +00:00
Alyssa Rosenzweig	97857731ce	nir: Handle unified atomics in simple cases Lots of passes can be made unified-atomics-aware simply by adding extra cases in their switch statements. This commit fixes a bunch of passes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22914>	2023-05-12 20:39:46 +00:00
Daniel Schürmann	2bb369dd8d	nir: add assertions that loops don't have a Continue Construct Hoping that I didn't miss any, this should add assertions to all functions and passes which explicitly handle 'nir_loop'. Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Timothy Arceri	cb58d75224	nir/nir_opt_copy_prop_vars: don't call memset when cloning This makes the pass significantly faster cutting execution time by around 30% in the cts test dEQP-GLES31.functional.ubo.random.all_per_block_buffers.20 This 30% improvement is in addition to all the improvements from the proceeding patches. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20381>	2023-02-16 23:31:59 +00:00
Timothy Arceri	d1a41d9c64	nir/nir_opt_copy_prop_vars: reorder clone calls This helps with the reuse of dynamic arrays. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20381>	2023-02-16 23:31:59 +00:00
Timothy Arceri	2a2d85e254	nir/nir_opt_copy_prop_vars: reuse dynamic arrays As per the previous commit if we don't reuse these dynamic arrays we end up needlessly thrashing the memory handling functions. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20381>	2023-02-16 23:31:59 +00:00
Timothy Arceri	ffe0f3fda1	nir/nir_opt_copy_prop_vars: reuse hash tables Due to how this pass works we can end up thrashing memory if we do not reuse these hash tables rather than reusing them. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20381>	2023-02-16 23:31:59 +00:00
Timothy Arceri	731e9fd535	nir/nir_opt_copy_prop_vars: avoid comparison explosion Previously the pass was comparing every deref to every load/store causing the pass to slow down more the larger the shader is. Here we use a hash table so we can simple store everything needed for comparision of a var separately. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20381>	2023-02-16 23:31:59 +00:00
Timothy Arceri	8f6f5730f6	nir/nir_opt_copy_prop_vars: remove extra loop The fix in `947f7b452a` introduced an extra loop over the copies array to find the correct entry in the case it had been moved. The problem is these loops can be iterated over millions of times so lets simply update the entry pointer in the case we change its location in the array. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20381>	2023-02-16 23:31:59 +00:00
Emma Anholt	6c38797101	nir/nir_opt_copy_prop_vars: Don't leak dynarray memory during the pass. It was swept at the end, but it meant that in shaders with lots of copies available at the start of lots of if statements, you'd blow up memory usage. turnip memory consumption on dEQP-VK.ssbo.layout.random.scalar.75 drops from 1.4GB to 110MB, and runtime from 19s to 17s. Fixes: #7361 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18891>	2022-10-03 15:33:21 +00:00
Rhys Perry	dab745f3b4	nir/copy_prop_vars: fix non-vector shader call payloads Fixes RADV+Q2RTX. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Fixes: `ff05137c2d` ("nir: introduce and use nir_component_mask") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16029>	2022-04-20 21:03:03 +00:00
Erik Faye-Lund	ff05137c2d	nir: introduce and use nir_component_mask The BITFIELD_MASK() macro is intended for using with actual bitfields, not with nir_component_mask_t. This means we do some extra work to handle values that are invalid for nir_component_mask_t in the first place. This eliminates some warnings on Clang, where the compiler complains about casting UINT32_MAX to UINT16_MAX. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15547>	2022-04-19 06:54:47 +00:00
Emma Anholt	d506d910e4	nir: Switch to using nir_vec_scalars() for things that used nir_channel(). This should reduce follow-on optimization work to copy-propagate and dead-code away the movs generated in construction of vectors. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14865>	2022-03-02 22:28:58 +00:00
Bas Nieuwenhuizen	02c5dc8035	nir: Add lowered vendor independent raytracing intrinsics. For use in a generic nir_lower_shader_calls. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10339>	2021-06-21 21:23:51 +00:00
Jesse Natalie	09440ce3fb	nir: Fix MSVC warning C4334 (32bit shift cast to 64bit) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-By: Bill Kristiansen <billkris@microsoft.com> Cc: mesa-stable@lists.freedesktop.org Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10331>	2021-04-20 00:28:34 +00:00
Jason Ekstrand	e20e85f01e	nir: Make nir_ssa_def_rewrite_uses_after take an SSA value This replaces the new_src parameter of nir_ssa_def_rewrite_uses_after() with an SSA def, and rewrites all the users as needed. Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9383>	2021-03-08 16:59:55 +00:00
Jason Ekstrand	117668b811	nir: Make nir_ssa_def_rewrite_uses take an SSA value This commit replaces the new_src parameter of nir_ssa_def_rewrite_uses() with an SSA def, removes nir_ssa_def_rewrite_uses_ssa(), and rewrites all the users as needed. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9383>	2021-03-08 16:59:55 +00:00
Rhys Perry	8c3abcab6d	nir/copy_prop_vars: avoid a duplicate lookup if src == vec_src Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7511>	2020-11-20 13:57:34 +00:00
Rhys Perry	33eeb994be	nir/copy_prop_vars: use nir_deref_and_path Instead of recreating paths, create them once when needed using nir_deref_and_path. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7511>	2020-11-20 13:57:34 +00:00
Rhys Perry	be7b20135e	nir/copy_prop_vars,nir/dead_write_vars: ignore read-only loads Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7511>	2020-11-20 13:57:34 +00:00
Marcin Ślusarz	6e6dab4799	nir: handle float atomics in copy propagation pass Without this patch, copy propagation pass can optimize out buffer loads out of compare & swap loop, which then leads to infinite loop. Triggered by a change to atomicCompSwap float test in piglit. Fixes: `8424cd8fbd` ("nir: Account for atomics in copy propagation.") Suggested-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7538>	2020-11-12 19:20:50 +00:00
Jason Ekstrand	03683b9b2e	nir: Handle ray-tracing intrinsics and storage classes in copy-prop etc. We need to consider shader calls as potential writes to their payloads. For other ray-tracing intrinsics, we may not have a shader payload pointer and have to treat them more like a barrier. We also need to ensure that global and SSBO reads/writes aren't propagated across shader call intrinsics. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6479>	2020-11-05 23:36:46 +00:00
Caio Marcelo de Oliveira Filho	dd39e311b3	nir: Add nir_intrinsic_{load,store}_deref_block_intel Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7448>	2020-11-04 20:24:48 +00:00
Jason Ekstrand	ced9b6f0d8	nir: Use nir_deref_mode_may_be in deref optimizations All the checks being replaced are fore potential aliasing so we want to flush stores whenever the mode might be something that aliases. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	e363da3bdd	nir: Handle memcpy in copy_prop_vars and combine_stores Fixes: `b2899f7265` "nir: Add a new memcpy intrinsic" Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6871>	2020-10-02 07:30:49 +00:00
Eric Anholt	a82f664d0a	nir/opt_copy_prop_vars: Quiet valgrind warning about overlapping memcpy. The warning is kind of silly: Test case 'dEQP-GLES2.functional.shaders.indexing.tmp_array.vec3_const_write_static_read_vertex'.. ==1874780== Source and destination overlap in memcpy(0xa261690, 0xa261690, 160) ==1874780== at 0x484D498: __GI_memcpy (vg_replace_strmem.c:1037) ==1874780== by 0x596FC07: copy_entry_remove (nir_opt_copy_prop_vars.c:296) The "memcpy is undefined if they overlap" thing is surely meant to be "memcpy with partial overlap is undefined", but let's keep anyone else from having to debug this. Reviewed-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6178>	2020-08-27 21:39:50 +00:00
Jason Ekstrand	5e1c42d85f	nir: Call nir_metadata_preserve on !progress Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5171>	2020-06-11 05:08:12 +00:00
Boris Brezillon	345b5847b4	nir: Replace the scoped_memory barrier by a scoped_barrier SPIRV OpControlBarrier can have both a memory and a control barrier which some hardware can handle with a single instruction. Let's turn the scoped_memory_barrier into a scoped barrier which can embed both barrier types. Note that control-only or memory-only barriers can be supported through this new intrinsic by passing NIR_SCOPE_NONE to the unused barrier type. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Suggested-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4900>	2020-06-03 07:39:52 +00:00
Jason Ekstrand	f0e075ce6e	nir/copy_prop_vars: Record progress in more places Fixes: `96c32d7776` "nir/copy_prop_vars: handle load/store of vector..." Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5170>	2020-05-22 18:41:15 +00:00
Rhys Perry	a46aa3dc2e	nir: add missing group_memory_barrier handling Totals from 2 (0.00% of 127638) affected shaders: VGPRs: 164 -> 168 (+2.44%) CodeSize: 18420 -> 18756 (+1.82%) Instrs: 3658 -> 3700 (+1.15%) Cycles: 82912 -> 83080 (+0.20%) VMEM: 70 -> 69 (-1.43%) PreVGPRs: 155 -> 168 (+8.39%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> CC: <mesa-stable@lists.freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4889>	2020-05-05 18:34:02 +00:00
Jason Ekstrand	ed67717167	nir/copy_prop_vars: Report progress when deleting self-copies Fixes: `62332d139c` "nir: Add a local variable-based copy prop..." Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4767>	2020-04-28 22:55:25 +00:00
Jason Ekstrand	d9af5277b3	nir/copy_prop_vars: Handle volatile better For deref_store, we can still delete invalid stores that write to statically OOB data. For everything, we need to make sure that we kill aliases of destinations even if it's volatile. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4767>	2020-04-28 22:55:25 +00:00
Caio Marcelo de Oliveira Filho	3cc501be69	nir: Add missing nir_var_mem_global to various passes Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3322> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3322>	2020-01-14 14:42:12 -08:00
Jason Ekstrand	d3737002ee	nir/lower_atomics_to_ssbo: Also lower barriers This is more correct for a pass which is supposed to completely lower away atomic counters. It also lets us stop supporting atomic counter barriers in most of the drivers. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3307> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3307>	2020-01-13 17:23:47 +00:00
Jason Ekstrand	e40b11bbcb	nir: Rename nir_intrinsic_barrier to control_barrier This is a more explicit name now that we don't want it to be doing any memory barrier stuff for us. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3307>	2020-01-13 17:23:47 +00:00
Jason Ekstrand	60097cc840	nir: Add a new memory_barrier_tcs_patch intrinsic Right now, it's implemented as a no-op for everyone. For most drivers, it's a switch case in the NIR -> whatever which just breaks. For ir3, they already have code to delete tessellation barriers so we just add a case to also delete memory_barrier_tcs_patch. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3307>	2020-01-13 17:23:47 +00:00
Jason Ekstrand	f09db0bed5	nir: Handle more barriers in dead_write and copy_prop Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3307>	2020-01-13 17:23:46 +00:00
Caio Marcelo de Oliveira Filho	73572abc2a	nir: Add scoped_memory_barrier intrinsic Add a NIR instrinsic that represent a memory barrier in SPIR-V / Vulkan Memory Model, with extra attributes that describe the barrier: - Ordering: whether is an Acquire or Release; - "Cache control": availability ("ensure this gets written in the memory") and visibility ("ensure my cache is up to date when I'm reading"); - Variable modes: which memory types this barrier applies to; - Scope: how far this barrier applies. Note that unlike in SPIR-V, the "Storage Semantics" and the "Memory Semantics" are split into two different attributes so we can use variable modes for the former. NIR passes that took barriers in consideration were also changed - nir_opt_copy_prop_vars: clean up the values for the mode of an ACQUIRE barrier. Copy propagation effect is to "pull up a load" (by not performing it), which is what ACQUIRE restricts. - nir_opt_dead_write_vars and nir_opt_combine_writes: clean up the pending writes for the modes of an RELEASE barrier. Dead writes effect is to "push down a store", which is what RELEASE restricts. - nir_opt_access: treat the ACQUIRE and RELEASE as a full barrier for the modes. This is conservative, but since this is a GL-specific pass, doesn't make a difference for now. v2: Fix the scoped barrier handling in copy propagation. (Jason) Add scoped barrier handling to nir_opt_access and nir_opt_combine_writes. (Rhys) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-10-24 11:39:55 -07:00
Connor Abbott	3a2ea2af9d	nir: Avoid coalescing vars created by lower_io_to_temporaries Right now nir_copy_prop_vars is effectively undoing nir_lower_io_to_temporaries for inputs by propagating the original variable through the copy created in lower_io_to_temporaries. A theoretical variable coalescing pass would have the same issue with output variables, although that doesn't exist yet. To fix this, add a new bit to nir_variable, and disable copy propagation when it's set. This doesn't seem to affect any drivers now, probably since since no one uses lower_io_to_temporaries for inputs as well as copy_prop_vars, but it will fix radv once we flip on lower_io_to_temporaries for fs inputs. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-08 14:14:53 +02:00
Connor Abbott	75063fbac5	nir/copy_prop_vars: Ignore volatile accesses The spec explicitly says that volatile writes can't be removed and volatile reads do not guarantee that the same value will still be around after the read, as if there were a barrier after each read/write. Just ignore them. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-06-19 14:08:28 +02:00
Brian Paul	e584fd894e	nir: silence three compiler warnings seen with MinGW Silence two unused var warnings. And init elem_size, elem_align to zero to silence "maybe uninitialized" warnings. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-05-29 13:59:24 -06:00
Vinson Lee	77aa11ca32	nir: Fix anonymous union initialization with older GCC. Fix this build error with GCC 4.4.7. CC nir/nir_opt_copy_prop_vars.lo nir/nir_opt_copy_prop_vars.c: In function ‘load_element_from_ssa_entry_value’: nir/nir_opt_copy_prop_vars.c:454: error: unknown field ‘ssa’ specified in initializer nir/nir_opt_copy_prop_vars.c:455: error: unknown field ‘def’ specified in initializer nir/nir_opt_copy_prop_vars.c:456: error: unknown field ‘component’ specified in initializer nir/nir_opt_copy_prop_vars.c:456: error: extra brace group at end of initializer nir/nir_opt_copy_prop_vars.c:456: error: (near initialization for ‘(anonymous).<anonymous>’) nir/nir_opt_copy_prop_vars.c:456: warning: excess elements in union initializer nir/nir_opt_copy_prop_vars.c:456: warning: (near initialization for ‘(anonymous).<anonymous>’) Fixes: `96c32d7776` ("nir/copy_prop_vars: handle load/store of vector elements") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109810 Reviewed-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-03-22 10:43:41 -07:00

1 2

88 commits