fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-22 17:50:12 +01:00

Author	SHA1	Message	Date
Jason Ekstrand	bd428162b6	nir/lower_io: Fix the unknown-array-index case in get_deref_align The current align_mul calculation in the unknown-array-index calculation is align_mul = MIN3(parent_mul, min_pow2_divisor(parent_offset), min_pow2_divisor(stride)) which is certainly correct if parent_offset > 0. However, when parent_offset = 0, min_pow2_divisor(parent_offset) isn't well-defined and our calculation for it is 1 << -1 which isn't well-defined. That said.... it's not actually needed. The offset to the base of the array is array_base = parent_mul * k + parent_offset for some integer k. When we throw in an unknown array index i, we get elem = parent_mul * k + parent_offset + stride * i. If we set new_align = MIN2(parent_mul, min_pow2_divisor(stride)), then both parent_mul and stride are divisible by new_align and elem = (parent_mul / new_alig) * new_align * k + (stride / new_align) * new_align * i + parent_offset = new_align * ((parent_mul / new_alig) * k + (stride / new_align) * i) + parent_offset so elem = new_align * j + parent_offset where j = (parent_mul / new_alig) * k + (stride / new_align) * i. That's a very long-winded way of saying that we can delete one parameter from the align_mul calculation and it's still fine. :-) Fixes: `480329cf8b` "nir: Add a helper for getting the alignment of a deref" Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Tested-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6628>	2020-09-07 17:29:10 +00:00
Jason Ekstrand	9641f483e9	nir: Allow uniform in nir_lower_vars_to_explicit_types Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6472>	2020-09-03 18:02:50 +00:00
Jason Ekstrand	3719b69dfc	nir: Allow var_mem_global in nir_lower_vars_to_explicit_types Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6472>	2020-09-03 18:02:50 +00:00
Jason Ekstrand	beefd37021	nir/lower_io: Apply alignments from derefs when available If the deref has no explicit alignment in the chain, we assume component alignment which is what we currently assume for all derefs today. This should be correct for all APIs in the sense that we can usually assume at least component alignment. However, for some APIs such as OpenCL, we could potentially make larger alignment assumptions. The intention is that those will be handled via alignment-increasing casts. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6472>	2020-09-03 18:02:50 +00:00
Jason Ekstrand	480329cf8b	nir: Add a helper for getting the alignment of a deref Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6472>	2020-09-03 18:02:50 +00:00
Jason Ekstrand	0654a9e823	nir: Handle all array stride cases in nir_deref_instr_array_stride This renames it to drop the ptr_as and makes it handle all of the stride cases. There's a bit of a tricky bit in here around Booleans but we currently use 32-bit for those always. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6472>	2020-09-03 18:02:50 +00:00
Jason Ekstrand	9414cbc13c	nir: Don't bail too early in lower_mem_constant_vars If there were no constant variables, we would bail out entirely. However, we may still have constant input pointers coming in from the client. Fixes: `4360a8a2b3` "nir/lower_io: Add support for nir_var_mem_constant" Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6472>	2020-09-03 18:02:50 +00:00
Marek Olšák	8c43edf9f9	nir: fix a bug in is_dual_slot in nir_io_add_const_offset_to_base Fixes: `01ab308edc` "nir: update IO semantics in nir_io_add_const_offset_to_base" Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6540>	2020-09-02 20:05:05 +00:00
Jason Ekstrand	c93ade93fb	nir/lower_explicit_io: Assert that compute address sizes match derefs Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6379>	2020-09-01 20:50:04 +00:00
Jason Ekstrand	4360a8a2b3	nir/lower_io: Add support for nir_var_mem_constant This commit adds support for nir_var_mem_constant various places. It also adds a pass similar to nir_lower_vars_to_explicit_types except it also scrapes out the constants and stuffs them into constant_data. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6379>	2020-09-01 20:50:03 +00:00
Jason Ekstrand	ef142c68e1	nir/lower_io: Add a build_addr_for_var helper The new version is more verbose but also more extensible. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6379>	2020-09-01 20:50:03 +00:00
Jason Ekstrand	965c268865	nir/lower_io: Use the variable mode for load_scratch_base_ptr checks Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6379>	2020-09-01 20:50:03 +00:00
Jason Ekstrand	ff124e3fe3	nir: Add a load_global_constant intrinsic This has the same semantics as load_global except the memory it reads is known to be constant so load_global_constant intrinsics can be CSEd rather than relying on more complex copy-propagation. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6379>	2020-09-01 20:50:03 +00:00
Jason Ekstrand	4d18e71fea	nir: Rename num_shared to shared_size This one is always a size in bytes. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6524>	2020-09-01 17:30:51 +00:00
Jesse Natalie	865a2ad086	clover/nir/spirv: Use uniform rather than shader_in for kernel inputs The semantics of inputs for CL are a closer match to the semantics of uniforms for graphics. Rather than cross-stage data, it's data that every thread sees uniformly. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6497>	2020-08-31 19:58:14 +00:00
Italo Nicola	ee288f293b	nir: add shared/global atomics to nir_get_io_offset_src() Signed-off-by: Italo Nicola <italonicola@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6521>	2020-08-31 17:36:12 +00:00
Marek Olšák	01ab308edc	nir: update IO semantics in nir_io_add_const_offset_to_base Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6442>	2020-08-24 19:07:18 +00:00
Marek Olšák	502abfce7f	nir: save IO semantics in lowered IO intrinsics This enables drivers and utils to get all IO information from intrinsics, so that they don't have to walk the complex types of NIR variables to find out other information about IO intrinsics. NIR in/out variables can be removed after nir_lower_io. We could remove the variables in the pass, but for now I just decided to remove the variables in radeonsi before shaders are returned to st/mesa. (st/mesa just needs adjustments to work without NIR in/out variables) Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6442>	2020-08-24 19:07:18 +00:00
Rhys Perry	7530f66c16	nir: add and use nir_intrinsic_has_ helpers Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rob Clark <robdclark@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6402>	2020-08-21 16:47:00 +00:00
Jesse Natalie	627c8e1640	nir: Add nir_address_format_32bit_index_offset_pack64 This new address mode is supported by nir_lower_explicit_io Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6330>	2020-08-17 14:36:18 +00:00
Jesse Natalie	113458d372	nir: Add nir_address_format_32bit_offset_as_64bit This new address mode is supported by nir_lower_explicit_io Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6330>	2020-08-17 14:36:18 +00:00
Marek Olšák	8a012f429d	nir: handle load_input_vertex in nir_get_io_offset_src Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6328>	2020-08-17 11:06:49 +00:00
Jason Ekstrand	d70fff99c5	nir: Use a single list for all shader variables Instead of having separate lists of variables, roughly sorted by mode, use a single list for all shader-level NIR variables. This makes a few list walks a bit longer here and there but list walks aren't a very common thing in NIR at all. On the other hand, it makes a lot of things like validation, printing, etc. way simpler. Also, there are a number of cases where we move variables from inputs/outputs to globals and this makes it way easier because we no longer have to move them between lists. We only have to deal with that if moving them from the shader to a nir_function_impl. Reviewed-by: Rob Clark <robdclark@chromium.org> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5966>	2020-07-29 17:38:58 +00:00
Jason Ekstrand	964c1c4b87	nir: Take a nir_shader and variable mode in assign_var_locations Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5966>	2020-07-29 17:38:58 +00:00
Karol Herbst	e2e89fb137	nir/lower_io: assert that offsets are used for shader_in Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6059>	2020-07-25 08:51:48 +00:00
Jason Ekstrand	c30824adc0	nir/lower_io: Add support for global scratch addressing This provides an alternate lowering for scratch in which it uses global reads/writes and bases scratch addresses on a base pointer. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5927>	2020-07-22 23:43:35 +00:00
Jason Ekstrand	4815ae51d7	nir/lower_io: Use b2b for shader and function temporaries This way we can avoid some unnecessary conversions because there's no need to sanitize to 0/1 for scratch. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5927>	2020-07-22 23:43:35 +00:00
Jason Ekstrand	3a2975db98	nir/lower_io: Choose to set access based on intrinsic metadata This should be far more reliable than trying to keep opcode lists up-to-date. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5927>	2020-07-22 23:43:35 +00:00
Jesse Natalie	0e90b3d0c4	nir: Support load/store of temps as scratch in nir_lower_explicit_io Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5889>	2020-07-14 18:15:40 +00:00
Jesse Natalie	99aaf0ec18	nir: When nir_lower_vars_to_explicit_types is run on temps, update scratch_size To allow interop with other scratch ops, append any remaining temp vars to the end of any already-allocated scratch space. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5889>	2020-07-14 18:15:40 +00:00
Jesse Natalie	bf138c1fd4	nir_lower_io: Add addr_format_is_offset helper Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5889>	2020-07-14 18:15:40 +00:00
Jason Ekstrand	a6ed1d7fa5	nir: Add docs to nir_lower[_explicit]_io Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5418>	2020-07-06 19:54:30 +00:00
Jason Ekstrand	0bc5a829dd	nir: Remove shared support from lower_io No drivers are using this anymore so we can delete it and not keep maintaining this legacy code-path. If any drivers want this in the future, they should use nir_lower_varst_to_explicit_types followed by nir_lower_explicit_io. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5418>	2020-07-06 19:54:30 +00:00
Jason Ekstrand	be96b069ad	nir: Assert that nir_lower_io is only called with allowed modes Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5418>	2020-07-06 19:54:30 +00:00
Connor Abbott	12e18d9e7a	nir: add vec2_index_32bit_offset address format For turnip, we use the "bindless" model on a6xx. Loads and stores with the bindless model require a bindless base, which is an immediate field in the instruction that selects between 5 different 64-bit "bindless base registers", a 32-bit descriptor index that's added to the base, and the usual 32-bit offset. The bindless base usually, but not always, corresponds to the Vulkan descriptor set. We can handle the case where the base is non-constant by using a bunch of if-statements, to make it a little easier in core NIR, and this seems to be what Qualcomm's driver does too. Therefore, the pointer format we need to use in NIR has a vec2 index, for the bindless base and descriptor index. Plumb this format through core NIR. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5683>	2020-07-06 16:44:15 +00:00
Samuel Pitoiset	86f21e4eba	nir/lower_explicit_io: fix NON_UNIFORM access for UBO loads Make sure to propagate the NON_UNIFORM access for UBO loads, so that non-uniform loads are correctly lowered. Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5311>	2020-06-08 07:35:43 +00:00
Jason Ekstrand	c217ee8d35	nir: Insert b2b1s around booleans in nir_lower_to By inserting a b2b1 around the load_ubo, load_input, etc. intrinsics generated by nir_lower_io, we can ensure that the intrinsic has the correct destination bit size. Not having the right size can mess up passes which try to optimize access. In particular, it was causing brw_nir_analyze_ubo_ranges to ignore load_ubo of booleans which meant that booleans uniforms weren't getting pushed as push constants. I don't think this is an actual functional bug anywhere hence no CC to stable but it may improve perf somewhere. Shader-db results on ICL with iris: total instructions in shared programs: 16076707 -> 16075246 (<.01%) instructions in affected programs: 129034 -> 127573 (-1.13%) helped: 487 HURT: 0 helped stats (abs) min: 3 max: 3 x̄: 3.00 x̃: 3 helped stats (rel) min: 0.45% max: 3.00% x̄: 1.33% x̃: 1.36% 95% mean confidence interval for instructions value: -3.00 -3.00 95% mean confidence interval for instructions %-change: -1.37% -1.29% Instructions are helped. total cycles in shared programs: 338015639 -> 337983311 (<.01%) cycles in affected programs: 971986 -> 939658 (-3.33%) helped: 362 HURT: 110 helped stats (abs) min: 1 max: 1664 x̄: 97.37 x̃: 43 helped stats (rel) min: 0.03% max: 36.22% x̄: 5.58% x̃: 2.60% HURT stats (abs) min: 1 max: 554 x̄: 26.55 x̃: 18 HURT stats (rel) min: 0.03% max: 10.99% x̄: 1.04% x̃: 0.96% 95% mean confidence interval for cycles value: -79.97 -57.01 95% mean confidence interval for cycles %-change: -4.60% -3.47% Cycles are helped. total sends in shared programs: 815037 -> 814550 (-0.06%) sends in affected programs: 5701 -> 5214 (-8.54%) helped: 487 HURT: 0 LOST: 2 GAINED: 0 The two lost programs were SIMD16 shaders in CS:GO. However, CS:GO was also one of the most helped programs where it shaves sends off of 134 programs. This seems to reduce GPU core clocks by about 4% on the first 1000 frames of the PTS benchmark. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4338>	2020-03-30 15:46:19 +00:00
Jason Ekstrand	d2dfcee7f7	nir: Use b2b opcodes for shared and constant memory No shader-db changes on ICL with iris Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4338>	2020-03-30 15:46:19 +00:00
Karol Herbst	87365e263e	nir/lower_ssbo: handle atomics Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2753> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2753>	2020-02-21 13:06:22 +00:00
Samuel Pitoiset	cf6cae832c	nir: lower interp_deref_at_vertex to load_input_vertex This introduces a new NIR intrinsic for loading inputs at a specific vertex index. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3578>	2020-01-29 09:49:50 +00:00
Samuel Pitoiset	d29f10a7ca	nir: add nir_intrinsic_interp_deref_at_vertex From the SPV_AMD_shader_explicit_vertex_parameter extension: "Returns the value of the input <interpolant> without any interpolation, i.e. the raw output value of previous shader stage." Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3578>	2020-01-29 09:49:50 +00:00
Kai Wasserbäch	8aa4d0bff6	nir: fix unused variable warning in nir_lower_vars_to_explicit_types This commit fixes the following warning: ../src/compiler/nir/nir_lower_io.c: In function ‘nir_lower_vars_to_explicit_types’: ../src/compiler/nir/nir_lower_io.c:1435:22: warning: unused variable ‘supported’ [-Wunused-variable] 1435 \| nir_variable_mode supported = nir_var_mem_shared \| nir_var_shader_temp \| nir_var_function_temp; \| ^~~~~~~~~ Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-11-07 11:32:55 +11:00
Timothy Arceri	7f106a2b5d	util: rename list_empty() to list_is_empty() This makes it clear that it's a boolean test and not an action (eg. "empty the list"). Reviewed-by: Eric Engestrom <eric@engestrom.ch>	2019-10-28 11:24:38 +00:00
Rob Clark	6320e37d4b	nir: add amul instruction Used for address/offset calculation (ie. array derefs), where we can potentially use less than 32b for the multiply of array idx by element size. For backends that support `imul24`, this gives a lowering pass an easy way to find multiplies that potentially can be converted to `imul24`. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2019-10-18 15:08:54 -07:00
Caio Marcelo de Oliveira Filho	c0c55bd84f	nir/lower_explicit_io: Handle 1 bit loads and stores Load a 32-bit value then convert to 1-bit. Convert 1-bit to 32-bit value, then Store it. These cases started to appear when we changed Anvil to use derefs for shared memory. v2: Use `bit_size` in a couple of places we were missing. (Jason) Reassign `value` instead of `src[0]`. (Jason) Fixes: `024a46a407` ("anv: use derefs for shared memory access") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-09-05 22:24:09 -07:00
Rhys Perry	fd73ed1bd7	nir: add nir_lower_to_explicit() v2: use glsl_type_size_align_func v2: move get_explicit_type() to glsl_types.cpp/nir_types.cpp v2: use align() instead of util_align_npot() v2: pack arrays a bit tighter v2: rename mem_* to field_* v2: don't attempt to handle when struct offsets are already set v2: use column_type() instead of recreating it v2: use a branch instead of \|= in nir_lower_to_explicit_impl() v2: assign locations to variables and update shared_size and num_shared v2: allow the pass to be used with nir_var_{shader_temp,function_temp} v4: rebase v5: add TODO v5: small formatting changes v5: remove incorrect assert in get_explicit_type() v5: rename to nir_lower_vars_to_explicit_types v5: correctly update progress when only variables are updated v5: rename get_explicit_type() to get_explicit_shared_type() v5: add comment explaining how get_explicit_shared_type() is different v5: update cast strides v6: update progress when lowering nir_var_function_temp variables v6: formatting changes v6: add more detailed documentation comment for get_explicit_shared_type v6: rename get_explicit_shared_type to get_explicit_type_for_size_align v7: fix comment in nir_lower_vars_to_explicit_types_impl() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> (v5) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-08 12:10:39 -05:00
Rhys Perry	8bd2e138f5	nir/lower_explicit_io: add nir_var_mem_shared support v2: require nir_address_format_32bit_offset instead v3: don't call nir_intrinsic_set_access() for shared atomics Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-08 12:10:39 -05:00
Jason Ekstrand	078dcb7ccd	nir/lower_io: Add an option to lower 64-bit varyings Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-31 18:14:09 -05:00
Jason Ekstrand	9700e45463	nir/lower_io: Return SSA defs from helpers I can't find a single place where nir_lower_io is called after going out of SSA which is the only real reason why you wouldn't do this. Returning SSA defs is more idiomatic and is required for the next commit. Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-23 17:48:49 -05:00
Connor Abbott	133273aa22	nir/lower_io: Don't use variable to get deref mode Drivers only use lower_io for modes where pointers don't have a meaningful value, and dereferences can always be traced back to a variable. But there can be other modes, like global mode with VK_EXT_buffer_device_address, where pointers cannot be traced back to a variable, and lower_io would segfault on loads/stores of these since nir_deref_instr_get_variable() would return NULL. Just use the mode on the deref itself to filter out these modes before we try to get the variable. Fixes: `118a66df99` ("radv: Use NIR barycentric coordinates") Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-10 12:31:41 +02:00

1 2 3 4

174 commits