fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-22 13:30:12 +01:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	59e73674c3	nir: Drop legacy atomics in simple cases This commit drops legacy atomic support from core passes where we can simply delete switch cases with no other changes. As such it's separated from the more complex pass-specific commits for ease of review. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23036>	2023-05-16 22:36:21 +00:00
Alyssa Rosenzweig	97857731ce	nir: Handle unified atomics in simple cases Lots of passes can be made unified-atomics-aware simply by adding extra cases in their switch statements. This commit fixes a bunch of passes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22914>	2023-05-12 20:39:46 +00:00
Mihai Preda	2a9ef6b1f8	nir: update nir->num_inputs, num_outputs in nir_recompute_io_bases() Since nir_recompute_io_bases is going to be used instead of nir_assign_io_var_locations, we need to update num_inputs, num_outputs. Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21861>	2023-04-19 21:42:11 +00:00
Emma Anholt	5873dcb32f	nir/lower_mediump: Fix assertion about copy_deref lowering matching. Copy and paste typo. We shouldn't have copy_derefs during this pass, anyway, but caught a failure with my upcoming unit testing. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21666>	2023-03-21 00:51:24 +00:00
Georg Lehmann	0a3387a190	nir/lower_mediump: don't use fp16 for constants if the result is denormal Image stores are not required to preserve denorms. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21622>	2023-03-02 11:42:10 +00:00
Georg Lehmann	a00b50d820	nir: change 16bit image dest folding option to per type Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21404>	2023-02-27 09:55:34 +00:00
Danylo Piliaiev	1c9ee30838	nir/fold_16bit_tex_image: Add type granularity for dst folding Some HW may be able to fold only some of dst types, e.g. for Adreno folding i32 -> i16 could cause a different result since folded variant clamps the result instead of masking it. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20396>	2022-12-23 15:48:18 +01:00
Jason Ekstrand	8a406fe055	nir: Fix builder usage in lower_mediump_vars() In our handling of load_deref, we were calling builder helpers to create conversions and then adjusting the destination bit size of the load. We should adjust the bit size first because the builder sometimes looks at the bit sizes of SSA values passed in as arguments. Even though it's not strictly necessary, adjust the store_deref case as well to make it fully symmetric with the load_deref case. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20067>	2022-12-01 01:10:12 +00:00
Emma Anholt	7e986e5f04	nir/lower_mediump_vars: Don't lower mediump shared vars with atomic access. I don't know of any GPUs doing 16-bit atomic accesses, nor do I know of anybody wanting that in shaders. But deqp has GLES CTS cases that set mediump on shared variables, so just skip lowering for those vars. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18452>	2022-09-14 14:56:22 -07:00
Emma Anholt	0cee5f3918	nir: Add a pass to lower mediump temps and shared mem. SPIRV and GLSL are reasonable at converting ALU ops to mediump, but variable storage would be wrapped in a 2f32/2mp on store/load, and if nir_vars_to_ssa doesn't make that storage go away then you'd have extra conversions. For compute shader shared mem, you'd waste memory too. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18259>	2022-09-01 22:39:39 +00:00
Emma Anholt	80b35fbefe	nir/lower_mediump: Lower FS outputs to 16-bit when the value was upconverted. Take this real-world (trimmed) shader: precision highp float; in lowp vec4 var_varVertexColor; layout(location = 0) out vec4 out_FragColor0; void main() { vec4 textureColor0 = vec4(1.000000e+00, 0.000000e+00, 0.000000e+00, 1.000000e+00); vec3 color = vec3(1.000000e+00, 1.000000e+00, 1.000000e+00); vec4 outColor = vec4(vec3((color).rgb), 1.000000e+00); (outColor *= vec4(var_varVertexColor)); (out_FragColor0 = outColor); } After opts, it's just a store from input to output. If we decide to lower the input to 16-bit, then as long as the driver can handle 16-bit outputs, it would be a good idea to demote the output and save the conversions. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18003>	2022-08-31 02:43:45 +00:00
Georg Lehmann	c8ad1aeeb2	nir/fold_16bit_tex_image: Add an option to fold image sources. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18106>	2022-08-24 17:04:03 +00:00
Georg Lehmann	8eac45b274	nir: Add nir_ssa_scalar_is_undef. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18183>	2022-08-24 15:22:40 +00:00
Georg Lehmann	df4b5914cd	nir/fold_16bit_tex_image: Default to only_fold_all. No driver doesn't use this option. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17757>	2022-07-27 18:57:12 +00:00
Georg Lehmann	a93786fc26	nir/lower_mediump: Add an option to only fold if all tex sources can be folded. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16978>	2022-07-21 19:15:03 +00:00
Georg Lehmann	87e3277b82	nir: Rewrite and merge 16bit tex folding pass with 16bit image folding pass. Allow folding constants/undef sources by sharing more code with the image_store 16bit folding pass. Allow more than one set of sources because RADV wants two, one for G16 (ddx/ddy) and one for A16 (all other sources). Allow folding cube sampling destination conversions on radeonsi/radv because I think the limitation only applies to sources. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16978>	2022-07-21 19:15:03 +00:00
Emma Anholt	6e087f96c9	nir_lower_mediump: Drop assertion about not containing movs. A 1D texture operation may need to do a mov to turn a reference to a channel of an SSA value into a scalar value to be passed as the texture coordinate (since texture srcs can't do swizzles). Seen in amnesia-the-dark-descent/low/46.shader_test() for example, where a 1D texture is used to remap each of r,g,b from a previous texture result. Besides, the nir_op_is_vec() case will (perhaps surprisingly) look through a mov, anyway. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616>	2022-06-01 22:19:44 +00:00
Georg Lehmann	5833fab766	nir/lower_mediump: Add a new pass to fold 16bit image load/store. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15179>	2022-05-04 09:58:03 +00:00
Mike Blumenkrantz	27a43b531b	nir/fold_16bit_sampler_conversions: add a mask for supported sampler dims AMD might not support cubes, but that doesn't mean cubes can't be used on other drivers Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15852>	2022-04-20 12:12:36 +00:00
Georg Lehmann	81b2008af9	nir/legalize_16bit_sampler_srcs: Don't guess source type. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14895>	2022-03-23 20:55:39 +00:00
Georg Lehmann	b5fe1187ec	nir/fold_16bit_sampler_conversions: Fix src type mismatches. Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5996 Fixes: `fb29cef8` ("nir: add many passes that lower and optimize 16-bit input/outputs and samplers") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14895>	2022-03-23 20:55:39 +00:00
Georg Lehmann	88ec73e5e8	nir/fold_16bit_sampler_conversions: Fix dest type mismatches. Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5996 Fixes: `fb29cef8dd` ("nir: add many passes that lower and optimize 16-bit input/outputs and samplers") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14895>	2022-03-23 20:55:39 +00:00
Georg Lehmann	798e47be51	nir/fold_16bit_sampler_conversions: Don't fold dest upcasts. This is not a valid optimization. Fixes: `fb29cef8dd` ("nir: add many passes that lower and optimize 16-bit input/outputs and samplers") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14895>	2022-03-23 20:55:39 +00:00
Marek Olšák	1dcd1eac6a	nir: pass nir_shader into nir_recompute_io_bases instead of func_impl Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Georg Lehmann	c2168f845e	nir/lower_mediump: Treat u2u16 like i2i16. There is a comment in nir_fold_16bit_sampler_conversions saying that these are the same, but the code only checks for i2i16. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14893>	2022-02-10 16:13:54 +00:00
Daniel Schürmann	1e4c6e059e	nir/fold_16bit_sampler_conversions: skip sparse residency tex instructions The residency return value mismatches between NIR and Radeon. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13592>	2021-11-15 18:28:20 +00:00
Alyssa Rosenzweig	9b57a81815	nir/lower_mediump: Fix metadata in all passes Fixes: `fb29cef8dd` ("nir: add many passes that lower and optimize 16-bit input/outputs and samplers") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11732>	2021-08-10 20:55:33 +00:00
Alyssa Rosenzweig	03c18f7efc	nir/lower_mediump_io: Don't remap base unless needed Otherwise drivers that don't use 16-bit slots for varyings will get confused and have their driver_locations scribbled over. This has caused multiple problems for both Panfrost and Asahi this week. Given the only other user of the pass for varyings is radeonsi, which needs both together, I think this is the least controversial fix. Fixes: `fb29cef8dd` ("nir: add many passes that lower and optimize 16-bit input/outputs and samplers") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11732>	2021-08-10 20:55:33 +00:00
Marek Olšák	fb29cef8dd	nir: add many passes that lower and optimize 16-bit input/outputs and samplers Added: * a pass that renumbers bases of IO intrinsics * a pass that converts mediump IO to 16 bits, optionally using the new packed varying slots * a pass that sets (forces) mediump in IO intrinsics (for testing) * a pass that remaps VARYING_SLOT_VAR[0..15]_16BIT to VARYING_SLOT_VAR[0..31] (if some shader stages don't want packed varyings) * a pass that folds type conversions around texture opcodes into those opcodes (e.g. tex(f2f32(coord), ..) is changed into tex accepting f16) * a pass that changes (legalizes) sampler src and dst types based on specified hw constraints (e.g. derivatives must be the same type as coordinates) Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9050>	2021-04-13 05:07:42 +00:00

29 commits