fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-25 02:10:11 +01:00

Author	SHA1	Message	Date
Jason Ekstrand	8a406fe055	nir: Fix builder usage in lower_mediump_vars() In our handling of load_deref, we were calling builder helpers to create conversions and then adjusting the destination bit size of the load. We should adjust the bit size first because the builder sometimes looks at the bit sizes of SSA values passed in as arguments. Even though it's not strictly necessary, adjust the store_deref case as well to make it fully symmetric with the load_deref case. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20067>	2022-12-01 01:10:12 +00:00
Emma Anholt	7e986e5f04	nir/lower_mediump_vars: Don't lower mediump shared vars with atomic access. I don't know of any GPUs doing 16-bit atomic accesses, nor do I know of anybody wanting that in shaders. But deqp has GLES CTS cases that set mediump on shared variables, so just skip lowering for those vars. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18452>	2022-09-14 14:56:22 -07:00
Emma Anholt	0cee5f3918	nir: Add a pass to lower mediump temps and shared mem. SPIRV and GLSL are reasonable at converting ALU ops to mediump, but variable storage would be wrapped in a 2f32/2mp on store/load, and if nir_vars_to_ssa doesn't make that storage go away then you'd have extra conversions. For compute shader shared mem, you'd waste memory too. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18259>	2022-09-01 22:39:39 +00:00
Emma Anholt	80b35fbefe	nir/lower_mediump: Lower FS outputs to 16-bit when the value was upconverted. Take this real-world (trimmed) shader: precision highp float; in lowp vec4 var_varVertexColor; layout(location = 0) out vec4 out_FragColor0; void main() { vec4 textureColor0 = vec4(1.000000e+00, 0.000000e+00, 0.000000e+00, 1.000000e+00); vec3 color = vec3(1.000000e+00, 1.000000e+00, 1.000000e+00); vec4 outColor = vec4(vec3((color).rgb), 1.000000e+00); (outColor *= vec4(var_varVertexColor)); (out_FragColor0 = outColor); } After opts, it's just a store from input to output. If we decide to lower the input to 16-bit, then as long as the driver can handle 16-bit outputs, it would be a good idea to demote the output and save the conversions. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18003>	2022-08-31 02:43:45 +00:00
Georg Lehmann	c8ad1aeeb2	nir/fold_16bit_tex_image: Add an option to fold image sources. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18106>	2022-08-24 17:04:03 +00:00
Georg Lehmann	8eac45b274	nir: Add nir_ssa_scalar_is_undef. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18183>	2022-08-24 15:22:40 +00:00
Georg Lehmann	df4b5914cd	nir/fold_16bit_tex_image: Default to only_fold_all. No driver doesn't use this option. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17757>	2022-07-27 18:57:12 +00:00
Georg Lehmann	a93786fc26	nir/lower_mediump: Add an option to only fold if all tex sources can be folded. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16978>	2022-07-21 19:15:03 +00:00
Georg Lehmann	87e3277b82	nir: Rewrite and merge 16bit tex folding pass with 16bit image folding pass. Allow folding constants/undef sources by sharing more code with the image_store 16bit folding pass. Allow more than one set of sources because RADV wants two, one for G16 (ddx/ddy) and one for A16 (all other sources). Allow folding cube sampling destination conversions on radeonsi/radv because I think the limitation only applies to sources. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16978>	2022-07-21 19:15:03 +00:00
Emma Anholt	6e087f96c9	nir_lower_mediump: Drop assertion about not containing movs. A 1D texture operation may need to do a mov to turn a reference to a channel of an SSA value into a scalar value to be passed as the texture coordinate (since texture srcs can't do swizzles). Seen in amnesia-the-dark-descent/low/46.shader_test() for example, where a 1D texture is used to remap each of r,g,b from a previous texture result. Besides, the nir_op_is_vec() case will (perhaps surprisingly) look through a mov, anyway. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616>	2022-06-01 22:19:44 +00:00
Georg Lehmann	5833fab766	nir/lower_mediump: Add a new pass to fold 16bit image load/store. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15179>	2022-05-04 09:58:03 +00:00
Mike Blumenkrantz	27a43b531b	nir/fold_16bit_sampler_conversions: add a mask for supported sampler dims AMD might not support cubes, but that doesn't mean cubes can't be used on other drivers Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15852>	2022-04-20 12:12:36 +00:00
Georg Lehmann	81b2008af9	nir/legalize_16bit_sampler_srcs: Don't guess source type. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14895>	2022-03-23 20:55:39 +00:00
Georg Lehmann	b5fe1187ec	nir/fold_16bit_sampler_conversions: Fix src type mismatches. Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5996 Fixes: `fb29cef8` ("nir: add many passes that lower and optimize 16-bit input/outputs and samplers") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14895>	2022-03-23 20:55:39 +00:00
Georg Lehmann	88ec73e5e8	nir/fold_16bit_sampler_conversions: Fix dest type mismatches. Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5996 Fixes: `fb29cef8dd` ("nir: add many passes that lower and optimize 16-bit input/outputs and samplers") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14895>	2022-03-23 20:55:39 +00:00
Georg Lehmann	798e47be51	nir/fold_16bit_sampler_conversions: Don't fold dest upcasts. This is not a valid optimization. Fixes: `fb29cef8dd` ("nir: add many passes that lower and optimize 16-bit input/outputs and samplers") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14895>	2022-03-23 20:55:39 +00:00
Marek Olšák	1dcd1eac6a	nir: pass nir_shader into nir_recompute_io_bases instead of func_impl Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Georg Lehmann	c2168f845e	nir/lower_mediump: Treat u2u16 like i2i16. There is a comment in nir_fold_16bit_sampler_conversions saying that these are the same, but the code only checks for i2i16. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14893>	2022-02-10 16:13:54 +00:00
Daniel Schürmann	1e4c6e059e	nir/fold_16bit_sampler_conversions: skip sparse residency tex instructions The residency return value mismatches between NIR and Radeon. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13592>	2021-11-15 18:28:20 +00:00
Alyssa Rosenzweig	9b57a81815	nir/lower_mediump: Fix metadata in all passes Fixes: `fb29cef8dd` ("nir: add many passes that lower and optimize 16-bit input/outputs and samplers") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11732>	2021-08-10 20:55:33 +00:00
Alyssa Rosenzweig	03c18f7efc	nir/lower_mediump_io: Don't remap base unless needed Otherwise drivers that don't use 16-bit slots for varyings will get confused and have their driver_locations scribbled over. This has caused multiple problems for both Panfrost and Asahi this week. Given the only other user of the pass for varyings is radeonsi, which needs both together, I think this is the least controversial fix. Fixes: `fb29cef8dd` ("nir: add many passes that lower and optimize 16-bit input/outputs and samplers") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11732>	2021-08-10 20:55:33 +00:00
Marek Olšák	fb29cef8dd	nir: add many passes that lower and optimize 16-bit input/outputs and samplers Added: * a pass that renumbers bases of IO intrinsics * a pass that converts mediump IO to 16 bits, optionally using the new packed varying slots * a pass that sets (forces) mediump in IO intrinsics (for testing) * a pass that remaps VARYING_SLOT_VAR[0..15]_16BIT to VARYING_SLOT_VAR[0..31] (if some shader stages don't want packed varyings) * a pass that folds type conversions around texture opcodes into those opcodes (e.g. tex(f2f32(coord), ..) is changed into tex accepting f16) * a pass that changes (legalizes) sampler src and dst types based on specified hw constraints (e.g. derivatives must be the same type as coordinates) Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9050>	2021-04-13 05:07:42 +00:00

22 commits