fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 05:18:12 +02:00

Author	SHA1	Message	Date
Rhys Perry	aa2d6e020b	Revert "nir: Drop the unused instr arg for src/dest copy functions." This reverts commit `c3a0184118`. Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12910>	2022-08-30 18:21:44 +00:00
Rhys Perry	1df320dae7	nir/serialize: remove unused parameter from read_src() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12910>	2022-08-30 18:21:44 +00:00
Connor Abbott	9d9b891f94	nir: Free instructions more often Soon we'll be allocating instructions out of a per-shader pool, which means that if we don't free too many instructions during the main optimization loop, the final nir_sweep() call will create holes which can't be filled. By freeing instructions more aggressively, we can allocate more instructions from the freelist which will reduce the final memory usage. Modified from Connor Abbott's original patch to rebase on top of refactored DCE and so that the use-after-free in nir_algebraic_impl() is fixed. Co-authored-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12910>	2022-08-30 18:21:44 +00:00
Daniel Schürmann	9b843f8e4a	nir/opt_algebraic: a & ~a -> 0 Also re-ordered some optimizations for better readability. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18250>	2022-08-30 14:10:22 +00:00
Rhys Perry	797150c144	nir/lower_tex: ignore width of cube textures On AMD hardware, height is faster to access and we're already doing so. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17991>	2022-08-30 07:37:08 +00:00
Rhys Perry	fc06f0cbd5	nir/print: support nir_texop_descriptor_amd Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Fixes: `3098000e71` ("nir: add nir_texop_descriptor_amd") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17991>	2022-08-30 07:37:08 +00:00
Marcin Ślusarz	9f3eb63878	Revert "nir/lower_task_shader: don't use base index for shared memory intrinsics" This reverts commit `e5970fe22a`. Intel backend has implemented the missing functionality. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17618>	2022-08-29 12:42:40 +00:00
Marcin Ślusarz	3531c1e315	nir/lower_task_shader: print shader after each step Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17618>	2022-08-29 12:42:40 +00:00
Qiang Yu	a19dcdf9d5	nir,ac/llvm: add nir_intrinsic_load_viewport_xy_scale_and_offset Used by RADV/Radeonsi NGG culling. Pack them into a single vec4 load for radeonsi to reduce const buffer load. Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17651>	2022-08-26 05:50:30 +00:00
Qiang Yu	1aef9c8318	nir,ac/llvm: add nir_intrinsic_load_half_line_width_amd Used by AMD GPU NGG line culling. We could use nir load line width and viewport scale to calculate this in shader, but this way needs expensive divide ops. Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17651>	2022-08-26 05:50:30 +00:00
Georg Lehmann	c8ad1aeeb2	nir/fold_16bit_tex_image: Add an option to fold image sources. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18106>	2022-08-24 17:04:03 +00:00
Gert Wollny	13355232e4	nir_lower_atomics_to_ssbo: Initialize deref struct This fixes the use of an uninitialzed value: Conditional jump or move depends on uninitialised value(s) bcmp (vg_replace_strmem.c:1203) _mesa_add_sized_state_reference (prog_parameter.c:434) st_nir_assign_uniform_locations(gl_context, gl_program, nir_shader) (st_glsl_to_nir.cpp:209) st_finalize_nir (st_glsl_to_nir.cpp:1041) by 0x58271B9: st_glsl_to_nir_post_opts(st_context, gl_program, gl_shader_program) (st_glsl_to_nir.cpp:571) ... Uninitialised value was created by a heap allocation malloc (vg_replace_malloc.c:381) ralloc_size (ralloc.c:114) ralloc_array_size (ralloc.c:218) deref_offset_var (nir_lower_atomics_to_ssbo.c:47) lower_instr (nir_lower_atomics_to_ssbo.c:111) nir_lower_atomics_to_ssbo (nir_lower_atomics_to_ssbo.c:204) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18227>	2022-08-24 16:02:03 +00:00
Georg Lehmann	8eac45b274	nir: Add nir_ssa_scalar_is_undef. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18183>	2022-08-24 15:22:40 +00:00
Timothy Arceri	0c8492cd3b	glsl: fix location for array subscript xfb_decl_assign_location() assumes that arrays are going to be packed. But some conditions might prevent packing (e.g: explicit location or smooth interpolation mode). Instead of assuming that packing will happen, this commit adds a check to determine if it'll happen and use the result to compute the proper location. Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/2214 Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18175>	2022-08-24 02:19:34 +00:00
Timothy Arceri	04e7ed8323	glsl: make packed varying helper needs_lowering() external We will use this helper to correctly calculate xfb offsets in the following patch. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18175>	2022-08-24 02:19:34 +00:00
Yonggang Luo	a87195a653	glsl: Fixes [-Wdeprecated-declarations] in list_iterators.cpp Warning messages: ../src/compiler/glsl/tests/list_iterators.cpp:68:1: warning: 'InstantiateTestCase_P_IsDeprecated' is deprecated: INSTANTIATE_TEST_CASE_P is deprecated, please use INSTANTIATE_TEST_SUITE_P [-Wdeprecated-declarations] ../src/compiler/glsl/tests/list_iterators.cpp:187:1: warning: 'InstantiateTestCase_P_IsDeprecated' is deprecated: INSTANTIATE_TEST_CASE_P is deprecated, please use INSTANTIATE_TEST_SUITE_P [-Wdeprecated-declarations] Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18203>	2022-08-23 15:19:16 +00:00
Yonggang Luo	fd516fca15	nir: Fixes [-Wdeprecated-declarations] in serialize_tests.cpp Warning messages: ../src/compiler/nir/tests/serialize_tests.cpp:113:1: warning: 'InstantiateTestCase_P_IsDeprecated' is deprecated: INSTANTIATE_TEST_CASE_P is deprecated, please use INSTANTIATE_TEST_SUITE_P [-Wdeprecated-declarations] ../src/compiler/nir/tests/serialize_tests.cpp:119:1: warning: 'InstantiateTestCase_P_IsDeprecated' is deprecated: INSTANTIATE_TEST_CASE_P is deprecated, please use INSTANTIATE_TEST_SUITE_P [-Wdeprecated-declarations] Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18203>	2022-08-23 15:19:16 +00:00
Erik Faye-Lund	b08f293686	glsl/tests: do not perform eol-conversion on windows These tests fail on Windows, because we open the expected files in text-mode, performing EOL conversion. Instead, let's read them as binary files, and manually UTF-8 decode them to get the expected result. This fixes the tests on Windows for me. Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18179>	2022-08-23 09:16:19 +00:00
Ian Romanick	2b3e1d587d	glsl: Remove lower_offset_arrays pass It is no longer used. Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16547>	2022-08-23 01:10:23 +00:00
Ian Romanick	dbd022f2ab	nir: spirv: Allow 32-bit version of nir_intrinsic_is_sparse_texels_resident This intrinsic returns a Boolean. Both 1-bit and 32-bit versions must be allowed. Otherwise, size mismatches will occur after lowering 1-bit Booleans to 32-bit. Fixes: `4cbdf9ec4d` ("nir,spirv: implement SpvOpImageSparseTexelsResident") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16547>	2022-08-23 01:10:23 +00:00
Yonggang Luo	0f9b662f9a	meson: add enable-glcpp-tests option these are too intermittent to be left enabled on CI for now Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17928>	2022-08-22 14:18:53 +00:00
Timothy Arceri	87940c3193	glsl: dont lower precision for textureGatherOffsets textureGatherOffsets always takes a highp array of constants. As per the discussion in [1] trying to lower the precision results in segfault later on in the compiler as textureGatherOffsets will end up being passed a temp when its expecting a constant as required by the spec. [1] https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16547#note_1393704 Fixes: `b83f4b9fa2` ("glsl: Add an IR lowering pass to convert mediump operations to 16-bit") Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18101>	2022-08-18 23:45:04 +00:00
Mike Blumenkrantz	37aa92a3cd	nir: add uses_bindless flag for shader_info this is cumbersome to detect, so detect it here the flag denotes the use of either bindless texture operations or shader variables such that drivers can infer the use of bindless descriptor management functionality Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18088>	2022-08-17 21:53:02 +00:00
Qiang Yu	84956286a8	nir/lower_gs_intrinsics: fix primitive count for points When primitive is points, EndPrimitive can't be used to count primitive. Need to use vertex count instead. And it's also not needed to do vertex per primitive count and overwrite incomplete primitive work for points. Fixes: `2be99012e9` ("nir: Add ability to count emitted GS primitives.") Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17805>	2022-08-15 01:39:28 +00:00
Tapani Pälli	a3a04ed6f3	glsl: add check for too large atomic counter buffer offset Fixes upcoming CTS test for atomic counter buffer offsets. "It's being clarified that placing an atomic counter into a buffer at such an offset that the buffer is too large results in a compilation error." https://gitlab.khronos.org/Tracker/vk-gl-cts/-/issues/3124 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17905>	2022-08-12 10:45:53 +00:00
Tapani Pälli	a9b64bd7ad	glsl: allow image*Shadow keywords on ES and GLSL >= 420 These were not reserved keywords in GLSL ES and also allowed on desktop GLSL after 420. New CTS compiler tests will test this. https://gitlab.khronos.org/Tracker/vk-gl-cts/-/issues/3007 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17904>	2022-08-12 04:58:12 +00:00
Michael Tang	97902a9ef8	nir: add nir_instr_as_str Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12510>	2022-08-11 16:17:46 +00:00
Mike Blumenkrantz	c37c6ac613	nir/validate: add some (light) validation for sampler type matching this adds minimal validation for tex ops with derefs to check that the dest type integer-ness matches the sampled type's integer-ness the aim is to provide the most basic validation that nir is being modified and created consistently, not to perform exact verification that the types are identical fix #6985 Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17874>	2022-08-10 19:44:59 +00:00
Mike Blumenkrantz	b7eda568a4	nir/validate: clamp unsized tex dests to 32bit this is the "default" size that's expected cc: mesa-stable Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17874>	2022-08-10 19:44:59 +00:00
Pierre-Eric Pelloux-Prayer	70891edd97	nir: add a nir_opt_if_options enum And don't enable nir_opt_if_optimize_phi_true_false on radeonsi with LLVM 14 because it crashes Blender. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6976 Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17949>	2022-08-10 12:55:39 +00:00
Timothy Arceri	8bffd601ed	Revert "nir: Preserve offsets in lower_io_to_scalar_early" This reverts commit `96fa23bca5`. The correct fix to the problem was `a1bc152340`, making this change obsolete as the pass skips any vars marked with always_active_io. There was no real advantage to allowing these vars to be split because they can't be removed anyway. Also there is no way to split varying arrays gracefully here due to the xfb layout rules, and this change didn't handle arrays at all. Removing this obsolete code also fixes an assert in the new CTS test KHR-Single-GL45.enhanced_layouts.xfb_all_stages. The test was legally adding xfb offsets to all vertex stages but since we only mark the varyings in the final vertex stage with the always_active_io flag the other stages were correctly lowering to scalars but when an array with an offset hit this code it asserted since it couldn't handle it. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Fixes: `a1bc152340` ("spirv: mark variables decorated with XfbBuffer as always active") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6928 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17878>	2022-08-08 01:37:20 +00:00
Iago Toral Quiroga	9d6770d20a	nir/lower_alu: drop unnecessary iand on uadd_carry result uadd_carry returns 1 or 0, so ANDing with 1 is unnecessary. Probably this was implemented thinking that it was returning a boolean value. shader-db results for V3D: total instructions in shared programs: 12463571 -> 12462964 (<.01%) instructions in affected programs: 28994 -> 28387 (-2.09%) helped: 110 HURT: 1 total uniforms in shared programs: 3704591 -> 3704588 (<.01%) uniforms in affected programs: 247 -> 244 (-1.21%) helped: 3 HURT: 0 total max-temps in shared programs: 2148138 -> 2148117 (<.01%) max-temps in affected programs: 729 -> 708 (-2.88%) helped: 23 HURT: 2 total sfu-stalls in shared programs: 21230 -> 21232 (<.01%) sfu-stalls in affected programs: 0 -> 2 helped: 0 HURT: 2 Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17903>	2022-08-06 23:11:40 +00:00
Karol Herbst	caf2794f6f	vtn: silence warning about linkage For OpenCL kernels we simply link together SPIR-V files, so the only case where we are left with linking shaders together is libclc and we handle that just fine. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:50 +00:00
Karol Herbst	6637b1f41e	clc: undefine spirv defs to work around LLVMs headers Clang unconditionally adds those definitions if using a spirv LLVM target. That's not a problem on its own, but clang's internal OpenCL header enable a bunch of OpenCL extensions if those are set. Lucky for us, we can simply undefine them and spare us the trouble of finding an upstream solution to this problem :) This fixes the OpenCL CTS' compiler features_macro test. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:50 +00:00
Jason Ekstrand	de2065496a	nir: Clean up and improve nir_dedup_inline_samplers It now removes dead inline sampler variables and moves everything to the end so we no longer need nir_move_inline_samplers_to_end(). Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:50 +00:00
Karol Herbst	2b12985465	nir: extract the clc inline sampler dedup pass from clc Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:50 +00:00
Karol Herbst	31ed24cec7	nir/lower_images: extract from clover Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:50 +00:00
Karol Herbst	01500198a6	nir: serialize printf metadata for CL kernels Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:49 +00:00
Karol Herbst	aa82808645	printf: extract clovers printf impl Also make the code cleaner and simplier. Signed-off-by: Karol Herbst <kherbst@redhat.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:49 +00:00
Dave Airlie	0bb03ffc76	gallium: use gl shader types as the basis for the gallium ones This should enable a rename transistion. Trace needs to swap over to a non-generated version, but that should be fine. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17747>	2022-08-04 08:17:39 +00:00
Constantine Shablya	fa5559f272	nir: add a pass to remove non-uniform access qualifier when the operands are uniform Signed-off-by: Constantine Shablya <constantine.shablya@collabora.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17558>	2022-08-03 23:57:50 +00:00
Marek Olšák	e075769a53	nir: add shader_info::uses_resource_info_query for txs, levels, samples, etc. AMD will use this to execute a lowering pass conditionally. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17693>	2022-08-03 17:44:15 +00:00
Marek Olšák	3098000e71	nir: add nir_texop_descriptor_amd AMD will use it to emulate resinfo. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17693>	2022-08-03 17:44:15 +00:00
Marek Olšák	6483fd394e	nir: add nir_intrinsic_image_descriptor_amd This returns the AMD shader resource descriptor. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17693>	2022-08-03 17:44:15 +00:00
Marek Olšák	ea6993f9c7	nir: add nir_intrinsic_image_samples_identical radeonsi will use it Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17693>	2022-08-03 17:44:15 +00:00
Alyssa Rosenzweig	a4a15f500c	nir/lower_idiv: Be less creative about signs I'm sorry to whoever wrote this, but (x - (int) (x < 0)) ^ -((int) (x < 0)) is not an acceptable way to write iabs. Shader-db results on Intel Tiger Lake with lower_idiv enabled: total instructions in shared programs: 21122548 -> 21122570 (<.01%) instructions in affected programs: 2369 -> 2391 (0.93%) helped: 2 HURT: 8 total cycles in shared programs: 791609360 -> 791608062 (<.01%) cycles in affected programs: 114106 -> 112808 (-1.14%) helped: 9 HURT: 1 If we make the Intel back-end less stupid, we get to 9/1 helped/HURT for instructions as well but that's for a different MR. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17845>	2022-08-03 14:24:38 +00:00
Jason Ekstrand	25dcb8d201	nir/from_ssa: Ignore undef sources Is a phi source is an undef, there's no point in copying it or really caring about it at all. We would just end up inserting a mov from an undef to a register. Instead, treat phi sources which point to an undef as if the phi source doesn't exist. This also prevents them from being included in phi webs which should reduce the overall interference seen in the shader. Currently, if two phis share an undef, their phi webs are consdiered to interfere. By ignoring undefs we can get rid of this false interference and reduce the size of phi webs. Reducing the number of things being copied by the parallel copy instructions should also free up the paralle copy algorithm and reduce the over-all churn of movs. Shader-db results on Haswell: total instructions in shared programs: 8156608 -> 8155406 (-0.01%) instructions in affected programs: 164838 -> 163636 (-0.73%) Shader-db results on Skylake: total instructions in shared programs: 18227370 -> 18227359 (<.01%) instructions in affected programs: 519 -> 508 (-2.12%) helped: 6 HURT: 0 Shader-db results on Tigerlake: total instructions in shared programs: 21167987 -> 21168025 (<.01%) instructions in affected programs: 23701 -> 23739 (0.16%) helped: 21 HURT: 27 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16817>	2022-08-01 22:13:24 +00:00
Emma Anholt	31b9b04880	nir: Use nir_foreach_phi_src consistently. I copy-and-pasted one of these and people noted that we had a better tool, so make sure nobody else copy and pastes it. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17664>	2022-08-01 17:39:30 +00:00
Emma Anholt	a4bfe11a49	glsl: Remove opt_conditional_discard(). The nir_opt_conditional_discard pass is called anyway and covers discard/demote/terminate. iris shader-db: total instructions in shared programs: 8933422 -> 8933426 (<.01%) instructions in affected programs: 48 -> 52 (8.33%) helped: 0 HURT: 4 which is a synmark shader going from 12 to 13 instrs. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17664>	2022-08-01 17:39:30 +00:00
Emma Anholt	3714c89d0e	nir: Add an opt pass for phis after if choosing between true/false. This pattern almost always gets peephole-selected out anyway, but I noticed it once I removed glsl opt_conditional_discard. iris shader-db: total instructions in shared programs: 8933934 -> 8933158 (<.01%) instructions in affected programs: 75575 -> 74799 (-1.03%) helped: 179 HURT: 15 Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17664>	2022-08-01 17:39:30 +00:00

1 2 3 4 5 ...

7219 commits