fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 11:38:06 +02:00

Author	SHA1	Message	Date
Timothy Arceri	8bffd601ed	Revert "nir: Preserve offsets in lower_io_to_scalar_early" This reverts commit `96fa23bca5`. The correct fix to the problem was `a1bc152340`, making this change obsolete as the pass skips any vars marked with always_active_io. There was no real advantage to allowing these vars to be split because they can't be removed anyway. Also there is no way to split varying arrays gracefully here due to the xfb layout rules, and this change didn't handle arrays at all. Removing this obsolete code also fixes an assert in the new CTS test KHR-Single-GL45.enhanced_layouts.xfb_all_stages. The test was legally adding xfb offsets to all vertex stages but since we only mark the varyings in the final vertex stage with the always_active_io flag the other stages were correctly lowering to scalars but when an array with an offset hit this code it asserted since it couldn't handle it. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Fixes: `a1bc152340` ("spirv: mark variables decorated with XfbBuffer as always active") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6928 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17878>	2022-08-08 01:37:20 +00:00
Iago Toral Quiroga	9d6770d20a	nir/lower_alu: drop unnecessary iand on uadd_carry result uadd_carry returns 1 or 0, so ANDing with 1 is unnecessary. Probably this was implemented thinking that it was returning a boolean value. shader-db results for V3D: total instructions in shared programs: 12463571 -> 12462964 (<.01%) instructions in affected programs: 28994 -> 28387 (-2.09%) helped: 110 HURT: 1 total uniforms in shared programs: 3704591 -> 3704588 (<.01%) uniforms in affected programs: 247 -> 244 (-1.21%) helped: 3 HURT: 0 total max-temps in shared programs: 2148138 -> 2148117 (<.01%) max-temps in affected programs: 729 -> 708 (-2.88%) helped: 23 HURT: 2 total sfu-stalls in shared programs: 21230 -> 21232 (<.01%) sfu-stalls in affected programs: 0 -> 2 helped: 0 HURT: 2 Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17903>	2022-08-06 23:11:40 +00:00
Karol Herbst	caf2794f6f	vtn: silence warning about linkage For OpenCL kernels we simply link together SPIR-V files, so the only case where we are left with linking shaders together is libclc and we handle that just fine. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:50 +00:00
Karol Herbst	6637b1f41e	clc: undefine spirv defs to work around LLVMs headers Clang unconditionally adds those definitions if using a spirv LLVM target. That's not a problem on its own, but clang's internal OpenCL header enable a bunch of OpenCL extensions if those are set. Lucky for us, we can simply undefine them and spare us the trouble of finding an upstream solution to this problem :) This fixes the OpenCL CTS' compiler features_macro test. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:50 +00:00
Jason Ekstrand	de2065496a	nir: Clean up and improve nir_dedup_inline_samplers It now removes dead inline sampler variables and moves everything to the end so we no longer need nir_move_inline_samplers_to_end(). Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:50 +00:00
Karol Herbst	2b12985465	nir: extract the clc inline sampler dedup pass from clc Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:50 +00:00
Karol Herbst	31ed24cec7	nir/lower_images: extract from clover Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:50 +00:00
Karol Herbst	01500198a6	nir: serialize printf metadata for CL kernels Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:49 +00:00
Karol Herbst	aa82808645	printf: extract clovers printf impl Also make the code cleaner and simplier. Signed-off-by: Karol Herbst <kherbst@redhat.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:49 +00:00
Dave Airlie	0bb03ffc76	gallium: use gl shader types as the basis for the gallium ones This should enable a rename transistion. Trace needs to swap over to a non-generated version, but that should be fine. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17747>	2022-08-04 08:17:39 +00:00
Constantine Shablya	fa5559f272	nir: add a pass to remove non-uniform access qualifier when the operands are uniform Signed-off-by: Constantine Shablya <constantine.shablya@collabora.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17558>	2022-08-03 23:57:50 +00:00
Marek Olšák	e075769a53	nir: add shader_info::uses_resource_info_query for txs, levels, samples, etc. AMD will use this to execute a lowering pass conditionally. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17693>	2022-08-03 17:44:15 +00:00
Marek Olšák	3098000e71	nir: add nir_texop_descriptor_amd AMD will use it to emulate resinfo. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17693>	2022-08-03 17:44:15 +00:00
Marek Olšák	6483fd394e	nir: add nir_intrinsic_image_descriptor_amd This returns the AMD shader resource descriptor. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17693>	2022-08-03 17:44:15 +00:00
Marek Olšák	ea6993f9c7	nir: add nir_intrinsic_image_samples_identical radeonsi will use it Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17693>	2022-08-03 17:44:15 +00:00
Alyssa Rosenzweig	a4a15f500c	nir/lower_idiv: Be less creative about signs I'm sorry to whoever wrote this, but (x - (int) (x < 0)) ^ -((int) (x < 0)) is not an acceptable way to write iabs. Shader-db results on Intel Tiger Lake with lower_idiv enabled: total instructions in shared programs: 21122548 -> 21122570 (<.01%) instructions in affected programs: 2369 -> 2391 (0.93%) helped: 2 HURT: 8 total cycles in shared programs: 791609360 -> 791608062 (<.01%) cycles in affected programs: 114106 -> 112808 (-1.14%) helped: 9 HURT: 1 If we make the Intel back-end less stupid, we get to 9/1 helped/HURT for instructions as well but that's for a different MR. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17845>	2022-08-03 14:24:38 +00:00
Jason Ekstrand	25dcb8d201	nir/from_ssa: Ignore undef sources Is a phi source is an undef, there's no point in copying it or really caring about it at all. We would just end up inserting a mov from an undef to a register. Instead, treat phi sources which point to an undef as if the phi source doesn't exist. This also prevents them from being included in phi webs which should reduce the overall interference seen in the shader. Currently, if two phis share an undef, their phi webs are consdiered to interfere. By ignoring undefs we can get rid of this false interference and reduce the size of phi webs. Reducing the number of things being copied by the parallel copy instructions should also free up the paralle copy algorithm and reduce the over-all churn of movs. Shader-db results on Haswell: total instructions in shared programs: 8156608 -> 8155406 (-0.01%) instructions in affected programs: 164838 -> 163636 (-0.73%) Shader-db results on Skylake: total instructions in shared programs: 18227370 -> 18227359 (<.01%) instructions in affected programs: 519 -> 508 (-2.12%) helped: 6 HURT: 0 Shader-db results on Tigerlake: total instructions in shared programs: 21167987 -> 21168025 (<.01%) instructions in affected programs: 23701 -> 23739 (0.16%) helped: 21 HURT: 27 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16817>	2022-08-01 22:13:24 +00:00
Emma Anholt	31b9b04880	nir: Use nir_foreach_phi_src consistently. I copy-and-pasted one of these and people noted that we had a better tool, so make sure nobody else copy and pastes it. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17664>	2022-08-01 17:39:30 +00:00
Emma Anholt	a4bfe11a49	glsl: Remove opt_conditional_discard(). The nir_opt_conditional_discard pass is called anyway and covers discard/demote/terminate. iris shader-db: total instructions in shared programs: 8933422 -> 8933426 (<.01%) instructions in affected programs: 48 -> 52 (8.33%) helped: 0 HURT: 4 which is a synmark shader going from 12 to 13 instrs. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17664>	2022-08-01 17:39:30 +00:00
Emma Anholt	3714c89d0e	nir: Add an opt pass for phis after if choosing between true/false. This pattern almost always gets peephole-selected out anyway, but I noticed it once I removed glsl opt_conditional_discard. iris shader-db: total instructions in shared programs: 8933934 -> 8933158 (<.01%) instructions in affected programs: 75575 -> 74799 (-1.03%) helped: 179 HURT: 15 Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17664>	2022-08-01 17:39:30 +00:00
Eric Engestrom	2c67457e5e	util/list: rename LIST_ENTRY() to list_entry() This follows the Linux kernel convention, and avoids collision with macOS header macro. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6751 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6840 Cc: mesa-stable Signed-off-by: Eric Engestrom <eric@igalia.com> Acked-by: David Heidelberg <david.heidelberg@collabora.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17772>	2022-07-28 10:10:44 +00:00
Georg Lehmann	df4b5914cd	nir/fold_16bit_tex_image: Default to only_fold_all. No driver doesn't use this option. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17757>	2022-07-27 18:57:12 +00:00
Jesse Natalie	d216d32756	nir_lower_io_to_scalar: Support arrayed (per-vertex) I/O Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17603>	2022-07-23 14:48:17 +00:00
Ian Romanick	430d1a20eb	spirv: Fix array length of buffers larger than INT32_MAX. Like `90a8fb0355`. fossil-db results: All Skylake and newer Intel platforms had similar results. (Ice Lake shown) Instructions in all programs: 141442369 -> 141442363 (-0.0%) Instructions helped: 1 Cycles in all programs: 9099270231 -> 9099270187 (-0.0%) Cycles helped: 1 Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17637>	2022-07-22 17:54:28 +00:00
Emma Anholt	f6c5b1d6c6	nir: Split usub_sat lowering flag from uadd_sat. Intel vec4 would like to do uadd_sat, but use lowering for usub_sat. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17637>	2022-07-22 17:54:28 +00:00
Timothy Arceri	7c484b0c1c	glsl: correctly track cross slot component packing Otherwise we will mix and match mesa's custom cross slot packing with arb_enhanced_layouts style packing and we won't correctly handle the size of the vars needed for the mesa custom packing. The code was working correctly if the shader interface had both a matching input and output but when we only had one side of the interface we were only marking a single slot location as packed. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Fixes: `e5122a5543` ("glsl: add a NIR based varying linker") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6853 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17550>	2022-07-22 10:43:18 +00:00
Georg Lehmann	a93786fc26	nir/lower_mediump: Add an option to only fold if all tex sources can be folded. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16978>	2022-07-21 19:15:03 +00:00
Georg Lehmann	87e3277b82	nir: Rewrite and merge 16bit tex folding pass with 16bit image folding pass. Allow folding constants/undef sources by sharing more code with the image_store 16bit folding pass. Allow more than one set of sources because RADV wants two, one for G16 (ddx/ddy) and one for A16 (all other sources). Allow folding cube sampling destination conversions on radeonsi/radv because I think the limitation only applies to sources. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16978>	2022-07-21 19:15:03 +00:00
Emma Anholt	74aea0b840	glsl: Remove optimize_swizzles. It will get turned into SSA and copy-propagated in NIR, no need to walk the IR collapsing it here. iris shader-db results appear to be noise: total instructions in shared programs: 8932195 -> 8932147 (<.01%) instructions in affected programs: 537 -> 489 (-8.94%) LOST: 12 GAINED: 11 lost/gained are simd32 switches in unigine, l4d2, portal2, asphalt9. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17613>	2022-07-20 20:45:09 +00:00
Emma Anholt	f075376823	glsl: Use the same NIR path for shared mem lowering as SPIRV does. Now that we have no non-NIR drivers, we can retire the old code. We just need to pass the variable accesses through to it. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17610>	2022-07-20 19:32:48 +00:00
Alejandro Piñeiro	8d3ce4eb06	nir: call nir_metadata_preserve at nir_remove_unused_io_vars Without it we got a metadata assert: deqp-vk: ../src/compiler/nir/nir_metadata.c:108: nir_metadata_check_validation_flag: Assertion `!(function->impl->valid_metadata & nir_metadata_not_properly_reset)' failed if we try to use NIR_PASS(_, instead of NIR_PASS_V (that among other things, do more validations). Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17609>	2022-07-20 11:35:24 +00:00
Marcin Ślusarz	5e14445430	nir: convert unused mesh outputs to shared memory Otherwise reads from output in one subgroup may not see writes from other subgroups. Temp variables are later converted to scratch, so even within one subgroup we may not see correct values. Test case in https://gitlab.freedesktop.org/mesa/crucible/-/merge_requests/115 Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17517>	2022-07-20 09:22:06 +00:00
Timothy Arceri	d1e36634bd	nir/loop_unroll: clean up after complex_unroll_single_terminator() Previously we would just unroll the loop one extra iteration and let other optimisation passes clean up the mess. This worked to a degree but if the loop happened to be nested inside another loop we would end up with phi chains that would block other passes from being able to do the cleanup. With this commit we explicitly clone the variables create by lcsaa and insert them directly in the last continue branch after we are done unrolling. With this optimisation passes can recognise both sides of the if output the same values and can progress further. Help with the issues described in: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6051 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17611>	2022-07-20 03:47:45 +00:00
Daniel Schürmann	6f25d45877	spirv: handle phis decorated with RelaxedPrecision If the driver can do 16-bit ALU ops, then store RelaxedPrecision phi values into 16-bit NIR variables with downconverts/upconverts on the way in/out. This has no impact on shader-db on freedreno (not that we have a ton of GLES content there), but it does cause an ANGLE-translated CTS shader on vulkan to get consistent conversions between two copies of a value, and avoid a test bug. Reviewed-by: Emma Anholt <emma@anholt.net> Closes: #6585 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14018>	2022-07-19 21:58:01 +00:00
Konstantin Seurer	fab0050223	nir: Add a common gen_rect_vertices implementation Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17535>	2022-07-19 12:47:30 +00:00
Iago Toral Quiroga	b18cecbfb6	nir: add nir_address_format_2x32bit_global This adds support for global 64-bit GPU addresses as a pair of 32-bit values. This is useful for platforms with 32-bit GPUs that want to support VK_KHR_buffer_device_address, which makes GPU addresses explicitly 64-bit. With the new format we also add new global intrinsics with 2x32 suffix that consume the new address format. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17275>	2022-07-19 09:47:34 +02:00
Arvind Yadav	8adbd2a964	ac/llvm: Implement nir_intrinsic_load_point_coord_maybe_flipped opcodes Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15117>	2022-07-16 07:08:10 -04:00
Arvind Yadav	30865756db	nir: Add a lowering pass for point smoothing When point smoothing is enabled then this lowering pass will modifies the alpha component of every write to fragment output. Anti-aliased points get rounded with respect to their radius instead of square. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15117>	2022-07-16 07:08:09 -04:00
Arvind Yadav	cad4908fa0	nir: add load_point_coord_maybe_flipped intrinsics for point smoothing gl_PointCoord can be flipped upside down via a state. To avoid this adding new load_point_coord_maybe_flipped intrinsics. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15117>	2022-07-16 07:07:32 -04:00
Arvind Yadav	2709786bde	nir: Add a lowering pass for polygon and line smoothing When poly_line smoothing is enabled then this lowering pass will modify the alpha component of every write to fragment output using sample coverage mask. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16245>	2022-07-16 10:15:22 +00:00
Lionel Landwerlin	a41e8dc588	spirv: switch to uint64 for rayquery internal type Fixes dEQP-VK.ray_query.advanced.using_wrapper_function.comp.* An empty struct is causing problems because when passing it as argument the spirv parser will just drop the argument, considering it does not hold any data. v2: update radv CI Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4c703686db` ("spirv: handle ray query intrinsics") Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17420>	2022-07-14 09:15:52 +00:00
Jason Ekstrand	1124bee4ba	glsl/nir: Set sample_shading if a FS output ever shows up as an rvalue If framebuffer fetch is used, we have to enable sample shading because the fetched framebuffer value is per-sample. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	3cf103f23d	nir/gather_info: Stop gathering uses_sample_shading Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	b6543470fe	spirv/nir: Set uses_sample_shading from spirv_to_nir We don't really want to base this on a late nir_gather_info for two reasons: 1) The Vulkan spec says that if a sample-qualified input, SampleID, or SamplePosition are in the entry-point's interface, you get per-sample dispatch. This means we really should gather this information before dead-code has a chance to delete anything. 2) We want to be able to add nir_intrinsic_load_sample_pos intrinsics as part of lowering passes without causing per-sample interpolation. This means nir_gather_info needs to stop gathering it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	830654b7b0	glsl/nir: Set uses_sample_shading from glsl_to_nir We don't really want to base this on a late nir_gather_info for two reasons: 1) The GL spec says that any static use of a sample-qualified input, gl_SampleID, or gl_SamplePosition causes per-sample dispatch. This means we really should gather this information before dead-code has a chance to delete anything. 2) We want to be able to add nir_intrinsic_load_sample_pos intrinsics as part of lowering passes without causing per-sample interpolation. This means nir_gather_info needs to stop gathering it. For 1, this doesn't actually get us quite there as GLSL IR may have deleted something already. However, it does get us closer. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	23b2d625dd	nir: Add a pass for lowering shaders to single-sampled On Intel, we have to do this because we can't ask for the per-sample barycentrics without setting the per-sample dispatch bit or the GPU will hang. However, nothing we're doing in this pass is Intel-specific and it may be a useful optimization for someone else so we may as well make it a generic NIR pass. This version actually does a bit more than the current brw_nir_demote_sample_qualifiers() pass as it also handles pre-nir_lower_io interp_dref_at* as well as a couple system values which we can easily constant-fold. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Georg Lehmann	aac8ddae2f	nir/opt_algebraic: Optimize [ui](add\|sub)_sat with 0. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17468>	2022-07-13 07:34:09 +00:00
Georg Lehmann	90a8fb0355	nir/lower_io: Fix array length of buffers larger than INT32_MAX. Before, if the ssbo is too large this would always return 0. Also, this code is easier to optimize, so the common case of offset 0 and pot stride results in one ushr instead of 5+ instructions. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17468>	2022-07-13 07:34:09 +00:00
Eric Engestrom	9844a2fb64	nir: use updated tokens from vk.xml Signed-off-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17342>	2022-07-12 15:53:11 +00:00
Emma Anholt	0e1fb2d984	nir+ir3: Rename load_size_ir3 to load_center_rhw_ir3. Now that we know what it does, it also explains what it's doing in interpolateAtOffset in ir3. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17322>	2022-07-11 16:56:05 +00:00

... 32 33 34 35 36 ...

8839 commits