fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 22:38:06 +02:00

Author	SHA1	Message	Date
Jason Ekstrand	3d9ffdcc72	nir/lower_memcpy: Don't mask the store For constant-size memcpys, we can do as much as a vec4 at a time. We were accidentally masking the store to only the .x component. Fixes: `a3177cca99` "nir: Add a lowering pass to lower memcpy" Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7305>	2020-10-26 14:47:19 +00:00
Jason Ekstrand	3ba786f624	spirv: Fix OpCopyMemorySized I have no idea how we are passing CTS tests with that bug in there. I guess by luck? Fixes: `8323c03bbf` "spirv: Add support for OpCopyMemorySized" Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7294>	2020-10-23 16:46:52 +00:00
Connor Abbott	4ca38a1995	nir/lower_clip_cull: Store array size for FS inputs I think the rationale for not setting the size for inputs is that when passed between geometry stages the clip and cull distances are supposed to be treated like any other varying. However, this isn't 100% the case for the FS, since when it's read by the FS it's also used by the fixed-function stage. In freedreno we setup varying locations when compiling the FS, and then tack on VS-only outputs like gl_Position at the end. Furthermore there's code to compact input locations based on what's actually read. But this compaction can't happen for clip and cull distances, because then we won't have space for components that are only read by the clipper. So, we need to know the original number of components for both arrays. Modify this pass so that we don't have to go digging around for it ourselves. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6959>	2020-10-23 11:09:18 +00:00
Andrii Simiklit	d972a6ac4c	nir: get rid of OOB dereferences in nir_lower_io_arrays_to_elements This patch fixes mesa compiler crash in i965 on shaders like the following one: ``` in VS_OUTPUT { mat4 data; } vs_output; out vec4 fs_output; vec4 convert(in float val) { return vec4(val); } void main() { fs_output = vec4(0.0); for (int a = -1; a < 5; a++) { for (int b = -1; b < 5; b++) { fs_output += convert(vs_output.data[b][a]); } } } ``` Section 5.11 (Out-of-Bounds Accesses) of the GLSL 4.60 spec says: In the subsections described above for array, vector, matrix and structure accesses, any out-of-bounds access produced undefined behavior.... Out-of-bounds reads return undefined values, which include values from other variables of the active program or zero. Out-of-bounds writes may be discarded or overwrite other variables of the active program. GL_KHR_robustness and GL_ARB_robustness encourage us to return zero for reads. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6560>	2020-10-23 09:51:38 +00:00
Vinson Lee	53fc3eb4a2	glsl: Initialize lower_shared_reference_visitor members. Fix defects reported by Coverity Scan. Uninitialized scalar field (UNINIT_CTOR) uninit_member: Non-static class member buffer_access_type is not initialized in this constructor nor in any functions that it calls. uninit_member: Non-static class member progress is not initialized in this constructor nor in any functions that it calls. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7243>	2020-10-23 00:27:03 +00:00
Ian Romanick	67956689bb	nir: Rename replicated-result dot-product instructions All these instructions replicate the result of a N-component dot-product to a vec4. Naming them fdot_replicatedN gives the impression that are some sort of abstract dot-product that replicates the result to a vecN. They also deviate from fdph_replicated... which nobody would reasonably consider naming fdot_replicatedh. Naming these opcodes fdotN_replicated more closely matches what they are, and it matches the pattern of fdph_replicated. I believe that the only reason these opcodes were named this way was because it simplified the implementation of the binop_reduce function in nir_opcodes.py. I made some fairly simple changes to that function, and I think the end result is ok. The bulk of the changes come from the sed rename: sed --in-place -e 's/fdot_replicated$[234]$/fdot\1_replicated/g' \ $(grep -r 'fdot_replicated[234]' src/) v2: Use a named parameter to binop_reduce instead of using isinstance(name, str). Suggested by Jason. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5725>	2020-10-22 18:00:19 +00:00
Jan Beich	8cee9ce750	spirv: switch to util_bswap32 to improve portability `bswap_32` and `<byteswap.h>` aren't available on BSDs. Instead the same function is spelled slightly different and is provided by different header file. However, Mesa provides `util_bswap32` to avoid complicated conditionals. Fixes: `fb6b243c11` ("spirv: Support big-endian strings") Tested-by: Piotr Kubaj <pkubaj@FreeBSD.org> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7257>	2020-10-22 17:02:49 +00:00
Gert Wollny	b739bb7168	compile/nir: Correct printing dest_type Fixes: `0aa08ae2f6` nir: Split NIR_INTRINSIC_TYPE into separate src/dest indices Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7261>	2020-10-22 11:39:34 +00:00
Rhys Perry	4735c8a522	nir/loop_analyze: adjust force unrolling to only include interesting modes Instead of force-unrolling any loop which reads an entire array, only do it for arrays which might be faster to access with constant indices. Significantly improves compile-time for these CTS tests, which could previously timeout: dEQP-VK.spirv_assembly.instruction.graphics.16bit_storage.struct_mixed_types.uniform_buffer_block_geom dEQP-VK.spirv_assembly.instruction.graphics.16bit_storage.struct_mixed_types.uniform_geom dEQP-VK.spirv_assembly.instruction.graphics.8bit_storage.struct_mixed_types.storage_buffer_geom dEQP-VK.spirv_assembly.instruction.graphics.spirv_ids_abuse.lots_ids_geom fossil-db (Navi): Totals from 19 (0.01% of 137413) affected shaders: SGPRs: 1728 -> 1688 (-2.31%) VGPRs: 1176 -> 1168 (-0.68%) CodeSize: 198496 -> 136580 (-31.19%) MaxWaves: 154 -> 156 (+1.30%) Instrs: 38889 -> 26029 (-33.07%) Cycles: 446108 -> 1059924 (+137.59%); split: -0.91%, +138.51% VMEM: 3245 -> 2926 (-9.83%) SMEM: 850 -> 828 (-2.59%); split: +4.71%, -7.29% VClause: 549 -> 533 (-2.91%) SClause: 1810 -> 1522 (-15.91%) Copies: 2209 -> 1705 (-22.82%); split: -22.95%, +0.14% Branches: 854 -> 603 (-29.39%); split: -29.86%, +0.47% PreSGPRs: 1512 -> 1506 (-0.40%); split: -0.53%, +0.13% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7161>	2020-10-22 12:07:45 +01:00
Vinson Lee	025050bae7	glsl: Initialize ir_if_to_cond_assign_visitor members in constructor. Fix defects reported by Coverity Scan. Uninitialized scalar field (UNINIT_CTOR) uninit_member: Non-static class member found_unsupported_op is not initialized in this constructor nor in any functions that it calls. uninit_member: Non-static class member found_expensive_op is not initialized in this constructor nor in any functions that it calls. uninit_member: Non-static class member found_dynamic_arrayref is not initialized in this constructor nor in any functions that it calls. uninit_member: Non-static class member is_then is not initialized in this constructor nor in any functions that it calls. uninit_member: Non-static class member then_cost is not initialized in this constructor nor in any functions that it calls. uninit_member: Non-static class member else_cost is not initialized in this constructor nor in any functions that it calls. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7228>	2020-10-21 15:31:00 -07:00
Caio Marcelo de Oliveira Filho	8cf0024432	nir: Use a switch in nir_lower_explicit_io_instr Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7255>	2020-10-21 12:00:09 -07:00
Erik Faye-Lund	33ccf0e9bc	nir: drop unused alpha_ref_float Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7251>	2020-10-21 16:33:43 +00:00
Erik Faye-Lund	42ee423e3a	nir: drop support for using load_alpha_ref_float Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7251>	2020-10-21 16:33:43 +00:00
Marek Olšák	233520035a	nir: consider load_color intrinsics as both inputs and sysval in gathering src/mesa expects this somewhere. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6950>	2020-10-21 16:10:08 +00:00
Eric Anholt	fdbc45d1d4	nir: Only validate in passes that might have changed things. If a pass returning boolean progress reports no change, we shouldn't need to re-validate. If a pass breaks the NIR but also fails to report progress correctly, it would be up to the next pass to catch that. This should hopefully help with test timeouts on KHR-GL33.texture_swizzle.functional since switching softpipe to nir-to-tgsi and enabling NIR validation in CI (27s to 20s on my system). Suggested-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Rob Clark <robdclark@chromium.org> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7239>	2020-10-21 05:00:17 +00:00
Timothy Arceri	c54c42321e	glsl: relax rule on varying matching for shaders older than 4.00 Please see new code commment for full justification. Fixes: `18004c338f` ("glsl: fail when a shader's input var has not an equivalent out var in previous") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3648 Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7184>	2020-10-21 15:00:47 +11:00
Jason Ekstrand	eb965719ab	compiler/types: Allow images and samplers in get_explicit_type_for_size_align Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7069>	2020-10-20 23:46:42 +02:00
Jason Ekstrand	0021d3ae87	compiler/types: Assert non-zero alignments in get_explicit_type_for_size_align Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7069>	2020-10-20 23:46:42 +02:00
Jason Ekstrand	ef68f740a6	nir/lower_io: Assert non-zero power-of-two alignments The way the ALIGN_POT macro works, an alignment of 0 may cause ALIGN_POT(x, 0) to return 0 for any x. Throw in an assert to guard against this case. Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7069>	2020-10-20 23:46:42 +02:00
Jason Ekstrand	589d918a4f	spirv: Add 0.5 to integer coordinates for OpImageSampleExplicitLod Just casting to a float is insufficient because that gives us the upper-left corner of the texel rather than the center. Fixes: `701cb9d60c` "nir/vtn: Handle integer sampling coordinates" Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7069>	2020-10-20 23:46:42 +02:00
Eric Anholt	d867e7c974	nir: Add an option to not lower source mods for f64/u64/i64. TGSI can't handle them, but we want to use this pass for nir-to-tgsi. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3395>	2020-10-20 08:54:06 -07:00
Eric Anholt	c730feacc0	nir: Add a call to get a struct describing SSA liveness per instruction. nir-to-tgsi will use this to release release temporaries for SSA storage back to ureg's linear register allocation once they're dead. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3395>	2020-10-20 08:54:06 -07:00
Eric Anholt	a206b58157	nir: Add a block start/end ip to live instr index metadata. I wanted it for the per-instruction live intervals metadata, and it's not much to store in general. Make the ip explicitly 32-bit, on suggestion by jekstrand. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3395>	2020-10-20 08:54:06 -07:00
Eric Anholt	2f5d18403a	nir: Replace nir_ssa_def->live_index with nir_instr->index. live_index had two things going on: 0 meant the instr was an undef and always dead, and otherwise ssa defs had increasing numbers by instruction order. We already have a field in the instruction for storing instruction order, and ssa defs don't need that number to be contiguous (if you want a compact per-ssa-def number, use ssa->index after reindexing). We don't use ssa->index for this, because reindexing those would change nir_print, and that would be rude to people trying to track what's happening in optimization passes. This openend up a hole in nir_ssa_def, so we move nir_ssa_def->index toward the end to shrink the struct from 64 bytes to 56. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3395>	2020-10-20 08:54:01 -07:00
Eric Anholt	b6cb184e86	nir: Introduce nir_metadata_instr_index for nir_index_instr() being current. This will be useful to remove the live_index field from nir_ssa_def. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3395>	2020-10-20 08:53:36 -07:00
Timur Kristóf	d9cb9ff414	nir: Emit set_vertex_and_primitive_count for inactive streams. This fixes issues in backends such as ACO which rely on always getting this intrinsic to know the correct vertex and primitive count. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7213>	2020-10-20 07:11:29 +00:00
Vinson Lee	b17e264e66	glsl: Initialize lower_ubo_reference_visitor members in constructor. Fix defects reported by Coverity Scan. Uninitialized pointer field (UNINIT_CTOR) uninit_member: Non-static class member buffer_access_type is not initialized in this constructor nor in any functions that it calls. uninit_member: Non-static class member uniform_block is not initialized in this constructor nor in any functions that it calls. uninit_member: Non-static class member progress is not initialized in this constructor nor in any functions that it calls. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7120>	2020-10-15 23:03:12 +00:00
Caio Marcelo de Oliveira Filho	886d2d1a9a	spirv: Handle SpvOpTerminateInvocation Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7150>	2020-10-15 21:40:09 +00:00
Caio Marcelo de Oliveira Filho	4dfd292307	spirv: Update headers and metadata from latest Khronos commit This corresponds to c43a43c7cc3af55910b9bec2a71e3e8a622443cf (" Register the Xenia emulator as a generator (#171)") in https://github.com/KhronosGroup/SPIRV-Headers. Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7150>	2020-10-15 21:40:09 +00:00
Caio Marcelo de Oliveira Filho	f6d5dd825f	nir: Add nir_intrinsic_terminate and nir_intrinsic_terminate_if Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7150>	2020-10-15 21:40:09 +00:00
Rhys Perry	f91b2fe384	nir/opt_load_store_vectorize: add some tests for discard/demote behaviour Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7163>	2020-10-15 18:21:44 +00:00
Rhys Perry	f8e971f511	nir/opt_load_store_vectorize: don't vectorize stores across demote Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Fixes: `ce9205c03b` ("nir: add a load/store vectorization pass") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7163>	2020-10-15 18:21:44 +00:00
Mike Blumenkrantz	4231cc2e99	glsl: more accurately handle swizzle in 64bit varying split with no left value as implied in the surrounding code, left_components can be 0 here, in which case creating a left swizzle is unnecessary (and triggers an assert) this moves a failing assert farther down the stack to a more useful location when trying to pack e.g., struct[3] { dvec3; float; } ref spec@arb_gpu_shader_fp64@execution@inout@vs-out-fs-in-s1-s2@3-dvec2-float Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7134>	2020-10-15 11:48:12 +00:00
Alejandro Piñeiro	9b01598fe5	nir/lower_io_to_scalar: update io semantics on per-component inst When we replace the original instruction with per-channel operations, the new instruction should inherint the semantics of the original instruction. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6721>	2020-10-14 22:54:58 +00:00
Eric Anholt	4722491124	glsl/tests: Make the tests skip on Android binary execution failures. We don't have a suitable exe wrapper for running them, and the missing linker is throwing return code 255 instead of an ENOEXEC. Catch it and return skip from the tests. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6700>	2020-10-14 16:54:59 +00:00
Daniel Schürmann	f503699e10	nir/opt_algebraic: optimize unpack_half_2x16_split_x(ushr, a, 16) Same as extract_u16(a, 1) Totals from 2021 (1.48% of 136546) affected shaders (RAVEN): VGPRs: 129516 -> 129524 (+0.01%); split: -0.00%, +0.01% CodeSize: 12485704 -> 12486600 (+0.01%); split: -0.00%, +0.01% Instrs: 2435041 -> 2434999 (-0.00%); split: -0.00%, +0.00% Cycles: 20952552 -> 20952624 (+0.00%); split: -0.00%, +0.00% VMEM: 374492 -> 374212 (-0.07%); split: +0.01%, -0.08% SMEM: 123309 -> 123291 (-0.01%); split: +0.00%, -0.02% VClause: 64156 -> 64164 (+0.01%) Copies: 191620 -> 191616 (-0.00%); split: -0.03%, +0.03% Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6777>	2020-10-14 15:31:38 +00:00
Rhys Perry	21422b1ff2	nir/opt_uniform_atomics: remove useless returns Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7117>	2020-10-14 09:53:34 +00:00
Iago Toral Quiroga	f4c754bcd1	nir: add a nir_get_ubo_size intrinsic This is the same as nir_get_buffer_size but geared towards UBOs instead of SSBOs. The new intrinsic is useful in Vulkan backends that need to add bound checks on buffer accesses to honor the robust buffer access feature. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	6004ad9df1	nir/lower_io: add an option to lower interpolateAt functions The option use_interpolated_input_intrinsics will lower these as well as regular input loads. This is inconvenient for V3D, where we can produce optimal code for regular input loads based on the input variable layout qualifiers, so this change adds an option to only lower instances of interpolateAt(). Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	50351df828	nir/glsl: add a glsl_ivec4_type() helper Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	10b79bf901	nir: include texture query lod as one of the ops that requires a sampler In practice we found that we need this for v3d (specifically for cube map arrays, as they don't support the default value for wrap_i, so a sampler object is needed to override that value). It is worth to note that the main reason behind this auxiliar method was to identify those cases that we didn't have a sampler object available for Vulkan. So far, we found that we have a sampler object coming from nir always for that operation. Fixes cube map array tests like the following: dEQP-VK.glsl.texture_functions.query.texturequerylod.usamplercubearray_fragment Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Rhys Perry	044d213086	scons: fix SPIR-V -> NIR build Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Tested-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Fixes: `18f9fc919e` ('spirv: add and use a generator id enum') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7096>	2020-10-13 16:53:10 +01:00
Rhys Perry	a7114f3f46	nir/opt_uniform_atomics: don't optimize atomics twice Applications sometimes already do this optimization themselves. fossil-db (Navi): Totals from 51 (0.04% of 135946) affected shaders: CodeSize: 507484 -> 501860 (-1.11%) Instrs: 99635 -> 98471 (-1.17%) Cycles: 2421944 -> 2414780 (-0.30%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6558>	2020-10-13 12:47:21 +00:00
Rhys Perry	bc43650522	nir/opt_uniform_atomics: optimize image atomics fossil-db (Navi): Totals from 65 (0.05% of 135946) affected shaders: SGPRs: 3792 -> 3784 (-0.21%) VGPRs: 2784 -> 2716 (-2.44%) CodeSize: 707492 -> 713080 (+0.79%) MaxWaves: 873 -> 887 (+1.60%) Instrs: 133376 -> 134524 (+0.86%) Cycles: 3004772 -> 3011440 (+0.22%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6558>	2020-10-13 12:47:21 +00:00
Rhys Perry	f83bc5beb8	nir: add pass to optimize uniform atomics This optimizes atomics with a uniform offset so that only one atomic operation is done in the subgroup. For shaders which do a very large amount of atomics, this can significantly improve performance. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6558>	2020-10-13 12:47:21 +00:00
Rhys Perry	37b6b0967c	nir: allow divergence information to be updated when inserting instruction Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6558>	2020-10-13 12:47:21 +00:00
Rhys Perry	e1120f274f	nir: move divergence analysis options to nir_shader_compiler_options Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6558>	2020-10-13 12:47:21 +00:00
Rhys Perry	1a912a550f	nir: add last_invocation intrinsic Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6558>	2020-10-13 12:47:20 +00:00
Rhys Perry	8850a63161	radv/aco,nir/lower_subgroups: don't lower elect ACO can implement this better. fossil-db (Navi): Totals from 33 (0.02% of 135946) affected shaders: SGPRs: 1736 -> 1744 (+0.46%) VGPRs: 1680 -> 1656 (-1.43%) CodeSize: 246160 -> 245916 (-0.10%); split: -0.14%, +0.04% MaxWaves: 449 -> 461 (+2.67%) Instrs: 48301 -> 48266 (-0.07%); split: -0.12%, +0.05% Cycles: 469740 -> 469240 (-0.11%); split: -0.18%, +0.08% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6558>	2020-10-13 12:47:20 +00:00
Mike Blumenkrantz	c31ababae3	nir: update ubo locations in nir_lower_uniforms_to_ubo locations are important for these because they provide info about how many block indices each ubo takes up UBO arrays have nonzero values here. all non-array UBOs have either 0 for the base or nonzero for an io lowered block at an offset, but only arrays need to be changed here because they're the only ones with absolute values, whereas all the others are relative. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6272>	2020-10-13 12:31:40 +00:00

1 2 3 4 5 ...

5537 commits