fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 13:38:19 +02:00

Author	SHA1	Message	Date
Timothy Arceri	0e0633ca49	glsl: relax rule on varying matching for shaders older than 4.20 This expands on commit `c54c42321e`. See the code comment for full justifications. At the time of the previous commit Ian wanted to limit the relaxing of the rule to GLSL 3.30 as that was the highest version of shaders seen in the wild that were having trouble with the stricter rules. However since then I've found that the long standing issue with tess shaders failing to compile in the game 'Layers Of Fear' is due to this same issue. The game uses 4.10 shaders and also makes use of explicit varying locations, so here we relax the rule to 4.20 and make sure to apply the restriction to shaders using varyings with explicit locations also. Fixes: `c54c42321e` ("glsl: relax rule on varying matching for shaders older than 4.00") Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11873>	2021-07-23 03:06:26 +00:00
Jason Ekstrand	60b5faf572	nir/lower_tex: Add a lower_txs_cube_array option Several bits of hardware require the division by 6 to happen in the shader. May as well have common lowering for it. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12005>	2021-07-22 14:22:35 -05:00
Jason Ekstrand	c6102dda0a	nir/lower_image: Handle index and bindless image_size Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12005>	2021-07-22 14:22:35 -05:00
Caio Marcelo de Oliveira Filho	baefdceeaf	spirv: Implement SPV_EXT_shader_atomic_float16_add Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11981>	2021-07-21 20:15:21 +00:00
Jordan Justen	6898549d56	nir: Add nir_lower_image() to lower cube image sizes Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9466>	2021-07-21 11:02:15 -07:00
Jason Ekstrand	b0fba89cf6	nir/lower_subgroups: Handle down-casts in uint_to_ballot_type This is required for Zink where the API ballot type is a uint64_t and the "hardware" ballot type is uvec4. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11989>	2021-07-21 16:41:56 +00:00
Timothy Arceri	5cc36887ab	nir/gcm: be less destructive with instruction order This changes the pass to extract pinned instructions and not just unpinned instructions when rescheduling instructions. This stops pinned instructions from being bunched together when instructions are reinserted into the blocks which can result in regressions with regards to cycles and instruction counts on i965 and register use/Max Waves on AMD hardware. In order to do this we also throw away the post-order depth-first search linearization algorithm used to re-insert the instructions, which itself causes possible regressions when instructions are reinserted into a less than ideal new order (of which the bunched together pinned instructions is one example). Instead we simply insert instructions in the reverse order they were extracted. This will simply place instructions that were scheduled earlier onto the end of their new block and instructions that were scheduled later to the start of their new block. With this everything should remain in order without the need to run over uses. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/597>	2021-07-21 14:24:00 +00:00
Ian Romanick	436668874a	nir/gcm: Clear out pass_flags before starting With this pass enabled in Intel drivers, running shader-db on shaders/unity/38.shader_test resulted in Program received signal SIGSEGV, Segmentation fault. gcm_schedule_early_src (src=0x555555d45348, void_state=0x7fffffffba40) at ../../SOURCE/master/src/compiler/nir/nir_opt_gcm.c:297 297 if (info->early_block->index < src_info->early_block->index) (gdb) print src_info->early_block $1 = (nir_block *) 0x0 I tracked this down to an early exit from gcm_schedule_early_instr on the parent instruction because instr->pass_flags was 0x1c. That should be an impossible value for this pass, so I inferred that pass_flags must have dirt left from some previous pass. Fixes: `8dfe6f672f` ("nir/GCM: Use pass_flags instead of bitsets for tracking visited/pinned") Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/597>	2021-07-21 14:24:00 +00:00
Mike Blumenkrantz	3ab74d0ffa	nir: add nir_imm_ivec3 builder the other ones exist, so why not this one too Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11983>	2021-07-21 13:57:14 +00:00
Jason Ekstrand	393ee837fb	nir: Add a format field to _deref image intrinsics The rules here are the same as for texture instructions. The bits on the intrinsic are the ground truth and are allowed to vary from the deref a bit as-needed. If the intrinsic says PIPE_FORMAT_NONE, then we can look at the variable, if visible, to get format information. This means that we need to be careful when we rewrite intrinsics based on the deref to only override the format from the _deref intrinsic from the image variable unless the intrinsic is PIPE_FORMAT_NONE. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11849>	2021-07-20 23:18:22 +00:00
Jason Ekstrand	0b57272af8	nir: Set src_components = -1 for image intrinsic deref sources Semantically, -1 means "Unknown; don't validate" but it's really only used for derefs because they often need to be flexible. We don't really need that flexibility for image intrinsics but this makes it more consistent. More immediately useful is that this gives us the ability to tell _deref forms of these intrinsics apart from the lowered ones. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11849>	2021-07-20 23:18:22 +00:00
Jason Ekstrand	c0afb60258	nir: Set IMAGE_DIM and IMAGE_ARRAY on deref intrinsics The rules here are the same as for texture instructions. The bits on the intrinsic are the ground truth and are allowed to vary from the deref a bit as-needed. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11849>	2021-07-20 23:18:22 +00:00
Jason Ekstrand	ea7fcd5a97	glsl/nir: Use nir_ssa_undef() from nir_builder Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11849>	2021-07-20 23:18:22 +00:00
Mike Blumenkrantz	50f9519ea5	nir/lower_point_size_mov: zero nir_state_slot::swizzle in new variable this is otherwise uninitialized during nir_serialize calls Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11932>	2021-07-20 16:34:51 +00:00
Timothy Arceri	749251391d	glsl: replace some C++ code with C This replaces some new/delete uses with malloc/free. This is more consistent with most of the other glsl IR code but more importantly it allows the game "Battle Block Theater" to start working on some mesa drivers. The game overrides new and ends up throwing an assert and crashing when it sees this function calling new [0]. Note: The game still crashes with radeonsi due to similar conflicts with LLVM. CC: mesa-stable Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11907>	2021-07-17 10:07:37 +00:00
Sagar Ghuge	06ab737686	nir: Add optimizations for iadd3 This patch also adds has_iadd3 bit to give more control if backend supports ternary add instruction or not. v2: - Add patterns in late optimization (Connor Abbott) Suggested-by: Alyssa/Jason Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11596>	2021-07-16 15:59:56 +00:00
Sagar Ghuge	e8dff256c0	nir: Add new opcode for ternary addition v2: - Make it 2src commutative (Connor Abbott) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11596>	2021-07-16 15:59:55 +00:00
Jason Ekstrand	0ee322acdb	nir: Better document the Boissinot algorithm in nir_from_ssa() Reviewed-by: Yevhenii Kolesnikov <yevhenii.kolesnikov@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8815>	2021-07-16 06:19:25 +00:00
Emma Anholt	bb35195b73	nir: Validate after deserialization. It's a particularly relevant place for NIR bugs to occur, and if you make a mistake in this code it gets caught in your debug build in something like mesa/st's call to nir_split_var_copies() during finalization, which is rather misleading. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11860>	2021-07-15 18:43:42 +00:00
Timur Kristóf	48e638ab29	nir: Add AMD specific intrinsics for NGG shader based culling. The new intrinsics fall into the following categories: 1. New viewport intrinsics: For missing components that we need. RADV will emit new SGPR arguments which will contain the viewport information for culling shaders. These are used to compute the screen space coordinates for small primitive culling. 2. load_cull_xxx: Load the culling settings in runtime. These will be a new SGPR argument in RADV. 3. overwrite_xxx: These are needed because system values such as vertex and instance ID are not writeable, but we need to change them after repacking shader invocations of VS and TES. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Jason Ekstrand	3d934ee03f	glsl: Delete lower_texture_projection This is only used by i965 and we've been getting it through nir_lower_tex since forever. Get rid of the GLSL IR pass. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11827>	2021-07-13 14:06:33 +00:00
Jason Ekstrand	2111551485	Convert a few files to UTF-8 Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11788>	2021-07-12 23:45:34 +00:00
Jason Ekstrand	a195ef123e	nir/lower_subgroups: Pad ballot values before bitcasting Otherwise, if we cast from a uint32_t to a uint64_t, the bitcast will fail before we pad. This happens on Intel. Fixes: `e4e79de2a4` "nir/subgroups: Support > 1 ballot components" Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5045 Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11786>	2021-07-09 14:21:26 +00:00
Jason Ekstrand	d4b482d378	android: Drop the Android.mk build system Android.mk files haven't really been supported by Mesa devs for a long time. Most of us have been willing to update Makefile.sources if we remember and sometimes we try to blind code some Android.mk for a new generator. However, the reality is that it breaks regularly and ends up being maintained by the Android community. To address this problem another approach was implemented in !10183 utilizing the maintained meson build system. The old Android.mk files are no longer required. This commit was created with the following commands: git rm /Android.mk git rm /Android..mk git rm */Makefile.sources git rm CleanSpec.mk Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4487 Acked-by: Roman Stratiienko <r.stratiienko@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9728>	2021-07-08 14:44:02 -05:00
Jason Ekstrand	624e799cc3	nir: Drop nir_ssa_def::name and nir_register::name We say that they're for debug only but we don't really have a good policy around when to set them and when not to. In particular, nir_lower_system_values and nir_lower_vars_to_ssa which are the chief producers of SSA values which might reasonably have a name do not bother to set one. We have some names set from things like BLORP and RADV's meta shaders but AFAICT, they're setting a name more because it's there than because they actually care. Also, most things other than nir_clone and nir_serialize don't bother to try and preserve them. You can see in the diffstat of this commit exactly what passes attempt to preserve names. Notably missing from the list is opt_algebraic which is the single largest source of SSA def churn and it happily throws names away. These observations lead me to question whether or not names are actually useful at all or if they're just taking up space (8B per instruction) and wasting CPU cycles (to ralloc_strdup on the off chance we do have one). I don't think I can think of a single time in recent history where I've been debugging a shader issue and a SSA value name has been there and been useful. If anything, the few times they are there, they just throw me off because they mess up the indentation in nir_print. iris shader-db on my system gets runtime -2.07734% +/- 1.26933% (n=5) Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5439>	2021-07-08 17:34:41 +00:00
Connor Abbott	68b8b9e9e1	tu, ir3: Plumb through support for CS subgroup size/id The way that the blob obtains the subgroup id on compute shaders is by just and'ing gl_LocalInvocationIndex with 63, since it advertizes a subgroupSize of 64. In order to support VK_EXT_subgroup_size_control and expose a subgroupSize of 128, we'll have to do something a little more flexible. Sometimes we have to fall back to a subgroup size of 64 due to various constraints, and in that case we have to fake a subgroup size of 128 while actually using 64 under the hood, by just pretending that the upper 64 invocations are all disabled. However when computing the subgroup id we need to use the "real" subgroup size. For this purpose we plumb through a driver param which exposes the real subgroup size. If the user forces a particular subgroup size then we lower load_subgroup_size in nir_lower_subgroups, otherwise we let it through, and we assume when translating to ir3 that load_subgroup_size means "give me the actual subgroup size that you decided in RA" and give you the driver param. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6752>	2021-07-08 16:02:41 +00:00
Connor Abbott	cc514bfa0e	nir: Add read_invocation_cond_ir3 intrinsic On qualcomm, we have shared registers similar to SGPR's on AMD. However, there is no readlane or readfirstlane primitive. shared registers can only be written to when just one lane is active. This means that we have to lower readInvocation(val, id) to something like: if (gl_SubgroupInvocation == id) { scalar_reg = val; } return scalar_reg; However it's a bit difficult to actually get the value of gl_SubgroupInvocation in the backend, because for compute it requires some calculations and we don't have any CSE support in the backend. This intrinsic lets us turn it into "readInvocationCond(val, id == gl_SubgroupInvocation)" in NIR at which point the backend code generation is a lot easier. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6752>	2021-07-08 16:02:41 +00:00
Connor Abbott	e4e79de2a4	nir/subgroups: Support > 1 ballot components Qualcomm has a mode with a subgroup size of 128, so just emitting larger integer operations and then lowering them later isn't an option. This makes the pass able to handle the lowering itself, so that we don't have to go down to 64-thread wavefronts when ballots are used. (The GLSL and legacy SPIR-V extensions only support a maximum of 64 threads, but I guess we'll cross that bridge when we come to it...) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6752>	2021-07-08 16:02:41 +00:00
Connor Abbott	90819b9b0e	nir/subgroups: Replace lower_vote_eq_to_ballot with lower_vote_eq Lower it to a vote instead of a ballot. This was only used for AMD, and in that case they're pretty much the same. However Qualcomm has a vote builtin, which we want to use instead of ballots. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6752>	2021-07-08 16:02:41 +00:00
Mike Blumenkrantz	b67a4ba4ad	nir/format_convert: add ssa version of uint packing Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10619>	2021-07-07 13:41:37 +00:00
Mike Blumenkrantz	c948251d2b	nir/format_convert: nir_shift -> nir_shift_imm Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10619>	2021-07-07 13:41:37 +00:00
Emma Anholt	4118264643	nir: Free the instructions in a DCE instr removal. No significant change in shader-db time (n=11), but should be a little win for memory usage by the compiler. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11628>	2021-07-06 11:24:48 -07:00
Emma Anholt	5618445d45	nir: Use remove_and_dce for nir_shader_lower_instructions(). Reduces the work that other shader passes have to do to look at dead code, and possibly extra rounds around the optimization loop if dce wasn't the last pass in it. shader-db runtime -1.12919% +/- 0.264337% (n=49) on SKL. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11628>	2021-07-06 11:24:45 -07:00
Emma Anholt	5251548572	nir: Add a nir_instr_remove that recursively removes dead code. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11628>	2021-07-06 11:24:43 -07:00
Danylo Piliaiev	c0f623e62f	glsl: Prohibit implicit conversion of mem parameter in atomicOP functions Per OpenGL Shading Language, section 8.11. "Atomic Memory Functions" first argument "mem" of all atomicOP functions is inout. The same is true for ARB_shader_storage_buffer_object and GL_INTEL_shader_atomic_float_minmax For implicit conversion of inout parameters it is required for type to support bi-directional conversion, since there is no such types in glsl - implicit conversion is effectively prohibited. Alternatively we could have marked atomic_var parameter of built-in atomicOP functions as inout, however it opens another can of worms during NIR lowerings. Fixes: `ea0a1f5beb` Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2837 Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4887>	2021-07-05 18:29:51 +03:00
Thomas H.P. Andersen	ffea622604	nir/ifind_msb_rev: fix input check ifind_msb_rev was introduced in `a5747f8ab3`. ifind_msb_rev guards against src0 being both 0 or -1 at the same time. That is always true. This patch changes it to check for those values individually. Spotted from a compile warning. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Fixes: `a5747f8ab3` (\"nir: add opcodes for *find_msb_rev and lowering\") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11630>	2021-07-04 12:17:58 +00:00
Jesse Natalie	f8f2c3d835	nir_lower_readonly_images: Clear variable data when changing the type For images, variable data includes the format. For samplers, variable data is used for OpenCL inline samplers. When converting a variable from one to the other, zero out the data so we don't accidentally interpret a converted image as an inline sampler. Fixes: `fa677c86` ("nir_lower_readonly_images_to_tex: Support non-CL semantics") Acked-by: Enrico Galli <enrico.galli@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11674>	2021-07-02 04:24:22 +00:00
Alyssa Rosenzweig	3da23a9c7e	nir: Fix constant folding for irhadd/urhadd This should be a subtract, not an add. The comment's proof is correct, but the (wrong) expression we actually use isn't what it's in the comment! Correct the discrepancy. The lowering in nir_opt_algebraic was correctly typed. Fixes: `272e927d0e` ("nir/spirv: initial handling of OpenCL.std extension opcodes") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11671>	2021-07-02 00:21:22 +00:00
Rob Clark	c7b935962b	nir: Add pass to lower phi precision In addition to register pressure benefits from getting more fp16/int16, this avoids i2imp's from standing in the way of loop unrolling. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11545>	2021-06-29 23:27:28 +00:00
Thomas H.P. Andersen	b4369de27f	nir/lower_packing: use shader_instructions_pass Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11615>	2021-06-29 22:08:29 +00:00
Thomas H.P. Andersen	ed530ac6c2	nir: return progress from nir_lower_packing Compiling with clang warns about an unused variable in nir_lower_packing. Tracking progress was added to nir_lower_packing in `adb157ddfd` but the function will ignore the progress from impl calls and always return false. This patch changes it to return the progress. It fixes the warning and should enable validation calls in NIR_PASS when progress is made. Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Fixes: `adb157ddfd` "nir: Return progress from nir_lower_64bit_pack()" Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11615>	2021-06-29 22:08:29 +00:00
Timothy Arceri	a73e7305e9	util/driconf: add new ignore_write_to_readonly_var workaround This forces the GLSL compiler to ignore writes to readonly vars rather than throwing an error. Cc: mesa-stable Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11602>	2021-06-29 02:59:28 +00:00
Timothy Arceri	e607205af0	glsl: force_glsl_version to shaders with no defined version If a shader has no defined version force_glsl_version was previous ignored and the shader would default to 110. This updates the code so that those shaders are forced to a new level also. We reused the existing code to make sure a sensible value is set for the version. Cc: mesa-stable Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11602>	2021-06-29 02:59:28 +00:00
Eleni Maria Stea	49e8b77fd9	intel: struct bitset is renamed to brw_bitset Static struct bitset was renamed to brw_bitset as a struct bitset is defined in sys/_bitset.h included by pthread_np.h on FreeBSD that is indirectly included by src/intel/compiler/brw_nir_lower_shader_calls.c Signed-off-by: Eleni Maria Stea <elene.mst@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11203>	2021-06-28 21:12:24 +03:00
Emma Anholt	0afab39af9	nir: Add a helper for chasing movs with nir_ssa_scalar(). Sometimes you might want to find a constant source without going through all the copy prop and constant folding to make your source be a load_const. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11613>	2021-06-28 16:26:24 +00:00
Rhys Perry	502b06c4f5	nir/opt_load_store_vectorize: fix check_for_robustness() with deref access We could do better if we knew the nir_address_format to obtain addition_bits, but the only affected driver (Turnip) probably won't benefit because it doesn't vectorize across vec4. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `2e7bceb220` ("nir/load_store_vectorizer: fix check_for_robustness() with indirect loads") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4922 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11382>	2021-06-28 15:15:42 +00:00
Caio Marcelo de Oliveira Filho	6ad88a8f08	spirv: Support SPV_KHR_subgroup_uniform_control_flow There's no SPIR-V Capability associated, so check in the Execution Mode. For now, don't keep track of whether a shader uses uniform control flow in the shader_info, we can add that when/if a driver actually need that information. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11476>	2021-06-25 22:41:32 +00:00
Caio Marcelo de Oliveira Filho	a219073e9b	spirv: Update headers and metadata from latest Khronos commit This corresponds to f95c3b3761ee1b1903f54ae69b526ed6f0edc3b9 ("Merge pull request #219 from cmarcelo/SPV_EXT_shader_atomic_float16_add") in https://github.com/KhronosGroup/SPIRV-Headers. Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11476>	2021-06-25 22:41:32 +00:00
Caio Marcelo de Oliveira Filho	3a9289eaed	nir: Add test to check edge case in Split ALU optimization Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11476>	2021-06-25 22:41:32 +00:00
Caio Marcelo de Oliveira Filho	b951929795	nir/opt_if: Don't split ALU for single block infinite loops Some infinite loop cases were already covered by other restrictions (e.g. if the loop had a body), but the case with a single block in the loop body wasn't yet. This prevents an infinite loop when optimizing the shader in dEQP-VK.reconvergence.subgroup_uniform_control_flow_ballot.compute.nesting2.3.2 and various others reconvergence tests. Fixes: `0881e90c09` ("nir: Split ALU instructions in loops that read phis") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> [v1] Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11476>	2021-06-25 22:41:32 +00:00

1 2 3 4 5 ...

6251 commits