fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-15 12:08:14 +02:00

Author	SHA1	Message	Date
Karol Herbst	8bdf4b2b41	compiler/types: fix size of padded OpenCL Structs In C the size of a struct { uin32_t a; uint8_t b; } is 8, not 5, so we have to account for the biggest alignment across all struct members. Funny that the OpenCL CTS doesn't catch that. Fixes: `44d32e62fb` ("glsl: add cl_size and cl_alignment") Signed-off-by: Karol Herbst <git@karolherbst.de> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23701> (cherry picked from commit `4431e5a222`)	2023-06-20 13:38:04 +01:00
Jesse Natalie	22ab43f3e8	nir: Fix constant expression for unpack_64_4x16 Cc: Mesa-stable Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173> (cherry picked from commit `663d957480`)	2023-06-15 22:11:43 +01:00
Jesse Natalie	00ecfc8690	nir_opt_constant_folding: Fix nir_deref_path leak Cc: Mesa-stable Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23173> (cherry picked from commit `009d2de88f`)	2023-06-15 22:11:42 +01:00
Sviatoslav Peleshko	8cf5581e92	nir/lower_shader_calls: Fix cursor if broken after nir_cf_extract() call Fixes: `e2dadda3` ("Revert "nir/lower_shader_calls: put inserted instructions into a dummy block") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8978 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22884> (cherry picked from commit `08e95f8f8e`)	2023-06-15 22:08:05 +01:00
Karol Herbst	f4f6d2e7cf	clc: relax spec constant validation Multiple values can have multiple spec constants assigned and vtn handles this just fine. So just drop that assert as we need it to run SyCL kernels. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9037 Fixes: `a699844ffb` ("microsoft/clc: Parse SPIR-V specialization consts into metadata") Signed-off-by: Karol Herbst <git@karolherbst.de> Acked-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23512> (cherry picked from commit `90b8666ff2`)	2023-06-15 22:07:22 +01:00
Friedrich Vock	130ea8ffee	nir: Remove unnecessary assert in nir_before_src This condition can occur in the wild (more specifically in RT shader call lowering), and it is handled correctly. Cc: mesa-stable Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22536> (cherry picked from commit `87ac5d7d0a`)	2023-06-07 11:14:20 +02:00
Friedrich Vock	56198c4851	nir: Rematerialize derefs in use blocks before repairing SSA nir_repair_ssa_impl may insert phi nodes for any deref, but if the deref mode is uniform, validation fails. To fix this, rematerialize the derefs in the blocks they are used to avoid generating phi nodes for them. Cc: mesa-stable Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22536> (cherry picked from commit `ee2764d5e8`)	2023-06-07 11:14:20 +02:00
Jesse Natalie	794821b05f	nir_opt_algebraic: Don't shrink 64-bit bitwise ops if pack_split is going to be lowered Otherwise this can cause optimizations to fight resulting in infinite optimization loops with opt_algebraic, constant_folding, and copy_prop. Fixes: `368be872` ("nir/algebraic: shrink 64-bit bitwise operations with 0/-1 constant half") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23192> (cherry picked from commit `6c62eaf22d`)	2023-06-02 19:34:02 +01:00
Caio Oliveira	9a4db70a19	spirv: Fix gl_spirv_validation when OpLine with strings is present Fix issue by handling the OpString instructions when walking through the preamble for validation. The gl_spirv_validation() creates a vtn_builder() and walks the instructions looking for a subset of the information. However our current way to walk the instructions will also perform tracking of OpLine/OpNoLine, that may make references to OpString instructions that were being previously ignored by gl_spirv_validation(). This would cause the parsing to fail. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9004 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22973> (cherry picked from commit `1b31d528b9`)	2023-06-01 16:42:28 +01:00
Caio Oliveira	c4f39f0074	spirv: Extract vtn_handle_debug_text() helper This will be later used by gl_spirv handling. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22973> (cherry picked from commit `a32f97530a`)	2023-06-01 16:42:13 +01:00
Tatsuyuki Ishi	cb9d82bc2a	nir: Fix serializing pointer initializers. Found by manual inspection. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Fixes: `7acc81056f` ("compiler/nir: Add support for variable initialization from a pointer") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22355> (cherry picked from commit `1546a9de99`)	2023-05-25 14:06:13 +01:00
Kenneth Graunke	7cc32b0e15	nir: Add find_lsb lowering to nir_lower_int64. Some GPUs can only handle 32-bit find_lsb. Cc: mesa-stable Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23123> (cherry picked from commit `9293d8e64b`)	2023-05-25 14:06:12 +01:00
Konstantin Seurer	a8f93f86d5	nir/lower_shader_calls: Remat derefs earlier spill_ssa_defs_and_lower_shader_calls can insert phis as well which can make nir_opt_shrink_stores crash. Fixes: `200e551c` ("nir/lower_shader_calls: Remat derefs before lowering resumes") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9003 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23007> (cherry picked from commit `40653f0783`)	2023-05-25 14:06:11 +01:00
antonino	d146364601	nir: make var arrays large enough in `nir_create_passthrough_gs` Because each location has 4 possible different values for location_frac the arrays need to br 4x the size. Fixes: `d0342e28` ("nir: Add helper to create passthrough GS shader") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22871> (cherry picked from commit `b5818e2e4f`)	2023-05-25 14:06:11 +01:00
antonino	475df98f04	nir: handle interface blocks in `copy_vars` Fixes: `99121c9b77` ("nir/gs: fix array type copying for passthrough gs") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22871> (cherry picked from commit `8f22669f9b`)	2023-05-25 14:06:11 +01:00
antonino	287e69f76e	nir: don't create invalid inputs in `nir_create_passthrough_gs` The helper was creating input locations for some builtin bariables. This caused validation errors in zink because those builtins can't be used as input. Fixes: `d0342e28b3` ("nir: Add helper to create passthrough GS shader") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22871> (cherry picked from commit `83692bfe30`)	2023-05-25 14:06:11 +01:00
antonino	41ba7bb42e	nir: use `nir_variable_clone` in `nir_create_passthrough_gs` Some stream out properties where not being copied causing problems in zink. Use the appropiate helper instead of copying fields by hand. Fixes: `d0342e28b3` ("nir: Add helper to create passthrough GS shader") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22871> (cherry picked from commit `78d59ef4b1`)	2023-05-25 14:06:11 +01:00
Mike Blumenkrantz	3e9b9fdc77	glsl/lower_samplers_as_deref: apply bindings for unused samplers if a sampler is never used (no derefs) then its binding will never be applied here, leaving it with binding=0. this will clobber the real binding=0 sampler in driver backends, leading to errors, so try to iterate using the same criteria as above and apply bindings in the same way fixes #8974 cc: mesa-stable Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22902> (cherry picked from commit `ccbfcf3933`)	2023-05-09 16:56:48 +01:00
Konstantin Seurer	4c4780997e	nir/lower_fp16_casts: Fix SSA dominance Fixes: `01dfd65` ("nir: port fp16 casting code from dxil") Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22804> (cherry picked from commit `5503a08583`)	2023-05-09 16:54:28 +01:00
Erik Faye-Lund	e2ccae5fae	nir: fix constant-folding of 64-bit fpow We need to do full pow if 64-bit, and we can do fpow() otherwise. Not the other way around. Fixes: `9076c4e289` ("nir: update opcode definitions for different bit sizes") Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22774> (cherry picked from commit `955797d015`)	2023-05-03 14:40:41 +01:00
Lone_Wolf	731681db4b	compiler/clc: Fix embedded clang headers (microsoft-clc) for LLVM 16+ Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7742 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22741> (cherry picked from commit `f53e5efad7`)	2023-05-01 09:02:34 +01:00
Lionel Landwerlin	3a9a3356d3	nir/divergence: add missing load_global_constant_* intrinsics Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22624> (cherry picked from commit `d6e9479d4b`)	2023-05-01 09:01:27 +01:00
Mike Blumenkrantz	68a35bad86	nir/gs: fix array type copying for passthrough gs same mechanics as in zink passes Fixes: `d0342e28b3` ("nir: Add helper to create passthrough GS shader") Reviewed-by: Antonino Maniscalco <antonino.maniscalco@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22669> (cherry picked from commit `99121c9b77`)	2023-04-26 17:37:26 +01:00
Marek Olšák	32c5980e63	nir: fix 2 bugs in nir_create_passthrough_tcs - VAR31 was ignored. - Only a half of the 16-bit slot was passed through, though I'm not sure if nir_lower_io handles vec8. The slots are only for GLES and I don't think a passthrough TCS is possible with GLES. Fixes: `a8e84f50bc` - nir: Add helper to create passthrough TCS shader Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21861> (cherry picked from commit `ace8a7068e`)	2023-04-26 17:37:24 +01:00
Mike Blumenkrantz	7830c29f6e	nir/lower_alpha_test: rzalloc state slots this otherwise leads to uninitialized memory cc: mesa-stable Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22558> (cherry picked from commit `24555f5462`)	2023-04-26 17:37:24 +01:00
Eric Engestrom	f2911b79e5	compiler: fix buggy usage of unreachable() Cc: mesa-stable Signed-off-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22529> (cherry picked from commit `f5ed1c79ae`)	2023-04-19 14:37:56 +01:00
Karol Herbst	8fcfc51dad	clc: add clc_validate_spirv Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22280>	2023-04-13 02:54:21 +00:00
Alyssa Rosenzweig	c66be7521f	nir/lower_blend: Enable per-sample shading Loading output require per-sample blending, so enable per-sample execution of the shader as a whole so the right sample values are blended. Affects: dEQP-GLES31.functional.multisample.default_framebuffer.sample_mask_sum_of_inverses Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22385>	2023-04-11 04:16:32 +00:00
Alyssa Rosenzweig	a74c2ac403	nir/lower_blend: Set uses_fbfetch_output conservatively Only insert a load_output if we're going to use it, don't rely on it getting DCE'd since that will mess up the shader info. This does require a bit of logic to figure out whether we do need it. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22385>	2023-04-11 04:16:32 +00:00
Rhys Perry	bb653b0acb	nir: make nir_fisnan helper exact Floating point ALU assume no NaNs unless FLOAT_CONTROLS_SIGNED_ZERO_INF_NAN_PRESERVE_FPn or (for some opcodes) exact=true. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Fixes: `bf9c1699cd` ("nir: add nir_fisnan helper function") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22206>	2023-04-10 17:42:24 +00:00
Alyssa Rosenzweig	f5471ca965	nir/validate: Only walk uses once Ostensibly faster. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Alyssa Rosenzweig	9a35079074	nir/repair_ssa: Refactor some use handling We can mostly unify the instr-use and if-use handling, which is a lot more concise. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Alyssa Rosenzweig	dcb59a7672	nir: Remove nir_if_rewrite_condition_ssa Now unused. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Alyssa Rosenzweig	e25c182993	nir: Use nir_src_rewrite_ssa Where sensible. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Alyssa Rosenzweig	e9e0956d62	nir: Factor out nir_src_rewrite_ssa helper Like nir_instr_rewrite_ssa but without the asserted extra argument. Works on ifs too, now that we have a unified use list. We do need to assert that the source has actually been inserted and has valid use/def chains. Previously, asserting on the parent instruction accomplished that indirectly. For the more general helper, we instead directly assert that there exists a non-null parent, whatever it is. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Alyssa Rosenzweig	2285b5daae	nir: Reduce indirection A source used by an if is necessarily the condition of that if. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Alyssa Rosenzweig	c4a91c12dc	nir/validate: Don't treat if-uses specially We don't use the tag anywhere, so don't bother with it. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Suggested-by: Faith Ekstrand <faith@gfxstrand.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Alyssa Rosenzweig	f3b420692b	nir: Remove 2nd argument from nir_before_src We can now determine whether a nir_src is for an if without a sideband, so simplify the function signature. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Suggested-by: Faith Ekstrand <faith@gfxstrand.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Alyssa Rosenzweig	8505f0bd84	nir/opt_loop_unroll: Avoid list_length It is O(N) but can often be replaced with something O(1). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Alyssa Rosenzweig	7356f3eee7	nir/opt_ray_queries: Don't use list_length Expensive. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Alyssa Rosenzweig	7f6491b76d	nir: Combine if_uses with instruction uses Every nir_ssa_def is part of a chain of uses, implemented with doubly linked lists. That means each requires 2 * 64-bit = 16 bytes per def, which is memory intensive. Together they require 32 bytes per def. Not cool. To cut that memory use in half, we can combine the two linked lists into a single use list that contains both regular instruction uses and if-uses. To do this, we augment the nir_src with a boolean "is_if", and reimplement the abstract if-uses operations on top of that list. That boolean should fit into the padding already in nir_src so should not actually affect memory use, and in the future we sneak it into the bottom bit of a pointer. However, this creates a new inefficiency: now iterating over regular uses separate from if-uses is (nominally) more expensive. It turns out virtually every caller of nir_foreach_if_use(_safe) also calls nir_foreach_use(_safe) immediately before, so we rewrite most of the callers to instead call a new single `nir_foreach_use_including_if(_safe)` which predicates the logic based on `src->is_if`. This should mitigate the performance difference. There's a bit of churn, but this is largely a mechanical set of changes. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Alyssa Rosenzweig	d1b569d26f	nir/print: Don't print sampler_index for txf NIR's docs for sampler_index say The following operations do not require a sampler and, as such, this field should be ignored: - nir_texop_txf - nir_texop_txf_ms - nir_texop_txs - nir_texop_query_levels - nir_texop_texture_samples - nir_texop_samples_identical Contrary to this documentation, we were still printing the sampler_index anyway, even though the value is formally undefined. This was helpful for PIPE_CAP_TEXTURE_BUFFER_SAMPLER drivers that (despite the NIR docs) respected the sampler_index anyway. There are no longer any such drivers, so we should stop printing sampler_index for txf to avoid confusion (and noise). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22223>	2023-04-07 01:15:41 +00:00
Ian Romanick	72a9d12c96	nir/tests: Port almost all loop_analyze tests to new macro-based infastructure The one test that remains would have an automatically generated name that would conflict with another test. This test is also a little special (per the comment in the test), so it's probably best to leave it separate anyway. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3445>	2023-04-06 23:50:27 +00:00
Yevhenii Kolesnikov	9427aaeab7	nir/loop_analyze: Determine iteration counts for more kinds of loops If loop iterator is incremented with something other than regular addition, it would be more error prone to calculate the number of iterations theoretically. What we can do instead, is try to emulate the loop, and determine the number of iterations empirically. These operations are covered: - imul - fmul - ishl - ishr - ushr Also add unit tests for loop unrollment. Improves performance of Aztec Ruins (sixonix gfxbench5.aztec_ruins_vk_high) by -1.28042% +/- 0.498555% (N=5) on Intel Arc A770. v2 (idr): Rebase on 3 years. :( Use nir_phi_instr_add_src in the test cases. v3 (idr): Use try_eval_const_alu in to evaluate loop termination condition in get_iteration_empirical. Also restructure the loop slightly. This fixed off by one iteration errors in "inverted" loop tests (e.g., nir_loop_analyze_test.ushr_ieq_known_count_invert_31). v4 (idr): Use try_eval_const_alu in to evaluate induction variable update in get_iteration_empirical. This fixes non-commutative update operations (e.g., shifts) when the induction varible is not the first source. This fixes the unit test nir_loop_analyze_test.ishl_rev_ieq_infinite_loop_unknown_count. v5 (idr): Fix _type parameter for fadd and fadd_rev loop unroll tests. Hopefully that fixes the failure on s390x. Temporarily disable fmul. This works-around the revealed problem in glsl-fs-loop-unroll-mul-fp64, and there were no shader-db or fossil-db changes. v6 (idr): Plumb max_unroll_iterations into get_iteration_empirical. I was going to do this, but I forgot. Suggested by Tim. v7 (idr): Disable fadd tests on s390x. They fail because S390 is weird. Almost all of the shaders affected (OpenGL or Vulkan) are from gfxbench or geekbench. A couple shaders in Deus Ex (OpenGL), Dirt Rally (OpenGL), Octopath Traveler (Vulkan), and Rise of the Tomb Raider (Vulkan) are helped. The lost / gained shaders in OpenGL are an Aztec Ruins shader that goes from SIMD16 to SIMD8. The spills / fills affected are in a single Aztec Ruins (Vulkan) compute shader. shader-db results: Skylake, Ice Lake, and Tiger Lake had similar results. (Tiger Lake shown) total loops in shared programs: 5514 -> 5470 (-0.80%) loops in affected programs: 62 -> 18 (-70.97%) helped: 37 / HURT: 0 LOST: 2 GAINED: 2 Haswell and Broadwell had similar results. (Broadwell shown) total loops in shared programs: 5346 -> 5298 (-0.90%) loops in affected programs: 66 -> 18 (-72.73%) helped: 39 / HURT: 0 fossil-db results: Skylake, Ice Lake, and Tiger Lake had similar results. (Tiger Lake shown) Instructions in all programs: 157374679 -> 157397421 (+0.0%) Instructions hurt: 28 SENDs in all programs: 7463800 -> 7467639 (+0.1%) SENDs hurt: 28 Loops in all programs: 38980 -> 38950 (-0.1%) Loops helped: 28 Cycles in all programs: 7559486451 -> 7557455384 (-0.0%) Cycles helped: 28 Spills in all programs: 11405 -> 11403 (-0.0%) Spills helped: 1 Fills in all programs: 19578 -> 19588 (+0.1%) Fills hurt: 1 Lost: 1 Signed-off-by: Yevhenii Kolesnikov <yevhenii.kolesnikov@globallogic.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3445>	2023-04-06 23:50:27 +00:00
Yevhenii Kolesnikov	f051967f19	nir/loop_analyze: Track induction variables incremented by more operations These operations are covered: - imul - fmul - ishl - ishr - ushr The only cases that can be currently affected are those where the calculated loop-trip count would be zero. v2 (idr): Split out from original commit. Rebase on lots of other work. v3 (idr): Move operand size assertion. This code only cares that the operands have the same size for the iadd and fadd cases. In other cases, such as shifts, the sizes may not match. Fixes assertion failures in tests/spec/arb_gpu_shader_int64/glsl-fs-loop-unroll-ishl-int64.shader_test. No shader-db or fossil-db changes on any Intel platform. Signed-off-by: Yevhenii Kolesnikov <yevhenii.kolesnikov@globallogic.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3445>	2023-04-06 23:50:27 +00:00
Ian Romanick	bc170e895f	nir/loop_analyze: Use try_eval_const_alu and induction variable basis info This dramatically simplifies will_break_on_first_iteration, and, much more importantly, makes it significantly more flexible. It is now possible to handle loops with more complex exit condition and other kinds of increment operations. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3445>	2023-04-06 23:50:27 +00:00
Ian Romanick	99a7a6648d	nir/loop_analyze: Change invert_cond instead of changing the condition This ensures that scenarios like nir_loop_analyze_test.iadd_inot_ilt_rev_known_count_5 don't regress in the next commit. It also means we don't change float comparisons. These are probably fine... but it still made me a little uneasy. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3445>	2023-04-06 23:50:27 +00:00
Ian Romanick	aeb8af1141	nir/loop_analyze: Track induction variable basis information Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3445>	2023-04-06 23:50:27 +00:00
Ian Romanick	30879a760c	nir/loop_analyze: Add a function to evaluate an ALU as constant ...with a substitution. This function is largely a copy-and-paste of try_fold_alu (nir_opt_constant_folding.c), and an argument could be made that this function belongs in that file. v2: Some changes were mistakenly squashed in to "nir/loop_analyze: Use try_eval_const_alu and induction variable basis info" that should have been here. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3445>	2023-04-06 23:50:27 +00:00
Ian Romanick	2e942909c8	nir/tests: Add many loop analysis tests for induction variables modified by imul Loop analysis doesn't currently treat values updated by multiplication as induction variables. Future patches will change this. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3445>	2023-04-06 23:50:27 +00:00

1 2 3 4 5 ...

7890 commits