fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-25 02:10:11 +01:00

Author	SHA1	Message	Date
Jason Ekstrand	e6de164e03	nir: Use nir_const_value_for_int in nir_lower_subgroups Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7670 Fixes: `e4e79de2a4` ("nir/subgroups: Support > 1 ballot components") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19689>	2022-12-02 23:12:30 +00:00
Danylo Piliaiev	5d025f4003	nir/nir_opt_offsets: Prevent offsets going above max In try_fold_load_store when trying to extract const addition from non-const offset source, we should take into account that there is already a constant base offset, which should count towards the limit. The issue was found in "Monster Hunter: World" running on Turnip. Fixes: `cac6f633b2` ("nir/opt_offsets: Use nir_ssa_scalar to chase offset additions.") Well, the issue was present before this commit but it made a lot of changes in surrounding code. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20099>	2022-12-02 15:04:52 +00:00
Qiang Yu	bb837bf6ef	nir,ac/llvm: add nir_buffer_atomic_add_amd Used by radeonsi for lower nir_atomic_add_gen/xfb_prim_count_amd. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18010>	2022-12-02 07:34:31 +00:00
Qiang Yu	8030fbcf16	nir,ac/llvm: add nir_load_smem_buffer_amd Used by radeonsi to load const buffer. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18010>	2022-12-02 07:34:31 +00:00
Qiang Yu	73ea7d651a	ac/llvm: nir_load_smem_amd support 32bit base address For radeonsi which use 32bit address in ac_build_load_to_sgpr(). Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18010>	2022-12-02 07:34:31 +00:00
Alyssa Rosenzweig	0af08acca5	nir: Add intrinsics for lowering UBOs/VBOs on AGX We'll use formatted loads and some system values to lower UBOs and VBOs to global memory in NIR, using the AGX-specific format support and addressing arithmetic to optimize the emitted code. Add the intrinsics and teach nir_opt_preamble how to move them so we don't regress UBO pushing. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19996>	2022-12-02 06:25:20 +00:00
Eric Engestrom	8140eca23b	meson: replace deprecated meson.get_cross_property(...) with meson.get_external_property(...) According to the deprecation note: > It's a pure subset of meson.get_external_property, and works strangely > in host == build configurations, since it would be more accurately > described as get_host_property. Signed-off-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19904>	2022-12-01 22:09:55 +00:00
Marcin Ślusarz	f6adfd6278	nir/lower_task_shader: allow offsetting of the start of payload We need this, because on Intel task payload starts with private header, followed by user-accessible data. Fixes: `37e78803d7` ("intel/compiler: use nir_lower_task_shader pass") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19409>	2022-12-01 11:19:47 +00:00
Jason Ekstrand	4fb33124c3	nir/divergence: Handle base_workgroup_id and workgrpu_id_zero_base Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20068>	2022-12-01 04:56:48 +00:00
Jason Ekstrand	0531630658	nir/builder: Also short-circuit for auto-generated nir_t2t<N>() This makes nir_i2i32(b, x) behave exactly like nir_i2iN(b, x, 32) etc. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7787 Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20067>	2022-12-01 01:10:12 +00:00
Jason Ekstrand	e67e2293fa	nir/builder: Rework the boolean conversion helpers Move them up to where the other conversion helpers. For nir_b2<T>(), suffix them with N like all the others and make them use nir_type_convert() as well. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20067>	2022-12-01 01:10:12 +00:00
Jason Ekstrand	d9a24632d3	nir/builder: Drop nir_i2i and nir_u2u in favor of nir_x2xN Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20067>	2022-12-01 01:10:12 +00:00
Jason Ekstrand	ccf19e0956	nir/builder: Move conversions higher in nir_builder.h Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20067>	2022-12-01 01:10:12 +00:00
Jason Ekstrand	9a225415e3	nir/builder: Short-circuit in nir_type_convert if no conversion happens If both types are the same or both are integer types with the same bit size, no actual conversion happens and nir_type_conversion_op() will return nir_op_mov. In this case, there's no point in emitting the move and we can just return src instead. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20067>	2022-12-01 01:10:12 +00:00
Jason Ekstrand	c5fbcab803	nir/builder: Fix indentation of nir_type_convert Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20067>	2022-12-01 01:10:12 +00:00
Jason Ekstrand	8a406fe055	nir: Fix builder usage in lower_mediump_vars() In our handling of load_deref, we were calling builder helpers to create conversions and then adjusting the destination bit size of the load. We should adjust the bit size first because the builder sometimes looks at the bit sizes of SSA values passed in as arguments. Even though it's not strictly necessary, adjust the store_deref case as well to make it fully symmetric with the load_deref case. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20067>	2022-12-01 01:10:12 +00:00
Erik Faye-Lund	d0342e28b3	nir: Add helper to create passthrough GS shader Based on nir_create_passthrough_tcs and d3d12_make_passthrough_gs, this creates a passthrough geometry shader that can be used by drivers that needs to emulate some graphics features in the geometry shader. Reviewed-by: Rob Clark <robclark@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19987>	2022-11-30 08:08:25 +00:00
Lionel Landwerlin	9d0560fe87	nir/lower_shader_calls: enable vectorizer We cannot fully use the vectorizer outside of this pass because once stack load/store operations have been lower to global load/store, the robustness rule applies to those as they would to application load/store. But this is all internal and we know it doesn't require out of bound checking. So doing the vectorizing here is the best solution. We just have to teach the vectorizer about our intrinsics. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20058>	2022-11-30 07:23:30 +00:00
Lionel Landwerlin	9c76cda7f0	nir/lower_shader_calls: add a pass to split load/store into scalars We'll run this pass prior to opt_load_store_vectorize to maximize the effect of the optimization. At the moment opt_load_store_vectorize is unable to pack this : store vec3 store vec3 store vec2 into this : store vec4 store vec3 If your backend can only do vec4 stores max. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20058>	2022-11-30 07:23:30 +00:00
Lionel Landwerlin	e84eab42c4	nir/lower_shader_calls: avoid moving loads into loops This is similar to what opt_gcm is doing. Moving a load inside a loop will increase memory bandwidth. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20058>	2022-11-30 07:23:30 +00:00
Karol Herbst	5398dd04bf	nir/lower_int64: fix shift lowering Starting with !19748 lowered 64 bit shifts were showing wrong results for shifts with insignificant bits set. nir shifts are defined to only look at the least significant bits. The lowering has take this into account. So there are two things going on: 1. the `ieq` and `uge` further down depend on `y` being masked. 2. the calculation of `reverse_count` actually depends on a masked `y` as well, due to the `(iabs (iadd y -32))` giving a different result for shifts > 31; Fixes: `41f3e9e5f5` ("nir: Implement lowering of 64-bit shift operations") Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19995>	2022-11-29 23:08:53 +00:00
Tapani Pälli	dba75d345d	nir: fix a leak of ralloc ctx in nir_opt_ray_query_ranges Fixes following leak: ==7520== 48 bytes in 1 blocks are definitely lost in loss record 1,597 of 2,016 ==7520== at 0x484486F: malloc (vg_replace_malloc.c:381) ==7520== by 0x5314A4E: ralloc_size (ralloc.c:117) ==7520== by 0x5314A1F: ralloc_context (ralloc.c:104) ==7520== by 0x6A95D68: nir_opt_ray_query_ranges (nir_opt_ray_queries.c:235) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `f5b6576585` ("nir: Add a pass for combining ray queries") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20002>	2022-11-25 22:04:52 +00:00
Lionel Landwerlin	99dcdf4d64	nir/divergence: add missing btd_shader_type_intel Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `6d9ae6ec1e` ("intel: add a new intrinsic to get the shader stage from bindless shaders") Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19948>	2022-11-23 15:04:22 +00:00
Constantine Shablya	c2695dac5a	nir: convert nir_opt_idiv_const to nir_shader_instructions_pass Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19881>	2022-11-22 14:04:13 +00:00
Ian Romanick	f75c83c4aa	nir/loop_analyze: Fix get_iteration for nir_op_fneu Consider the loop: float i = 0.0; while (true) { if (i != 0.0) break; i = i + 1.0; } This loop clearly executes exactly one time. Some trickery is necessary to handle cases where the initial loop value is very large and the increment is, by comparison, very small. From the fenu_once test case, float i = -604462909807314587353088.0; while (true) { if (i != -604462909807314587353088.0) break; i = i + 36028797018963968.0; } This loop should also execute exactly once, but this is much more challenging to calculate due to precision issues. Going towards smaller magnitude (i.e., adding a small positive value to a large negative value) requires a smaller delta to make a difference than going towards a larger magnitude. For this reason, -604462909807314587353088.0 + 36028797018963968.0 != -604462909807314587353088.0, but -604462909807314587353088.0 + -36028797018963968.0 == -604462909807314587353088.0. Math class is tough. No changes in shader-db or fossil-db. v2: Fix major bug in checking result of the eval_const_binop(nir_op_feq, ...) discovered while developing fneu_once_easy unit test. Fix a typo in the comment just above that. Add fneu_once_easy test. v3: Skip the iteration count adjustment tests for nir_op_fenu and nir_op_ine. Since the iteration count is either 1 or unknown, all this function can do is add numerical error. Add fenu_once tests. v4: Change the initial value in the fneu_once test from large positive to large negative. Change check in get_iteration from nir_op_fsub to nir_op_fadd. Both changes from discussion with M Henning. Also add some more explanation in fneu_once. v5: Rename test cases. Fixes: `6772a17acc` ("nir: Add a loop analysis pass") Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19732>	2022-11-22 03:18:54 +00:00
Ian Romanick	d9f014401b	nir/loop_analyze: Fix get_iteration for nir_op_ine I discovered this problem because adding an algebraic transformation to convert some uge and ult to ieq or ine caused a couple loops to stop unrolling. Consider the loop: uint i = 0; while (true) { if (i >= 1) break; i++; } This loop clearly executes exactly one time. Note that uge(x, 1) is equivalent to ine(x, 0). Changing the condition to 'if (i != 0)' will also execute exactly one time. In the added test cases, uge_once correctly get an exact loop trip count of 1. Without the changes to nir_loop_analyze.c, the ine_once case detects a maximum loop trip count of zero and does not get an exact loop trip count. No changes in shader-db or fossil-db. v2: Move nir_op_fneu changes to a separate commit. v3: Rename test cases. Fixes: `6772a17acc` ("nir: Add a loop analysis pass") Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19732>	2022-11-22 03:18:54 +00:00
Ian Romanick	dbad33da16	nir/loop_analyze: Add basic unit test framework This test comes from a comment in the loop analysis code. The ine_zero test checks that zero iteration loops involving ine are correctly identified. v2: Add ine_zero test. Suggested by Tim. v3: Rename test cases. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19732>	2022-11-22 03:18:54 +00:00
Rhys Perry	368be87255	nir/algebraic: shrink 64-bit bitwise operations with 0/-1 constant half fossil-db (navi21): Totals from 457 (0.34% of 135636) affected shaders: Instrs: 259349 -> 250383 (-3.46%) CodeSize: 1411976 -> 1369136 (-3.03%) Latency: 2175961 -> 2148158 (-1.28%) InvThroughput: 502206 -> 490244 (-2.38%) Copies: 15238 -> 15232 (-0.04%); split: -0.07%, +0.03% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19748>	2022-11-21 17:34:46 +00:00
Alyssa Rosenzweig	940b871dba	nir: Define AGX intrinsics for local pixel access Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Lionel Landwerlin	e2dadda35f	Revert "nir/lower_shader_calls: put inserted instructions into a dummy block" This reverts commit `35d82ecf1e`. Cc: mesa-stable Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19820>	2022-11-19 10:53:18 +00:00
Lionel Landwerlin	3686d5a312	nir/lower_shader_calls: wrap only jumps rather than entire code blocks Moving entire chunks of code into a dummy if block is causing issues in some situations. To work around the issue that we tried to fix in `35d82ecf1e` ("nir/lower_shader_calls: put inserted instructions into a dummy block") which is that we cannot cut and past a block of instruction that ends with a jump if there are more instruction behind where we're going to past. We can instead just wraps the jumps into dummy if blocks. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19820>	2022-11-19 10:53:18 +00:00
Lionel Landwerlin	96d84e2a77	nir/lower_shader_calls: update metadata before validation Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19820>	2022-11-19 10:53:18 +00:00
Ian Romanick	2ba55ec504	nir/range_analysis: Set higher default maximum for max_workgroup_count Fixes: `c2a81ebe19` ("nir: Add default unsigned upper bound configuration.") Closes: #7676 Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19835>	2022-11-19 05:40:42 +00:00
Yonggang Luo	94886a2975	util: Move src/gallium/include/pipe/p_format.h to src/util/format/u_formats.h Because p_format.h shared between vulkan drivers and opengl drivers Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19629>	2022-11-19 03:38:19 +00:00
Lionel Landwerlin	723b15fb75	nir/lower_explicit_io: fix metadata preserve This pass can insert if blocks, therefore no dominance/block_index for you. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19818>	2022-11-18 20:46:20 +00:00
Rhys Perry	716aaf3673	nir/lower_bit_size: lower uadd_sat/iadd_sat/isub_sat to unsaturated alu The unsaturated arithmetic won't overflow/borrow, and may be faster. No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19473>	2022-11-18 18:31:32 +00:00
Rhys Perry	8a4f9a874b	nir/lower_bit_size: optimize usub_sat lowering The result should never be larger than uint_max. This doesn't need a special path. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19473>	2022-11-18 18:31:32 +00:00
Rhys Perry	e19584db2b	nir/algebraic: optimize open-coded uadd_sat/usub_sat fossil-db (navi21): Totals from 19 (0.01% of 135636) affected shaders: Instrs: 40730 -> 40688 (-0.10%) CodeSize: 217708 -> 217568 (-0.06%) Latency: 261466 -> 261373 (-0.04%) InvThroughput: 74944 -> 74896 (-0.06%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19473>	2022-11-18 18:31:32 +00:00
Rhys Perry	da30fb5df7	nir/lower_bit_size: lower uadd_carry 8/16-bit uadd_carry can exist in SPIR-V. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7615 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19473>	2022-11-18 18:31:32 +00:00
Konstantin Seurer	bdd2abe334	nir/lower_shader_calls: Get rid of any brw occurences Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19749>	2022-11-18 12:28:14 +00:00
Connor Abbott	e402d2dbe9	nir: Fix nir_chase_binding() vecN handling In the comments we claimed to handle vecN instructions, for the case where an offset is trimmed from the descriptor, but we didn't ignore the offset itself and in effect only handled identity vecN's (which copy propagation would normally remove already!), so the handling of vecN was useless and this relied on copy propagation cleaning things up. Fix it to ignore everything except the components in the original source. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18703>	2022-11-18 11:57:05 +00:00
Jesse Natalie	cb32f9515e	nir_scale_fdiv: Respect vector swizzles Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19709>	2022-11-15 03:05:13 +00:00
Timothy Arceri	63c4849e8b	nir: add another common ffract -> ffloor pattern shader-db results (BDW): total instructions in shared programs: 17527053 -> 17526931 (<.01%) instructions in affected programs: 5116 -> 4994 (-2.38%) helped: 25 HURT: 0 helped stats (abs) min: 2 max: 15 x̄: 4.88 x̃: 3 helped stats (rel) min: 0.25% max: 5.34% x̄: 3.39% x̃: 3.90% 95% mean confidence interval for instructions value: -6.19 -3.57 95% mean confidence interval for instructions %-change: -3.98% -2.81% Instructions are helped. total cycles in shared programs: 856680230 -> 856682009 (<.01%) cycles in affected programs: 6583780 -> 6585559 (0.03%) helped: 117 HURT: 77 helped stats (abs) min: 1 max: 854 x̄: 68.56 x̃: 16 helped stats (rel) min: <.01% max: 35.34% x̄: 2.12% x̃: 0.76% HURT stats (abs) min: 1 max: 2188 x̄: 127.27 x̃: 18 HURT stats (rel) min: 0.01% max: 22.66% x̄: 1.86% x̃: 0.67% 95% mean confidence interval for cycles value: -30.07 48.41 95% mean confidence interval for cycles %-change: -1.28% 0.19% Inconclusive result (value mean confidence interval includes 0). LOST: 3 GAINED: 1 Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19666>	2022-11-14 09:50:11 +11:00
Konstantin Seurer	f5b6576585	nir: Add a pass for combining ray queries We can determice scopes/ranges of the use of ray queries and use this information to combine ray queries. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16593>	2022-11-11 15:17:08 +00:00
Konstantin Seurer	d22037b96c	nir: Add and use nir_intrinsic_is_ray_query helper Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16593>	2022-11-11 15:17:08 +00:00
Konstantin Seurer	04abfbca57	nir: Remove gather_info after removing dead vars Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16593>	2022-11-11 15:17:08 +00:00
Timothy Arceri	34c52d8cb9	nir: fix typo in lower_double options handling Seems the intention was to check that both flags were not enabled instead we were checking that the floor flag was both set and not set so the result would always be false. Fixes: `3749a6ecd2` ("nir: honor lower_double options for ffloor and ffract") Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19642>	2022-11-11 14:36:00 +00:00
Gert Wollny	917d992b32	nir/algeraic_opt: use double options too for lowering ftrunc@64 ftrunc@64 also might need lowering on fp64 only, especially now that it might be introduced by nir_lower_int64. Fixes: `29da985682` nir/lower_int64: Enable lowering of 64-bit float to 64-bit integer conversions. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19657>	2022-11-11 09:29:31 +00:00
Qiang Yu	533b39bfcb	nir,ac/llvm,radeonsi: add nir_load_clamp_vertex_color_amd Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19429>	2022-11-11 04:22:20 +00:00
Lionel Landwerlin	b499a27d74	nir: make ray query load values visible in NIR prints Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19641>	2022-11-10 14:40:08 +02:00

1 2 3 4 5 ...

4022 commits