fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-23 21:38:18 +02:00

Author	SHA1	Message	Date
Ian Romanick	c1fad08d69	glsl_to_nir: Fix NIR bit-size of ir_triop_bitfield_extract and ir_quadop_bitfield_insert Previously these would return result->bit_size of 32 even though the type might have been int16_t or uint16_t. This prevents many assertion failures in "glsl: Use nir_type_convert instead of nir_type_conversion_op" on zink. Fixes: `5e922fbc16` ("glsl_to_nir: fix bitfield_extract with 16-bit operands") Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121> (cherry picked from commit `43da822312`)	2022-12-14 20:56:54 +00:00
Friedrich Vock	57827e6903	nir: Do not consider phis with incompatible dests equal CSE tries to collapse equal instructions, and collapsing two phis with incompatible dests is illegal. Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Fixes: `6bdce55c` ("nir: Add a basic CSE pass") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19960> (cherry picked from commit `a54c2c8289`)	2022-12-14 20:47:01 +00:00
Marcin Ślusarz	5f387adc02	nir/lower_task_shader: fix task payload corruption when shared memory workaround is enabled We were not taking into account that when all invocations within workgroup are active, we'll copy more data than needed, corrupting task payload of other workgroups. Fixes: `8aff8d3dd4` ("nir: Add common task shader lowering to make the backend's job easier.") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20080> (cherry picked from commit `ffefa386fd`)	2022-12-14 20:47:01 +00:00
Chia-I Wu	3ef6b27bde	nir: fix nir_link_varying_precision link_varyings ignores precisions and can assign the same location to variables with different precisions. nir_link_varying_precision should check location_frac as well. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20113> (cherry picked from commit `7244d88516`)	2022-12-14 20:47:01 +00:00
Jason Ekstrand	f2327830b2	nir: Use nir_const_value_for_int in nir_lower_subgroups Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7670 Fixes: `e4e79de2a4` ("nir/subgroups: Support > 1 ballot components") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19689> (cherry picked from commit `e6de164e03`)	2022-12-14 20:47:00 +00:00
Danylo Piliaiev	dcf2960733	nir/nir_opt_offsets: Prevent offsets going above max In try_fold_load_store when trying to extract const addition from non-const offset source, we should take into account that there is already a constant base offset, which should count towards the limit. The issue was found in "Monster Hunter: World" running on Turnip. Fixes: `cac6f633b2` ("nir/opt_offsets: Use nir_ssa_scalar to chase offset additions.") Well, the issue was present before this commit but it made a lot of changes in surrounding code. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20099> (cherry picked from commit `5d025f4003`)	2022-12-14 20:47:00 +00:00
Marcin Ślusarz	985a4ebab3	nir/lower_task_shader: allow offsetting of the start of payload We need this, because on Intel task payload starts with private header, followed by user-accessible data. Fixes: `37e78803d7` ("intel/compiler: use nir_lower_task_shader pass") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19409> (cherry picked from commit `f6adfd6278`)	2022-12-14 20:47:00 +00:00
Karol Herbst	7a28549247	nir/lower_int64: fix shift lowering Starting with !19748 lowered 64 bit shifts were showing wrong results for shifts with insignificant bits set. nir shifts are defined to only look at the least significant bits. The lowering has take this into account. So there are two things going on: 1. the `ieq` and `uge` further down depend on `y` being masked. 2. the calculation of `reverse_count` actually depends on a masked `y` as well, due to the `(iabs (iadd y -32))` giving a different result for shifts > 31; Fixes: `41f3e9e5f5` ("nir: Implement lowering of 64-bit shift operations") Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19995> (cherry picked from commit `5398dd04bf`)	2022-11-30 21:12:44 +00:00
Lionel Landwerlin	4b38684f60	nir/divergence: add missing btd_shader_type_intel Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `6d9ae6ec1e` ("intel: add a new intrinsic to get the shader stage from bindless shaders") Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19948> (cherry picked from commit `99dcdf4d64`)	2022-11-23 19:12:00 +00:00
Ian Romanick	73580de2e8	nir/loop_analyze: Fix get_iteration for nir_op_fneu Consider the loop: float i = 0.0; while (true) { if (i != 0.0) break; i = i + 1.0; } This loop clearly executes exactly one time. Some trickery is necessary to handle cases where the initial loop value is very large and the increment is, by comparison, very small. From the fenu_once test case, float i = -604462909807314587353088.0; while (true) { if (i != -604462909807314587353088.0) break; i = i + 36028797018963968.0; } This loop should also execute exactly once, but this is much more challenging to calculate due to precision issues. Going towards smaller magnitude (i.e., adding a small positive value to a large negative value) requires a smaller delta to make a difference than going towards a larger magnitude. For this reason, -604462909807314587353088.0 + 36028797018963968.0 != -604462909807314587353088.0, but -604462909807314587353088.0 + -36028797018963968.0 == -604462909807314587353088.0. Math class is tough. No changes in shader-db or fossil-db. v2: Fix major bug in checking result of the eval_const_binop(nir_op_feq, ...) discovered while developing fneu_once_easy unit test. Fix a typo in the comment just above that. Add fneu_once_easy test. v3: Skip the iteration count adjustment tests for nir_op_fenu and nir_op_ine. Since the iteration count is either 1 or unknown, all this function can do is add numerical error. Add fenu_once tests. v4: Change the initial value in the fneu_once test from large positive to large negative. Change check in get_iteration from nir_op_fsub to nir_op_fadd. Both changes from discussion with M Henning. Also add some more explanation in fneu_once. v5: Rename test cases. Fixes: `6772a17acc` ("nir: Add a loop analysis pass") Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19732> (cherry picked from commit `f75c83c4aa`)	2022-11-23 19:11:59 +00:00
Ian Romanick	aee1c4ca00	nir/loop_analyze: Fix get_iteration for nir_op_ine I discovered this problem because adding an algebraic transformation to convert some uge and ult to ieq or ine caused a couple loops to stop unrolling. Consider the loop: uint i = 0; while (true) { if (i >= 1) break; i++; } This loop clearly executes exactly one time. Note that uge(x, 1) is equivalent to ine(x, 0). Changing the condition to 'if (i != 0)' will also execute exactly one time. In the added test cases, uge_once correctly get an exact loop trip count of 1. Without the changes to nir_loop_analyze.c, the ine_once case detects a maximum loop trip count of zero and does not get an exact loop trip count. No changes in shader-db or fossil-db. v2: Move nir_op_fneu changes to a separate commit. v3: Rename test cases. Fixes: `6772a17acc` ("nir: Add a loop analysis pass") Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19732> (cherry picked from commit `d9f014401b`)	2022-11-23 19:11:59 +00:00
Lionel Landwerlin	4e0f9c36e0	Revert "nir/lower_shader_calls: put inserted instructions into a dummy block" This reverts commit `35d82ecf1e`. Cc: mesa-stable Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19820> (cherry picked from commit `e2dadda35f`)	2022-11-23 19:11:58 +00:00
Lionel Landwerlin	8f6c7cb351	nir/lower_shader_calls: wrap only jumps rather than entire code blocks Moving entire chunks of code into a dummy if block is causing issues in some situations. To work around the issue that we tried to fix in `35d82ecf1e` ("nir/lower_shader_calls: put inserted instructions into a dummy block") which is that we cannot cut and past a block of instruction that ends with a jump if there are more instruction behind where we're going to past. We can instead just wraps the jumps into dummy if blocks. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19820> (cherry picked from commit `3686d5a312`)	2022-11-23 19:11:58 +00:00
Lionel Landwerlin	b2b0770690	nir/lower_shader_calls: update metadata before validation Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19820> (cherry picked from commit `96d84e2a77`)	2022-11-23 19:11:58 +00:00
Ian Romanick	35c695882d	nir/range_analysis: Set higher default maximum for max_workgroup_count Fixes: `c2a81ebe19` ("nir: Add default unsigned upper bound configuration.") Closes: #7676 Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19835> (cherry picked from commit `2ba55ec504`)	2022-11-23 19:11:58 +00:00
Lionel Landwerlin	d0cc462008	nir/lower_explicit_io: fix metadata preserve This pass can insert if blocks, therefore no dominance/block_index for you. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19818> (cherry picked from commit `723b15fb75`)	2022-11-23 19:11:58 +00:00
Rhys Perry	df7dc583e7	nir/lower_bit_size: lower uadd_carry 8/16-bit uadd_carry can exist in SPIR-V. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7615 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19473> (cherry picked from commit `da30fb5df7`)	2022-11-23 19:11:58 +00:00
Timothy Arceri	18a8b0a122	nir: fix typo in lower_double options handling Seems the intention was to check that both flags were not enabled instead we were checking that the floor flag was both set and not set so the result would always be false. Fixes: `3749a6ecd2` ("nir: honor lower_double options for ffloor and ffract") Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19642> (cherry picked from commit `34c52d8cb9`)	2022-11-17 14:05:03 +00:00
Gert Wollny	238c58e7d1	nir/algeraic_opt: use double options too for lowering ftrunc@64 ftrunc@64 also might need lowering on fp64 only, especially now that it might be introduced by nir_lower_int64. Fixes: `29da985682` nir/lower_int64: Enable lowering of 64-bit float to 64-bit integer conversions. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19657> (cherry picked from commit `917d992b32`)	2022-11-17 14:05:03 +00:00
Karol Herbst	5f5821232a	glsl: fix buffer texture type Fixes: `3ace6b968b` ("compiler/types: Add a texture type") Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19381> (cherry picked from commit `87526f79db`)	2022-11-17 14:05:02 +00:00
Caio Oliveira	f4a7f28608	nir: Don't reorder volatile intrinsics Fixes issue with "is helper invocation" that in recent SPIR-V is mapped to a volatile Load. The CSE was catching the loads before they were transformed in the new is_helper_invocation intrinsic (that is not reorderable). Fixes: `729df14e45` ("nir: Handle volatile semantics for loading HelperInvocation builtin") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19432> (cherry picked from commit `8ab628ab2e`)	2022-11-09 21:22:06 +00:00
Francisco Jerez	a7482cfa89	nir/lower_int64: Fix float16 to int64 conversions. Currently float16 to int64 conversions don't work correctly, because the "div" variable has an infinite value, since 2^32 isn't representable as a 16-bit float, which causes the result of of rem(x, div) to be NaN for all inputs, leading to an incorrect result. Since no values of magnitude greater than 2^32 are representable as a float16 we don't actually need to do the fdiv/frem operations, the conversion is equivalent to f2u32 with the result padded to 64 bits. Rework: * Jordan: Handle f16 in if/else rather than conditional Fixes: `936c58c8fc` ("nir: Extend nir_lower_int64() to support i2f/f2i lowering") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19391> (cherry picked from commit `e14f85366e`)	2022-11-09 21:22:06 +00:00
Alex Brachet	7ba025d528	nir: Fix qsort comparator function `pred` is a pointer, for sufficiently large numbers these being cast to int were both > 0 regardless of the order of `data1` and `data2`. Fixes: `523a28d3fe` ("nir: add an instruction set API") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19539> (cherry picked from commit `c987a727a7`)	2022-11-09 21:22:06 +00:00
Illia Abernikhin	aa4ac5ff8b	utils: Merge util/debug.* into util/u_debug.* and remove util/debug.* Rename env_var_as_unsigned() -> debug_get_num_option(), because duplicate Rename env_var_as_bool() -> debug_get_bool_option(), because duplicate Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7177 Signed-off-by: Illia Abernikhin <illia.abernikhin@globallogic.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19336>	2022-11-02 07:25:39 +00:00
Alyssa Rosenzweig	2a6338722e	panfrost: Don't use nir_variable in the compilers More future proof, simpler, and works with early I/O lowering. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19456>	2022-11-02 04:22:06 +00:00
Kenneth Graunke	fde99747e9	nir: Drop infer_non_readable option for nir_opt_access() Everybody sets it to true now, and the only reason for the option to exist was to work around a bug that's now been fixed. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19162>	2022-11-02 03:42:04 +00:00
Alyssa Rosenzweig	45a111c21c	nir/opt_algebraic: Fuse c - a * b to FMA Algebraically it is clear that -(a * b) + c = (-a) * b + c = fma(-a, b, c) But this is not clear from the NIR ('fadd', ('fneg', ('fmul', a, b)), c) Add rules to handle this case specially. Note we don't necessarily want to solve this by pushing fneg into fmul, because the rule opt_algebraic (not the late part where FMA fusing happens) specifically pulls fneg out of fmul to push fneg up multiplication chains. Noticed in the big glmark2 "terrain" shader, which has a cycle count reduced by 22% on Mali-G57 thanks to having this pattern a ton and being FMA bound. BEFORE: 1249 inst, 16.015625 cycles, 16.015625 fma, ... 632 quadwords AFTER: 997 inst, 12.437500 cycles, .... 504 quadwords Results on the same shader on AGX are also quite dramatic: BEFORE: 1294 inst, 8600 bytes, 50 halfregs, ... AFTER: 1154 inst, 8040 bytes, 50 halfregs, ... Similar rules apply for fabs. v2: Use a loop over the bit sizes (suggested by Emma). shader-db on Valhall (open + small subset of closed), results on Bifrost are similar: total instructions in shared programs: 167975 -> 164970 (-1.79%) instructions in affected programs: 92642 -> 89637 (-3.24%) helped: 492 HURT: 25 helped stats (abs) min: 1.0 max: 252.0 x̄: 6.25 x̃: 3 helped stats (rel) min: 0.30% max: 20.18% x̄: 3.21% x̃: 2.91% HURT stats (abs) min: 1.0 max: 5.0 x̄: 2.80 x̃: 3 HURT stats (rel) min: 0.46% max: 9.09% x̄: 3.89% x̃: 3.37% 95% mean confidence interval for instructions value: -6.95 -4.68 95% mean confidence interval for instructions %-change: -3.08% -2.65% Instructions are helped. total cycles in shared programs: 10556.89 -> 10538.98 (-0.17%) cycles in affected programs: 265.56 -> 247.66 (-6.74%) helped: 88 HURT: 2 helped stats (abs) min: 0.015625 max: 3.578125 x̄: 0.20 x̃: 0 helped stats (rel) min: 0.65% max: 22.34% x̄: 5.65% x̃: 4.25% HURT stats (abs) min: 0.0625 max: 0.0625 x̄: 0.06 x̃: 0 HURT stats (rel) min: 8.33% max: 12.50% x̄: 10.42% x̃: 10.42% 95% mean confidence interval for cycles value: -0.28 -0.12 95% mean confidence interval for cycles %-change: -6.30% -4.30% Cycles are helped. total fma in shared programs: 1582.42 -> 1535.06 (-2.99%) fma in affected programs: 871.58 -> 824.22 (-5.43%) helped: 502 HURT: 9 helped stats (abs) min: 0.015625 max: 3.578125 x̄: 0.09 x̃: 0 helped stats (rel) min: 0.60% max: 25.00% x̄: 5.46% x̃: 4.82% HURT stats (abs) min: 0.015625 max: 0.0625 x̄: 0.03 x̃: 0 HURT stats (rel) min: 4.35% max: 12.50% x̄: 6.22% x̃: 4.35% 95% mean confidence interval for fma value: -0.11 -0.08 95% mean confidence interval for fma %-change: -5.58% -4.93% Fma are helped. total cvt in shared programs: 665.55 -> 665.95 (0.06%) cvt in affected programs: 61.72 -> 62.12 (0.66%) helped: 33 HURT: 43 helped stats (abs) min: 0.015625 max: 0.359375 x̄: 0.04 x̃: 0 helped stats (rel) min: 1.01% max: 25.00% x̄: 6.68% x̃: 4.35% HURT stats (abs) min: 0.015625 max: 0.109375 x̄: 0.04 x̃: 0 HURT stats (rel) min: 0.78% max: 38.46% x̄: 10.85% x̃: 6.90% 95% mean confidence interval for cvt value: -0.01 0.02 95% mean confidence interval for cvt %-change: 0.23% 6.24% Inconclusive result (value mean confidence interval includes 0). total quadwords in shared programs: 93376 -> 91736 (-1.76%) quadwords in affected programs: 25376 -> 23736 (-6.46%) helped: 169 HURT: 1 helped stats (abs) min: 8.0 max: 128.0 x̄: 9.75 x̃: 8 helped stats (rel) min: 1.52% max: 33.33% x̄: 8.35% x̃: 8.00% HURT stats (abs) min: 8.0 max: 8.0 x̄: 8.00 x̃: 8 HURT stats (rel) min: 25.00% max: 25.00% x̄: 25.00% x̃: 25.00% 95% mean confidence interval for quadwords value: -11.18 -8.11 95% mean confidence interval for quadwords %-change: -8.95% -7.36% Quadwords are helped. total threads in shared programs: 4697 -> 4701 (0.09%) threads in affected programs: 4 -> 8 (100.00%) helped: 4 HURT: 0 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% 95% mean confidence interval for threads value: 1.00 1.00 95% mean confidence interval for threads %-change: 100.00% 100.00% Threads are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Marek Ol<C5><A1><C3><A1>k <marek.olsak@amd.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> [v1] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19312>	2022-11-01 22:39:45 -04:00
Jason Ekstrand	15796bdd0e	nir/types: Add some asserts to glsl_get_struct_field() Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19430>	2022-11-01 14:48:41 +00:00
Rhys Perry	e6d26cb288	nir,ac/nir,aco,radv: replace has_input_*_amd with more general intrinsics Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19228>	2022-10-31 14:33:43 +00:00
Francisco Jerez	be6da31034	nir/lower_int64: Implement lowering of 64-bit integer to 64-bit float conversions. This involves computing the significand with a 64-bit precision type, and implementing the normalization and packing manually instead of relying on u2f32, since the significand can no longer be represented as a 32-bit integer. This fixes 64-bit integer to 64-bit float conversions on devices that support 64-bit float natively but lack 64-bit integer support, like Intel MTL hardware. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> (v1) Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19128>	2022-10-29 19:45:44 +00:00
Francisco Jerez	29da985682	nir/lower_int64: Enable lowering of 64-bit float to 64-bit integer conversions. The existing code for this appears to work okay for conversions involving 64-bit floats, relax the assert and enable the lowering path. This fixes 64-bit float to 64-bit integer integer conversions on devices that have native support for 64-bit floats but lack 64-bit integer support, like Intel MTL hardware. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19128>	2022-10-29 19:45:44 +00:00
Marek Olšák	0ac37b595a	nir: add nir_intrinsic_optimization_barrier_vgpr_amd for LLVM We need this for the MSAA resolve shader. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Mihai Preda <mhpreda@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19243>	2022-10-29 18:38:33 +00:00
Rob Clark	5d3895d13b	nir: Add way to create passthrough TCS without VS nir In the case of disk-cache hits, radeonsi no longer has the nir shader around. So add a way to create a passthrough TCS with just the VS output locations. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7567 Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19382>	2022-10-29 17:46:23 +00:00
Karol Herbst	e58c004870	nir/algebraic: add vec8/16 cmp lowering Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19150>	2022-10-29 10:31:39 +00:00
Karol Herbst	5efbef833a	nir/algebraic: generalize vector_cmp lowering Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19150>	2022-10-29 10:31:39 +00:00
Karol Herbst	f27e2234e1	nir/algebraic: support CL vector accessors Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19150>	2022-10-29 10:31:39 +00:00
Karol Herbst	1d6014f267	nir/algebraic: add 8 and 64 bit urol and uror lowering Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19150>	2022-10-29 10:31:39 +00:00
Mykhailo Skorokhodov	f8425e661a	glsl/meson: Add variable to export float64.glsl Signed-off-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18854>	2022-10-28 10:08:50 +00:00
Mykhailo Skorokhodov	4692c66358	nir: Add assert in nir_lower_doubles Cc: mesa-stable Signed-off-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18854>	2022-10-28 10:08:50 +00:00
Mykhailo Skorokhodov	e4b7bf1a6d	nir: Make lower_double_ops recognize SPIR-V mangling Signed-off-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18854>	2022-10-28 10:08:50 +00:00
Alyssa Rosenzweig	63320c691a	nir/lower_idiv: Inline convert_instr_precise Now that we only have one convert_instr path, this is simpler. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19303>	2022-10-27 19:37:14 +00:00
Alyssa Rosenzweig	941c37c085	nir/lower_idiv: Remove imprecise_32bit_lowering NIR has two implementations of lower_idiv, keyed on the imprecise_32bit_lowering flag. This flag is misleading: the results when setting this flag "imprecise", they're completely wrong for some values. If a backend has a native implementation of umul_high, the correct path isn't that much more expensive. If it doesn't, it's substantially slower for highp integer divison... but in practice, non-constant highp integer division is pretty rare. After a painful migration of the tree, this code path has no more users. Remove it so nobody else gets the bright idea of using it again. Closes: #6555 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19303>	2022-10-27 19:37:14 +00:00
Daniel Schürmann	22534e0d1a	nir: add AMD RT traversal intrinsics These I/O intrinsics help to create an enclosed traversal shader. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19188>	2022-10-27 09:45:39 +00:00
Qiang Yu	3d6cce2e4c	nir: add two amd ngg lds base load intrinsics These two values are not known when compile for radeonsi. They are relocated when link/upload time. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18832>	2022-10-27 07:35:01 +00:00
Dave Airlie	6a29cb2654	nir/lower_bool_to_int32: add support for lowering functions. Change the function parameters to 32-bit. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19291>	2022-10-26 21:47:29 +00:00
Lionel Landwerlin	117b32a594	nir/divergence_analysis: add missing desc_set_address_intel Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19320>	2022-10-26 21:09:20 +00:00
Lionel Landwerlin	edda5731c0	nir/divergence_analysis: add some missing RT intrinsics Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19320>	2022-10-26 21:09:20 +00:00
Jason Ekstrand	5e05d98848	nir: Unconditionally call nir_trim_vector in nir_lower_readonly_images_to_tex It will already short-circuit if the number of components matches. Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19301>	2022-10-26 17:11:44 +00:00
Jason Ekstrand	d9cf6de4a8	nir: Misc. style fixes to nir_lower_readonly_images_to_tex Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19301>	2022-10-26 17:11:44 +00:00
Jason Ekstrand	b684a603f1	nir: Use nir_shader_instructions_pass in nir_lower_readonly_images_to_tex nir_shader_lower_instructions is overkill and this makes the pass generally easier to understand. Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19301>	2022-10-26 17:11:44 +00:00

1 2 3 4 5 ...

7413 commits