fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 20:28:05 +02:00

Author	SHA1	Message	Date
Rhys Perry	3d0e997e99	nir: split nir_lower_mov64 ACO will want to lower the conversions, but preserve the bcsels. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23926>	2023-07-03 10:38:27 +00:00
Karol Herbst	02aaf58908	nv50/ir/nir: set numBarriers if we emit an OP_BAR Even though the field is called `numBarriers` we set it to 1 just like we do with TGSI. It's unknown on what's the proper behavior here is. But without this set the GPU will complain to us loudly, so this silences at least that. Fixes: `a2d7a4f978` ("nv50/ir: convert to scoped_barrier") Signed-off-by: Karol Herbst <git@karolherbst.de> Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23749>	2023-06-24 02:12:14 +00:00
Caio Oliveira	59cc77f0fa	compiler: Move from nir_scope to mesa_scope Just moving the enum and performing renames, no behavior change. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23328>	2023-06-19 23:29:26 +00:00
Karol Herbst	c9a00d6676	nv50/ir: resolve -Woverloaded-virtual=1 warnings Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23656>	2023-06-15 18:48:10 +00:00
Karol Herbst	6c73c6cec6	nv50/ir: use override Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23656>	2023-06-15 18:48:10 +00:00
Alyssa Rosenzweig	1d4a59448c	treewide: Remove use_scoped_barrier It is now set by all relevant drivers and not checked anywhere. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23191>	2023-06-13 16:36:10 +00:00
Emma Anholt	c3cbe610df	nouveau: Delete the NV50_PROG_USE_TGSI env var. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23114>	2023-06-12 17:37:54 +00:00
Karol Herbst	a2d7a4f978	nv50/ir: convert to scoped_barrier Contrary to how we implemented barriers the MEMBAR instruction actually does not allow us to specify which memory to synchronize. We can only specify the scope. No regressions on TU102. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: M Henning <drawoc@darkrefraction.com> Signed-off-by: Karol Herbst <git@karolherbst.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23208>	2023-06-07 09:57:24 +00:00
Yonggang Luo	12256136e0	compiler: Rename shader_prim to mesa_prim and replace all usage of pipe_prim_type with mesa_prim This is a prepare step to remove depends on p_defines.h in src/util/* This is done by: replace pipe_prim_type with mesa_prim replace shader_prim with mesa_prim replace PIPE_PRIM_MAX with MESA_PRIM_COUNT replace SHADER_PRIM_ with MESA_PRIM_ replace PIPE_PRIM_ with MESA_PRIM_ This patch only replace code only Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23369>	2023-06-03 03:29:03 +00:00
Alyssa Rosenzweig	ecd295bb8b	treewide: Avoid nir_lower_regs_to_ssa calls nir_registers are only supposed to be used temporarily. They may be created by a producer, but then must be immediately lowered prior to optimizing the produced shader. They may be created internally by an optimization pass that doesn't want to deal with phis, but that pass needs to lower them back to phis immediately. Finally they may be created when going out-of-SSA if a backend chooses, but that has to happen late. Regardless, there should be no case where a backend sees a shader that comes in with nir_registers needing to be lowered. The two frontend producers of registers (tgsi_to_nir and mesa/st) both call nir_lower_regs_to_ssa to clean up as they should. Some backend (like intel) already depend on this behaviour. There's no need for other backends to call nir_lower_regs_to_ssa too. Drop the pointless calls as a baby step towards replacing nir_register. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23181>	2023-05-24 17:30:03 +00:00
Alyssa Rosenzweig	c323762f9f	treewide: Stop lowering legacy atomics There are no more producers of legacy atomics so these calls are inert. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23036>	2023-05-16 22:36:21 +00:00
Karol Herbst	6ff97776b7	nv50/ir: Use unified atomics Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22914>	2023-05-12 20:39:46 +00:00
M Henning	cabbbbf0af	nouveau/nir: Set isSigned on all atomic_imax/imin Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22889>	2023-05-08 18:57:14 +00:00
Faith Ekstrand	d1e565a8eb	nouveau/nir: image_samples/size don't have coordinates Without this, it treats the src[1] as a coordinate (it's actually LOD) and may try to read more than one component. I don't think this usually hurts anything as the coordinate should get ignored later but it can result in OOB memory reads while translating NIR. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22834>	2023-05-03 19:52:09 +00:00
M Henning	d7e37389bc	nv50/codegen: Set lower_uniforms_to_ubo Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22815>	2023-05-03 19:18:08 +00:00
M Henning	d49c7b9582	nouveau/codegen: Check nir_dest_num_components instead of reaching into a union and pulling out garbage when the dest is a reg Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8863 Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22674>	2023-04-25 18:17:41 +00:00
Karol Herbst	7cfb8cb1a5	nv50/ir: ignore CL system values Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19712>	2023-04-14 07:41:54 +00:00
Emma Anholt	7c57061b77	nouveau: Enable frexp lowering in the backend. This would be desired for NVK using this backend, but also for getting lowering out of the GLSL frontend. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22083>	2023-04-06 02:32:01 +00:00
Emma Anholt	3a336a8ffd	nouveau: Add missing nir_opt_algebraic_late. This was needed for nir_lower_frexp, but it's a win anyway. shader-db results: total gpr in shared programs: 1143621 -> 1143502 (-0.01%) gpr in affected programs: 33918 -> 33799 (-0.35%) total instructions in shared programs: 7829415 -> 7820124 (-0.12%) instructions in affected programs: 1204967 -> 1195676 (-0.77%) total bytes in shared programs: 71802760 -> 71717352 (-0.12%) bytes in affected programs: 11031888 -> 10946480 (-0.77%) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22083>	2023-04-06 02:32:01 +00:00
Michel Dänzer	ff73392774	nouveau: Make getSize return unsigned int This matches the type of the underlying size member, and is consistent with other getSize methods. Avoids compiler warning with LTO enabled: In member function '__ct ', inlined from 'convertToSSA' at ../src/nouveau/codegen/nv50_ir_ssa.cpp:401:26, inlined from 'convertToSSA' at ../src/nouveau/codegen/nv50_ir_ssa.cpp:310:28, inlined from 'nv50_ir_generate_code' at ../src/nouveau/codegen/nv50_ir.cpp:1331:22: ../src/nouveau/codegen/nv50_ir_ssa.cpp:407:48: error: argument 1 value '18446744073709551615' exceeds maximum object size 9223372036854775807 [-Werror=alloc-size-larger-than=] 407 \| stack = new Stack[func->allLValues.getSize()]; \| ^ /usr/include/c++/12/new: In function 'nv50_ir_generate_code': /usr/include/c++/12/new:128:26: note: in a call to allocation function 'operator new []' declared here 128 \| _GLIBCXX_NODISCARD void* operator new[](std::size_t) _GLIBCXX_THROW (std::bad_alloc) \| ^ Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21781>	2023-03-17 16:08:33 +00:00
Daniel Schürmann	2bb369dd8d	nir: add assertions that loops don't have a Continue Construct Hoping that I didn't miss any, this should add assertions to all functions and passes which explicitly handle 'nir_loop'. Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Ian Romanick	ea413e826b	nir: Eliminate nir_op_f2b Builds on the work of !15121. This gets to delete even more code because many drivers shared a lot of code for i2b and f2b. No shader-db or fossil-db changes on any Intel platform. v2: Rebase on `1a35acd8d9`. v3: Update a comment in nir_opcodes_c.py. Suggested by Konstantin. v4: Another rebase. Remove f2b stuff from Midgard. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20509>	2023-02-03 22:39:57 +00:00
Ian Romanick	eb76cee9f8	nir: Eliminate nir_op_i2b There are a lot of optimizations in opt_algebraic that match ('ine', a, 0), but there are almost none that match i2b. Instead of adding a huge pile of additional patterns (including variations that include both ine and i2b), always lower i2b to a != 0. At this point in the series, it should be impossible for anything to generate i2b, so there /should not/ be any changes. The failing test on d3d12 is a pre-existing bug that is triggered by this change. I talked to Jesse about it, and, after some analysis, he suggested just adding it to the list of known failures. v2: Don't rematerialize i2b instructions in dxil_nir_lower_x2b. v3: Don't rematerialize i2b instructions in zink_nir_algebraic.py. v4: Fix zink-on-TGL CI failures by calling nir_opt_algebraic after nir_lower_doubles makes progress. The latter can generate b2i instructions, but nir_lower_int64 can't handle them (anymore). v5: Add back most of the hunk at line 2125 of nir_opt_algebraic.py. I had accidentally removed the f2b(bf2(x)) optimization. v6: Just eliminate the i2b instruction. v7: Remove missed i2b32 in midgard_compile.c. Remove (now unused) emit_alu_i2orf2_b1 function from sfn_instr_alu.cpp. Previously this function was still used. 🤷 No shader-db changes on any Intel platform. All Intel platforms had similar results. (Ice Lake shown) Instructions in all programs: 141165875 -> 141165873 (-0.0%) Instructions helped: 2 Cycles in all programs: 9098956382 -> 9098956350 (-0.0%) Cycles helped: 2 The two Vulkan shaders are helped because of the "new" (('b2i32', ('ine', ('ubfe', a, b, 1), 0)), ('ubfe', a, b, 1)) algebraic pattern. Acked-by: Jesse Natalie <jenatali@microsoft.com> [earlier version] Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Daniel Schürmann <daniel@schuermann.dev> [earlier version] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:21 +00:00
Ben Skeggs	3a94b3b2a7	gv100/ir: noop OP_BAR for now Let's get stuff rolling and deal with figuring this out later. Acked-by: M Henning <drawoc@darkrefraction.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17633>	2022-11-09 21:21:22 +00:00
Yusuf Khan	2c5b1d0e3b	nv50/ir: Support fmulz and ffmaz Signed-off-by: Yusuf Khan <yusisamerican@gmail.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19333>	2022-11-08 21:10:08 +00:00
Yusuf Khan	47251d2852	nv50/ir: add prefer_nir flag for getting compiler options So that we dont expose certain options for nir_to_tgsi Signed-off-by: Yusuf Khan <yusiamerican@gmail.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19333>	2022-11-08 21:10:08 +00:00
Joan Bruguera	6014a642ae	nv50/ir/nir: ignore sampler for TXF/TXQ ops. Recently, a regression was reported where videos in Firefox had shifted/ glitched colors on certain Kepler hardware. This was bisected to `bf02bffe15`, however, the issue already existed but didn't hit users until TGSI was switched to NIR as default. The issue was traced to a YUV-to-RGB fragment shader used by Firefox, which uses three samplers for the Y/U/V components. The Y component was handled correctly, but the U/V components were bogus, causing the issue. After analysis, it appears the TXF/TXQ ops. should only handle the texture (r) but not the sampler (s), see `63b850403c` and `346ce0b988`. Similarly, handleTXQ/handleTXF on nv50_ir_from_tgsi always sets s=0. Only Kepler was affected because other hardware ignores s at codegen. Always set s=0 on NIR for TXF/TXQ, to keep TGSI behavior and fix the regression. Thanks: Karol Herbst and M Henning for help diagnosing the issue. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7416 Cc: mesa-stable Suggested-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Signed-off-by: Joan Bruguera <joanbrugueram@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19453>	2022-11-02 12:29:34 +00:00
Jason Ekstrand	d0c9ab529e	nouveau/codegen: Support bindless texture queries Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19431>	2022-11-01 15:29:24 +00:00
Alyssa Rosenzweig	941c37c085	nir/lower_idiv: Remove imprecise_32bit_lowering NIR has two implementations of lower_idiv, keyed on the imprecise_32bit_lowering flag. This flag is misleading: the results when setting this flag "imprecise", they're completely wrong for some values. If a backend has a native implementation of umul_high, the correct path isn't that much more expensive. If it doesn't, it's substantially slower for highp integer divison... but in practice, non-constant highp integer division is pretty rare. After a painful migration of the tree, this code path has no more users. Remove it so nobody else gets the bright idea of using it again. Closes: #6555 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19303>	2022-10-27 19:37:14 +00:00
Yusuf Khan	d9a257b339	nv50/ir: nir_op_b2i8 and nir_op_b2i16 Signed-off-by: Yusuf Khan <yusisamerican@gmail.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19256>	2022-10-27 02:16:24 +00:00
Thomas Debesse	6d5921c623	nv50: call nir_lower_flrp Fix #7432: unknown nir_op flrp assertion This copy-pastes src/gallium/drivers/radeonsi/si_shader_nir.c The lower_flrp16 value differs given chipset >= NVISA_GV100_CHIPSET. Signed-off-by: Thomas Debesse <dev@illwieckz.net> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19003>	2022-10-10 17:22:49 +00:00
Adam Jackson	14c6f716b4	nouveau: const cleanup Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18586>	2022-09-16 14:23:47 +00:00
Emma Anholt	873365caee	nouveau: Fix compiler warnings about silly address checks in ir_print. in/out/sv are arrays, so &array[i] is a non-null pointer. Presumably numSysVals/Inputs/Outputs are only incremented when there's data in the arrays, anyway. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18536>	2022-09-13 18:39:07 +00:00
Danilo Krummrich	b97590371a	nv50/ir: handle U8/U16 integers converting to U64 We can't directly convert from unsigned integers smaller than 64 bit to unsigned 64 bit integers. Hence, converting from 32 bit to 64 bit is handled by just merging with 0. To support U8/U16 integers handle them just the same way. Reviewed-by: Karol Herbst <kherbst@redhat.com> Signed-off-by: Danilo Krummrich <dakr@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18109>	2022-09-09 17:32:30 +02:00
Danilo Krummrich	caba679e56	nv50/ir: handle S8/S16 integers converting to S64 We can't convert directly from signed integers smaller 64 bit to signed 64 bit integers. For 32 bit integers this is handled with SHR and MERGE. In order to also support 8/16 bit singed integers convert them to 32 bit first. Reviewed-by: Karol Herbst <kherbst@redhat.com> Signed-off-by: Danilo Krummrich <dakr@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18109>	2022-09-09 17:32:27 +02:00
Danilo Krummrich	2aaa315eee	nv50/ir: split and cvt 64bit integers for {i,u}2{i,u}{8,16} We can't convert from a 64 bit integer to any integer smaller than 64 bit directly, hence split the value first and then cvt / mov to the target type. Reviewed-by: Karol Herbst <kherbst@redhat.com> Signed-off-by: Danilo Krummrich <dakr@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18109>	2022-09-09 17:32:24 +02:00
Danilo Krummrich	8ccba4ea5c	nv50/ir: add intermediate conversion for f2{i,u}{8,16} Directly converting from a float to an 8 bit integer and from a 64 bit float to an integer smaller than 32 bit is not supported, therefore add an intermediate conversion to an 32 bit integer. Reviewed-by: Karol Herbst <kherbst@redhat.com> Signed-off-by: Danilo Krummrich <dakr@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18109>	2022-09-09 17:32:22 +02:00
Danilo Krummrich	6a9825bc1b	nv50/ir/nir: always round towards zero for f2i/f2u Conversions to integers must be rounded towards zero, hence, actually do this for all integers including 8/16 bit sources. Reviewed-by: Karol Herbst <kherbst@redhat.com> Signed-off-by: Danilo Krummrich <dakr@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18109>	2022-09-09 17:32:18 +02:00
Danilo Krummrich	109d56f612	nv50/ir/nir: convert 8/16 bit src to 32 bit for {i,u}2f64 Converting signed and unsigned integers from 8/16 bit sources to a 64 bit floating point destination (i2f64 / u2f64) isn't possible, hence convert the source to 32 bit first. Reviewed-by: Karol Herbst <kherbst@redhat.com> Signed-off-by: Danilo Krummrich <dakr@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18109>	2022-09-09 17:32:16 +02:00
Danilo Krummrich	78fc5e3773	nv50/ir: add isUnsignedIntType() and isIntType() helpers Add helper functions to check whether a DataType is an unsigned integer type and whether a DataType is either an unsigned or signed integer type. Reviewed-by: Karol Herbst <kherbst@redhat.com> Signed-off-by: Danilo Krummrich <dakr@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18109>	2022-09-09 17:32:13 +02:00
Danilo Krummrich	ec60dcd870	nv50/ir/nir: avoid 8/16 bit dest regs for OP_MOV Instructions like mov u16 %r78s 0x00ff (0) are dropped, since they're not supported by the HW, hence avoid using 8/16 bit destination registers for OP_MOV and use the full width of the register instead. Reviewed-by: Karol Herbst <kherbst@redhat.com> Signed-off-by: Danilo Krummrich <dakr@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18109>	2022-09-09 17:32:10 +02:00
Danilo Krummrich	6e2fda15f1	nv50/ir/nir: convert to 32 bit for all OP_SET opcodes The 'set' instruction does distinguish between signed and unsigned, but always treats values as 32 bit. For singed values < 0 with a bit width smaller than 32 bit this falsely results in treating it as a positive value. Reviewed-by: Karol Herbst <kherbst@redhat.com> Signed-off-by: Danilo Krummrich <dakr@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18109>	2022-09-09 17:32:05 +02:00
Danilo Krummrich	cd53bcd325	nv50/ir/nir: add conversion ops for bit width < 32 Reviewed-by: Karol Herbst <kherbst@redhat.com> Signed-off-by: Danilo Krummrich <dakr@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18109>	2022-09-09 17:31:57 +02:00
Karol Herbst	b23b94fbc9	nv50/ir: fix OP_UNION resolving when used for vector values When an OP_UNION def takes part in a vector source e.g. for a tex instruction we failed to clean up the OP_UNION instruction as rep() points towards the coalesced value instead. This fixes a regression on nv50 moving to NIR, but also potentially issues with nvc0. The main reason this is common in nv50 is, that we lower OP_SLCT to a set, predicated movs and a union. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6406 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7117 Cc: mesa-stable Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18377>	2022-09-08 11:35:35 +00:00
M Henning	f90f04d501	nv/nir: Set ssbo CacheMode from intrinsic access Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18354>	2022-09-04 20:32:30 +00:00
Pierre Moreau	16b07b342d	nv50/nir: A group barrier is CTA-level not global-level Reviewed-by: Karol Herbst <kherbst@redhat.com> Signed-off-by: Pierre Moreau <dev@pmoreau.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10711>	2022-08-23 18:29:44 +00:00
Pierre Moreau	9236af8b6c	nv50/ir: Avoid generating splits of splits Among others, it would result in the spill offsets being wrong due to being relative to the parent split and not absolute. For example when computing a 64-bit multiply on Tesla (which only supports 16-bit mul in hardware), the sources will first be split into 32-bit values and then a second time down to 16-bit ones. Looking at the first source, the spill offsets ended being computed as follows: { .hihi = +2, .hilo = +0, .lohi = +2, .lolo = +0 } instead of the expected { .hihi = +6, .hilo = +4, .lohi = +2, .lolo = +0 } This is resolved with this patch. Signed-off-by: Pierre Moreau <dev@pmoreau.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10711>	2022-08-23 18:29:44 +00:00
Pierre Moreau	b327f46e45	nv50/ra: Fix the offset computation for compounds compMask is expressed in terms of colours, not bytes, where on Tesla we have 1 colour per 16-bit (whereas it is 1 per 32-bit for later architectures). By multiplying by units we will get back to a result in bytes. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Pierre Moreau <dev@pmoreau.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10711>	2022-08-23 18:29:44 +00:00
Pierre Moreau	4d892829f3	nv50/peephole: Disallow combining sub 4-byte ld/st for now Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Pierre Moreau <dev@pmoreau.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10711>	2022-08-23 18:29:44 +00:00
Pierre Moreau	81828284b2	nv50/ir: Handle non-32-bit values when cst folding SPLIT Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Pierre Moreau <dev@pmoreau.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10711>	2022-08-23 18:29:44 +00:00

1 2

60 commits