fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 09:38:05 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	51db19f7a2	nir: Rename scoped_barrier -> barrier sed + ninja clang-format + fix up spacing for common code. If you are unhappy that I did not manually change the whitespace of your driver, you need to enable clang-format for it so the formatting would happen automatically. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24428>	2023-08-01 23:18:29 +00:00
M Henning	d4086be6bc	nv/codegen: Implement nir_op_fquantize2f16 Passes most of dEQP-VK.spirv_assembly.instruction.graphics.opquantize.* but not the too_small_* tests for some reason. (Tested on kepler.) Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24327>	2023-08-01 18:58:04 +00:00
Faith Ekstrand	6023943b81	nv50/ir: Run nir_divergence_analysis before out-of-SSA We don't actually use or need this information but it gets generated by nir_opt_non_uniform_access() and stale divergence information can cause out-of-SSA to assert in parallel copy lowering. Reviewed-by: M Henning <drawoc@darkrefraction.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24327>	2023-08-01 18:58:04 +00:00
George Ouzounoudis	df5d1ef2b5	nouveau/codegen: Fix compact patch varyings in case of NIR The code path was not implemented and an assert was reached. Reviewed-by: M Henning <drawoc@darkrefraction.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24327>	2023-08-01 18:58:03 +00:00
George Ouzounoudis	f453623255	nouveau/codegen: Handle nir op amul This came from CTS clipping tests with geometry shaders. Maybe can be done as a lowering operation instead. Reviewed-by: M Henning <drawoc@darkrefraction.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24327>	2023-08-01 18:58:03 +00:00
George Ouzounoudis	43b8da3a8b	nouveau/codegen: Support compact clip distances with arrayed_io Reviewed-by: M Henning <drawoc@darkrefraction.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24327>	2023-08-01 18:58:03 +00:00
Rebecca Mckeever	6990439eb8	nouveau/codegen: Set lower_device_index_to_zero This instructs NIR to lower DeviceIndex to zero, which is needed for a no-op implementation of VK_KHR_device_group. Signed-off-by: Rebecca Mckeever <rebecca.mckeever@collabora.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24327>	2023-08-01 18:58:03 +00:00
Rebecca Mckeever	e2221a9cac	nouveau/codegen: Support nir_intrinsic_load_workgroup_id_zero_base Signed-off-by: Rebecca Mckeever <rebecca.mckeever@collabora.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24327>	2023-08-01 18:58:03 +00:00
Faith Ekstrand	1f60923b89	nouveau/nir: Implement support for compact arrays This is needed for clip and cull distances. Reviewed-by: M Henning <drawoc@darkrefraction.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24327>	2023-08-01 18:58:03 +00:00
M Henning	77acf89336	nv/codegen: Call nir_shader_gather_info We need this info to be up-to-date for slot assignment. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24327>	2023-08-01 18:58:03 +00:00
Faith Ekstrand	9a2d016021	nouveau: Allow GLSL_SAMPLER_DIM_SUBPASS* Reviewed-by: M Henning <drawoc@darkrefraction.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24327>	2023-08-01 18:58:03 +00:00
Faith Ekstrand	9cb70c6ee0	nv50/nir: Lower to scratch AFTER optimization Reviewed-by: M Henning <drawoc@darkrefraction.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24327>	2023-08-01 18:58:03 +00:00
Karol Herbst	8d7f682bdb	nv50/ir/nir: Fix zero source handling of tex instructions. For TXQ we know make sure that we at least add one source. If the nir instruction however didn't had any sources, we inserted a fake 0 source ending up with two 0s for TXQ. It's unclear to me if we have other ops where this would be necessary. Fixes: `85a31fa1fc` ("nv50/ir/nir: fix txq emission on MS textures") Signed-off-by: Karol Herbst <git@karolherbst.de> Acked-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24373>	2023-07-29 19:01:40 +00:00
Karol Herbst	85a31fa1fc	nv50/ir/nir: fix txq emission on MS textures In GL and a lot of Vulkan if we end up with either a lod or an ms index. Sadly in Vulkan we can end up with both and have to choose properly. For TXQ we have to emit a zero LOD. For TXF we have to emit the ms index. Fixes: `bb032d8b62` ("nv50/ir/nir: implement nir_instr_type_tex") Signed-off-by: Karol Herbst <git@karolherbst.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24343>	2023-07-27 22:49:05 +00:00
M Henning	c631635f43	nouveau: Drop tgsi support from nv50_ir_prog_info Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24175>	2023-07-21 02:40:35 +00:00
M Henning	1032d5c836	nv50/ir: Drop nir_jump_return handling This is always lowered before this point. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23006>	2023-07-19 11:47:10 +00:00
Faith Ekstrand	259ba104f7	nv50/ir: Support vector movs nir_opt_mov and nir_op_vecN are only the same if the mov is only a single component. Otherwise the vec loop will try to access src[c] where c > 0 which breaks for nir_op_mov. It's uncommon but scalar back-ends can see vector movs so we need to handle this correctly. Fixes: `6513c675ad` ("nv50/ir/nir: implement nir_alu_instr handling") Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24167>	2023-07-15 15:53:40 +00:00
Faith Ekstrand	c136a22b60	nv50/ir: Convert to new-style NIR registers Shader-db results on Turing: total inst in shared programs : 11121531 -> 11121458 (-0.00%) total gpr in shared programs : 1848287 -> 1848425 (0.01%) total ugpr in shared programs : 0 -> 0 (0.00%) total local in shared programs : 27200 -> 27200 (0.00%) total shared in shared programs : 236476 -> 236476 (0.00%) total bytes in shared programs : 177944496 -> 177943328 (-0.00%) total cached in shared programs : 0 -> 0 (0.00%) inst gpr ugpr local shared bytes cached helped 470 50 0 0 0 470 0 hurt 327 197 0 0 0 327 0 Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24110>	2023-07-12 22:49:27 +00:00
Faith Ekstrand	73e191924c	nir: Add a reg_intrinsics flag to nir_convert_from_ssa It doesn't do anything yet. We leave that to the subsequent patches so we can keep the tree-wide refactor as simple as possible. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23089>	2023-07-12 01:34:27 +00:00
Rhys Perry	3d0e997e99	nir: split nir_lower_mov64 ACO will want to lower the conversions, but preserve the bcsels. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23926>	2023-07-03 10:38:27 +00:00
Karol Herbst	02aaf58908	nv50/ir/nir: set numBarriers if we emit an OP_BAR Even though the field is called `numBarriers` we set it to 1 just like we do with TGSI. It's unknown on what's the proper behavior here is. But without this set the GPU will complain to us loudly, so this silences at least that. Fixes: `a2d7a4f978` ("nv50/ir: convert to scoped_barrier") Signed-off-by: Karol Herbst <git@karolherbst.de> Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23749>	2023-06-24 02:12:14 +00:00
Caio Oliveira	59cc77f0fa	compiler: Move from nir_scope to mesa_scope Just moving the enum and performing renames, no behavior change. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23328>	2023-06-19 23:29:26 +00:00
Alyssa Rosenzweig	1d4a59448c	treewide: Remove use_scoped_barrier It is now set by all relevant drivers and not checked anywhere. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23191>	2023-06-13 16:36:10 +00:00
Emma Anholt	c3cbe610df	nouveau: Delete the NV50_PROG_USE_TGSI env var. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23114>	2023-06-12 17:37:54 +00:00
Karol Herbst	a2d7a4f978	nv50/ir: convert to scoped_barrier Contrary to how we implemented barriers the MEMBAR instruction actually does not allow us to specify which memory to synchronize. We can only specify the scope. No regressions on TU102. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: M Henning <drawoc@darkrefraction.com> Signed-off-by: Karol Herbst <git@karolherbst.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23208>	2023-06-07 09:57:24 +00:00
Yonggang Luo	12256136e0	compiler: Rename shader_prim to mesa_prim and replace all usage of pipe_prim_type with mesa_prim This is a prepare step to remove depends on p_defines.h in src/util/* This is done by: replace pipe_prim_type with mesa_prim replace shader_prim with mesa_prim replace PIPE_PRIM_MAX with MESA_PRIM_COUNT replace SHADER_PRIM_ with MESA_PRIM_ replace PIPE_PRIM_ with MESA_PRIM_ This patch only replace code only Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23369>	2023-06-03 03:29:03 +00:00
Alyssa Rosenzweig	ecd295bb8b	treewide: Avoid nir_lower_regs_to_ssa calls nir_registers are only supposed to be used temporarily. They may be created by a producer, but then must be immediately lowered prior to optimizing the produced shader. They may be created internally by an optimization pass that doesn't want to deal with phis, but that pass needs to lower them back to phis immediately. Finally they may be created when going out-of-SSA if a backend chooses, but that has to happen late. Regardless, there should be no case where a backend sees a shader that comes in with nir_registers needing to be lowered. The two frontend producers of registers (tgsi_to_nir and mesa/st) both call nir_lower_regs_to_ssa to clean up as they should. Some backend (like intel) already depend on this behaviour. There's no need for other backends to call nir_lower_regs_to_ssa too. Drop the pointless calls as a baby step towards replacing nir_register. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23181>	2023-05-24 17:30:03 +00:00
Alyssa Rosenzweig	c323762f9f	treewide: Stop lowering legacy atomics There are no more producers of legacy atomics so these calls are inert. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23036>	2023-05-16 22:36:21 +00:00
Karol Herbst	6ff97776b7	nv50/ir: Use unified atomics Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22914>	2023-05-12 20:39:46 +00:00
M Henning	cabbbbf0af	nouveau/nir: Set isSigned on all atomic_imax/imin Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22889>	2023-05-08 18:57:14 +00:00
Faith Ekstrand	d1e565a8eb	nouveau/nir: image_samples/size don't have coordinates Without this, it treats the src[1] as a coordinate (it's actually LOD) and may try to read more than one component. I don't think this usually hurts anything as the coordinate should get ignored later but it can result in OOB memory reads while translating NIR. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22834>	2023-05-03 19:52:09 +00:00
M Henning	d7e37389bc	nv50/codegen: Set lower_uniforms_to_ubo Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22815>	2023-05-03 19:18:08 +00:00
M Henning	d49c7b9582	nouveau/codegen: Check nir_dest_num_components instead of reaching into a union and pulling out garbage when the dest is a reg Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8863 Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22674>	2023-04-25 18:17:41 +00:00
Karol Herbst	7cfb8cb1a5	nv50/ir: ignore CL system values Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19712>	2023-04-14 07:41:54 +00:00
Emma Anholt	7c57061b77	nouveau: Enable frexp lowering in the backend. This would be desired for NVK using this backend, but also for getting lowering out of the GLSL frontend. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22083>	2023-04-06 02:32:01 +00:00
Emma Anholt	3a336a8ffd	nouveau: Add missing nir_opt_algebraic_late. This was needed for nir_lower_frexp, but it's a win anyway. shader-db results: total gpr in shared programs: 1143621 -> 1143502 (-0.01%) gpr in affected programs: 33918 -> 33799 (-0.35%) total instructions in shared programs: 7829415 -> 7820124 (-0.12%) instructions in affected programs: 1204967 -> 1195676 (-0.77%) total bytes in shared programs: 71802760 -> 71717352 (-0.12%) bytes in affected programs: 11031888 -> 10946480 (-0.77%) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22083>	2023-04-06 02:32:01 +00:00
Daniel Schürmann	2bb369dd8d	nir: add assertions that loops don't have a Continue Construct Hoping that I didn't miss any, this should add assertions to all functions and passes which explicitly handle 'nir_loop'. Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Ian Romanick	ea413e826b	nir: Eliminate nir_op_f2b Builds on the work of !15121. This gets to delete even more code because many drivers shared a lot of code for i2b and f2b. No shader-db or fossil-db changes on any Intel platform. v2: Rebase on `1a35acd8d9`. v3: Update a comment in nir_opcodes_c.py. Suggested by Konstantin. v4: Another rebase. Remove f2b stuff from Midgard. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20509>	2023-02-03 22:39:57 +00:00
Ian Romanick	eb76cee9f8	nir: Eliminate nir_op_i2b There are a lot of optimizations in opt_algebraic that match ('ine', a, 0), but there are almost none that match i2b. Instead of adding a huge pile of additional patterns (including variations that include both ine and i2b), always lower i2b to a != 0. At this point in the series, it should be impossible for anything to generate i2b, so there /should not/ be any changes. The failing test on d3d12 is a pre-existing bug that is triggered by this change. I talked to Jesse about it, and, after some analysis, he suggested just adding it to the list of known failures. v2: Don't rematerialize i2b instructions in dxil_nir_lower_x2b. v3: Don't rematerialize i2b instructions in zink_nir_algebraic.py. v4: Fix zink-on-TGL CI failures by calling nir_opt_algebraic after nir_lower_doubles makes progress. The latter can generate b2i instructions, but nir_lower_int64 can't handle them (anymore). v5: Add back most of the hunk at line 2125 of nir_opt_algebraic.py. I had accidentally removed the f2b(bf2(x)) optimization. v6: Just eliminate the i2b instruction. v7: Remove missed i2b32 in midgard_compile.c. Remove (now unused) emit_alu_i2orf2_b1 function from sfn_instr_alu.cpp. Previously this function was still used. 🤷 No shader-db changes on any Intel platform. All Intel platforms had similar results. (Ice Lake shown) Instructions in all programs: 141165875 -> 141165873 (-0.0%) Instructions helped: 2 Cycles in all programs: 9098956382 -> 9098956350 (-0.0%) Cycles helped: 2 The two Vulkan shaders are helped because of the "new" (('b2i32', ('ine', ('ubfe', a, b, 1), 0)), ('ubfe', a, b, 1)) algebraic pattern. Acked-by: Jesse Natalie <jenatali@microsoft.com> [earlier version] Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Daniel Schürmann <daniel@schuermann.dev> [earlier version] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:21 +00:00
Yusuf Khan	2c5b1d0e3b	nv50/ir: Support fmulz and ffmaz Signed-off-by: Yusuf Khan <yusisamerican@gmail.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19333>	2022-11-08 21:10:08 +00:00
Yusuf Khan	47251d2852	nv50/ir: add prefer_nir flag for getting compiler options So that we dont expose certain options for nir_to_tgsi Signed-off-by: Yusuf Khan <yusiamerican@gmail.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19333>	2022-11-08 21:10:08 +00:00
Joan Bruguera	6014a642ae	nv50/ir/nir: ignore sampler for TXF/TXQ ops. Recently, a regression was reported where videos in Firefox had shifted/ glitched colors on certain Kepler hardware. This was bisected to `bf02bffe15`, however, the issue already existed but didn't hit users until TGSI was switched to NIR as default. The issue was traced to a YUV-to-RGB fragment shader used by Firefox, which uses three samplers for the Y/U/V components. The Y component was handled correctly, but the U/V components were bogus, causing the issue. After analysis, it appears the TXF/TXQ ops. should only handle the texture (r) but not the sampler (s), see `63b850403c` and `346ce0b988`. Similarly, handleTXQ/handleTXF on nv50_ir_from_tgsi always sets s=0. Only Kepler was affected because other hardware ignores s at codegen. Always set s=0 on NIR for TXF/TXQ, to keep TGSI behavior and fix the regression. Thanks: Karol Herbst and M Henning for help diagnosing the issue. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7416 Cc: mesa-stable Suggested-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Signed-off-by: Joan Bruguera <joanbrugueram@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19453>	2022-11-02 12:29:34 +00:00
Alyssa Rosenzweig	941c37c085	nir/lower_idiv: Remove imprecise_32bit_lowering NIR has two implementations of lower_idiv, keyed on the imprecise_32bit_lowering flag. This flag is misleading: the results when setting this flag "imprecise", they're completely wrong for some values. If a backend has a native implementation of umul_high, the correct path isn't that much more expensive. If it doesn't, it's substantially slower for highp integer divison... but in practice, non-constant highp integer division is pretty rare. After a painful migration of the tree, this code path has no more users. Remove it so nobody else gets the bright idea of using it again. Closes: #6555 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19303>	2022-10-27 19:37:14 +00:00
Yusuf Khan	d9a257b339	nv50/ir: nir_op_b2i8 and nir_op_b2i16 Signed-off-by: Yusuf Khan <yusisamerican@gmail.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19256>	2022-10-27 02:16:24 +00:00
Thomas Debesse	6d5921c623	nv50: call nir_lower_flrp Fix #7432: unknown nir_op flrp assertion This copy-pastes src/gallium/drivers/radeonsi/si_shader_nir.c The lower_flrp16 value differs given chipset >= NVISA_GV100_CHIPSET. Signed-off-by: Thomas Debesse <dev@illwieckz.net> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19003>	2022-10-10 17:22:49 +00:00
Danilo Krummrich	6a9825bc1b	nv50/ir/nir: always round towards zero for f2i/f2u Conversions to integers must be rounded towards zero, hence, actually do this for all integers including 8/16 bit sources. Reviewed-by: Karol Herbst <kherbst@redhat.com> Signed-off-by: Danilo Krummrich <dakr@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18109>	2022-09-09 17:32:18 +02:00
Danilo Krummrich	109d56f612	nv50/ir/nir: convert 8/16 bit src to 32 bit for {i,u}2f64 Converting signed and unsigned integers from 8/16 bit sources to a 64 bit floating point destination (i2f64 / u2f64) isn't possible, hence convert the source to 32 bit first. Reviewed-by: Karol Herbst <kherbst@redhat.com> Signed-off-by: Danilo Krummrich <dakr@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18109>	2022-09-09 17:32:16 +02:00
Danilo Krummrich	ec60dcd870	nv50/ir/nir: avoid 8/16 bit dest regs for OP_MOV Instructions like mov u16 %r78s 0x00ff (0) are dropped, since they're not supported by the HW, hence avoid using 8/16 bit destination registers for OP_MOV and use the full width of the register instead. Reviewed-by: Karol Herbst <kherbst@redhat.com> Signed-off-by: Danilo Krummrich <dakr@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18109>	2022-09-09 17:32:10 +02:00
Danilo Krummrich	6e2fda15f1	nv50/ir/nir: convert to 32 bit for all OP_SET opcodes The 'set' instruction does distinguish between signed and unsigned, but always treats values as 32 bit. For singed values < 0 with a bit width smaller than 32 bit this falsely results in treating it as a positive value. Reviewed-by: Karol Herbst <kherbst@redhat.com> Signed-off-by: Danilo Krummrich <dakr@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18109>	2022-09-09 17:32:05 +02:00
Danilo Krummrich	cd53bcd325	nv50/ir/nir: add conversion ops for bit width < 32 Reviewed-by: Karol Herbst <kherbst@redhat.com> Signed-off-by: Danilo Krummrich <dakr@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18109>	2022-09-09 17:31:57 +02:00

1 2

59 commits