fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 13:48:06 +02:00

Author	SHA1	Message	Date
Connor Abbott	0977925c53	nir, spirv: Add support for VK_EXT_fragment_density_map This involves two new system values. Reviewed-by: Faith Ekstrand <faith@gfxstrand.net> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20303>	2023-04-04 13:14:35 +00:00
Timur Kristóf	e42d2bd534	nir: Gather compile time constant task->mesh dispatch size. Some GPUs such as AMD RDNA3 can use this information to optimize mesh shader dispatches. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22222>	2023-04-03 15:36:02 +00:00
antonino	15b3d77b40	nir: only handle flat interpolation when needed in `nir_create_passthrough_gs` When turning primitives into line strips this function needs to move attributes around, but this is not needed in other cases. Fixes: `1a5bdca2dd` ("zink: implement flat shading using inlined uniforms") Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22162>	2023-03-31 11:03:48 +00:00
Ian Romanick	71e5530c07	nir/algebraic: Undistribute fsat from fmax To be helpful, the thing inside the fsat has to be used with and without the fsat. Otherwise it just moves a saturate destination modifier around. To not be harmful, the fsat has to only be used by the bcsel. All Broadwell and newer Intel platforms had similar results. (Ice Lake shown) total instructions in shared programs: 20174475 -> 20174449 (<.01%) instructions in affected programs: 3913 -> 3887 (-0.66%) helped: 13 / HURT: 0 total cycles in shared programs: 866844832 -> 866844719 (<.01%) cycles in affected programs: 46037 -> 45924 (-0.25%) helped: 10 / HURT: 1 All Intel platforms had similar results. (Ice Lake shown) Instructions in all programs: 161491468 -> 161491372 (-0.0%) helped: 31 / HURT: 8 Cycles in all programs: 10933090736 -> 10933024716 (-0.0%) helped: 32 / HURT: 18 Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22169>	2023-03-29 23:48:19 +00:00
antonino	2bd72a4101	nir: keep xfb properties in nir_create_passthrough_gs Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:40 +00:00
antonino	0b65514775	nir/zink: handle provoking vertex mode in `nir_create_passthrough_gs` Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:40 +00:00
antonino	1a5bdca2dd	zink: implement flat shading using inlined uniforms Zink will now handle flat interpolation correctly when line loops are generated from primitives. The flat shading information is passed to the emulation gs using constant uniforms which get inlined. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:40 +00:00
antonino	3b5fb8b060	nir: allow to force line strip out in nir_create_passthrough_gs `nir_create_passthrough_gs` now allows the user to force the generated GS to always output a line strip from the primitive regardless of whether edgeflags are present. Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:40 +00:00
antonino	24535ffb3d	nir: handle edge flags in nir_create_passthrough_gs `nir_create_passthrough_gs` will now take a boolean argument to decide whether it needs to handle edgeflags. When true is passed it will output a line strip where edges that shouldn't be visible are not emitted. This is usefull because geometry shaders will generally throw away edgeflags so for a passthrough GS to act transparently it needs to emulate them. Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:40 +00:00
antonino	a0751e8088	nir: calculate number of vertices in nir_create_passthrough_gs `nir_create_passthrough_gs` has been changed to take the type of primitive as opposed to the number of vertices as an argument. Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:40 +00:00
antonino	edecb66b01	nir: avoid generating conflicting output variables Because not all vertex outputs can have corresponding fragment inputs (eg. edgeflags) some logic is needed to correctly generate variables in a passthough gs. Before this change some output variables ened up with the same location. Fixes: `d0342e28b3` ("nir: Add helper to create passthrough GS shader") Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:39 +00:00
antonino	ea14579f3d	nir: handle primitives with adjacency `nir_create_passthrough_gs` can now handle primitives with adjacency where some vertices need to be skipped. Fixes: `d0342e28b3` ("nir: Add helper to create passthrough GS shader") Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:39 +00:00
Sil Vilerino	0d0221a574	nir: Fix use of alloca() without #include c99_alloca.h Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22150>	2023-03-29 16:56:42 +00:00
Emma Anholt	ceef2b9982	nir/lower_sysvals: Add support for un-lowered tess_level_inner/outer. GLSL has been responsible for doing this, but we can just extract the array index here. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21940>	2023-03-29 16:06:03 +00:00
Timur Kristóf	b688a6d227	nir: Remove IB address and stride intrinsics. RADV used these to emulate firstTask for NV_mesh_shader. They are no longer needed. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22139>	2023-03-29 15:08:55 +00:00
Qiang Yu	bf9c1699cd	nir: add nir_fisnan helper function Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21552>	2023-03-28 19:57:11 +00:00
Qiang Yu	c9d60547ef	nir,radeonsi: add and implement nir_load_alpha_reference_amd Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21552>	2023-03-28 19:57:11 +00:00
Qiang Yu	6848e05f9c	nir: pack_(s\|u)norm_2x16 support float16 as input For AMD GPU which has instruction to normalize and pack two float16 inputs, and used when fragment shader export color output. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21552>	2023-03-28 19:57:11 +00:00
Faith Ekstrand	01275a1a95	nir: Drop a bunch of Authors tags This is what git blame is for. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22120>	2023-03-26 00:16:25 +00:00
Konstantin Seurer	200e551cbb	nir/lower_shader_calls: Remat derefs before lowering resumes Closes: #7923 cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20399>	2023-03-24 14:55:37 +00:00
Alyssa Rosenzweig	47ed0b41be	nir: Add Mali load_output taking converison Mali's LD_TILE instruction (mapping to NIR's load_output) requires a "conversion descriptor" specifying how to convert from the register foramt to the tilebuffer format. To implement framebuffer fetch on OpenGL without shader variants, we generate these descriptors in the driver and pass them in a uniform. However, to comply with the Ekstrand Rule, we can't have magically materialized system values -- they should come only from the NIR where the driver can lower as it pleases (e.g. PanVK can lower to a constant because it knows the framebuffer format at pipeline create time). Add intrinsics to model this. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:45 +00:00
Alyssa Rosenzweig	60bfc4deb9	nir: Add Panfrost intrinsics to lower sample mask We want to lower this in NIR instead of the backend IR to give the driver a chance to lower the "is multisampled?" system value, which makes more sense to do in NIR. This gets rid of one of the magic compiler materialized sysvals. Plus, this will let us constant fold away the lowering in Vulkan when we know that the pipeline is single-sampled / multi-sampled. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:45 +00:00
Amber	8da3494d53	freedreno, nir, ir3: implement GL_EXT_shader_framebuffer_fetch Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21260>	2023-03-23 16:59:56 +00:00
Amber	ca92183845	nir: Add memory coherency information to shaders. Signed-off-by: Amber Amber <amber@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21260>	2023-03-23 16:59:56 +00:00
Amber	1462da2a70	nir: allow nir_lower_fb_read to support multiple render targets Signed-off-by: Amber Amber <amber@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21260>	2023-03-23 16:59:56 +00:00
Rhys Perry	e99ba0b6d3	nir/range_analysis: use perform_analysis() in nir_analyze_range() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21381>	2023-03-22 09:24:18 +00:00
Rhys Perry	2b03db39b3	nir/range_analysis: use perform_analysis() in nir_unsigned_upper_bound() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21381>	2023-03-22 09:24:18 +00:00
Rhys Perry	29a38b09cf	nir/range_analysis: add helpers for limiting stack usage Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21381>	2023-03-22 09:24:18 +00:00
Rhys Perry	2145cf3dd1	nir/range_analysis: add missing masking of shift amounts Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Fixes: `72ac3f6026` ("nir: add nir_unsigned_upper_bound and nir_addition_might_overflow") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21381>	2023-03-22 09:24:18 +00:00
Alyssa Rosenzweig	2933af7576	nir/builder: Add nir_umod_imm helper Like nir_udiv_imm, we can do a similar power-of-two trick. It's also really convenient. v2: Assert reasonable bounds on the modulus (Faith). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> [v1] Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> [v1] Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22010>	2023-03-22 06:18:18 +00:00
Georg Lehmann	cec04adcee	nir: optimize i2f(f2i(fsign)) Foz-DB Navi10: Totals from 3013 (2.23% of 134906) affected shaders: VGPRs: 138068 -> 136964 (-0.80%); split: -0.80%, +0.00% CodeSize: 10476416 -> 10391800 (-0.81%) MaxWaves: 79118 -> 80088 (+1.23%) Instrs: 1963227 -> 1945003 (-0.93%) Latency: 24734883 -> 24649279 (-0.35%); split: -0.39%, +0.05% InvThroughput: 6366777 -> 6334735 (-0.50%); split: -0.50%, +0.00% VClause: 36845 -> 36882 (+0.10%); split: -0.26%, +0.36% SClause: 59249 -> 59273 (+0.04%); split: -0.25%, +0.29% Copies: 108570 -> 108501 (-0.06%); split: -0.19%, +0.13% PreSGPRs: 105371 -> 105862 (+0.47%) PreVGPRs: 117675 -> 116625 (-0.89%); split: -0.89%, +0.00% Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22003>	2023-03-22 05:34:55 +00:00
Samuel Pitoiset	bb7e0c4280	spirv,nir: add support for SpvBuiltInFullyCoveredEXT Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21497>	2023-03-21 08:44:09 +00:00
Emma Anholt	5873dcb32f	nir/lower_mediump: Fix assertion about copy_deref lowering matching. Copy and paste typo. We shouldn't have copy_derefs during this pass, anyway, but caught a failure with my upcoming unit testing. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21666>	2023-03-21 00:51:24 +00:00
Jesse Natalie	49885f87c3	nir: Propagate alignment when rematerializing cast derefs Fixes: `878a8daca6` ("nir: Add alignment information to cast derefs") Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21975>	2023-03-17 08:16:03 +00:00
Timur Kristóf	022e55557b	nir: Add load_typed_buffer_amd intrinsic. This new intrinsic maps to the MTBUF instruction format on AMD GPUs and represents a typed buffer load in NIR. Also add an unsigned upper bound for the new intrinsic. Code for that ported from aco_instruction_selection_setup. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16805>	2023-03-15 14:54:27 +00:00
Isabella Basso	59fea8af3a	nir/algebraic: remove duplicate bool conversion lowerings While [1] added some boolean conversion lowering patterns, those were already dealt with on [2]. [1] - `b86305bb` ("nir/algebraic: collapse conversion opcodes (many patterns)") [2] - `d7e0d47b` ("nir/algebraic: nir: Add a bunch of b2[if] optimizations") Fixes: `b86305bb` ("nir/algebraic: collapse conversion opcodes (many patterns)") Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Signed-off-by: Isabella Basso <isabellabdoamaral@usp.br> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20965>	2023-03-11 17:21:38 +00:00
Isabella Basso	a553d3cd29	nir/algebraic: make patterns for float conversion lowerings imprecise As noted on [1], lowering patterns of the form floatS -> floatB -> floatS ==> floatS cannot require precision since this may cause flush denorming. [1] `3f779013` ("nir: Add an algebraic optimization for float->double->float") Fixes: `b86305bb` ("nir/algebraic: collapse conversion opcodes (many patterns)") Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Signed-off-by: Isabella Basso <isabellabdoamaral@usp.br> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20965>	2023-03-11 17:21:37 +00:00
Isabella Basso	79c94ef52e	nir/algebraic: extend lowering patterns for conversions on smaller bit sizes Conversions on smaller bit sizes should also be collapsed when composed. This also adds more patterns on the intS -> intB -> floatB ==> intS -> floatB lowering so as to deal with any int size C > B instead of a fixed intB. Closes: #7776 Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Signed-off-by: Isabella Basso <isabellabdoamaral@usp.br> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20965>	2023-03-11 17:21:37 +00:00
Isabella Basso	a27bcd63d0	nir/algebraic: extend mediump patterns Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Suggested-by: Italo Nicola <italonicola@collabora.com> Signed-off-by: Isabella Basso <isabellabdoamaral@usp.br> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20965>	2023-03-11 17:21:37 +00:00
Isabella Basso	b3685f3ba7	nir/algebraic: insert patterns inside optimizations list Some patterns were outside the list of optimizations. Fixes: `b86305bb` ("nir/algebraic: collapse conversion opcodes (many patterns)") Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Signed-off-by: Isabella Basso <isabellabdoamaral@usp.br> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20965>	2023-03-11 17:21:37 +00:00
Alyssa Rosenzweig	2ba48eea88	nir/lower_point_size: Use shader_instructions_pass Sleepy code deletion mood. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21750>	2023-03-11 16:42:36 +00:00
Ian Romanick	0cadc3830f	nir/lower_int64: Optionally lower ufind_msb using uadd_sat v2: Fix inverted condition for applying the optimization. Noticed by Ken. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	831f9d3f61	nir/algebraic: Optimize some ifind_msb to ufind_msb On Intel platforms, the uclz lowering if ufind_msb is either one instruction better (Gfx7 and newer) or two instructions better (all older platforms) than the ifind_msb implementations. On platforms that use lower_find_msb_to_reverse, there should be no difference. All Haswell and newer Intel platforms had similar results. (Ice Lake shown) total instructions in shared programs: 19938662 -> 19938634 (<.01%) instructions in affected programs: 850 -> 822 (-3.29%) helped: 2 / HURT: 0 total cycles in shared programs: 858467067 -> 858465538 (<.01%) cycles in affected programs: 10080 -> 8551 (-15.17%) helped: 2 / HURT: 0 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	db6d1edc1b	nir: Restrict ufind_msb and ufind_msb_rev to 32- or 64-bit sources `4d802df3aa` loosened the type restrictions on these opcodes to enable support for 64-bit ballot operations. In doing so, it enabled 8-bit and 16-bit sizes as well. It's impossible to get these sizes through GLSL or SPIR-V. None of the lowering in nir_opt_algebraic can handle non-32-bit sizes. Almost no drivers can handle non-32-bit sizes. It doesn't seem possible to enforce anything other than "one bit size" or "all bit sizes" in nir_opcodes.py. The only way it seems possible to enforce this is in nir_validate. This is not ideal, but it be what it be. v2: Remove restriction on find_lsb. It is acutally possible to get this via GLSL by doing findLSB() on a lowp value. findMSB declares its parameter as highp, so that path is still impossible. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	2d6f48f6ef	nir/algebraic: Do not generate 8- or 16-bit find_msb The next commit will add validation to restrict this instruction (and others) to only 32-bit or 64-bit sources. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	2119ab7319	nir/builder: Do not generate 8- or 16-bit find_msb Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	28311f9d02	nir: intel/compiler: Move ufind_msb lowering to NIR Fossil-db results: All Intel platforms had similar results. (Ice Lake shown) Cycles in all programs: 9098346105 -> 9098333765 (-0.0%) Cycles helped: 6 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	a4052e70ea	nir/algebraic: Only lower ufind_msb with 32-bit sources The 31-ufind_msb_rev(x) lowering only produces the correct result for 32-bit sources. ufind_msb_rev can also have 64-bit sources, and most platforms are expected to lower this to 32-bit instructions with extra logic operations. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	0cc7bf63b7	nir: intel/compiler: Move ifind_msb lowering to NIR Unlike ufind_msb, ifind_msb is only defined in NIR for 32-bit values, so no @32 annotation is required. No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	66840b98e4	nir: ifind_msb_rev can only have int32 sources Just like ifind_msb. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00

1 2 3 4 5 ...

4273 commits