fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-24 21:50:12 +01:00

Author	SHA1	Message	Date
Chia-I Wu	4f1c43d38e	ac/surface: print tile_swizzle as well swizzle modes that are _X or _T depend on tile_swizzle. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23143>	2023-05-22 20:14:22 +00:00
Chia-I Wu	4f5edcd0ee	amd/drm-shim: add raven2 It differs from raven in interesting ways (e.g., GB_ADDR_CONFIG). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23143>	2023-05-22 20:14:22 +00:00
Erik Faye-Lund	569d035a08	panfrost: expose PIPE_CAP_POLYGON_OFFSET_CLAMP This gives us ARB_polygon_offset_clamp and EXT_polygon_offset_clamp, and most of the actual state plumbing was already in place. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23169>	2023-05-22 20:00:18 +00:00
Alyssa Rosenzweig	8484fdf501	mesa/st: Set pipe_shader_image::single_layer_view Pass it through from the API. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23142>	2023-05-22 16:41:10 +00:00
Alyssa Rosenzweig	a6a3a7a881	gallium: Add pipe_image_view::single_layer_view OpenGL has a goofy feature that allows creating an image view of a single layer of an array texture... in which case that image is treated as non-arrayed in shader. If you have a 16x16x16 3D texture and bind the third layer, you get a 16x16 2D texture instead of a 16x16x1 3D texture. That distinction matters to the hardware on AGX, since the texture dimension needs to match between the shader and the pipe_image_view. If the shader is going to use image2D, we need to know that the pipe_image_view should be treated as 2D (even though the underlying resource is 3D). "But, Alyssa, we already have first_layer and last_layer. Surely you can just check if first_layer == last_layer?" you ask. The problem is that doesn't distinguish a 16x16x1 3D texture (accessed as image3D in the shader) from a 16x16 slice (accessed as image2D in the shader) of a 16x16x16 3D texture. To solve, we add a boolean flag indicating we want to create a view (with a lower dimension than the underlying resource). This provides an unambiguous way to communicate this case to drivers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23142>	2023-05-22 16:41:10 +00:00
Martin Roukala (né Peres)	17fd50b817	radv/ci: switch to b2c v0.9.10 This brings a fix for the steam decks which may boot too fast sometimes, and have the network adapter not being enumerated by the time it tries to connect to the gateway... Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23170>	2023-05-22 16:01:52 +00:00
Caio Oliveira	623bc176fb	mesa/spirv: Provide more specific error message for glSpecializeShader() Distinguish between the "entry point not found" and "parsing error" cases in the error text. For consistency, identify the unhandled specialization index case as part of the verification function. The verification function was renamed to make clearer its scope and what module it belongs. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22976>	2023-05-22 15:26:40 +00:00
Alyssa Rosenzweig	eebb9377c4	pan/mdg: Use nir_lower_image_atomics_to_global We were already lowering image atomics to lea_image + global atomic. It's a lot nicer to make that lowering explicit in the NIR. This is much bigger win than in the Bifrost compiler since here lea_image is used only for atomics, and here it wasn't well abstracted in the compiler. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23120>	2023-05-22 14:33:14 +00:00
Alyssa Rosenzweig	47f5cc6ba7	pan/bi: Use nir_lower_image_atomics_to_global We were already lowering image atomics to lea_attr_tex + global atomic, might as well make that lowering explicit in the NIR. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23120>	2023-05-22 14:33:14 +00:00
Alyssa Rosenzweig	1ff7ec0c9e	pan/bi: Fix atomic exchange on Valhall Copypaste fail when switching to unified atomics, missed becuase I don't have any Valhall hardware and Valhall isn't in CI. (Good news, that means it probably didn't affect anyone in the mean time :-p) Fixes crashes with lots of dEQP-GLES31 tests observed under drm-shim. Fixes: `e258083e07` ("pan/bi: Use unified atomics") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23120>	2023-05-22 14:33:13 +00:00
Alyssa Rosenzweig	de648020af	nir: Add pass to lower image atomics Hardware that lacks dedicated image atomics can still implement image atomics with regular atomics on global memory, as long as there is a way to get the address of a texel in memory. I've open-coded this lowering in my first 2 compilers, so before I add another crappy vendored version in my 3rd, let's add a common NIR pass to do the lowering. Thanks to unified atomics, the pass itself is fairly concise. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23120>	2023-05-22 14:33:13 +00:00
Alyssa Rosenzweig	66656822e3	nir: Add image_texel_address intrinsics Some hardware has an instruction to load the address of a texel in a writeable image, given the coordinates ("LEA_IMAGE"). This operation is defined only for uncompressed images, but it is well-defined regardless of the underlying twiddling. As such, it is not expected to be produced by APIs but is useful for internal lowering when it is known that images will be uncompressed (e.g. because image_store does not support compression on the hardware). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23120>	2023-05-22 14:33:13 +00:00
Alyssa Rosenzweig	c3ea2f8d20	nir: Document extra image source I was scratching my head about this for a few minutes until I found the answer in spirv_to_nir. Hopefully this saves someone else some head scratching in turn. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23120>	2023-05-22 14:33:13 +00:00
David Heidelberg	32b150344e	docs: use meson instead invoking ninja directly This approach is available since meson 0.47.0 which we depend on. Reviewed-by: Sergi Blanch-Torné <sergi.blanch.torne@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23127>	2023-05-22 15:41:40 +02:00
Mike Blumenkrantz	62961b172f	zink: try update fb resource refs when starting new renderpass in the case where a draw is triggered after a flush, zink_update_descriptor_refs will be called to set batch tracking for descriptors. this function also handles refs for fb attachments, and everything is usually fine there the problem with this approach is that tracking is no longer set on view objects at renderpass begin, which makes them susceptible to early deletion if a rp isn't started from a draw call instead, apply batch tracking to fb attachment resources on renderpass begin if the BATCH_CHANGED flag is set (need to rename this at some point) in order to guarantee that the resource (object) lifetime will match the cmdbuf runtime [since imageviews are now only freed upon batch completion] fixes #9059 Fixes: `f6bbd7875a` ("zink: remove batch tracking/usage from view types" Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23132>	2023-05-22 11:15:22 +00:00
Lionel Landwerlin	cab7ba00e2	anv: fix push descriptor deferred surface state packing Yuzu is running into a segfault because it writes the push descriptor twice with 2 different layouts, but without a draw/dispatch in between. First vkCmdPushDescriptorSetKHR() writes descriptor 0 & 1 with a uniform buffer. We toggle the 2 first bits of anv_descriptor_set::generate_surface_states. Second vkCmdPushDescriptorSetKHR() writes descriptor 0 with uniform buffer and descriptor 1 with an image view. The first bit of anv_descriptor_set::generate_surface_states stays, but the second bit was already set before and it should now be off. When we finally flush the push descriptor, we try to generate a surface state for descriptor 1, but there is no valid buffer view for it, we access an invalid pointer and segfault. This fix resets the anv_descriptor_set::generate_surface_states when the descriptor layout changes. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b49b18f0b7` ("anv: reduce BT emissions & surface state writes with push descriptors") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23156>	2023-05-22 10:50:26 +00:00
David Heidelberg	cc0cf1762d	r300: workaround GCC 12+ warning, declare NULL value as unreachable Solution recommended in the https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109716#c3 Suggested-by: Eric Engestrom <eric@engestrom.ch> Acked-by: Filip Gawin <filip@gawin.net> Reviewed-by: Pavel Ondračka <pavel.ondracka@gmail.com> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23148>	2023-05-22 12:32:42 +02:00
Iago Toral Quiroga	e401add741	broadcom/compiler: skip jumps in non-uniform if/then when block cost is small We have an optimization for non-uniform if/else where if all channels meet the jump condition we emit a branch to jump straight to the ELSE block. Similarly, if at the end of the THEN block we don't have any channels that would execute the ELSE block, we emit a branch to jump straight to the AFTER block. This optimization has a cost though: we need to emit the condition for the branch and a branch instruction (which also comes with a 3 delay slot), so for very small blocks (just a couple of ALU for example) emitting the branch instruction is typically worse. Futher, if the condition for the branch is not met, we still pay the cost for no benefit at all. Here is an example: nop ; fmul.ifa rf26, 0x3e800000, rf54 xor.pushz -, rf52, 2 ; nop bu.alla 32, r:unif (0x00000000 / 0.000000) nop ; nop nop ; nop nop ; nop xor.pushz -, rf52, 3 ; nop nop ; mov.ifa rf52, 0 nop ; mov.pushz -, rf52 nop ; mov.ifa rf26, 0x3f800000 The bu instruction here is setup to jump over the following 4 instructions (the last 4 instructions in there). To do this, we pay the price of the xor to generate the condition, the bu instruction, and the 3 delay slots right after it, so we end up paying 6 instructions to skip over 4 which we pay always, even if the branch is not taken and we still have to execute those 4 instructions. With this change, we produce: nop ; fmul.ifa rf56, 0x3e800000, rf28 xor.pushz -, rf9, 3 ; nop nop ; mov.ifa rf9, 0 nop ; mov.pushz -, rf9 nop ; mov.ifa rf56, 0x3f800000 Now we don't try to skip the small block, ever. At worse, if all channels would have met the branch condition, we only pay the cost of the 4 instructions instead of 6, at best, if any channel wouldn't take the branch, we save ourselves 5 cycles for the branch condition, the branch instruction and its 3 delay slots. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23161>	2023-05-22 09:23:41 +00:00
Yiwei Zhang	4c8be22c66	radv: fix radv_emit_userdata_vertex for vertex offset -1 -1 is a legit vertex offset upon vkCmdDrawIndexed and other cmds. This change fixes to track last_vertex_offset with an additional valid bit. Cc: mesa-stable Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23157>	2023-05-22 08:31:28 +00:00
Samuel Pitoiset	7cb4494039	radv: enable smoothLines For Zink. This marks one piglit test as expected failure because polygon smoothing can't be implemented properly in Vulkan. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21587>	2023-05-22 07:58:35 +00:00
Samuel Pitoiset	85cbdba355	radv: add support for smooth lines Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21587>	2023-05-22 07:58:35 +00:00
Samuel Pitoiset	8c5eaf2166	radv: lower nir_intrinsic_load_poly_line_smooth_enabled_amd Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21587>	2023-05-22 07:58:35 +00:00
Samuel Pitoiset	9b2e59abc5	radv: declare a new user SGPR for the dynamic line rasterization mode Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21587>	2023-05-22 07:58:35 +00:00
Samuel Pitoiset	fcfdb1bb6c	radv: determine if smooth lines can be used in the pipeline key Really complicated to reduce the scope because everything can be dynamic and with GPL you can't even know if the pipeline draws lines when compiling the fragment shader. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21587>	2023-05-22 07:58:35 +00:00
Samuel Pitoiset	9612603aac	radv: track if the smoothLines features is enabled in the device Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21587>	2023-05-22 07:58:35 +00:00
Samuel Pitoiset	3626c23e85	nir: lower smooth lines conditionally using the new intrinsic RADV will enable/disable this based on a dynamic state. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21587>	2023-05-22 07:58:34 +00:00
Samuel Pitoiset	759a57d902	radeonsi: lower nir_intrinsic_load_poly_line_smooth_enabled_amd Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21587>	2023-05-22 07:58:34 +00:00
Samuel Pitoiset	f023ab01e9	nir: add nir_intrinsic_load_poly_line_smooth_enabled To lower smooth lines conditionally in fragment shaders for RADV because the line rasterization mode in Vulkan can be dynamic. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21587>	2023-05-22 07:58:34 +00:00
Samuel Pitoiset	15bb9c4b96	radv: remove useless check about USAGE_STORAGE for TC-compat HTILE This should never happen. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23122>	2023-05-22 07:37:15 +00:00
Samuel Pitoiset	dda7400c0b	radv: disable IMAGE_USAGE_STORAGE with depth-only and stencil-only formats This shouldn't have been enabled at all. Depth-stencil formats were accidentally disabled but not depth-only or stencil-only formats. This doesn't seem allowed by DX12 and both AMD/NVIDIA don't enable it. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23122>	2023-05-22 07:37:15 +00:00
Samuel Pitoiset	3adc9b6722	radv: bump the global VRS image size to maximum supported FB dimensions Super sampling on a 4K screen could hit this. 16k seems pretty big but this image is only created on RDNA2 and on-demand if VRS attachments are used without depth-stencil attachments, which should be rare enough to care. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23105>	2023-05-22 06:53:03 +00:00
Timothy Arceri	5be8acc1b5	util: add Pixel Game Maker MV workaround Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8918 Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23095>	2023-05-22 00:45:45 +00:00
David Heidelberg	8e53b293f8	ci/v3dv: add often timeouting ssbo.layout.3_level_array.std140.column_major_mat4 Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23153>	2023-05-21 01:37:23 +02:00
David Heidelberg	4a49892ba3	ci/radv: add another raven flake dEQP-VK.draw.dynamic_rendering.primary_cmd_buff.linear_interpolation Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23152>	2023-05-21 00:51:03 +02:00
Timur Kristóf	b78cf192f0	radv: Clear query dirty flags when flushing them. This is just to make their code consistent with other similar functions. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20659>	2023-05-20 19:33:20 +00:00
Timur Kristóf	59c2711800	radv: Move empty dynamic states check to caller. Improves the CPU overhead of radv_emit_all_graphics_states. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20659>	2023-05-20 19:33:20 +00:00
Timur Kristóf	0d14f7a304	radv: Move indirect check from index buffer emission to caller. This improves the CPU overhead of radv_emit_all_graphics_states. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20659>	2023-05-20 19:33:20 +00:00
Timur Kristóf	8436fe5af4	radv: Slight refactor to late_scissor_emission. There is no need to set context_roll_without_scissor_emitted when pipeline, rbplus state, or binning state changes, because radv_need_late_scissor_emission already checks their dirty flags. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20659>	2023-05-20 19:33:20 +00:00
Timur Kristóf	2249ab1daa	radv: Set last_index_type in radv_before_draw. This function is always inlined so checking info->indexed can be constant folded by the compiler. So it is better to set this in before_draw. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20659>	2023-05-20 19:33:20 +00:00
Timur Kristóf	e5c3479fae	radv: Move ignore forced VRS code to more optimal place. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20659>	2023-05-20 19:33:20 +00:00
Timur Kristóf	4255bd63a4	radv: Compute tess info when emitting patch control points. Some tess info needs to be calculated in the command buffer when dynamic patch control points are enabled. Move this calculation from radv_emit_all_graphics states to where it actually matters. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20659>	2023-05-20 19:33:20 +00:00
Timur Kristóf	94465f3073	radv: Emit primitive reset index with primitive restart enable. The VGT_MULTI_PRIM_IB_RESET_INDX register has no effect when primitive restart is disabled, so we can move this out of the hot path. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20659>	2023-05-20 19:33:20 +00:00
Alyssa Rosenzweig	04bd1f2cda	asahi: Drop Asahi-as-a-swrast hack Now that we've dropped macOS support in the driver, this is all dead code and gets garbage collected. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23138>	2023-05-20 16:59:16 +00:00
Alyssa Rosenzweig	c284a200b9	gallium: Drop Asahi-as-a-swrast hack Now that we've dropped macOS support, these paths are deadcode. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23138>	2023-05-20 16:59:16 +00:00
David Heidelberg	a0b1aa6f00	docs: update crosvm networking options Reviewed-by: Corentin Noël <corentin.noel@collabora.com> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22892>	2023-05-20 10:33:48 +02:00
David Heidelberg	27c775d2f7	ci/crosvm: update cmdline options ``` [WARN crosvm::crosvm::cmdline] `--host-ip`, `--netmask`, and `--mac` are deprecated; ``` Reviewed-by: Corentin Noël <corentin.noel@collabora.com> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22892>	2023-05-20 10:33:43 +02:00
Kenneth Graunke	462ef200d8	nir: Assert that we don't shrink bit-sizes in nir_lower_bit_size() The idea of this pass is to promote small bit-sizes to larger, supported bit-sizes for certain operations. It doesn't handle emulating large bit-size operations on smaller bit-sizes; passes like nir_lower_int64 and nir_lower_doubles handle that. So, assert that we aren't shrinking the bit-size, as this will almost certainly produce incorrect results. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23123>	2023-05-19 22:44:37 +00:00
Kenneth Graunke	a2d384a5c0	intel/compiler: Fix 64-bit ufind_msb, find_lsb, and bit_count We only support 32-bit versions of ufind_msb, find_lsb, and bit_count, so we need to lower them via nir_lower_int64. Previously, we were failing to do so on platforms older than Icelake and let those operations fall through to nir_lower_bit_size, which used a callback to determine it should lower them for bit_size != 32. However, that pass only emulates small bit-size operations by promoting them to supported, larger bit-sizes (i.e. 16-bit using 32-bit). It doesn't support emulating larger operations (i.e. 64-bit using 32-bit). So nir_lower_bit_size would just u2u32 the 64-bit source, causing us to flat ignore half of the bits. Commit `78a195f252` (intel/compiler: Postpone most int64 lowering to brw_postprocess_nir) provoked this bug on Icelake and later as well, by moving the nir_lower_int64 handling for ufind_msb until late in compilation, allowing it to reach nir_lower_bit_size which broke it. To fix this, we always set int64 lowering for these opcodes, and also correct the nir_lower_bit_size callback to ignore 64-bit operations. Cc: mesa-stable Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23123>	2023-05-19 22:44:37 +00:00
Kenneth Graunke	9293d8e64b	nir: Add find_lsb lowering to nir_lower_int64. Some GPUs can only handle 32-bit find_lsb. Cc: mesa-stable Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23123>	2023-05-19 22:44:37 +00:00
Jesse Natalie	25c7181f1b	microsoft/compiler: Better and simpler bitcast reduction Using nir_gather_ssa_types works much better. There's 2 differences compared to what I was doing before: 1. Multiple passes to allow data to propagate forward and backward through the whole shader. 2. Allowing a value to have indeterminate types due to having both int and float usages. So this deletes some code and gets better results. Wish I'd known this existed last week. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23062>	2023-05-19 22:19:38 +00:00

... 3 4 5 6 7 ...

171727 commits