fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 02:48:07 +02:00

Author	SHA1	Message	Date
Sergi Blanch Torne	fce5e77604	New DUT for Alder Lake Introduce a new runner tag from a hidden job for ADL (Alder Lake Intel generation), known as brya. Signed-off-by: Sergi Blanch Torne <sergi.blanch.torne@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26831>	2024-08-27 12:49:16 +02:00
Kenneth Graunke	437bda3013	intel/brw: Get rid of the lsc_msg_desc_wcmask helper The LOAD/STORE opcodes take a vector size, while the LOAD/STORE_CMASK opcodes take a channel mask. The two are mutually exclusive. So we can just have the lsc_msg_desc() helper take one or the other in the same parameter. This more closely matches the actual descriptor. We couldn't do this until the previous commit, since we were previously relying on the lsc_msg_desc() function to calculate a cmask out of the number of vector components. But now we don't need it to do that. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30632>	2024-08-27 09:25:59 +00:00
Kenneth Graunke	55f193a105	intel/brw: Switch from LSC CMASK opcodes to regular LOAD/STORE The LOAD/STORE opcodes take a vector size (number of components), while the LOAD/STORE_CMASK opcodes take a channel mask. For some reason, we were passing a number of channels to lsc_msg_desc(), then using it to construct a channel mask with all channels enabled, and always using the CMASK message variants. Considering we don't actually want to mask off any channels, we should probably just use the regular LOAD/STORE opcodes, as they're more flexible anyway. One exception is that typed messages on Xe2 apparently only support LOAD_CMASK/STORE_CMASK and not regular LOAD/STORE. So we keep using those there. (Thanks to Sagar Ghuge for catching this!) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30632>	2024-08-27 09:25:58 +00:00
Sviatoslav Peleshko	7e52b67801	anv: Add full subgroups WA for the shaders with barriers in Breaking Limit When barriers are used in invalid shaders with non-uniform control flow we might get a hang. Forcing 32-wide group can help by making it more probable that barrier instruction is executed by at least one channel in each thread, and thus hang will be avoided. This shouldn't affect Xe2+, where active-thread-only barriers are used anyway. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11497 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30581>	2024-08-27 08:26:08 +00:00
Sviatoslav Peleshko	1904fe1186	anv: Release correct BO in anv_cmd_buffer_set_ray_query_buffer If p_atomic_cmpxchg doesn't set the ray_query_shadow_bos[bucket] to new_bo allocated by this thread, it returns the bucket BO allocated by the other thread and we use it. But due to a mistake, we also release that BO, not the candidate just allocated by this thread and never used again. Fixes: `5d3e4193` ("anv: enable ray queries") Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30581>	2024-08-27 08:26:08 +00:00
Sviatoslav Peleshko	09122e2be0	brw,elk: Fix opening flags on dumping shader binaries Truncation is needed for overwriting correctly in cases when old file is bigger than the one we want to dump (e.g. when the old one was edited inplace). Also, creation permissions are way too broad. Fixes: `4f41c44d` ("intel/compiler: Add variable to dump binaries of all compiled shaders") Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30581>	2024-08-27 08:26:08 +00:00
Sviatoslav Peleshko	442cc7996e	anv: Assert ray query BO actually exists The crash will happen if the client tries to use ray queries without enabling the KHR_ray_query extension. Add an assert to be able to catch this sooner. Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30581>	2024-08-27 08:26:08 +00:00
Nanley Chery	4a8f3181ba	intel: Support any depth fast-clear value on Xe2 Remove the restriction that a depth fast-clear must have a clear value which matches an image-dependent heuristic. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30767>	2024-08-27 06:15:36 +00:00
Nanley Chery	4a9e45061a	anv: Add and use anv_image_hiz_clear_value() The benchmarks we're tracking tend to prefer clearing depth buffers to 0.0f when the depth buffers are part of images with multiple aspects. Otherwise, they tend to prefer clearing depth buffers to 1.0f. Replace the ANV_HZ_FC_VAL constant with a function which implements this heuristic. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30767>	2024-08-27 06:15:36 +00:00
Nanley Chery	9fd79dc49e	anv: Pass the VkClearDepthStencilValue for clears Xe2 can easily support fast-clearing depth buffers to multiple clear values. Instead of assuming a hard-coded value in various parts of the driver, pass the clear value down the expected paths. For consistency, also adjust the slow depth clear function to have a matching parameter. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30767>	2024-08-27 06:15:36 +00:00
Paulo Zanoni	f3c7e14f09	isl: don't assert(num_elements > (1ull << 27)) Some games such as Marvel's Spider-Man Remastered and Assassin's Creed: Valhalla don't work in debug mode because they hit this assertion. In Release mode, they appear to work (although in some platforms there may be visual corruption or GPU hangs). There's nothing we can do about this error (see below), so in this patch we replace the assertion with an error message, because it allows us to (i) test the rest of the game in debug mode so we may catch other issues; and (ii) warn users of release mode that the issue is happening. The unsupported num_elements comes from vkGetDescriptorEXT() and appears to be violating VUID-VkDescriptorGetInfoEXT-type-09427. This function cannot return errors, but we can disable VK_EXT_descriptor_buffer. If we do disable the extension, then vkCreateBufferView() will start triggering the assertion, and we can see that VkBufferViewCreateInfo-range-00930 is being violated. If we change Anv to return errors on these vkCreateBufferView() cases, then the games won't work at all. I reported this to vkd3d-proton, but according to the vkd3d-proton developer Philip Rebohle: "There's also the problematic case of games using typed descriptors but passing non-typed buffer descriptors, which is an extremely common app bug that works on all D3D12 drivers that we need to work around by creating typed views. If that's what's happening here then the best we can do is to just not create the typed view and have the game be broken entirely, or create a smaller view and most likely still completely break the game, but at least that way it wouldn't trigger Vulkan validation. Emulating larger views via multiple smaller views is not possible for us." "Confirmed that it's the app itself creating these views." "D3D12 does not have runtime validation for this or any sort of query for the app, so we really can't do much here." Link: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9963 Link: https://github.com/HansKristian-Work/vkd3d-proton/issues/2071 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30775>	2024-08-27 05:47:50 +00:00
Lionel Landwerlin	6336e0fe7f	anv: order data in wa_bo to leave wa_addr last We want to make sure the workaround_address is the last item in the BO so that we don't have to care about the size of the writes going there, we'll be sure they won't overwrite other items in that BO. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `7b9400b7f7` ("intel/blorp: Don't use clear color conversion on gfx12") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11775 Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30844>	2024-08-27 00:51:03 +00:00
Lionel Landwerlin	d8ec8acede	anv: always use workaround_address, not workaround_bo The workaround BO has some debug information at the beginning. The workaround address is placed after that. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30844>	2024-08-27 00:51:03 +00:00
Nanley Chery	9b98cebe9a	intel: Drop BLORP_BATCH_NO_UPDATE_CLEAR_COLOR All drivers update the clear color themselves. So, drop the functionality from BLORP as well as the flag controlling it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30824>	2024-08-26 23:57:12 +00:00
Nanley Chery	721d0c3e77	anv,hasvk: Always use BLORP_BATCH_NO_UPDATE_CLEAR_COLOR Store the clear color from within the drivers, rather than from BLORP. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30824>	2024-08-26 23:57:11 +00:00
Nanley Chery	5fd42500cf	anv,hasvk: Add and use set_image_clear_color() We're going to be storing clear colors from the drivers rather than BLORP. Add a function for this purpose. For now, the first use replaces init_fast_clear_color(). One change in behavior is that the clear color initialization is now done without write-checking on gfx12. This actually matches what anv does to other writes to the image's fast-clear tracking state. We can fix this later if and when we address the larger issue. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30824>	2024-08-26 23:57:11 +00:00
Lionel Landwerlin	1f9c40a8d1	anv: explicitly disable BT pool allocations at device init The default state doesn't seem well defined (or kernel driver bug maybe?). Let's just set it to disabled on platforms where we're not using it. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Found-by: Chuansheng Liu <chuansheng.liu@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30841>	2024-08-26 10:34:31 +00:00
Caio Oliveira	31dfb04fd3	intel/brw: Remove long register file names The long names were originally meant to map to the HW encoding but nowadays the actual encoding values depend on gfx version, whether instruction is 3src, etc. Suggested by Ken. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30704>	2024-08-25 22:08:14 +00:00
Caio Oliveira	6bdf2de4d2	intel/brw: Remove unused ARF values and helpers These were used by old Gfx versions. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30704>	2024-08-25 22:08:14 +00:00
Caio Oliveira	72b687abb4	intel/brw: Make BAD_FILE the zero value for brw_reg_file Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30704>	2024-08-25 22:08:14 +00:00
Caio Oliveira	e8f921678a	intel/brw: Explicitly map brw_reg_file into hardware values For now this is a no-op, but will be useful when changing the enum to values that don't match the hardware. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30704>	2024-08-25 22:08:14 +00:00
Caio Oliveira	e7179232c9	intel/brw: Move encoding of Gfx11 3-src inside the inst helpers Create specific helper for register file encoding and handle it there. Use ad-hoc structs to let the macro take optional named arguments. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30704>	2024-08-25 22:08:14 +00:00
Caio Oliveira	d31c8bfb6f	intel/brw: Remove more uses of variable length arrays In these cases there's a clear bound we can use. In C++ this is a compiler extension and not compatible with zero initializing a regular struct -- which will happen in a later change. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30704>	2024-08-25 22:08:14 +00:00
Caio Oliveira	86c20e2910	intel/brw: Use a helper for common VEC pattern In the helper, instead of using the Variable Length Array, use a fixed size array to NIR_MAX_VEC_COMPONENTS. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30704>	2024-08-25 22:08:14 +00:00
Caio Oliveira	abc535a3b4	intel/brw: Remove unused variable Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30704>	2024-08-25 22:08:13 +00:00
Nanley Chery	23658920d1	anv,iris: Skip tex invalidate for clear conversion The hardware's clear color conversion feature requires invalidating the texture cache for every fast clear. We're no longer using the hardware feature, so we longer need the invalidation. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30646>	2024-08-23 15:28:34 +00:00
Nanley Chery	7b9400b7f7	intel/blorp: Don't use clear color conversion on gfx12 Instead of using the clear color conversion feature by the hardware, use software to write out the converted clear color pixel. When testing a patch which moves a state cache invalidate to occur after fast clears instead of before, this prevents the following failures on tgl/zink: * piglit.spec.arb_texture_cube_map_array.arb_texture_cube_map_array-cubemap * piglit.spec.ext_framebuffer_object.fbo-generatemipmap-formats Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30646>	2024-08-23 15:28:34 +00:00
Nanley Chery	b404ca0eb0	intel: Don't use HW clear color conversion on gfx11 The hardware's clear color conversion feature unfortunately requires invalidating the texture cache for every fast clear. To avoid the performance penalty that comes with the invalidation, avoid using the hardware feature and write out the converted clear color pixel ourselves. When testing a patch which moves a state cache invalidate to occur after fast clears instead of before, this prevents the following failures on icl/zink: * piglit.fast_color_clear.fcc-read-after-clear sample tex * piglit.spec.arb_clear_texture.arb_clear_texture-cube Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30646>	2024-08-23 15:28:34 +00:00
Nanley Chery	dfcd93d12f	intel/isl: Fix packing of SINT formats Prevents the next patch from failing many multisampled, signed integer rendering tests. For example: dEQP-VK.renderpass2.suballocation.multisample_resolve.r8_sint.samples_4 Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30646>	2024-08-23 15:28:34 +00:00
Lionel Landwerlin	778cb59086	anv: optimize STATE_BYTE_STRIDE emission Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30803>	2024-08-23 10:52:19 +00:00
Lionel Landwerlin	195c5b68ba	anv: don't miss workaround for indirect draws Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30803>	2024-08-23 10:52:19 +00:00
Lionel Landwerlin	f25b500af4	anv: move conditional render predicate after gfx_flush_state Following up on `f8c0a99d52` ("anv: emit conditional after gfx state flushing"), this should have been applied everywhere. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `0147908a89` ("anv: predicate emission of STATE_BASE_ADDRESS") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30803>	2024-08-23 10:52:19 +00:00
Tapani Pälli	5bf6602d23	anv: check if RT writes are happening for HasWriteableRT Fixes: `eebb6cd236` ("anv: stop using 3DSTATE_WM::ForceThreadDispatchEnable") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11749 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30785>	2024-08-23 06:28:00 +00:00
Lionel Landwerlin	a88898a28f	anv: optimize CLIP::MaximumVPIndex setting Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11746 Fixes: `982106e676` ("anv: only set 3DSTATE_CLIP::MaximumVPIndex once") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30762>	2024-08-23 05:45:03 +00:00
Kenneth Graunke	b97e10208c	intel/brw: Add a file parameter to idom_tree::dump() The other dump methods in this file also take a file parameter, defaulting to stderr. Dumping dot files to stdout is probably not what anybody really wanted. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30530>	2024-08-22 22:54:45 +00:00
Kenneth Graunke	bb4f05005e	intel/brw: Print blocks in brw_print_instructions_to_file() Useful when examining the control flow graph. For some reason, we printed this for the final assembly but not the IR. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30530>	2024-08-22 22:54:45 +00:00
Kenneth Graunke	2d73e42333	intel/brw: Fix OOB reads when printing instructions post-reg-alloc Post-register allocation, but before brw_fs_lower_vgrfs_to_fixed_grfs, we have registers with the VGRF file but they are actually fixed GRFs. brw_print_instructions_to_file() was seeing VGRFs and trying to access their size, but using bogus register numbers that could be out-of-bound. Detect when we're post-RA and avoid doing this. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30530>	2024-08-22 22:54:45 +00:00
Lionel Landwerlin	d9406658ed	brw: remove unused prog_data field Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30713>	2024-08-22 19:44:40 +00:00
Lionel Landwerlin	3769b58272	anv: move lowering of descriptor intrinsics to apply_layout Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30713>	2024-08-22 19:44:40 +00:00
Lionel Landwerlin	45117c0ed5	anv: simplify loading driver internal constants Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30713>	2024-08-22 19:44:39 +00:00
Lionel Landwerlin	7a55a930f6	anv: reuse common pipeline state for compute push allocations Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30713>	2024-08-22 19:44:39 +00:00
Eric Engestrom	d7f7aede15	intel/ci: don't trigger anv-jsl-full & anv-tgl-full on GL changes These are pure VK-CTS jobs, they don't run any GL tests. It doesn't matter right now because these two jobs are disabled, but when they get re-enabled, we'll want this to have been fixed. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30677>	2024-08-22 16:24:24 +00:00
Daniel Stone	cc507536db	ci/intel: Move manual/nightly jobs to postmerge stage Create a new stage called intel-postmerge and move the full and manual jobs over there, to avoid entanglement with the pre-merge jobs. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30784>	2024-08-22 15:35:18 +00:00
Daniel Stone	f1aab081b5	ci: Create new 'performance' stage Move all jobs doing performance testing to a separate stage. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30784>	2024-08-22 15:35:18 +00:00
Kenneth Graunke	6a292c2699	intel: Fix bad align_offset on global_constant_uniform_block_intel We were specifying align_offset = 64 and align_mul = 64, which is invalid. nir_combined_align() asserts that align_offset < align_mul. Our intention here is to perform cacheline-aligned (64B-aligned) block loads, so we should set align_mul = 64 and can leave align_offset = 0. Fixes: `fbafa9cabd` ("intel/nir: remove load_global_const_block_intel intrinsic") Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30755>	2024-08-21 20:44:57 +00:00
Ian Romanick	c96ceb50d0	intel/brw/xe2: Allow int64 conversions As far as I can tell from looking at the Bspec, MOV between integers of all sizes appears to be supported. shader-db: total instructions in shared programs: 17480631 -> 17480535 (<.01%) instructions in affected programs: 26284 -> 26188 (-0.37%) helped: 21 / HURT: 13 total cycles in shared programs: 897601907 -> 897664293 (<.01%) cycles in affected programs: 10929664 -> 10992050 (0.57%) helped: 48 / HURT: 45 fossil-db: Totals: Instrs: 140686824 -> 140686155 (-0.00%); split: -0.00%, +0.00% Cycle count: 21525129188 -> 21524717729 (-0.00%); split: -0.01%, +0.00% Spill count: 70778 -> 70776 (-0.00%) Fill count: 139172 -> 139168 (-0.00%) Max live registers: 47513859 -> 47513795 (-0.00%) Totals from 612 (0.11% of 549272) affected shaders: Instrs: 964441 -> 963772 (-0.07%); split: -0.09%, +0.02% Cycle count: 1215564312 -> 1215152853 (-0.03%); split: -0.09%, +0.06% Spill count: 16172 -> 16170 (-0.01%) Fill count: 37962 -> 37958 (-0.01%) Max live registers: 70749 -> 70685 (-0.09%) Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30700>	2024-08-21 20:16:00 +00:00
Ian Romanick	09cf9fe8ab	anv: Larger memory pools for huge shaders At least one ray tracing shader in cp2077 is over 4MB on Xe2. There isn't a memory pool large enough for the allocation, so the driver crashes instead. This commit adds 8MB and 16MB pools. I intend this as a stop gap fix. I would prefer to figure out why this shader is so much larger than on previous platforms. The shader in question has 3824 spills and 8625 fills. That is not good. I suspect dealing with that will also solve the problem, but that will require a bit more time. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11739 Suggested-by: Lionel Landwerlin Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30751>	2024-08-21 19:45:17 +00:00
Ian Romanick	0921dfa044	anv: Protect against OOB access to anv_state_pool::buckets Suggested-by: Paulo Zanoni Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30751>	2024-08-21 19:45:17 +00:00
Rohan Garg	29a2e5358d	anv: enable KHR_shader_relaxed_extended_instruction Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30726>	2024-08-21 14:13:46 +00:00
Francisco Jerez	71ca8529c5	intel/brw/gfx12.5+: Fix IR of sub-dword atomic LSC operations. We were currently emitting logical atomic instructions with a packed destination region for sub-dword LSC atomics, along the lines of: > untyped_atomic_logical(32) dst<1>:HF, ... However, these instructions use an LSC data size D16U32, which means that the 16b data on the return payload is expanded to 32b by the LSC shared function, so we were lying to the compiler about the location of the individual channels on the return payload, its execution masking, etc. This is why the hacks that manually set the 'inst->size_written' of the instruction were required. In some cases this worked, but any non-trivial manipulation of the instruction destination by lowering or optimization passes could have led to corruption, as has been reproduced in deqp-vk during lower_simd_width() for shaders that use 16-bit atomics in SIMD32 dispatch mode. Note that LSC sub-dword reads aren't affected by this because they use raw UD destinations and specify the actual bit size of the operation datatype as the immediate SURFACE_LOGICAL_SRC_IMM_ARG, which doesn't work for atomic operations since that immediate specifies the atomic opcode. Instead, have the logical operation implement the behavior of 16-bit destinations correctly instead of silently replacing the 16-bit region with an inconsistent 32-bit region -- This is done by emitting the MOV instructions used to pack the data from the UD temporary into the packed destination from the lower_logical_sends() pass instead of from the NIR translation pass. Fixes: `43169dbbe5` ("intel/compiler: Support 16 bit float ops") Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30683>	2024-08-21 02:33:12 +00:00

1 2 3 4 5 ...

12611 commits