fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-23 22:00:13 +01:00

Author	SHA1	Message	Date
Caio Oliveira	ace5daabbd	intel/compiler: Use -Werror=vla Acked-by: Dylan Baker <dylan.c.baker@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32965>	2025-02-11 11:25:48 +00:00
liuqiang	c317778c67	intel/brw: Remove redundant condition in components_read() DATA1 will be handled by the case reached in the fallthrough. Signed-off-by: liuqiang <liuqiang@kylinos.cn> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31782>	2025-02-11 10:33:42 +00:00
Caio Oliveira	ff44f4d278	intel/brw: Update outdated comments Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32536>	2025-02-11 09:13:28 +00:00
Caio Oliveira	5c55b29d1a	intel/brw: Rename a few remaining functions to remove fs prefix Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32536>	2025-02-11 09:13:28 +00:00
Caio Oliveira	c83ddaaa26	intel/brw: Rename fs_copy_prop_dataflow to brw_copy_prop_dataflow Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32536>	2025-02-11 09:13:28 +00:00
Caio Oliveira	cf3bb77224	intel/brw: Rename fs_visitor to brw_shader Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32536>	2025-02-11 09:13:28 +00:00
Caio Oliveira	352a63122f	intel/brw: Rename files brw_fs.cpp/h to brw_shader.cpp/h Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32536>	2025-02-11 09:13:28 +00:00
Caio Oliveira	6b471e4e26	intel/brw: Merge brw_fs_visitor.cpp into brw_fs.cpp Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32536>	2025-02-11 09:13:28 +00:00
Caio Oliveira	f8a979466b	intel/brw: Rename and move thread_payload types to own header Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32536>	2025-02-11 09:13:28 +00:00
Ian Romanick	1d485cc84f	brw/copy: Allow constant propagation of some 64-bit integers ADD, ASR, SHL, and SHR can mix D or UD sources with Q or UQ sources on Gfx20. If the constant will fit in 32-bits, the type is changed so the propagation can occur. No shader-db changes on any Intel platform. No fossil-db changes on any Intel platform other than Lunar Lake. Lunar Lake Totals: Instrs: 210778940 -> 209472782 (-0.62%); split: -0.63%, +0.01% Subgroup size: 14226752 -> 14227232 (+0.00%) Cycle count: 30614834794 -> 30573250444 (-0.14%); split: -0.26%, +0.12% Spill count: 507788 -> 504153 (-0.72%); split: -1.17%, +0.45% Fill count: 622824 -> 613848 (-1.44%); split: -1.96%, +0.52% Scratch Memory Size: 35826688 -> 35309568 (-1.44%); split: -1.67%, +0.23% Max live registers: 65506213 -> 65434861 (-0.11%) Totals from 126699 (17.93% of 706470) affected shaders: Instrs: 63615321 -> 62309163 (-2.05%); split: -2.09%, +0.04% Subgroup size: 2618160 -> 2618640 (+0.02%) Cycle count: 3141888676 -> 3100304326 (-1.32%); split: -2.52%, +1.19% Spill count: 454315 -> 450680 (-0.80%); split: -1.31%, +0.51% Fill count: 533584 -> 524608 (-1.68%); split: -2.29%, +0.61% Scratch Memory Size: 32182272 -> 31665152 (-1.61%); split: -1.86%, +0.26% Max live registers: 14773917 -> 14702565 (-0.48%) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33049>	2025-02-11 08:44:33 +00:00
Ian Romanick	6d594196a6	brw/copy: Use extract_imm in try_constant_propagate_value This is just a small refactor. Originally there was an extra commit on top of this. That commit didn't help generated code quality, so it was dropped. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33049>	2025-02-11 08:44:33 +00:00
Ian Romanick	ac4b93571c	brw/copy: Fix handling of offset in extract_imm The offset is measured in bytes. Some of the code here acted as though it were measured in src.type units. Also modify the assertion to check that all extracted bits come from data in the immediate value. Fixes: `580e1c592d` ("intel/brw: Introduce a new SSA-based copy propagation pass") Fixes: `da395e6985` ("intel/brw: Fix extract_imm for subregion reads of 64-bit immediates") Yes, I missed this error twice in code review. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33049>	2025-02-11 08:44:33 +00:00
Kenneth Graunke	d06c3e21ac	brw: Drop unnecessary mlen/header_size on virtual GET_BUFFER_SIZE op The logical send lowering code sets these, and is the code which -should- set these. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33297>	2025-02-08 01:07:22 +00:00
Kenneth Graunke	37a6278c9f	brw: Drop INTERPOLATE_AT mlen handling from size_read() FS_OPCODE_INTERPOLATE_AT_{SAMPLE,SHARED_OFFSET} never have a mlen set. They are lowered to SHADER_OPCODE_SEND in logical send lowering, at which point they acquire an mlen, but cease to be those opcodes. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33297>	2025-02-08 01:07:22 +00:00
Kenneth Graunke	ae60338142	brw: Lower MEMORY_FENCE and INTERLOCK in lower_logical_sends We teach lower_logical_sends to lower these to SHADER_OPCODE_SEND and drop all the corresponding generator and eu_emit code. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33297>	2025-02-08 01:07:22 +00:00
Kenneth Graunke	7b4e31b243	brw: Add latencies for HDC/RC memory fences We're about to start lowering these in the IR, at which point the scheduler will see SEND instructions with fence messages. Previously, we handled those in the generator, and didn't handle the virtual opcodes here, letting them fall through to the default case of 14 cycles. These new numbers are completely fabricated, matching the times we have for atomic operations. This is basically what we did for LSC atomics. While it may not be accurate, it's at least better than 14 cycles. Acked-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33297>	2025-02-08 01:07:22 +00:00
Kenneth Graunke	b9de19f917	brw: Eliminate the BTI source from MEMORY_FENCE/INTERLOCK opcodes Memory fences do not refer to an element of a binding table. Rather, the reason we had "BTI" in these opcodes was to distinguish what in modern terms are called UGM (untyped memory data cache) vs. SLM (cross-thread shared local memory) fences. Icelake and older platforms used the "data cache" SFID for both purposes, distinguishing them by having a special binding table index, 254, meaning "this is actually SLM access". This is where the notion that fences had BTIs came in. (In fact, prior to Icelake, separate SLM fences were not a thing, so BTI wasn't used there either.) To avoid confusion about BTI being involved, we choose a simpler lie: we have Icelake SLM fences target GFX12_SFID_SLM (like modern platforms would), even though it didn't really exist back then. Later lowering code sets it back to the correct Data Cache SFID with magic SLM binding table index. This eliminates BTI everywhere and an unnecessary source. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33297>	2025-02-08 01:07:22 +00:00
Kenneth Graunke	43d0ac9eb4	brw: Change destination of memory fences to UD type For some reason, we were using UW type for the destination of memory fences at the generator level, while in the IR we selected UD. There are some comments in the documentation for the message about it writing the notification register to the destination, which is 32-bit. Prior to Xe2, bits 31:16 were Reserved/MBZ. But on Xe2, all 32 bits are populated with actual data. I don't know whether this will fix anything in practice, but it seems like a better plan to use UD. Often we used UW types to avoid having the destination region of sends span too many registers, but we're in SIMD1 here, so it shouldn't matter. Acked-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33297>	2025-02-08 01:07:22 +00:00
Kenneth Graunke	c0a32af125	brw: Use correct builder size for MEMORY_FENCE/INTERLOCK virtual opcodes brw_memory_fence() overrides the instructions generated by the MEMORY_FENCE or INTERLOCK opcodes to be force_writemask_all with exec_size == 1. But the IR was emitting it in SIMD8 (regardless of dispatch width). Instead, just emit the IR as SIMD1/NoMask so the IR matches what we actually generate. Have size_written indicate that the entire destination is written, however, as it is ultimately going to be a SEND that writes a whole register. We were also using a UD register for the source of FS_OPCODE_SCHEDULING_FENCE when the generator overrides it to UW, so just specify UW in the IR as well so that they line up. Also add validation for MEMORY_FENCE/INTERLOCK that we've done the exec_size and masking right in the IR. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33297>	2025-02-08 01:07:22 +00:00
Kenneth Graunke	accef5e8f5	brw: Replace fs_inst::target field with logical FB read/write sources We can just specify this as a source to the logical FB read/write opcodes. Notably FB reads had no sources before; now they have one. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33297>	2025-02-08 01:07:22 +00:00
Kenneth Graunke	32dd722ff3	brw: Replace fs_inst::last_rt with a logical control source Rather than using a bit in the generic fs_inst data structure, we can simply set a source on our logical FB write messages. (We already do so for many other cases.) In the repclear shader, setting this wasn't actually having an effect, as we were setting it on a SHADER_OPCODE_SEND message which ignored it. (We had already correctly set the bit in the message descriptor.) Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33297>	2025-02-08 01:07:22 +00:00
Kenneth Graunke	fce01b8461	brw: Drop FB_WRITE_LOGICAL_SRC_DST_DEPTH source This was used for legacy depth passthrough on older hardware. Gfx9+ doesn't actually have dst depth as part of the message, which is the only hardware brw supports these days. It sure looks like we were setting it though... Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33297>	2025-02-08 01:07:22 +00:00
Kenneth Graunke	7390d6189c	brw: Replace fs_inst::pi_noperspective with a logical control source We already have logical pixel interpolator messages that get lowered to send messages. We can just add an extra boolean source to those opcodes rather than sticking a opcode-specific boolean in the generic fs_inst data structure. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33297>	2025-02-08 01:07:22 +00:00
Kenneth Graunke	168ac07ffd	brw: Eliminate fs_inst::shadow_compare brw_lower_logical_sends can just check for the TEX_LOGICAL_SRC_SHADOW_C source; we don't need a generic instruction bit for this. We used to have one because this was handled in the generator for older hardware before the advent of logical opcode lowering. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33297>	2025-02-08 01:07:22 +00:00
Kenneth Graunke	df836ee895	brw: Drop unused defines Nothing uses these. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33297>	2025-02-08 01:07:22 +00:00
Caio Oliveira	b50c925bd6	intel/brw: Fold simple_allocator into the shader This was originally turned into a separate struct for reuse between vec4 and fs backends, that's not needed anymore. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33334>	2025-02-06 08:33:03 -08:00
Caio Oliveira	f82bcd56fc	intel/brw: Add functions to allocate VGRF space Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33334>	2025-02-06 08:33:03 -08:00
Caio Oliveira	5c717e68ce	intel/brw: Pass fs_visitor around instead of the simple_allocator In preparation for getting rid of the simple_allocator. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33334>	2025-02-06 08:33:03 -08:00
Caio Oliveira	75b77382b8	intel/brw: Remove offsets and total_size from VGRF allocator Information was used for vec4 backend, not used here anymore. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33334>	2025-02-06 08:33:03 -08:00
Caio Oliveira	ea87bab4ce	intel/brw: Remove 'using namespace brw' directives Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33418>	2025-02-06 07:58:55 -08:00
Caio Oliveira	1ade9a05d8	intel/brw: Use brw prefix instead of namespace for analysis implementations Also drop the 'fs' prefix when applicable. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33048>	2025-02-05 21:47:07 +00:00
Caio Oliveira	2b92eb0b2c	intel/brw: Use brw prefix instead of namespace for dep analysis enum Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33048>	2025-02-05 21:47:07 +00:00
Caio Oliveira	e2f354587d	intel/brw: Merge brw_ir_analysis.h into brw_analysis.h Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33048>	2025-02-05 21:47:07 +00:00
Caio Oliveira	c943fb0c20	intel/brw: Move analysis passes without own file to brw_analysis.cpp Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33048>	2025-02-05 21:47:06 +00:00
Caio Oliveira	0f7eb96af8	intel/brw: Move idom_tree declaration to brw_analysis.h Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33048>	2025-02-05 21:47:06 +00:00
Caio Oliveira	0ebb75743d	intel/brw: Use brw_analysis prefix for performance analysis files Move declaration to the common header and rename definition file. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33048>	2025-02-05 21:47:06 +00:00
Caio Oliveira	6a23749332	intel/brw: Use brw_analysis prefix for def analysis file Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33048>	2025-02-05 21:47:06 +00:00
Caio Oliveira	e0614e8ea1	intel/brw: Use brw_analysis prefix for liveness analysis files Move declaration to the common header and rename definition file. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33048>	2025-02-05 21:47:06 +00:00
Caio Oliveira	e5369540ea	intel/brw: Add brw_analysis.h Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33048>	2025-02-05 21:47:06 +00:00
Alyssa Rosenzweig	bf48eae1f9	nir: drop printf_base_identifier superseded. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33380>	2025-02-05 20:33:15 +00:00
Alyssa Rosenzweig	41eabbadfa	intel: port to u_printf context + singleton this is required with vtn_bindgen2. fixes printf there. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33380>	2025-02-05 20:33:15 +00:00
Alyssa Rosenzweig	9429d001b9	intel/nir_lower_printf: modernize nir Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33380>	2025-02-05 20:33:15 +00:00
Alyssa Rosenzweig	03ff5b2c03	intel: drop nir_lower_printf calls this is now handled in vtn_bindgen2 for vtn path code. this does drop support from printf from GRL but that seems appropriate at this point. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33380>	2025-02-05 20:33:15 +00:00
Lionel Landwerlin	5c17299084	brw: enable A64 pulling of push constants This will be useful for pulling constants in device bound shaders. A64 allows us to put the constants anywhere. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32895>	2025-02-05 09:56:04 +00:00
Lionel Landwerlin	0808125914	brw/anv: rework push constants for mesh/task shaders Now using the same model as the compute shader. As a result we temporarily disable the use of the Inline register for providing push constants on Task & Mesh shaders. Since that register is also available on the compute shader we'll try to find a way to use the same mechanism for all 3 shaders in another MR and bring back that optimization. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32895>	2025-02-05 09:56:04 +00:00
Lionel Landwerlin	c08b437db7	brw: fixup scoreboarding for find_live_channels Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32895>	2025-02-05 09:56:03 +00:00
Caio Oliveira	92085e7bab	intel/brw: Remove 'fs' prefix from brw_from_nir functions Reviewed-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Dylan Baker <None> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33330>	2025-02-03 23:08:11 +00:00
Caio Oliveira	1332d84500	intel/brw: Rename file brw_fs_nir.cpp to brw_from_nir.cpp Reviewed-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Dylan Baker <None> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33330>	2025-02-03 23:08:11 +00:00
Lionel Landwerlin	41aa22a6b5	intel_clc: remove NIR output support Now unused Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33329>	2025-02-01 07:54:37 +00:00
Caio Oliveira	5ca23eff0b	intel/brw: Remove brw_gs_compile struct There were 4 fields: - key: now will be passed explicitly, so we can reuse the existing more general fs_visitor constructor; - input_vue_map: used only by the client code brw_compile_gs, so create it separatedly as a local variable; - two unsigned parameters: just put them inside a nested struct in the shader. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33228>	2025-02-01 02:44:29 +00:00

... 5 6 7 8 9 ...

4403 commits