fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-22 09:10:11 +01:00

Author	SHA1	Message	Date
Kenneth Graunke	1c1e79d75a	intel/brw: Copy the smaller payload in fixup_sends_duplicate_payload Sometimes one source can be a larger register than the other, especially since opt_register_coalesce can sometimes coalesce those sources into larger registers. Copy the smaller of mlen and ex_mlen. It's less copying. shader-db and fossil-db on Alchemist show 47 shaders affected with small 1-2 instruction improvements each, and no regressions. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27876>	2024-03-05 11:39:26 +00:00
Kenneth Graunke	91252c98a8	intel/brw: Add assertions that EOT messages live in g112+ The validator already catches this, but asserting here makes it easier to catch the problem earlier in a debugger. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27876>	2024-03-05 11:39:26 +00:00
Kenneth Graunke	f6ac6c94a9	intel/brw: Handle SHADER_OPCODE_SEND without src[3] in copy prop We construct some SENDs with only 3 sources (such as FB writes). This code could read out of bounds. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27876>	2024-03-05 11:39:26 +00:00
Kenneth Graunke	49606ab067	intel/brw: Avoid copy propagating any fixed registers into EOTs We were handling FIXED_GRF, but we probably also ought to handle ATTR (pushed inputs) and UNIFORM (pushed constants). Just check if file isn't VGRF to handle everything. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27876>	2024-03-05 11:39:26 +00:00
Kenneth Graunke	97bf3d3b2d	intel/brw: Replace CS_OPCODE_CS_TERMINATE with SHADER_OPCODE_SEND There's no need for special handling here, it's just a send message with a trivial g0 header and descriptor. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27924>	2024-03-05 11:16:20 +00:00
Kenneth Graunke	63d2aa4eb6	intel/brw: Mark FIND[_LAST]_LIVE_CHANNEL as not writing the flag brw_lower_find_live_channel doesn't actually write a flag register, but elk_find_live_channel notes that the flag was used on Gfx7. This allows more CSE on FIND[_LAST]_LIVE_CHANNEL. shader-db and fossil-db on Alchemist show minor reductions in cycles and instruction count, a few minor increases, but it doesn't seem to be a large effect in either direction. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27862>	2024-03-01 17:18:30 -08:00
Caio Oliveira	337641cfcc	intel/compiler: Fix SIMD lowering when instruction needs a larger SIMD When lower_simd_width() encounters an instruction that needs a larger SIMD, for example SHADER_OPCODE_TXS_LOGICAL in Gfx4 needs at least SIMD16. In this case the builder needs to be at least as large as max_width, otherwise the group() setup will assert. Turns out this did not assert before "by accident", since it was relying on the default fs_visitor builder that had a dispatch width of 64, a bogus placeholder value, expected not to be used. However, when we changed the code to remove that builder (and the bogus value), we created a new builder in the pass shader dispatch_width -- which work fine except in the case where we want to "lower" the SIMD above the shader dispatch width. The fix is to also consider the already calculated max_width when creating the builder. Fixes: `5b8ec015f2` ("intel/compiler: Don't use fs_visitor::bld in remaining places") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10338 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27782>	2024-03-01 22:54:57 +00:00
Kenneth Graunke	ad37622a8f	intel/brw: Delete legacy texture opcodes We first generate the logical opcodes, and these days fully lower to SHADER_OPCODE_SEND. In the past, we lowered to a non-logical variant and handled that in the generator. These days, we were just using the non-logical opcodes as an awkward intermediate opcode change during the lowering...which isn't really necessary at all. This patch eliminates them by using the original logical opcodes. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27908>	2024-03-01 22:19:51 +00:00
Kenneth Graunke	19248f48eb	intel/brw: Allow CSE on TXF_CMS_W_GFX12_LOGICAL This was missed when adding the new XeHP variant of the opcode. Fixes: `261dd6c8` ("intel/compiler: Add new variant for TXF_CMS_W") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27908>	2024-03-01 22:19:51 +00:00
Kenneth Graunke	45a5e4c0c4	intel/brw: Delete SHADER_OPCODE_TXF_UMS Nothing seems to generate this anymore. I guess we always use CMS. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27908>	2024-03-01 22:19:51 +00:00
Kenneth Graunke	601ef12467	intel/brw: Delete SHADER_OPCODE_TXF_CMS[_LOGICAL] We always use the wide variant (_W) on hardware this compiler supports. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27908>	2024-03-01 22:19:50 +00:00
Kenneth Graunke	494eee1337	intel/brw: Change unit tests to use TEX_LOGICAL instead of TEX We're not really doing any fancy texturing here, just emitting a TEX instruction that writes multiple destination registers. I plan to remove the non-logical TEX instruction in the next commit, so we swap these over to use the logical version instead. It should work just as well for the purposes of the test. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27908>	2024-03-01 22:19:50 +00:00
Caio Oliveira	082735750b	intel/brw: Simplify usage of reg immediate helpers Use fs_reg and don't take the type as argument. In all uses the type passed is the type of the register. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27904>	2024-03-01 17:52:09 +00:00
Caio Oliveira	fb1d871714	intel/brw: Fold backend_reg into fs_reg Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27904>	2024-03-01 17:52:09 +00:00
Rohan Garg	73d98848fa	intel/compiler: Xe2+ can do URB load/store with a byte offset Thanks to Ken for suggesting this URB refactoring change and pointing out that the LSC can operate on the byte offset granularity. This should fix the geometry shader test cases where we have more than 32 vertices since previously we were failing to write the correct control data bits because of incorrect write mask. Shader-db results for Xe2: total instructions in shared programs: 153475 -> 153437 (-0.02%) instructions in affected programs: 1374 -> 1336 (-2.77%) helped: 11 HURT: 0 helped stats (abs) min: 3 max: 5 x̄: 3.45 x̃: 3 helped stats (rel) min: 1.67% max: 4.92% x̄: 3.23% x̃: 2.70% 95% mean confidence interval for instructions value: -3.92 -2.99 95% mean confidence interval for instructions %-change: -4.10% -2.36% Instructions are helped. total loops in shared programs: 140 -> 140 (0.00%) loops in affected programs: 0 -> 0 helped: 0 HURT: 0 total cycles in shared programs: 16002649 -> 16002329 (<.01%) cycles in affected programs: 9174 -> 8854 (-3.49%) helped: 11 HURT: 0 helped stats (abs) min: 22 max: 38 x̄: 29.09 x̃: 32 helped stats (rel) min: 2.62% max: 5.54% x̄: 3.78% x̃: 3.85% 95% mean confidence interval for cycles value: -33.56 -24.62 95% mean confidence interval for cycles %-change: -4.48% -3.08% Cycles are helped. total spills in shared programs: 52 -> 52 (0.00%) spills in affected programs: 0 -> 0 helped: 0 HURT: 0 total fills in shared programs: 94 -> 94 (0.00%) fills in affected programs: 0 -> 0 helped: 0 HURT: 0 total sends in shared programs: 4240 -> 4240 (0.00%) sends in affected programs: 0 -> 0 helped: 0 HURT: 0 LOST: 0 GAINED: 0 Rework: (Sagar) - Adjust offset/indirect offset calculation. - Add shader-db results - Always calculate dword index - Drop changes for indirect writes Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27602>	2024-03-01 16:11:30 +00:00
Caio Oliveira	97759ef139	intel/brw: Remove typedefs from fs_builder Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27866>	2024-02-29 21:14:13 -08:00
Caio Oliveira	0f5f3fddd4	intel/brw: Fold backend_instruction into fs_inst Since we are touching it, change fs_inst to use struct instead of class so its forward declaration is C compatible. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27866>	2024-02-29 21:14:13 -08:00
Caio Oliveira	e5c5a983f7	intel/brw: Move functions from backend_instruction into fs_inst Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27866>	2024-02-29 21:14:13 -08:00
Caio Oliveira	f5a593ade7	intel/brw: Use fs_inst in disasm_annotate() Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27866>	2024-02-29 21:14:13 -08:00
Caio Oliveira	db322554a7	intel/brw: Use fs_inst explicitly in various passes Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27866>	2024-02-29 20:47:48 -08:00
Caio Oliveira	692021cad7	intel/brw: Use fs_inst in cfg_t Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27866>	2024-02-29 20:47:48 -08:00
Caio Oliveira	d5ed82b97c	intel/brw: Hide the definition of cfg_t et al from C code Will make easier to flatten the IR. We can revert this back later if we need to. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27866>	2024-02-29 20:47:48 -08:00
Caio Oliveira	1f975e7af7	intel/brw: Use C++ for brw_disasm_info.c This code uses cfg_t which we are going to rework a bit as part of flattening the IR types. It is easier if it can see C++ types for now. At the end we can change this back if needed. To avoid casting and be consistent with existing structs, use int for some offset parameters in the functions. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27866>	2024-02-29 20:47:48 -08:00
Caio Oliveira	6e33b39b46	intel/compiler: Remove nir_print_instr hack in disasm_info The compilers (brw and elk) static libraries depend only on idep_nir_headers instead of idep_nir. This was done to increase the parallelism in the build. One side effect is that consumers of the compilers must depend on idep_nir themselves to ensure nir symbols are resolved. Various intel tools don't use NIR directly, so don't depend on it, and only use a few functions of the compiler, that mostly don't depend on linking NIR functions except for the case of nir_print_instr. The current code adds a weak empty function to take its place in case it is not linked. This is sort of a hack because if we change the compiler in ways that use NIR differently, or we use different functions of the compiler in the tools, we will end up having to add other dummy definitions. A better solution here (suggested by Dylan) is to add the idep_nir to the list of dependencies of the compilers idep's. The static libraries of the compilers still don't depend directly on NIR, but any user of idep_compiler_* will get that dependency. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27865>	2024-02-29 23:08:16 +00:00
Caio Oliveira	1ba5e9432d	intel/meson: Add dependencies for brw and elk Instead of link_with, use meson dependency for the compilers. Will be useful later to propagate some extra dependencies. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27865>	2024-02-29 23:08:16 +00:00
Caio Oliveira	865ef36609	intel/brw: Remove brw_shader.h Find a better home for its existing content. Some functions are now just static functions at the usage sites. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27861>	2024-02-29 19:28:06 +00:00
Caio Oliveira	d9552fccf2	intel/brw: Remove extra stage_prog_data field in fs_visitor Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27861>	2024-02-29 19:28:06 +00:00
Caio Oliveira	634dff403f	intel/brw: Fold backend_shader into fs_visitor The base class was used when we had vec4, but now we can fold it with its only subclass. Declare fs_visitor now as a struct to be able to forward declare for C code without causing errors due to class/struct being mixed. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27861>	2024-02-29 19:28:05 +00:00
Caio Oliveira	f3e9a5c719	intel/brw: Move dump_* functions into fs_visitor Make them non-virtual and update the parameter to use fs_inst. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27861>	2024-02-29 19:28:05 +00:00
Caio Oliveira	20dfee69c3	intel/brw: Change cfg_t to refer to fs_visitor Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27861>	2024-02-29 19:28:05 +00:00
Caio Oliveira	1e3fbb1afe	intel/brw: Fold fs_instruction_scheduler into instruction_scheduler And use fs_inst instead of backend_instruction. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27861>	2024-02-29 19:28:05 +00:00
Caio Oliveira	559d94cd0d	intel/brw: Use fs_visitor instead of backend_shader in various passes And since we are touching them, rename a couple of passes to follow same name convention as existing ones. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27861>	2024-02-29 19:28:05 +00:00
Kenneth Graunke	f159a7943c	intel/brw: Delete brw_eu_util.c This was just a bunch of helpers for the Gfx4-6 fixed-function shaders. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27872>	2024-02-29 18:00:14 +00:00
Kenneth Graunke	655cb9c61f	intel/brw: Delete some swizzling functions Not needed in the align1 world, apparently. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27872>	2024-02-29 18:00:14 +00:00
Kenneth Graunke	bbcd35141b	intel/brw: Delete unnecessary brw_wm_prog_data fields Register blocks and interp_mode[] were for Gfx4-5. The binding table section doesn't seem to be used anymore, nor does color_outputs_written. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27872>	2024-02-29 18:00:14 +00:00
Kenneth Graunke	0eeeab16a8	intel/brw: Delete brw_wm_prog_key::line_aa Not used on modern hardware. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27872>	2024-02-29 18:00:14 +00:00
Kenneth Graunke	bfb12def74	intel/brw: Delete enum gfx6_gather_sampler_wa Only needed on Gfx6, which we don't support anymore. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27872>	2024-02-29 18:00:14 +00:00
Kenneth Graunke	5fbba530cf	intel/brw: Delete compiler->supports_shader_constants True for all drivers using this compiler. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27872>	2024-02-29 18:00:14 +00:00
Kenneth Graunke	485b2bca17	intel/brw: Delete constant_buffer_0_is_relative This was only for old kernels on old hardware. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27872>	2024-02-29 18:00:14 +00:00
Kenneth Graunke	eebd24680c	intel/brw: Delete SINCOS Only existed on Gfx4-5. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27872>	2024-02-29 18:00:14 +00:00
Kenneth Graunke	292e424162	intel/brw: Delete more unused compression stuff Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27872>	2024-02-29 18:00:14 +00:00
Kenneth Graunke	a18030305c	intel/brw: Delete SIMD4x2 URB opcodes Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27872>	2024-02-29 18:00:14 +00:00
Kenneth Graunke	288b966e3e	intel/brw: Delete legacy SFIDs This involves a little rework in the assembler to treat "math" as an opcode token rather than an SFID. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27872>	2024-02-29 18:00:14 +00:00
Kenneth Graunke	afae5e78ca	intel/brw: Delete more unused defines Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27872>	2024-02-29 18:00:14 +00:00
Kenneth Graunke	3202f3fdbe	intel/brw: Delete enum brw_urb_write_flags This was used by the vec4 backend. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27872>	2024-02-29 18:00:14 +00:00
Lionel Landwerlin	1de44b1951	anv: add pipeline/shader support for descriptor buffers Lowering/layout is pretty much the same as direct descriptors. The caveats is that since the descriptor buffers are not visible from the binding tables we can't promote anything to the binding table (except push descriptors). The reason for this is that there is nothing that prevents an application to use both types of descriptors and because descriptor buffers have visible address + capture replay, we can't merge the 2 types in the same virtual address space location (limited to 4Gb max, limited 2Gb with binding tables). If we had the guarantee that both are not going to be used at the same time, we could consider a 2Gb VA for descriptor buffers. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22151>	2024-02-29 07:05:06 +00:00
Lionel Landwerlin	99047451c9	intel/fs: add plumbing for embedded samplers We can address samplers from 3 different locations : - binding table - dynamic state base address - bindless sampler base address (only Gfx11+) Here we allow samplers to be address from the dynamic state base address with the embedded sampler flag. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22151>	2024-02-29 07:05:06 +00:00
Caio Oliveira	803a1a5ada	intel/brw: Remove automatic_exec_sizes As Ken describes: "This was only used by legacy SF/Clip/FFGS programs." Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27691>	2024-02-28 05:45:39 +00:00
Caio Oliveira	dae59e7078	intel/brw: Remove runtime_check_aads_emit It was used for Gfx4 payload. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27691>	2024-02-28 05:45:39 +00:00
Caio Oliveira	35b07ab035	intel/brw: Use a single register set Different sets were needed for SIMD8/SIMD16 in old Gfx versions, but now we can use a single one regardless of the SIMD size. Suggested by Ken. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27691>	2024-02-28 05:45:39 +00:00

1 2 3 4 5 ...

3297 commits