fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-22 13:08:09 +02:00

Author	SHA1	Message	Date
Ian Romanick	3e04990c68	elk: Increase the size of some structure fields in combine_constants Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details In very large shaders, first_use_ip, last_use_ip, and even (register) nr can overflow 16 bits. Increase the size of these fields. Some structure components are rearranged to promote better packing. Fixes: `2dad1e3abd` ("i965/fs: Add pass to combine immediates.") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37482>	2025-09-22 20:02:25 +00:00
Ian Romanick	b7e1ac8309	brw: Increase the size of some structure fields in combine_constants In very large shaders, first_use_ip, last_use_ip, and even (register) nr can overflow 16 bits. Increase the size of these fields. used_in_single_block is moved earlier in the structure to promote better packing. Fixes: `2dad1e3abd` ("i965/fs: Add pass to combine immediates.") Closes: #9489 Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Tested-by: Tapani Pälli <tapani.palli@intel.com> Tested-by: @joostruis Tested-by: @Snoucher Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37482>	2025-09-22 20:02:25 +00:00
Calder Young	c5acf58fba	anv: Add support for AV1 film grain sythesis on Xe2+ Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37351>	2025-09-22 14:41:48 +00:00
Calder Young	1e8b96c40c	anv: Advertise only OUTPUT_COINCIDE_BIT for AV1 video decoding Intel HW does not support separate destination and reference output pictures when decoding AV1 video. The only exception is film grain, which the Vulkan spec already includes a caveat for. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37351>	2025-09-22 14:41:48 +00:00
Lucas Fryzek	6e29e13e78	anv: Update viewport/scissor state when count changes Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We need to ensure that HW viewport and scissor state is updated when just the count is updated. Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37487>	2025-09-22 13:28:25 +00:00
Caio Oliveira	f65fbb23e2	brw: Fix encoding of 3-src dst in Xe2+ Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Use FD20 macro that will account for the implicit LSB zero value and is already used for sources. For the new macro we need to use the entire bit-range of the field (55-51), so remove the adjustments we used to do prior to encoding and decoding. Fixes assertion in vkpeak (https://github.com/nihui/vkpeak) when running bf16 tests on BMG. And the code now will correctly apply the subreg_nr to the destination, e.g. a mad(32) gets splitted into two pieces, the generation would not fill out the upper-part of the register ``` mad(16) g13<1>BF g10<8,8,1>BF g12<8,8,1>BF g56<1,1,1>F { align1 1H A@5 }; -mad(16) g13<1>BF g10.16<8,8,1>BF g12.16<8,8,1>BF g57<1,1,1>F { align1 2H A@5 }; +mad(16) g13.16<1>BF g10.16<8,8,1>BF g12.16<8,8,1>BF g57<1,1,1>F { align1 2H A@5 }; ``` Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37236>	2025-09-18 18:21:25 +00:00
Yiwei Zhang	951767ce36	intel/ds: update GPU clock to be sequence-scoped when applicable When CPU clock is the same with the authoritative trace clock (normally default to CLOCK_BOOTTIME), perfetto drops the non-monotonic snapshots to ensure validity of the global source clock in the resolution graph. When they are different, the clocks are marked invalid and the rest of the clock syncs will fail during trace processing. There's no central daemon emitting consistent snapshots for synchronization between CPU and GPU clocks on behalf of renderstages and counters producers. The sequence-scoped clock (64 <= ID < 128) is unique per producer + writer pair within the tracing session. So we can use sequence-scoped clock for gpu clock whenever applicable, and fallback to use global clock for dynamic minor allocated >= 192. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37425>	2025-09-18 17:23:42 +00:00
Yiwei Zhang	7a1e952279	intel/ds: minor code clean up Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37425>	2025-09-18 17:23:42 +00:00
Yiwei Zhang	7689aca21f	intel/ds: simplify clock sync emit In short, perfetto doesn't require the initial clock snapshot to be earlier than the timestamp to be converted. So we don't have to do complex handling for it. With this change: - renderstage event requires clock sync, so we'd only emit clock snapshots on the traceq thread that handles the callbacks - drops redundant sync_timestamp calls as well as sync_gpu_ts tracking - no need to reset next_clock_sync_ns when tracing is disabled, since a snapshot is always emitted right after the initial interned data emit upon tracing start Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37425>	2025-09-18 17:23:42 +00:00
Yiwei Zhang	7795669953	intel/ds: VulkanApiEvent doesn't rely on interning data The object name is part of the VkDebugUtilsObjectName event messages. When the trace buffer is full and the ring buffer fill policy is chosen, the debug obj events can be overwritten (lost), which is why we need the RefreshSetDebugUtilsObjectNameEXT. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37425>	2025-09-18 17:23:42 +00:00
Alyssa Rosenzweig	804ced9047	intel: drop legacy flatshade handling Let mesa/st do the keying instead. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37447>	2025-09-18 14:14:11 +00:00
Alyssa Rosenzweig	36bd06ebab	intel: drop clamp_fragment_color handling This is all dead code since we weren't even seting the cap in iris/crocus! Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37447>	2025-09-18 14:14:11 +00:00
Alyssa Rosenzweig	957f326a10	brw: drop printf info plumbing unused since printf hashing. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37447>	2025-09-18 14:14:10 +00:00
Alyssa Rosenzweig	58fd54b56e	anv,hasvk: do not use unify_interfaces it's GLSL cruft we want to get rid of. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37447>	2025-09-18 14:14:10 +00:00
Alyssa Rosenzweig	bbf5bc8632	brw: cleanup int64 option set Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37447>	2025-09-18 14:14:09 +00:00
Alyssa Rosenzweig	168704c2fe	brw: hoist shared options out of the stage loop ideally we'd have no stage switching, but this is just a cleanup for now. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37447>	2025-09-18 14:14:09 +00:00
Alyssa Rosenzweig	0d7083d5bc	brw: drop indirection on compiler options I see no point, we allocate for every shader stage anyway. This is a bit simpler. I'm not a fan of the brw_compiler singleton at all but torching that is not on today's agenda. Flattening it a little bit very much is. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37447>	2025-09-18 14:14:08 +00:00
Alyssa Rosenzweig	2c161cc35d	brw: drop unused brw_kernel code unused since we dropped GRL. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37447>	2025-09-18 14:14:07 +00:00
Paulo Zanoni	25d26a89e3	isl: allow sparse with STC_CCS on DG2 Thanks to Nanley Chery for pointing out this possibility. v2: Make it simpler (Nanley). Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37419>	2025-09-17 21:42:58 +00:00
Paulo Zanoni	7dd66d6bb1	isl: allow sparse with CCS on Xe2 and newer When the auxiliary surface is handled by the hardware directly, there's nothing to bind besides the main pixels, so we can allow sparse without doing anything else. We can't do this in the exact same way with DG2 (which has_flat_ccs) because it uses the aux_state_tracking_buffer. v2: Fix spelling (Nanley). Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37419>	2025-09-17 21:42:58 +00:00
Paulo Zanoni	e7fd99c205	intel: rework the way sparse forces CCS/MCS/HIZ to be disabled We want to be a little more granular than just "aux surfaces are completely incompatible with sparse!", so have each of isl_surf_get_*_surf disable itself when sparse is used. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37419>	2025-09-17 21:42:58 +00:00
Tapani Pälli	7f63279307	anv: remove assert, group can have 0 shaders in it This seems to be equal assert with `febe90e109` as we hit this when launching Quake II RTX. Fixes: `69b6b4cb28` ("anv: add shader instruction emission") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37429>	2025-09-17 08:54:00 +00:00
Konstantin Seurer	ea51a67996	vulkan/bvh: Enable glsl extensions in meson Having a list of all enabled/used extensions in meson allows us to get rid of a lot of boilerplate in every bvh build shader. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35326>	2025-09-16 20:18:01 +00:00
Zhou Qiankang	b0528bcab1	anv: Use os_get_page_size for mmap offset alignment to work with page size other than 4K Instead of hardcoding 4096-byte page size in bo mapping/unmapping logic, use os_get_page_size() to determine the correct alignment for munmap() offset adjustments and address assertions. Signed-off-by: Zhou Qiankang <wszqkzqk@qq.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37389>	2025-09-16 10:25:56 +00:00
Georg Lehmann	714a149396	nir: remove unsigned upper bound config All config information is now either in nir->info or nir->options. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37361>	2025-09-16 09:24:04 +00:00
Lionel Landwerlin	a69853ce5e	brw: improve eot_reg computation in register allocate Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c4c7ff3f8f` ("brw: enable register allocation to deal with multiple EOTs") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37326>	2025-09-16 07:49:07 +00:00
Lionel Landwerlin	1f86a4ee37	brw: remove unused RT write code With `4fda724fd4` ("brw: Avoid invalid access when compacting out-of-bounds JIP/UIP") this stuff isn't needed anymore. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `fe38fb858c` ("brw: workaround broken indirect RT messages on Gfx11") Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37326>	2025-09-16 07:49:07 +00:00
Valentine Burley	8803388d15	ci: Update to Debian 13 (trixie) Switch containers from Debian 12 (bookworm) to Debian 13 (trixie). Trixie ships LLVM 19 by default, so we no longer need to add LLVM repos to install llvm-19. Notably, trixie also uses Python 3.13. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6994 Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35853>	2025-09-16 06:16:21 +00:00
Valentine Burley	92623d2447	imgui: Silence build warnings for imgui Avoid treating any warnings as errors in the third-party imgui code, and use Wno-error=stringop-overflow for code in Mesa. Suggested-by: @eric Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35853>	2025-09-16 06:16:19 +00:00
Francisco Jerez	5c68b351fe	intel/brw: Fix regression in brw_allocate_registers() compiling large shaders with throughput==0. The following Vulkan CTS tests that emit massive shaders were regressing after "intel/brw/xe3+: Select scheduler heuristic with best trade-off between register pressure and latency.": dEQP-VK.graphicsfuzz.cov-nested-loops-set-struct-data-verify-in-function dEQP-VK.graphicsfuzz.cov-dfdx-dfdy-after-nested-loops The reason is that they have so many nested loops that they cause the performance analysis utilization estimates to overflow the 32-bit floating-point variables used to calculate them, which causes our throughput estimate to underflow and equal zero for those shaders, which breaks the logic introduced in brw_allocate_registers() to select the scheduling variant with highest throughput, since none of the scheduling modes tried has better throughput than the initial value equal to zero of "best_perf". Instead use -INFINITY as initial value for "best_perf" so we always select a scheduling mode. This should have been caught by CI but oddly the tests above are showing up as "not run" on my last baseline runs, so this wasn't flagged as a regression for me. v2: Use -INFINITY instead of previous approach that used NaN (Ian). Fixes: `531a34c7dd` ("intel/brw/xe3+: Select scheduler heuristic with best trade-off between register pressure and latency.") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13884 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13885 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (v1) Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37322>	2025-09-15 21:10:47 +00:00
Nanley Chery	7c8e38ac67	anv: Rework locking for sparse binding with TR-TT When sparse binding functions submit batches, they may modify the exec_obj_index field of anv_bo structs. This field is used to ensure a unique list of buffers is sent to the kernel (i915). Add a lock in these functions to prevent multiple threads from modifying this field during the batch submission process. To avoid creating a deadlock, also rework the locking done in anv_queue_submit(). When playing the Monster Hunter Wilds Benchmark on a mesa build which enables slab allocation of batch buffers (`6f7a32ec92`), this avoids a sporadic assert failure: nsterHunterWilds.exe: ../../src/intel/vulkan/i915/anv_batch_chain.c:489: setup_execbuf_for_cmd_buffers: Assertion `execbuf->bos[idx] == first_batch_bo_real' failed. This issue was seemingly first introduced in `04bfe828db` ("anv/sparse: allow sparse resouces to use TR-TT as its backend") Backport-to: 25.2 Ref: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12582 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37307>	2025-09-15 17:45:15 +00:00
Nanley Chery	27167fdcb5	anv,hasvk: Take trace submission ID out of lock The Vulkan spec requires that access to the queue parameter be externally synchronized for vkQueueSubmit(). So, each submit call to a specific queue will have a unique ID. Backport-to: 25.2 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37307>	2025-09-15 17:45:15 +00:00
Dylan Baker	7b337e214d	anv: remove dead code This code cannot be reached, since we already checked for `!valid_samples` and returned `VK_ERROR_FEATURE_NOT_PRESET` in that case above, and have not altered `valid_samples` since. Fixes: `d5da6980d3` ("anv/sparse: don't support depth/stencil with sparse") CID: 1662063 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37341>	2025-09-12 23:20:35 +00:00
Sushma Venkatesh Reddy	5f10c1a8fb	intel/compiler: generalize workaround script name for broader applicability Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Renamed brw_nir_trig_workarounds.py to brw_nir_workarounds.py to reflect its expanded scope beyond just trignometric workarounds. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36990>	2025-09-12 22:32:46 +00:00
Sushma Venkatesh Reddy	fe1d84e083	intel/compiler: apply sqrt workaround for Horizon Forbidden West shader Added a workaround for a known shader in Horizon Forbidden West that causes visual corruption on Intel anv driver. The fix clamps fsqrt inputs using fmax(x, 1e-12) to avoid invalid values. Integrated the workaround via brw_nir_apply_sqrt_workarounds() and applied it conditionally in the Vulkan pipeline based on the shader's BLAKE3 hash. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12555 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36990>	2025-09-12 22:32:46 +00:00
Georg Lehmann	79d02047b8	intel: switch to new subgroup size info Reviewed-by: Iván Briano <ivan.briano@intel.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37258>	2025-09-12 21:05:17 +00:00
Georg Lehmann	95c2a65662	nir: remove unused shader_info param in nir_create_shader Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37258>	2025-09-12 21:05:17 +00:00
Caio Oliveira	c358842c1d	brw: Don't use individual rallocs for each instruction Move from a single ralloc allocation per instruction to contiguous blocks of allocations. Still use ralloc for those large blocks. Each ralloc allocation has at least 5 pointers of overhead, which would be about a third of the current brw_inst, and get worse as we try to pack brw_inst better. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36730>	2025-09-12 00:25:05 +00:00
Caio Oliveira	2506540566	brw: Repack brw_inst fields In Release build, goes from 72 to 64 bytes, and now fits in a single cacheline. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36730>	2025-09-12 00:25:05 +00:00
Caio Oliveira	8ded571ef4	brw: Allocate only brw_inst for BASE instructions Now that all the other kinds were added, all transforms to SEND will come from non-BASE kinds, so we don't need overallocate for BASE instructions. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36730>	2025-09-12 00:25:05 +00:00
Caio Oliveira	08c0f33874	brw: Add a generic LOGICAL instruction kind This kind of instruction doesn't have a special struct but will still be always allocated so that it can be lowered to SEND. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36730>	2025-09-12 00:25:05 +00:00
Caio Oliveira	df2b5fb03f	brw: Add brw_fb_write_inst Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36730>	2025-09-12 00:25:04 +00:00
Caio Oliveira	d06c0a370e	brw: Add brw_urb_inst Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36730>	2025-09-12 00:25:04 +00:00
Caio Oliveira	90967e7b16	brw: Add brw_load_payload_inst Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36730>	2025-09-12 00:25:03 +00:00
Caio Oliveira	388bac06ce	brw: Add brw_dpas_inst Fixed the types in brw_inst::bits so the struct is packed correctly. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36730>	2025-09-12 00:25:03 +00:00
Caio Oliveira	09a26526cc	brw: Add brw_mem_inst Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36730>	2025-09-12 00:25:02 +00:00
Caio Oliveira	f0f1e63f99	brw: Add brw_tex_inst Incorporate some "control sources" directly into the instruction. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36730>	2025-09-12 00:25:02 +00:00
Caio Oliveira	0fcce2722f	brw: Add brw_send_inst Move all the SEND specific fields from brw_inst into brw_send_inst. This new instruction kind will contain all variants of SENDs plus the virtual opcodes that were already relying on those SEND fields. Use the `as_send()` helper to go from a brw_inst into the brw_send_inst when applicable. Some of the code was changed to use the brw_send_inst type directly. Until other kinds are added, all the instructions are allocated the same amount of space as brw_send_inst. This ensures that all brw_transform_inst() calls are still valid. This will change after a few patches so that BASE instructions can use less memory. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36730>	2025-09-12 00:25:01 +00:00
Caio Oliveira	b27f6621ae	brw: Add initial support for different instruction kinds Prepare code for supporting subclasses of brw_inst for certain specialized kinds of instructions. This will allow - Move certain fields from brw_inst to the specialized one, reducing its size and making it easy to understand what applies to which instruction; - Move certain control sources into the specialized inst type, which currently take a full brw_reg to encode small integers. Reducing the overall sources we walk and care also might help the code in general. Next commits will add the new instruction kinds. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36730>	2025-09-12 00:25:01 +00:00
Caio Oliveira	339a4e8680	brw: Remove the extra function call when lowering samplers Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36730>	2025-09-12 00:25:00 +00:00

... 10 11 12 13 14 ...

15202 commits