fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 20:08:06 +02:00

Author	SHA1	Message	Date
Jianxun Zhang	42c3585ea1	isl: Reuse Xe2 modifers on newer platforms We will reuse LNL and BMG modifiers on newer platforms until new modifiers show up. Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35776>	2025-10-01 14:51:53 -07:00
Caio Oliveira	d16d7ac470	intel/executor: Destroy syncobjs after using them Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37645>	2025-09-30 20:17:01 +00:00
Kenneth Graunke	937fa18bb9	iris/ci: Update trace checksums The difference here was 1-2 pixels. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36750>	2025-09-30 19:44:03 +00:00
Kenneth Graunke	3af4e63061	brw: Skip compilation of larger SIMDs when pressure is too high This allows us to skip the entire backend compilation process for large SIMD widths when register pressure is high enough that we'd likely decide to prefer a smaller one in the end anyway. The hope is to make the same decisions as before, but with less CPU overhead. We are making mostly the same decisions as before: \| API / Platform \| Total Shaders \| Changed \| % Identical -------------------------------------------------- \| VK / Arc A770 \| 905,525 \| 1,157 \| 99.872% \| \| VK / Arc B580 \| 788,127 \| 53 \| 99.993% \| \| VK / Panther \| 786,333 \| 13 \| 99.998% \| \| GL / Arc A770 \| 308,618 \| 269 \| 99.913% \| \| GL / Arc B580 \| 264,066 \| 13 \| 99.995% \| \| GL / Panther \| 273,212 \| 0 \| 100.000% \| Improves compile times on my i7-12700K: \| Game \| Arc B580 \| Arc A770 \| --------------------------------------------------- \| Assassins Creed: Odyssey \| -13.47% \| -10.98% \| \| Borderlands 3 (DX12) \| -10.05% \| -11.31% \| \| Dark Souls 3 \| -21.06% \| -21.08% \| \| Oblivion Remastered \| -11.10% \| -9.82% \| \| Phasmophobia \| -32.73% \| -31.00% \| \| Red Dead Redemption 2 \| -20.10% \| -14.38% \| \| Total War: Warhammer III \| -10.11% \| -14.44% \| \| Wolfenstein Youngblood \| -15.91% \| -13.47% \| \| Shadow of the Tomb Raider \| -30.23% \| -25.86% \| It seems to have nearly no effect on compile times on Xe3 unfortunately, as only 1,014 shaders in fossil-db even fail SIMD32 compilation in the first place, and we want to let most of the "might succeed" cases through to the backend for throughput analysis. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36750>	2025-09-30 19:44:03 +00:00
Kenneth Graunke	248050b6d0	brw: Add a quick NIR-based register pressure estimate pass This tries to calculate an underestimate (lower bound) for the register pressure at various SIMD widths, by counting live values in the NIR shader. This fundamentally won't be accurate, but it can give us an idea of whether it's even worth trying a certain SIMD-width compile. Doing this at the NIR level means we: - Can use SSA structure rather than fuzzy liveness intervals - Can avoid the backend scheduler aggressively trying to hide latency, presenting an overinflated view of the register pressure - Have divergence information on-hand, making it easier to "scale up" - Can skip cloning and optimizing NIR for compute shader SIMD widths Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36750>	2025-09-30 19:44:03 +00:00
Kenneth Graunke	5ebd766156	brw: Do most of NIR postprocessing before cloning for SIMD variants We were doing a lot of NIR work repeatedly for each SIMD variant of compute and mesh shaders. Instead, do it once before cloning, and just do one final optimization loop and out-of-SSA for each. fossil-db results on Arc B580: Totals: Instrs: 233771096 -> 233794024 (+0.01%); split: -0.01%, +0.02% Subgroup size: 15922768 -> 15922736 (-0.00%); split: +0.00%, -0.00% Send messages: 12095619 -> 12098234 (+0.02%); split: -0.00%, +0.02% Loop count: 137562 -> 137523 (-0.03%) Cycle count: 32600323744 -> 32667411252 (+0.21%); split: -0.06%, +0.27% Spill count: 540908 -> 542027 (+0.21%); split: -0.07%, +0.28% Fill count: 700938 -> 698983 (-0.28%); split: -0.73%, +0.45% Scratch Memory Size: 37266432 -> 37304320 (+0.10%); split: -0.10%, +0.20% Max live registers: 72691728 -> 72692987 (+0.00%); split: -0.00%, +0.00% Non SSA regs after NIR: 67690309 -> 67688352 (-0.00%); split: -0.01%, +0.00% Totals from 3576 (0.45% of 789301) affected shaders: Instrs: 6932956 -> 6955884 (+0.33%); split: -0.41%, +0.74% Subgroup size: 88816 -> 88784 (-0.04%); split: +0.09%, -0.13% Send messages: 329168 -> 331783 (+0.79%); split: -0.02%, +0.81% Loop count: 8753 -> 8714 (-0.45%) Cycle count: 15153678820 -> 15220766328 (+0.44%); split: -0.14%, +0.58% Spill count: 213751 -> 214870 (+0.52%); split: -0.18%, +0.71% Fill count: 282616 -> 280661 (-0.69%); split: -1.82%, +1.13% Scratch Memory Size: 13056000 -> 13093888 (+0.29%); split: -0.27%, +0.56% Max live registers: 834757 -> 836016 (+0.15%); split: -0.11%, +0.26% Non SSA regs after NIR: 995033 -> 993076 (-0.20%); split: -0.48%, +0.28% Looking at a few of the shaders with substantial instruction count increases, it appears that it is largely due to more loops being unrolled, which is probably actually a good thing. The compile time impact of this patch appears to be negligable. However, doing postprocessing before SIMD cloning allows us to examine the postprocessed SSA-form NIR for improvements in an upcoming patch. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36750>	2025-09-30 19:44:02 +00:00
Kenneth Graunke	0712c220ab	brw: Split brw_postprocess_nir() into two pieces brw_postprocess_nir contains a lot of stuff these days. The first part does a bunch of lowering and cleanup optimizations in SSA form. The second part does some post-optimization lowering and the out-of-SSA conversion. We may want to do additional work before the post-optimization/post-SSA phase. Splitting this allows us to insert such tasks in the "middle". For convenience, brw_postprocess_nir() becomes a wrapper which invokes both parts, so callers can continue working as they did until they have a reason to do otherwise. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36750>	2025-09-30 19:44:02 +00:00
Kenneth Graunke	71b513a1e9	brw: Lower certain subgroup size modes in brw_preprocess_nir This allows us to lower known subgroup size cases earlier, giving us some earlier optimization opportunities. We would need to know the actual SIMD width to handle certain cases, but we can just pass 0 here, which will lead to get_subgroup_size returning 0 - the same as leaving this unset. We can come back to that later during the per-SIMD-width postprocessing. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36750>	2025-09-30 19:44:02 +00:00
Kenneth Graunke	3e493e03cc	brw: Move "SSA form" printing to after divergence analysis is run We were printing the SSA form, then immediately running divergence analysis. This patch flips those, so we can see con/div in INTEL_DEBUG output for SSA form, which is really useful. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36750>	2025-09-30 19:44:02 +00:00
Kenneth Graunke	1b0808adf3	intel/nir: Make ffma peephole optimization preserve fp_fast_math flags float_controls2 may have marked these as needing to preserve NaN or other values. If so, our newly contracted ffma needs to as well. Fixes dEQP-VK.spirv_assembly.instruction.compute.float_controls2..input_args.mat_det_testedWithout_NotNan when nir_opt_algebraic is run after this pass. Cc: mesa-stable Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36750>	2025-09-30 19:44:02 +00:00
Ian Romanick	23bd356b42	brw/nir: nir_intrinsic_load_reloc_const_intel may not be scalar [v3] Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details If the (NIR) destination is a register (i.e., not an SSA value), the destination of the BRW instruction will not be is_scalar. This occurs in some shaders in Final Fantasy XVI (and finalfantasytype0_1.rdc.2826e29da3722a83.1.foz). If the destination is not is_scalar, revert most of this code to the state previous to `f3593df877`. This means - Allocate a SIMD1 register and UNDEF it. - Emit a SIMD1 MOV_RELOC_IMM to that register. - Emit an additional MOV to expand the SIMD1 result. Closes: #12520 Fixes: `f3593df877` ("brw/nir: Treat load_reloc_const_intel as convergent") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37384>	2025-09-29 16:48:07 +00:00
Jordan Justen	be61c12f3e	anv: Use image view base-layer in can_fast_clear_color_att() We currently only support fast clearing the first layer of an image. Attachments use VkImageView which can specify a base-layer of the view for an image attachment. Fixes: `44351d67f8` ("anv: Change params of anv_can_fast_clear_color_view") Ref: https://projects.blender.org/blender/blender/issues/141181 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37562>	2025-09-26 19:15:22 +00:00
José Roberto de Souza	141a225ca1	intel/brw: Use ASR over SHR for SHADER_OPCODE_ISUB_SAT src[1]/src0 is signed and Xe2+ SHR don't support operations over signed data types so lets switch this over ASR that supports signed data types. Cc: mesa-stable Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37557>	2025-09-26 16:44:24 +00:00
José Roberto de Souza	c45f442d5c	intel/decode: Add support to new version of Xe KMD devcoredump with canonical addresses Customers suggested that Xe KMD should change all possible interfaces visible to users to canonical address, with that we need some changes to keep the decode of devcoredump working. A old version of the tool will not be able to decode secondary batch buffers when parsing a new version of the file but the new version of this tool will be able to parse both versions of devcoredump file. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37570>	2025-09-26 16:15:53 +00:00
Hyunjun Ko	b7129a2085	anv/video: fix to set slice block size correctly for h265 decoding. Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Fixes dEQP-VK.video.encode.h265.resolution_change_dpb_layered_src_video_layout Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37412>	2025-09-26 12:27:59 +00:00
Simon McVittie	9d36bf891b	vulkan: Compute path to write into JSON manifests once, use it everywhere Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This reduces duplication: we only need to distinguish between Windows and Unix in one place. The previous code was inconsistent about using either the `platforms` option, or the `host_machine`. Following the logic described in commit `94379377` "lavapipe: build "Windows" check should use the host machine, not the `platforms` option.", I've assumed that checking the host machine is the more-correct version and used that. Signed-off-by: Simon McVittie <smcv@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37576>	2025-09-26 10:47:31 +00:00
Simon McVittie	be8cac52d3	vulkan: Consistently form driver library names as prefix + name + suffix This consistently uses `NAME.dll` on Windows, `libNAME.dylib` on Darwin derivatives such as macOS, and `libNAME.so` on Linux, *BSD and so on. It's also consistent about using the local variable name `icd_file_name` for this name in every Vulkan driver, which was already the case in many but not all drivers. Some of these drivers probably don't make sense (or don't work) on Windows and/or macOS, but if this is kept consistent for all drivers, it should avoid the need for driver-specific commits like commit `611e9f29e` "lavapipe: fix icd generation for windows", commit `951f3287` "lavapipe: set empty dll prefix", commit `13e7a39f` "lavapipe: fixes for macOS support", commit `7008e655` "radv: Update JSON generator if Windows" and so on, each time a driver is found to be relevant on more platforms than previously believed. Signed-off-by: Simon McVittie <smcv@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37576>	2025-09-26 10:47:31 +00:00
Tapani Pälli	c8f47d7681	blorp: add missing pipecontrol after 3DSTATE_WM_HZ_OP for Xe2+ Backport-to: 25.2 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37547>	2025-09-26 10:07:18 +00:00
Mauro Rossi	7b50b8966b	intel/mda: Fix gnu-empty-initializer warning This also causes build errors on older Android prebuilt clang. Fixes: `bccc0fa9` ("intel/mda: Add code to produce mesa debug archives") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37586>	2025-09-26 08:30:16 +00:00
Tim Van Patten	f90e0f0797	intel: Convert getenv() to os_get_option() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details os_get_option() is a wrapper for getenv() that checks properties in Android. It should be a no-op for other OS but will allow full use of env vars in Android. The environment variable names are automatically renamed by os_get_option() and the order of precedence thus becomes: 1. getenv (non-Android) 2. debug.mesa.* (Android) 3. vendor.mesa.* (Android) 4. mesa.* (Android, as a fallback for older versions) Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37587>	2025-09-25 17:01:18 -06:00
Caio Oliveira	f011e5707d	brw: Identify if/break/endif special case before emission Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37147>	2025-09-25 06:36:10 +00:00
Caio Oliveira	f00fca998e	intel/mda: Add MDA_FILTER to select which archives to generate Matches if names contains the filter value, multiple values separated by commas. Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29146>	2025-09-24 23:08:45 -07:00
Caio Oliveira	34198545aa	intel/mda: Use archive filename as directory name instead of hardcoded "mda/" This changes debug_archiver to create files inside a directory named after the archive filename (e.g., "debug.anv.mda/") instead of the hardcoded "mda/" directory. This prevents conflicts if multiple mda.tar files are extracted in the same location. Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29146>	2025-09-24 23:08:45 -07:00
Caio Oliveira	3ff37e97a0	intel/mda: Allow more toplevel directory names inside mda.tar files Will allow generate anv.tar with unique "first directory name" instead of always "mda/", avoiding overlaps if user untars the files. The name still must end with "mda/". Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29146>	2025-09-24 23:08:45 -07:00
Caio Oliveira	a1d66c18c9	intel/mda: If MDA_PREFIX=timestamp use the actual timestamp as a prefix Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29146>	2025-09-24 23:08:45 -07:00
Caio Oliveira	c21494576e	intel/mda: Add MDA_OUTPUT_DIR and MDA_PREFIX environment variable support Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29146>	2025-09-24 23:08:45 -07:00
Caio Oliveira	dfdc883833	intel/mda: Add pager support Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29146>	2025-09-24 23:08:45 -07:00
Caio Oliveira	ceb1baf939	intel/mda: Handle non-contiguous object versions in mda.tar files Allow archives to have interleaved entries where different object versions don't appear together. This gives more flexibility to the drivers when producing the mda.tar files. Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29146>	2025-09-24 23:08:45 -07:00
Caio Oliveira	38b462e09a	intel/mda: Add -U and -Y diff options Make more ergonomic to use the common unified and side-by-side diffs, and also optionally set the context lines (for -U) and the width (for -Y). This should cover most common cases. MDA_DIFF_COMMAND is still available for full control. Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29146>	2025-09-24 23:08:45 -07:00
Caio Oliveira	ef3cafe44e	intel/mda: Add search/searchall commands Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29146>	2025-09-24 23:08:45 -07:00
Caio Oliveira	44f39eea55	intel/mda: Add tool to inspect mesa debug archives The debug archive files are regular tar files, so can be inspected by tar, and also used direct by file managers and editors. However a few common tasks are worth having already set up in the repository. This tool adds convenience to some of those tasks, including - Print last version of a shader representation; - Print a `git-log`-like view of the changes of a shader; - Comparing two shaders, e.g. SIMD8 and SIMD16 shaders in Intel; - Comparing two specific versions of any shaders. See the "manual" inside the commit for more details. Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29146>	2025-09-24 23:08:45 -07:00
Caio Oliveira	6c90a9c8e3	anv: Create archive file when using INTEL_DEBUG=mda Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29146>	2025-09-24 23:08:45 -07:00
Caio Oliveira	bce2cb1e24	anv: Refactor anv_shader_compile result handling Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29146>	2025-09-24 23:08:45 -07:00
Caio Oliveira	9f6155e47d	brw: Also include the final disassembly in the debug archive This doesn't replace existing support for INTEL_DEBUG=shaders -- so both `shaders` and `mda` can be used. Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29146>	2025-09-24 23:08:45 -07:00
Caio Oliveira	cdef824b7a	brw: Include some NIR states in the debug archive Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29146>	2025-09-24 23:08:45 -07:00
Caio Oliveira	f82d85a685	brw: Use debug archive file with INTEL_DEBUG=mda Instead of dumping multiple files with the optimizer passes, write a single archive file with all the contents. The actual file is created by the drivers, so later commits will actually enable the feature in anv and iris. This removes the use of INTEL_DEBUG=optimizer (and the corresponding enum value) in brw. That environment variable is still used by ELK -- which currently doesn't support mda. Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29146>	2025-09-24 23:08:45 -07:00
Caio Oliveira	bccc0fa984	intel/mda: Add code to produce mesa debug archives Uses the tar format to collect multiple output files. It can be inspected using the regular UNIX tools, but a later commit will add a specialized tool to perform common tasks. The tar implementation is enough to fulfill the current needs without adding a dependency. There's also a small test mostly to ensure scaffolding is there in case we need to expand the implementation. Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29146>	2025-09-24 23:08:45 -07:00
Iván Briano	186cd59cf2	anv: use the color_map if present for calculating color_mask Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details If the FS has writes to multiple color outputs, but there are not enough color attachments for them all, we may optimize out the exceeding ones. With VK_KHR_dynamic_rendering_local_read, we were not respecting the mapping from output to attachment set by the application, and the wrong writes were getting eliminated. Fixes future CTS tests: dEQP-VK.renderpasses.dynamic_rendering.primary_cmd_buff.local_read.remap_single_attachment* Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37531>	2025-09-24 21:47:42 +00:00
Iván Briano	66fcae5b1b	anv: fix FS output <-> attachment map building The arrays is first memset to OUTPUT_DISABLED, but if we iterate over MAX_RTS instead of the actual attachment count, we end up resetting any values not set by the application to the, probably identity, that comes from the state. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37531>	2025-09-24 21:47:42 +00:00
Sushma Venkatesh Reddy	95669ad5df	intel/executor: Add examples for srnd srnd_edge_cases.lua is checking edge cases. srnd_randomized.lua is shared by Caio and it serves as a good example for understanding the randomness and probability of rounding. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36529>	2025-09-24 17:18:37 +00:00
Sushma Venkatesh Reddy	a1c5f1ccf6	intel/compiler: Validation for SRND instructions Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36529>	2025-09-24 17:18:37 +00:00
Sushma Venkatesh Reddy	fe6d364ca8	brw: Add assembler support for SRND Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36529>	2025-09-24 17:18:37 +00:00
Sushma Venkatesh Reddy	51f4a2572a	intel/compiler: Initial bits for SRND instruction Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36529>	2025-09-24 17:18:37 +00:00
Dylan Baker	1c930a505e	anv: don't attempt to memcpy if allocation fails Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Based on git history thhese appears to be a subset of `anv_batch_emit_batch`, so I've structured the code similarly, if `anv_batch_emit_dwords` returns `nullptr`, we just move on without copying the memory. CID: 1665339 CID: 1664814 Reviewed-by: Iván Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37534>	2025-09-24 15:29:48 +00:00
Eric Engestrom	2f9fd1768a	intel/meson: generate spirv_info.h before compiling brw_spirv.c Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37544>	2025-09-24 10:23:18 +00:00
Lionel Landwerlin	e9910fa955	brw: fix type conversion in tex operation params Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Fix a bunch of tests in dEQP-VK.glsl.texture_gather.* on Xe2+ Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `bddfbe7fb1` ("brw/blorp: lower MCS fetching in NIR") Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37532>	2025-09-24 08:47:03 +00:00
Lionel Landwerlin	66e9a2f61d	anv: fix crash in ESO tests Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details dEQP-VK.shader_object.misc.state.shaders.vert_geom_frag.geometry_streams.enabled dEQP-VK.shader_object.misc.state.shaders.vert_tess_geom_frag.geometry_streams.enabled Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `1d8847afcf` ("anv: fix streamout config comparison") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37535>	2025-09-24 00:34:50 +03:00
Iván Briano	f97b51186f	anv: intermediate RGB <-> RGBX copy for HIC Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details For 3-component RGB images with OPTIMAL tiling, we need to create the surface as RGBX or RGBA. When a host image copy to/from this image happens, we calculate sizes and offsets based on the 4-component surface and blow past the end of the 3-component API provided buffer. Hilarity^WSegfault ensues. Ideally we'd calculate the right sizes and have the tiled copy functions handle the conversion, but they are format unaware and expect to just copy bytes in blocks of equal sizes from both sides. Handle this case by making an intermediate copy to/from linear RGB from/to linear RGBX, and pass that intermediate slice to the tiled copy functions. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36453>	2025-09-23 18:06:14 +00:00
Iván Briano	5a18d8d867	anv: drop height_pitch parameter from anv_copy_image_memory Calculate the start of the memory pointer in the caller instead. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36453>	2025-09-23 18:06:14 +00:00
Iván Briano	c02a27bce1	anv: handle multiple aspects in vkCopyImageToImage Per VUID-VkCopyImageToImageInfo-srcImage-09069, srcImage and dstImage must have been created with identical image creation parameters, so we are not going to have copies from color <-> depth/stencil, but we can copy both D/S aspects of an image at the same time. Nothing says that we can't copy from one plane of a multiplanar image to another, so handle that case too (though nothing is currently testing it). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36453>	2025-09-23 18:06:14 +00:00

1 2 3 4 5 ...

14714 commits