fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-25 01:48:18 +02:00

Author	SHA1	Message	Date
Caio Oliveira	4b4500ad35	brw/cmat: Store more information about cmat slices Store the cmat_description and packing_factor so that various functions don't need to extract and recalculate them. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Caio Oliveira	a7ff177a88	brw: Consider bfloat16 in lower simd width pass Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Caio Oliveira	2c31516b3e	brw: Consider bfloat16 in lower regioning pass Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Caio Oliveira	5936768ce0	brw: Consider bfloat16 in copy propagation Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Caio Oliveira	129c074811	brw: Implement support for BFloat16 ALU opcodes Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Caio Oliveira	a38960e8f3	brw, nir: Use glsl_base_type instead of nir_alu_type for @dpas_intel This will allow including types that don't have a nir_alu_type equivalent, like bfloat16. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Rohan Garg	9e5d7eb88d	compiler/types: add a bfloat16 type Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:36 +00:00
Caio Oliveira	3e0418ba02	intel/executor: Fix bfloat example for converting F to packed BF In float pointing rules adding +0.0f preserves all values except for -0.0f, so what we want here is to add -0.0f. In the future we should add proper support for float immediates in the assembler. Fixes: `fafdd24285` ("intel/executor: Update bfloat example") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:36 +00:00
Eric Engestrom	4227982326	ci: rename misleading -postmerge stages to -nightly Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details These stages are for the jobs that are skipped in merge pipelines, automatically run in nightly pipelines, and are available to run manually in other pipelines. None of these ever run in post-merge pipelines. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34590>	2025-04-29 05:49:00 +00:00
Valentine Burley	10ea0002a6	ci/intel: Convert to using the new container based rootfs Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34451>	2025-04-28 20:08:32 +00:00
Ian Romanick	c2ac7fa77b	brw/cmod: Allow integer CMP to ADD propagation only for Z and NZ No shader-db chnages on any Intel platform. v2: Add a note about integer types in the saturate handling path. fossil-db: All Intel platforms had similar results. (Lunar Lake shown) Totals: Instrs: 210743769 -> 210743727 (-0.00%) Cycle count: 30377699060 -> 30377700318 (+0.00%); split: -0.00%, +0.00% Totals from 36 (0.01% of 706776) affected shaders: Instrs: 17032 -> 16990 (-0.25%) Cycle count: 291716 -> 292974 (+0.43%); split: -0.01%, +0.44% Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34509>	2025-04-28 19:44:23 +00:00
Ian Romanick	e26270249b	brw/cmod: Don't propagate from CMP to possible Inf + (-Inf) Most of the churn in this commit is changing unit tests that were testing things that are now invalid. shader-db: All Intel platforms had similar results. (Lunar Lake shown) total instructions in shared programs: 17122204 -> 17122669 (<.01%) instructions in affected programs: 120669 -> 121134 (0.39%) helped: 0 / HURT: 124 total cycles in shared programs: 895602370 -> 895613210 (<.01%) cycles in affected programs: 17868974 -> 17879814 (0.06%) helped: 35 / HURT: 85 fossil-db: All Intel platforms had similar results. (Lunar Lake shown) Totals: Instrs: 210736518 -> 210743769 (+0.00%) Cycle count: 30377733040 -> 30377699060 (-0.00%); split: -0.00%, +0.00% Max live registers: 66056852 -> 66056966 (+0.00%) Totals from 1505 (0.21% of 706776) affected shaders: Instrs: 1890151 -> 1897402 (+0.38%) Cycle count: 48397408 -> 48363428 (-0.07%); split: -0.11%, +0.04% Max live registers: 256821 -> 256935 (+0.04%) Reviewed-by: Matt Turner <mattst88@gmail.com> Fixes: `020b0055e7` ("i965/fs: Propagate conditional modifiers from compares to adds") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34509>	2025-04-28 19:44:23 +00:00
Ian Romanick	0dab520a19	brw/cmod: Fix some errors when propagating from CMP to ADD.SAT When I originally wrote that code, I didn't understand what a jerk NaN can be. v2: Remove the brw_type_is_uint stuff. This function is currently only called for float types. In a later commit, integer types will be supported but only for NZ and Z conditions. Noticed by Matt. shader-db: All Intel platforms had similar results. (Lunar Lake shown) total instructions in shared programs: 17122197 -> 17122204 (<.01%) instructions in affected programs: 1691 -> 1698 (0.41%) helped: 0 / HURT: 4 total cycles in shared programs: 895602484 -> 895602370 (<.01%) cycles in affected programs: 912964 -> 912850 (-0.01%) helped: 2 / HURT: 2 fossil-db: All Intel platforms had similar results. (Lunar Lake shown) Totals: Instrs: 210736388 -> 210736518 (+0.00%) Cycle count: 30377728900 -> 30377733040 (+0.00%); split: -0.00%, +0.00% Totals from 130 (0.02% of 706776) affected shaders: Instrs: 169911 -> 170041 (+0.08%) Cycle count: 18021210 -> 18025350 (+0.02%); split: -0.00%, +0.02% Reviewed-by: Matt Turner <mattst88@gmail.com> Fixes: `020b0055e7` ("i965/fs: Propagate conditional modifiers from compares to adds") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34509>	2025-04-28 19:44:23 +00:00
Ian Romanick	8f0fd0e66e	brw/cmod: Remove special handling of NOT The previous commit converts any NOT that might have been affected by this path into a simple MOV. Those MOVs are handled by other paths. No shader-db or fossil-db changes on any Intel platform. v2: Fix a bad squash. Changes that were accidentally in this commit were supposed to be in the previous commit. Noticed by Ivan. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34509>	2025-04-28 19:44:23 +00:00
Ian Romanick	08fe7988d7	brw/algebraic: Convert some NOT to MOV On Xe platforms, many fragment shaders have patterns like: asr(8) g21<2>W g1.2<0,1,0>W 15D ... mov(8) g11<1>UW g21<16,8,2>UW ... not.nz.f0.0(8) null<1>D g11<8,8,1>W Converting the NOT.NZ to MOV.Z enables copy propagation to eliminate the original MOV. Then cmod propagation is able to eliminate the NOT-converted-to-MOV. It might be possible to cover this case by adding more opcodes to the list NOT can propagate to. The next commit will show that just converting to MOV is a better approach anyway. v2: Fix a bad squash. Changes that were supposed to be in this commit were accidentally in the next commit. Noticed by Ivan. shader-db: Meteor Lake, DG2, and Tiger Lake had similar results. (Meteor Lake shown) total instructions in shared programs: 20069804 -> 20065167 (-0.02%) instructions in affected programs: 592450 -> 587813 (-0.78%) helped: 2300 / HURT: 0 total cycles in shared programs: 884534032 -> 884496201 (<.01%) cycles in affected programs: 13064194 -> 13026363 (-0.29%) helped: 1285 / HURT: 790 LOST: 18 GAINED: 15 fossil-db: Meteor Lake, DG2, and Tiger Lake had similar results. (Meteor Lake shown) Totals: Instrs: 234506495 -> 234468664 (-0.02%) Cycle count: 24444825202 -> 24445710703 (+0.00%); split: -0.01%, +0.01% Max live registers: 42349793 -> 42349789 (-0.00%) Max dispatch width: 7131344 -> 7131744 (+0.01%); split: +0.05%, -0.04% Totals from 16673 (2.07% of 805781) affected shaders: Instrs: 6497669 -> 6459838 (-0.58%) Cycle count: 435877770 -> 436763271 (+0.20%); split: -0.54%, +0.74% Max live registers: 1122972 -> 1122968 (-0.00%) Max dispatch width: 151528 -> 151928 (+0.26%); split: +2.19%, -1.92% No shader-db or fossil-db on any other Intel platforms. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34509>	2025-04-28 19:44:23 +00:00
Ian Romanick	9ce869aef5	brw/cmod: Delete some stale comment text Stale like the mummified remains of Ötzi, The Iceman. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34509>	2025-04-28 19:44:23 +00:00
Ian Romanick	12a022cf45	brw/algebraic: Greatly simplify brw_opt_constant_fold_instruction Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details brw_opt_constant_fold_instruction can either do nothing or replace the instruction with a MOV of an immediate value. Previously each opcode case would perform this replacement, and code at the bottom of the function would verify the results. It is much simpler if each opcode case calculates a result in a brw_reg, and code at the bottom of the function performs the replacement. There are two outlier cases that cannot use this pattern: MAD and BROADCAST. These cases simply return directly from the switch-statement after performing the replacement. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34707>	2025-04-28 18:33:42 +00:00
Tapani Pälli	ed9f135936	anv: put parenthesis to the set_sampler_size equation Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This fixes errors seen with some renderdoc captures failing to allocate descriptor sets. Fixes: `76096d04bb` ("anv: relax restriction on variable count descriptors") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34671>	2025-04-28 04:45:01 +00:00
Lionel Landwerlin	e60416b4e4	anv: use companion batch for operations with HIZ/STC_CCS destination We're currently crashing a couple of tests : dEQP-VK.pipeline.monolithic.depth.xfer_queue_layout.* deqp-vk: ../src/intel/blorp/blorp_blit.c:2935: blorp_copy: Assertion `blorp_copy_supports_blitter(batch->blorp, src_surf->surf, dst_surf->surf, src_surf->aux_usage, dst_surf->aux_usage)' failed. Tested on: dEQP-VK.api.copy_and_blit.copy_commands2.image_to_image_transfer_queue.all_formats.depth_stencil.* dEQP-VK.api.copy_and_blit.multiplanar_xfer.* dEQP-VK.pipeline.monolithic.depth.xfer_queue_layout.* Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `31eeb72e45` ("blorp: Add support for blorp_copy via XY_BLOCK_COPY_BLT") Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34023>	2025-04-24 14:47:40 +00:00
Tapani Pälli	765801fd9e	intel/dev: add note about PAT entries and Wa_18038669374 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34665>	2025-04-24 09:48:34 +00:00
Lionel Landwerlin	1f6cca0800	intel: fixup a few debugging option checks Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `ad328bc58d` ("intel: Switch uint64_t intel_debug to a bitset") Reviewed-by: Michael Cheng <michael.cheng@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34667>	2025-04-23 18:47:42 +00:00
Michael Cheng	3c267535ae	anv: Add new debug flag to show shader stage Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Add debug option to show current shader type being compiled within anv_shader_bin_create. Signed-off-by: Michael Cheng <michael.cheng@intel.com> Reviewed-by: Casey Bowman <casey.g.bowman@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34596>	2025-04-22 23:09:26 +00:00
Michael Cheng	ad328bc58d	intel: Switch uint64_t intel_debug to a bitset We are reaching our limit of adding flags to intel_debug (apporaching 64 flags). Switch intel_debug to a bitset, which gives us almost "unlimited" bits to use in the future. v2(Michael Cheng): Fixed a few ci errors Signed-off-by: Michael Cheng <michael.cheng@intel.com> Reviewed-by: Casey Bowman <casey.g.bowman@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34596>	2025-04-22 23:09:26 +00:00
Michael Cheng	2a1aa129ed	intel: Switch debug flags to enums to prep for bitset conversion Refactored the existing debug flags to use an enum instead of hardcoded 1ull << N macros. This is a prep step before the eventual switch of intel_debug to a bitset. Using enums gives us cleaner indexing and avoids annoying shift overflow warnings. No functional changes yet. Signed-off-by: Michael Cheng <michael.cheng@intel.com> Reviewed-by: Casey Bowman <casey.g.bowman@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34596>	2025-04-22 23:09:26 +00:00
José Roberto de Souza	fcb6dfb29c	intel: Fix the MOCS values in XY_BLOCK_COPY_BLT for Xe2+ One more instruction were the MOCS value was splited into two registes. Cc: mesa-stable Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34592>	2025-04-22 20:42:25 +00:00
José Roberto de Souza	161c412a82	intel: Fix the MOCS values in XY_FAST_COLOR_BLT for Xe2+ Xe2 changed the MOCS field in few instructions, those now have a field for the MOCS index and other the encryption enable bit but ISL returns the combination of both aka MEMORY_OBJECT_CONTROL_STATE. To minimize changes I have added 2 macros to extract the values from the value returned by isl. From all the instructions changed Mesa only make use of two, so the other instruction will be handled in the next patch. Cc: mesa-stable Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34592>	2025-04-22 20:42:25 +00:00
Caio Oliveira	6901f74fbf	intel/executor: Reorganize -h and --help Using -h will show a summarized view of the options, functions and macros. Using --help will open `man` with the longer contets, which is more convenient to search and gives a little bit of formatting. This scheme is similar to what is done for git subcommands, e.g. `git commit -h` and `git commit --help`. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34268>	2025-04-22 17:14:22 +00:00
Lina Versace	1bf8542490	anv: Enable VK_EXT_external_memory_acquire_unmodified Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Change-Id: If0480721f7f1fceec093e4ab7b5c9b712eb62ba1 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32295>	2025-04-21 13:55:32 -07:00
Lina Versace	3613b9c4f7	anv: Fix comment about external queue transitions Not all images with DRM format modifiers use ANV_IMAGE_MEMORY_BINDING_PRIVATE. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Change-Id: Idc6bae70ec7080f96555a85dcdc0ead915b02935 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32295>	2025-04-21 13:55:27 -07:00
Lina Versace	e87a04c6c1	anv: Assert that only external images have private bindings Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Change-Id: If2f18d88d48f70a58e236080632e72afb94f5e0b Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32295>	2025-04-21 13:55:08 -07:00
Sagar Ghuge	0463e14b94	anv: Enable 64bit memory structure mode for RT Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33047>	2025-04-21 20:10:45 +00:00
Kevin Chuang	703f29874b	intel/bvh/debug: Adapt instance leaf dumping to support 64-bit RT Adding a boolean "enable_64b_rt" in anv_accel_struct_header for the interpret.py to properly decode anv_instance_leaf Signed-off-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33047>	2025-04-21 20:10:45 +00:00
Kevin Chuang	cbc8af4555	intel/bvh: Compile and adapt bvh shaders separately into Xe1/2 and Xe3+ This change separate the encode, header, and copy shader into versions for Xe1/2 and Xe3+, including adding compile options and handling 64bit version of instance leaf for Xe3+. Signed-off-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33047>	2025-04-21 20:10:45 +00:00
Sagar Ghuge	36433e932b	intel/rt: Update BVH instance leaf load for Xe3+ Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33047>	2025-04-21 20:10:45 +00:00
Sagar Ghuge	5cd0f4ba2f	intel/compiler: Update MemRay data structure to 64-bit Rework: (Kevin) - Fix miss_shader_index offset - Handle hit group index Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33047>	2025-04-21 20:10:45 +00:00
Kevin Chuang	7b526de18f	intel/compiler/rt: Calculate barycentrics on demand This commit moves the calculation of tri_bary out of brw_nir_rt_load_mem_hit_from_addr(), and only do the calculation on demand, since unorm_float_convert can be expensive. We do this for both Xe1/2 and Xe3+ for consistency. Signed-off-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33047>	2025-04-21 20:10:45 +00:00
Sagar Ghuge	afc23dffa4	intel/compiler: Update MemHit data structure to 64-bit version Rework (Kevin): - Fix inst leaf ptr - Handle 24bit unorm barycentric coord Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33047>	2025-04-21 20:10:45 +00:00
Kevin Chuang	40fb95d51a	intel/compiler: Use 24bits for hit_kind on Xe3+ For Xe3+, the upper 8 bits of the second dword of a potential hit is used to store hitGroupIndex0, which is stuffed by the HW. This hitGroupIndex0 will later be used by the HW again to reconstruct the whole hitGroupIndex when driver issues a TRACE_RAY_COMMIT. We were corrupting this hitGroupIndex0 at the driver by setting the whole dword to hit_kind, which will cause the HW to read a wrong hitGroupIndex and therefore invoke a wrong closest hit shader. The behavior can be seen in dEQP-VK.ray_tracing_pipeline.pipeline_no_null_shaders_flag.gpu.boxes.\* and dEQP-VK.ray_tracing_pipeline.pipeline_library.configurations.\* This commit changes the driver to only use lower 24bits to store the hit_kind, and leave the upper 8bits as it. Signed-off-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33047>	2025-04-21 20:10:45 +00:00
Sagar Ghuge	64fd66407b	intel/compiler: Pass around intel_device_info parameter in helper This will help us to handle code path separately for Xe3+ for updated 64bit memory data structure for RT. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33047>	2025-04-21 20:10:45 +00:00
Sagar Ghuge	6deb1950a4	anv: Update RT dispatch globals to use 64bit data structure Rework (Kevin) - Fix Hit/Miss/Resume shader group table value Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33047>	2025-04-21 20:10:45 +00:00
Sagar Ghuge	fcd5fe4a75	intel/genxml/xe3: Update 3STATE_BTD field Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33047>	2025-04-21 20:10:45 +00:00
Sushma Venkatesh Reddy	4084527876	intel/compiler: Always run opt_algebraic after descriptor_lowering Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This change ensures that `brw_opt_algebraic` is always executed after `brw_lower_send_descriptors` in `brw_opt.cpp`. By doing so, redundant logical operations are optimized, resulting in cleaner and more compact assembly output. fossil-db results on LNL: - Totals: - Instructions: 215857290 -> 215857028 (-0.00%) - Cycle count: 32008929636 -> 32008935384 (+0.00%); split: -0.00%, +0.00% - Max live registers: 66940643 -> 66940557 (-0.00%) - Affected shaders (104 out of 713963): - Instructions: 31090 -> 30828 (-0.84%) - Cycle count: 5955908 -> 5961656 (+0.10%); split: -0.16%, +0.26% - Max live registers: 10888 -> 10802 (-0.79%) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34615>	2025-04-19 07:05:54 +00:00
Iván Briano	949d2e507d	anv: expose promoted KHR_depth_clamp_zero_one Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34614>	2025-04-18 21:31:37 +00:00
Rohan Garg	a5033c54e7	anv: use the common function for detecting a mesh shader stage Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34604>	2025-04-18 10:08:22 +00:00
Rohan Garg	9b477eea19	intel/compiler: use a immediate when doing the shift We can pass immediates to SHL and don't need to allocate a separate register here. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34604>	2025-04-18 10:08:22 +00:00
Konstantin Seurer	2dee1117b7	vulkan: Add a vk_device parameter to get_encode_key Useful for selecting different encoding options based on hardware generation. Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34273>	2025-04-17 20:20:40 +00:00
Caio Oliveira	fd0a7efb5a	spirv, nir: Delay calculation of shared_size when using explicit layout Move the calculation to nir_lower_vars_to_explicit_types(). This consolidates the check of shader_info::shared_memory_explicit_layout in a single place instead of in all drivers. This is motivated by SPV_KHR_untyped_pointers. Before that extension we had essentially two modes for shared memory variables - No layout decorations in the SPIR-V, and both internal layout and driver location was _given by the driver_. - Explicitly laid out, i.e. they are blocks, and decorated with Aliased. Because they all alias, we could assign them driver location directly to the start of the shared memory. With the untyped pointers extension, there's a third option, to be added by a later commit - Explicitly laid out, i.e. they are blocks, and NOT decorated with Aliased. Driver location is _given by the driver_. Blocks with and without Aliased can be mixed. The driver location of multiple blocks that don't alias depend on alignment that is driver-specific, which we can more easily do from the nir_lower_vars_to_explicit_types() that already has access to a function to obtain such value. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> (hk) Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> (v3dv) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (anv/hasvk) Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> (panvk) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (radv) Reviewed-by: Rob Clark <robdclark@gmail.com> (tu) Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34139>	2025-04-17 19:13:17 +00:00
José Roberto de Souza	a96e280dfe	intel: Program XY_FAST_COLOR_BLT::Destination Mocs for gfx12 Copy engine is not used in gfx12 platforms on ANV but that is possible in Iris. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34560>	2025-04-17 18:11:44 +00:00
Rohan Garg	cbc1ec4f73	anv: re enable compression for CPS surfaces on platforms other than Xe I accidentally disabled compression on CPS surfaces marked as storage or color attachment for all platforms, when this should only be limited to Xe. Fixes: 80f9b6 ('anv: CPB surfaces that are used as color attachments or for stores cannot be compressed') Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34297>	2025-04-17 14:24:11 +00:00
Daniel Stone	8d08cde667	ci/piglit: Use structured tagging for Piglit Structured tagging (cf. mesa/mesa!33421) captures a checksum of the thing we think we're building, and verifies this through the chain. When we run container builds, we check that the tag we've captured in the CI variables matches the calculated checksum, to make sure the declared tags are consistent and we always have traceability. When we run tests, we check the tags again between what was declared in the CI variables and what we're actually running from the test container. This makes sure that we're always testing what we think we're testing. As a side advantage, the rule inheritance we need to make this work means that we can start doing more optional downloads via overlays, instead of pulling a whole container full of stuff we might not ever use. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34539>	2025-04-17 09:22:39 +00:00

... 26 27 28 29 30 ...

15285 commits