fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 04:48:07 +02:00

Author	SHA1	Message	Date
Lionel Landwerlin	0a17035b5c	u_trace: add support for indirect data Allows a driver to declare indirect arguments for its tracepoints and pass an address. u_trace will request a copy of the data which should be implemented on the command processor. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Co-Authored-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29944>	2024-08-03 16:03:00 +03:00
Lionel Landwerlin	cb27b9541b	u_trace: remove timestamp reference in allocations We want to reduce the buffer allocations for other type of data than timestamps. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29944>	2024-08-03 16:02:56 +03:00
Lionel Landwerlin	4347ccbe57	u_trace: rework tracepoint argument declaration We're about to add indirect arguments, having a better way to describe arguments (as capture/storage) will be useful. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29944>	2024-08-03 16:02:53 +03:00
Jordan Justen	58469620d3	intel/brw/validate: Convert access mask to be grf based Our validation code doesn't need to know which bytes are accessed. It only needs to know which grfs were accessed by an element. This also helps to easily handle the Xe2 register size change. Backport-to: 24.2 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28479>	2024-08-02 22:18:51 +00:00
Jordan Justen	e62606b2ec	intel/brw/validate: Update dst grf crossing check for Xe2 Rework: * Update grf_size_shift calculation (s-b Ken) Backport-to: 24.2 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28479>	2024-08-02 22:18:51 +00:00
Jordan Justen	f2800deacb	intel/brw/validate: Simplify grf span validation check by not using a mask Previously this check would create a mask of the bytes used in the grf, and then shift the mask. This worked well when there was 32 bytes in the register because a 64-bit uint64_t could easily detect that bytes were used in the next regiter. (The next register was the high 32-bits of the `access_mask` variable.) With Xe2, the register size becomes 64 bytes, meaning this strategy doesn't work. Instead of a mask, we can just check to see if more than 1 grfs are used during each loop iteration. (Suggested by Ken.) This will make it easier to extend for Xe2 in a follow on commit. Verified this with dEQP-VK.subgroups.arithmetic.compute.subgroupexclusivemul_u64vec4_requiredsubgroupsize on Xe2, which otherwise would cause the program to fail to validate because it assumed a grf was 32 bytes. Backport-to: 24.2 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28479>	2024-08-02 22:18:51 +00:00
Hyunjun Ko	78ff100a52	anv: support h265 encoding Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27810>	2024-08-02 07:15:59 +00:00
Hyunjun Ko	eefa886b01	anv/video: initial support for h265 encoding Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27810>	2024-08-02 07:15:59 +00:00
Hyunjun Ko	3bd46afac1	anv/query: consider codec when querying the encoding status. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27810>	2024-08-02 07:15:59 +00:00
Hyunjun Ko	52f678004f	intel/decoder: Handle HCP_PAK_INSERT_OBJECT Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27810>	2024-08-02 07:15:59 +00:00
Hyunjun Ko	46e02ee861	intel/genxml: adds a value of reference pic to HCP_SURFACE_STATE Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27810>	2024-08-02 07:15:59 +00:00
Hyunjun Ko	7f280e1e93	intel/genxml: fix some length of HCP_FQM_STATE Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27810>	2024-08-02 07:15:59 +00:00
Hyunjun Ko	663f9eb740	intel/genxml: Adds more VDENC commands Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27810>	2024-08-02 07:15:59 +00:00
Hyunjun Ko	3eb69b9577	intel/genxml: fix the length of VDENC_DS_REF_SURFACE_STATE Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27810>	2024-08-02 07:15:59 +00:00
Hyunjun Ko	e79cad5af0	intel/genxml: Add missing fields for HCP_SLICE_STATE Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27810>	2024-08-02 07:15:59 +00:00
Hyunjun Ko	e28a299863	anv: enable VK_KHR_video_encode_queue and VK_KHR_video_encode_h264 Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27810>	2024-08-02 07:15:59 +00:00
Dave Airlie	3fbcd95b20	anv/video: add mode costs for h264 encoding Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27810>	2024-08-02 07:15:59 +00:00
Hyunjun Ko	3ec8f7f995	anv/video: initial support for h264 encoding Co-authored-by: Dave Airlie <airlied@redhat.com> Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27810>	2024-08-02 07:15:59 +00:00
Hyunjun Ko	f6c3e82201	anv/video: implemnt VkGetEncodedVideoSessionParametersKHR Also add a stub for VkGetPhysicalDeviceVideoEncodeQualityLevelPropertiesKHR Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27810>	2024-08-02 07:15:59 +00:00
Hyunjun Ko	f25cf314b3	anv/video: remove unnecessary macros Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27810>	2024-08-02 07:15:59 +00:00
Hyunjun Ko	a660bd9471	anv/query: handle VK_QUERY_TYPE_VIDEO_ENCODE_FEEDBACK_KHR Anv supports VK_VIDEO_ENCODE_FEEDBACK_BITSTREAM_BUFFER_OFFSET_BIT_KHR and VK_VIDEO_ENCODE_FEEDBACK_BITSTREAM_BYTES_WRITTEN_BIT_KHR. Also add to handle the VK_QUERY_RESULT_WITH_STATUS_BIT_KHR flag. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27810>	2024-08-02 07:15:59 +00:00
Hyunjun Ko	9425ba6f2b	intel/genxml: update VDENC instructions Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27810>	2024-08-02 07:15:59 +00:00
Hyunjun Ko	b97d440bc5	intel/genxml: change the length of MFX_QM_STATE Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27810>	2024-08-02 07:15:58 +00:00
Hyunjun Ko	5057a33fe3	intel/genxml: add a missing value for MFX_SURFACE_STATE Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27810>	2024-08-02 07:15:58 +00:00
Kenneth Graunke	8bca7e520c	intel/brw: Only force g0's liveness to be the whole program if spilling We don't actually need to extend g0's live range to the EOT message generally - most messages that end a shader are headerless. The main implicit use of g0 is for constructing scratch headers. With the last two patches, we now consider scratch access that may exist in the IR and already extend the liveness appropriately. There is one remaining problem: spilling. The register allocator will create new scratch messages when spilling a register, which need to create scratch headers, which need g0. So, every new spill or fill might extend the live range of g0, which would create new interference, altering the graph. This can be problematic. However, when compiling SIMD16 or SIMD32 fragment shaders, we don't allow spilling anyway. So, why not use allow g0? Also, when trying various scheduling modes, we first try allocation without spilling. If it works, great, if not, we try a (hopefully) less aggressive schedule, and only allow spilling on the lowest-pressure schedule. So, even for regular SIMD8 shaders, we can potentially gain the use of g0 on the first few tries at scheduling+allocation. Once we try to allocate with spilling, we go back to reserving g0 for the entire program, so that we can construct scratch headers at any point. We could possibly do better here, but this is simple and reliable with some benefit. Thanks to Ian Romanick for suggesting I try this approach. fossil-db on Alchemist shows some more spill/fill improvements: Totals: Instrs: 149062395 -> 149053010 (-0.01%); split: -0.01%, +0.00% Cycles: 12609496913 -> 12611652181 (+0.02%); split: -0.45%, +0.47% Spill count: 52891 -> 52471 (-0.79%) Fill count: 101599 -> 100818 (-0.77%) Scratch Memory Size: 3292160 -> 3197952 (-2.86%) Totals from 416541 (66.59% of 625484) affected shaders: Instrs: 124058587 -> 124049202 (-0.01%); split: -0.01%, +0.01% Cycles: 3567164271 -> 3569319539 (+0.06%); split: -1.61%, +1.67% Spill count: 420 -> 0 (-inf%) Fill count: 781 -> 0 (-inf%) Scratch Memory Size: 94208 -> 0 (-inf%) Witcher 3 shows a 33% reduction in scratch memory size, for example. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30319>	2024-08-01 16:37:34 -07:00
Kenneth Graunke	4ca4b064cf	intel/brw: Record g0 as live for sends with send_ex_desc_scratch set brw_send_indirect_split_message() implicitly reads g0 to construct the extended message descriptor for certain send messages when this is set. Record that liveness explicitly. Thanks to Francisco Jerez for reminding me about this use of g0. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30319>	2024-08-01 16:37:32 -07:00
Kenneth Graunke	9200fb966c	intel/brw: Record that SHADER_OPCODE_SCRATCH_HEADER uses g0 The generator code for emitting legacy scratch headers was implicitly using g0 as a source. But the IR wasn't indicating any usage of g0, which means the liveness isn't properly tracked at the IR level. It works because we reserve g0 as permanently live for the whole program. In order to stop doing that, we need to record it properly. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30319>	2024-08-01 16:37:31 -07:00
Kenneth Graunke	545f20419f	intel/brw: Delete fs_reg_alloc::discard_interference_graph() Unused since commit `50519598ff`. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30319>	2024-08-01 16:37:28 -07:00
Juston Li	ef58f2408f	anv/android: handle R8G8B8X8 as R8G8B8A8 Fall through to common vk_ahb_format_to_image_format() to handle R8G8B8X8 as R8G8B8A8. Fixes issues with querying for format feature support when its handled as R8G8B8. Signed-off-by: Juston Li <justonli@google.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30080>	2024-08-01 17:20:18 +00:00
Lionel Landwerlin	fafa0d5abb	anv: fix check on pipeline mode to track buffer writes We want to check the current mode of the pipeline, not the queue type (since graphics can toggle between 3D & gpgpu modes). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `455a13fb7f` ("anv: limit ANV_PIPE_RENDER_TARGET_BUFFER_WRITES to blorp operations using 3D") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11607 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30469>	2024-08-01 12:20:52 +00:00
Sushma Venkatesh Reddy	0116430d39	intel/brw: Handle 16-bit sampler return payloads API requires samplers to return 32-bit even though hardware can handle 16-bit floating point, so we detect that case and make more efficient use of memory BW. This is helping improve performance of encode and decode tokens during LLM by at least 5% across multiple platforms. Thank you Kenneth Graunke for suggesting and guiding me throughout this implementation. Signed-off-by: Sushma Venkatesh Reddy <sushma.venkatesh.reddy@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30447>	2024-07-31 21:26:46 +00:00
Sushma Venkatesh Reddy	ddd9e043dc	intel/brw: Move get_nir_def() higher to avoid UNDEF While extending our backend to handle 16-bit sampler return payloads, we found that in piglit's arb_texture_view-rendering-formats, the SIMD8 FS was missing the sampling operation altogether. This was because we were first emitting the texturing instruction, and then calling get_nir_def(), which adds an UNDEF instruction when the destination is smaller than the 32-bit. So the texturing was dead code elimated. Fix this by calling get_nir_def() earlier. Thank you to Kenneth Graunke for suggesting and guiding me throughout this implementation. Signed-off-by: Sushma Venkatesh Reddy <sushma.venkatesh.reddy@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30447>	2024-07-31 21:26:46 +00:00
Caio Oliveira	52be72e676	intel: Let compiler set indirect_ubos_use_sampler This option is used for Gfx < 12, elk already set it to true, so set it in brw and change the drivers to not set it anymore. Because the dual-compiler support in Iris, the helper function there had to change to consult the right compiler value instead. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30393>	2024-07-31 19:26:20 +00:00
Lionel Landwerlin	eebb6cd236	anv: stop using 3DSTATE_WM::ForceThreadDispatchEnable Documentation says we should leave this field to the default value (Normal). Instead we set 3DSTATE_PS_EXTRA::PixelShaderHasUAV when we see that a fragment shader has side effects. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30408>	2024-07-31 18:18:14 +00:00
Juston Li	34031e3e3b	anv/android: remove unneeded ANB implicit import flags ANB is only used by Android WSI which uses explicit sync so these flags can be dropped. Signed-off-by: Juston Li <justonli@google.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29883>	2024-07-30 09:27:28 -07:00
Jianxun Zhang	c5ee7e9bdc	anv: Disable legacy CCS setup in binding (xe2) The condition of flat ccs and vram_only checker causes different aux usage at binding stage. The current design is reusing CCS_E on Xe2, so we want both Xe2 integrated and discreted GPUs behave the same way. Xe2 shouldn't need any special setup of CCS in the loop. Backport-to: 24.2 Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30111>	2024-07-29 01:42:27 +00:00
Jianxun Zhang	e054068787	anv: Disable compression on legacy modifiers (xe2) On pre-Xe2 platforms, the compression on these modifiers that don't support compression are enabled. The compressed will be resolved when needed. On Xe2+ we haven't support explicit resolve, so all the paths to resolves are prohibited now. But the code is still doing it, causing an assertion failure: Fixes: vkcube src/intel/vulkan/anv_private.h:5467: anv_image_get_fast_clear_type_addr: Assertion `device->info->ver < 20' failed. Backport-to: 24.2 Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30111>	2024-07-29 01:42:27 +00:00
Jianxun Zhang	49c91a4ea0	anv: Fix assertion failures on BMG (xe2) Fixes: `beb0ea2469` ("anv: Disable tracking fast clear and aux state (xe2)") crucible run func.first dEQP-VK.api.copy_and_blit.core.image_to_image. all_formats.color.2d_to_2d.a1r5g5b5_unorm_pack16. r16_uint.optimal_optimal dEQP-VK.pipeline.monolithic.multisample.misc.clear_attachments. r8g8b8a8_unorm_r16g16b16a16_sfloat_r16g16b16a16_sint_d32_sfloat_ s8_uint.16x.ds_resolve_sample_zero.whole_framebuffer src/intel/vulkan/anv_private.h:5491: anv_image_get_compression_state_addr: Assertion `device->info->ver < 20' failed. Backport-to: 24.2 Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30111>	2024-07-29 01:42:26 +00:00
Ian Romanick	fdb6afe71e	intel/elk: Fix undefined left shift of negative value in elk_texture_offset Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30333>	2024-07-26 17:18:08 -07:00
Ian Romanick	f3f4a057b9	intel/elk: Fix undefined left shift of large UW value in elk_imm_uw Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30333>	2024-07-26 17:18:06 -07:00
Ian Romanick	0e5ac7d6b0	intel/elk: Fix undefined left shift of negative value in update_uip_jip v2: Add comment and assertion to explain why the shift is safe. Suggested by Caio. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30333>	2024-07-26 17:18:04 -07:00
Ian Romanick	c2dda8c8e7	intel/elk: Fix undefined shift by 64 of uint64_t in elk_compute_first_urb_slot_required Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30333>	2024-07-26 17:18:01 -07:00
Ian Romanick	e6669467b8	intel/brw: Fix undefined left shift of negative value in brw_texture_offset When -fsanitize=shift is used, many instances of the following are produced: src/intel/compiler/brw_fs_nir.cpp:114:30: runtime error: left shift of negative value -1 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30333>	2024-07-26 17:17:59 -07:00
Ian Romanick	4f24c2707f	intel/brw: Fix undefined left shift of large UW value in brw_imm_uw When -fsanitize=shift is used, 'ninja test' would fail in several Intel assembly tests (mul.asm and and.asm) with: src/intel/compiler/brw_reg.h:703:22: runtime error: left shift of 65532 by 16 places cannot be represented in type 'int' Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30333>	2024-07-26 17:17:56 -07:00
Ian Romanick	abb7c012ff	intel/brw: Fix undefined left shift of negative value in update_uip_jip When -fsanitize=shift is used, many instances of the following are produced: src/intel/compiler/brw_eu_compact.c:2244:50: runtime error: left shift of negative value -306 v2: Add comment and assertion to explain why the shift is safe. Suggested by Caio. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30333>	2024-07-26 17:17:53 -07:00
Ian Romanick	228e049db6	intel/brw: Fix undefined shift by 64 of uint64_t in brw_compute_first_urb_slot_required When -fsanitize=shift is used, many instances of the following are produced: src/intel/compiler/brw_compiler.h:1661:44: runtime error: shift exponent 64 is too large for 64-bit type 'long long unsigned int' I think this is an actual bug. It should check the sentinel value, but the sentinel value is 64. The shift by 64 is treated as a shift by 0. The varying 0 is explicitly filtered by the rest of the if-test. How does this work? Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30333>	2024-07-26 17:17:15 -07:00
Sushma Venkatesh Reddy	455deacbce	intel/brw: Fix DEBUG_OPTIMIZER Due to recent regression, adding INTEL_DEBUG=optimizer is dumping shader optimization pass details to console rather than to respective files. Thank you, Kenneth W Graunke for helping me figure this out. Fixes: `17b7e49089` ("intel/brw: Move out of fs_visitor and rename print instructions") Signed-off-by: Sushma Venkatesh Reddy <sushma.venkatesh.reddy@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30389>	2024-07-26 22:22:58 +00:00
José Roberto de Souza	eb5a3617e2	anv: Handle internal shader compilation failure Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30380>	2024-07-26 21:58:21 +00:00
José Roberto de Souza	196b3d7b5b	anv: Improve error message when pipeline creation fails during shader compilation Due the lack of SIMD8 in Xe2 platforms we are not able to compile a shader for dEQP-VK.protected_memory.stack.stacksize_1024 that fits into scratch space. So before this patch when such failure happened it would return VK_ERROR_OUT_OF_HOST_MEMORY error. So here when available include the compiler error string to better inform what the actual failure. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30380>	2024-07-26 21:58:21 +00:00
Jianxun Zhang	349e7a2919	intel/common: Remove blank lines in intel_set_ps_dispatch_state() (xe2) Backport-to: 24.2 Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29907>	2024-07-26 21:02:24 +00:00

1 2 3 4 5 ...

12463 commits