fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-22 02:40:11 +01:00

Author	SHA1	Message	Date
Boris Brezillon	b8174b21d2	panvk: Isolate CS specific bits in panvk_shader We are about to add FS specific info there, so let's make sure all the per-stage bits are part of a union and are conditionally filled/[de]serialized based on the shader type. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Boris Brezillon	8a16636444	panvk: Re-order things in panvk_deserialize_shader() Re-order things in panvk_deserialize_shader() to avoid declaring local variables for stuff we feed the panvk_shader with. The only exception is pan_shader_info, because we need to know the shader stage to call vk_shader_zalloc(), which if part of pan_shader_info. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Boris Brezillon	864ea81dcf	panvk/csf: Set invalidate_inherited_ctx only if the render pass is inherited Secondary command buffers don't necessarily inherit their render context. If we flush draws, we should only set invalidate_inherited_ctx when the render context is inherited, otherwise is messes up with the primary command buffer state. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Boris Brezillon	f3be0836b7	pan/bi: Pass an explicit sampleid to load_converted_output_pan Needed if we want to lower multisample input attachment loads to tile buffer loads. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Boris Brezillon	cdeda45282	pan/bi: Pass load_converted_output_pan target through a source This allows us to pass a dynamic render target which will be needed to support VK_KHR_dynamic_rendering_local_read. While at it, we also enable support for depth/stencil tile loads. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Boris Brezillon	2e8829f54a	pan/bi: Allow depth/stencil tile buffer access using LD_TILE LD_TILE has a .z_stencil modifier we can use to access the depth/stencil tile buffer. This will be needed for native depth/stencil input attachments support in panvk. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Boris Brezillon	08307ecf3c	panvk/jm: Don't force a preload if the previous batch didn't have draws We should only force a preload after a batch split if the batch we flush had draws, otherwise we might lose the effect of clears asked by the user. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Boris Brezillon	3669cc66c6	vulkan/state: Fix default input attachment map values When no input attachment location info is provided, the depth/stencil attachment are supposed to be NO_INDEX, not UNUSED, and we should also set the color_attachment_count to UNKNOWN. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Lionel Landwerlin	e321c438dc	anv: fix self dependency computation Some upcoming changes in the runtime will make it impossible to rely on the pipeline or runtime information to know whether a fragment shader has input attachments. Instead we gather that information at compile time and store it in our shader bind_map. At runtime we check whether the fragment shader has input attachments and whether those map to the runtime depth/stencil input attachments to set the 3DSTATE_PS_EXTRA::PixelShaderKillsPixel. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `d2f7b6d5a7` ("anv: implement VK_KHR_dynamic_rendering_local_read") Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Boris Brezillon	be2532fc00	vk/pass: Add input attachment location info For drivers using the render pass emulation provided by the runtime, it's important to express the mapping between depth/stencil/color attachments and input attachments using VkRenderingInputAttachmentIndexInfoKHR, otherwise those drivers have to special-case emulated render passes in their CmdBeginRendering() implementation. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Boris Brezillon	38e546c202	vulkan/state: Fix input attachment map state initialization/copy vk_dynamic_graphics_state_copy() is not copying the input attachment map, and color_attachment_count is not initialized in vk_dynamic_graphics_state_init_ial(). Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Acked-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Corentin Noël	34a5f4ac7c	virgl: Use drmCloseBufferHandle instead of calling dmIoctl directly Makes the code a bit lighter. Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34437>	2025-04-10 12:55:51 +00:00
Corentin Noël	5144a4f56c	virgl: Close handle on resource info failure We just opened the GEM handle a few line before (or used drmPrimeFDToHandle to acquire it), on failure it is just better to close it. Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34437>	2025-04-10 12:55:51 +00:00
Caterina Shablia	e5bdb41200	panfrost: move the comment closer to what it's about Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34193>	2025-04-10 08:05:21 +00:00
Caterina Shablia	83383cb4b8	panfrost: require buffer_count and pushed_words to be passed to panfrost_emit_const_buf Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34193>	2025-04-10 08:05:21 +00:00
Alyssa Rosenzweig	59a3e12039	panfrost: do not push "true" UBOs Panfrost supports pushing uniforms to hardware uniform registers (RMU/FAU for Midgard/Bifrost respectively). Since OpenGL uniforms are lowered to UBO #0, it does this with a pass that pushes UBOs. That's good! The pass also pushes 'true' OpenGL UBOs, since they look the same in the backend at this point. This is where the trouble comes in: - True UBOs are allocated in GPU BOs, not CPU allocated buffers. That means it's write-combine memory, which we cannot read from efficiently (at least depending on coherency details that were never plumbed through panfrost.ko and unlikely to be replumbed now that panthor is the new hot stuff). So, pushing true UBOs reduces GPU overhead at the cost of tremendous CPU overhead. This is dubious... When I benchmarked this on MT8192 in early 2023, this pushing improved FPS in SuperTuxKart but hurt FPS in Dolphin. - True UBOs can be written on the GPU. In OpenGL, we have batch tracking infrastructure to sort this mess out in theory. What this means is that pushing UBOs requires us to flush writers AND STALL at draw-time. If this is ever hit, our performance is utterly trashed. But it gets worse. - True UBOs can be written in the same batch that reads them. For example, we could bind a buffer as a transform feedback buffer, do a draw with XFB, then rebind as a UBO and do a draw reading. This is where we collapse -- our logic will flush the writer, which is the same batch we were in the middle of enqueueing a draw to. When we try to push words, we'll crash with theatrics. This could be solved by smartening the batch tracking logic but it's not trivial by any means. So, pushing true UBOs on the CPU is broken and can hurt performance. Stop doing it! Long term, the solution will be to push on the GPU instead. This avoids all of these issues. This can be done with a compute kernel or with CSF instructions. The Vulkan driver will likely have to do this for performance, since pushing UBOs from the CPU is utterly broken in Vulkan for the above reasons. I have a branch somewhere doing this on v9 but I'm doing this on NIR time to unblock a core change that was crashing piglit due to this pile of unsoundness. Let's fix the correctness issues first, then someone can look at recovering performance later when we're not blocking unrelated work. Fixes corruption in Piglit test gles-3.0-transform-feedback-uniform-buffer-object, which writes a UBO with transform feedback. (I suspect the test still doesn't pass for the same reason it's broken on other tilers. But that's a better place to be than oodles of memory corruption.) According to CI, fixes spec@arb_uniform_buffer_object@rendering{-dsa}-offset. Cc: mesa-stable Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34193>	2025-04-10 08:05:21 +00:00
Caterina Shablia	2c75b6bb01	panfrost: update nr_uniform_buffers before dispatching XFB Currently nr_uniform_buffers will be whatever the previous draw set for its vertex shader, which is not what the XFB shader usually expects. Fixes: `c246af0d` ("panfrost: Only upload UBOs when needed") Cc: mesa-stable Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34193>	2025-04-10 08:05:21 +00:00
Caterina Shablia	6948ab727f	panfrost: don't overwrite push uniforms and sysvals UBO with user's UBO ss->info.ubo_mask includes the push+sysval UBO so if there's a user UBO bound at the same index as the push+sysval UBO, without this change we end up writing a descriptor for the user UBO at that index. Fixes: `3b3cd59f` ("panfrost: Launch transform feedback shaders") Cc: mesa-stable Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34193>	2025-04-10 08:05:21 +00:00
Alyssa Rosenzweig	f179f6952f	panfrost: invert and rename no_ubo_to_push flag only the GL driver actually wants this, neither panvk nor internal shaders do. Cc'd as a prereq to the next patch Cc: mesa-stable Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34193>	2025-04-10 08:05:21 +00:00
Samuel Pitoiset	2f00daf67a	radv: tidy up radv_emit_hw_ngg() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34420>	2025-04-10 06:56:25 +00:00
Samuel Pitoiset	1290b38f57	radv: tidy up radv_emit_raster_state() Better isolation between configuration and emission. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34420>	2025-04-10 06:56:25 +00:00
Samuel Pitoiset	4b2d119d90	radv: reduce the number of emitted DWORDS for MSAA 8x user sample locs From 24 DWORDS to 16 DWORDS. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34420>	2025-04-10 06:56:25 +00:00
Samuel Pitoiset	c1ebf82700	radv: track redundant DB_RENDER_OVERRRIDE register writes on GFX12 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34420>	2025-04-10 06:56:25 +00:00
Samuel Pitoiset	7f5727b313	radv: use consecutive registers for PA_SC_WINDOW_SCISSOR_{TL,BR} For less DWORDS. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34420>	2025-04-10 06:56:25 +00:00
Samuel Pitoiset	32ea7df586	radv: move emitting more fb registers when rendering begins No need to delay the emission of these registers. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34420>	2025-04-10 06:56:25 +00:00
Samuel Pitoiset	001fa1cf11	radv: move the disable_trunc_coord drirc at instance/pdev level It no longer relies on enabled device features. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34425>	2025-04-10 06:36:09 +00:00
Samuel Pitoiset	65d717b45a	radv: remove an old workaround for D3D9 with DXVK 2.3.0 and older Proton 8.x+ uses this DXVK version but Proton 9.x+ is the default now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34425>	2025-04-10 06:36:09 +00:00
Dave Airlie	0df8c4f20c	nvk: add hopper support to nv_push_dump Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34442>	2025-04-10 00:37:45 +00:00
Dave Airlie	725e84974e	nvk: add ADA compute class to nv_push_dump now we have the classes use them in the push dump. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34442>	2025-04-10 00:37:45 +00:00
Faith Ekstrand	a441bd3c7e	nvk: Call vk_device_finish() last in nvk_DestroyDevice() We initialize it first so we should destroy it last. Otherwise, things we call in our own destroy code might reference a dead vk_device and blow up. Fixes: `d2ea532528` ("vulkan: Use syncobj shim") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34444>	2025-04-10 00:18:47 +00:00
Patrick Lerda	f0cfc1bbdc	i915: fix draw_create_fragment_shader() related memory leak For instance, this issue is triggered with "piglit/bin/fcc-blit-between-clears -auto -fbo": Direct leak of 16400 byte(s) in 5 object(s) allocated from: #0 0xb720689a in __interceptor_calloc (/usr/lib/libasan.so.6+0xb289a) #1 0xaf10f896 in draw_create_fragment_shader ../src/gallium/auxiliary/draw/draw_fs.c:47 #2 0xaef64619 in i915_create_fs_state ../src/gallium/drivers/i915/i915_state.c:550 #3 0xae16a955 in ureg_create_shader ../src/gallium/auxiliary/tgsi/tgsi_ureg.c:2194 #4 0xae17f45f in ureg_create_shader_with_so_and_destroy ../src/gallium/auxiliary/tgsi/tgsi_ureg.h:150 #5 0xae17f45f in ureg_create_shader_and_destroy ../src/gallium/auxiliary/tgsi/tgsi_ureg.h:159 #6 0xae17f45f in util_make_fs_blit_zs ../src/gallium/auxiliary/util/u_simple_shaders.c:365 #7 0xaf13300e in blitter_get_fs_texfetch_depth ../src/gallium/auxiliary/util/u_blitter.c:1157 #8 0xaf13300e in util_blitter_cache_all_shaders ../src/gallium/auxiliary/util/u_blitter.c:1322 #9 0xaef6b738 in i915_create_context ../src/gallium/drivers/i915/i915_context.c:233 #10 0xacb33c49 in st_api_create_context ../src/mesa/state_tracker/st_manager.c:986 #11 0xac845740 in dri_create_context ../src/gallium/frontends/dri/dri_context.c:178 #12 0xac854d97 in driCreateContextAttribs ../src/gallium/frontends/dri/dri_util.c:631 #13 0xb6ce79a3 in dri2_create_context_attribs ../src/glx/dri2_glx.c:240 #14 0xb6c9606f in dri_common_create_context ../src/glx/dri_common.c:665 #15 0xb6ca4f00 in CreateContext ../src/glx/glxcmds.c:322 #16 0xb6ca5c0b in glXCreateNewContext ../src/glx/glxcmds.c:1449 Fixes: `1a69b50b3b` ("i915g: Fix point sprites.") Signed-off-by: Patrick Lerda <patrick9876@free.fr> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27570>	2025-04-09 23:54:52 +00:00
Patrick Lerda	5af5f508b1	i915: fix nir_to_tgsi() related memory leak For instance, this issue is triggered with "piglit/bin/glx-multithread-texture -auto -fbo": Direct leak of 256 byte(s) in 1 object(s) allocated from: #0 0xb71eda62 in __interceptor_realloc (/usr/lib/libasan.so.6+0xb2a62) #1 0xadd5a32f in tokens_expand ../src/gallium/auxiliary/tgsi/tgsi_ureg.c:239 #2 0xadd5a32f in get_tokens ../src/gallium/auxiliary/tgsi/tgsi_ureg.c:262 #3 0xadd62519 in copy_instructions ../src/gallium/auxiliary/tgsi/tgsi_ureg.c:2079 #4 0xadd62519 in ureg_finalize ../src/gallium/auxiliary/tgsi/tgsi_ureg.c:2129 #5 0xadd64bde in ureg_get_tokens ../src/gallium/auxiliary/tgsi/tgsi_ureg.c:2206 #6 0xade377d0 in nir_to_tgsi_options ../src/gallium/auxiliary/nir/nir_to_tgsi.c:4043 #7 0xade3da63 in nir_to_tgsi ../src/gallium/auxiliary/nir/nir_to_tgsi.c:3831 #8 0xaeb606c9 in i915_create_vs_state ../src/gallium/drivers/i915/i915_state.c:662 #9 0xac781a2c in st_create_common_variant ../src/mesa/state_tracker/st_program.c:720 #10 0xac78e8a4 in st_get_common_variant ../src/mesa/state_tracker/st_program.c:773 #11 0xac78fc10 in st_precompile_shader_variant ../src/mesa/state_tracker/st_program.c:1259 #12 0xac78fc10 in st_finalize_program ../src/mesa/state_tracker/st_program.c:1345 #13 0xac790b1a in st_program_string_notify ../src/mesa/state_tracker/st_program.c:1378 #14 0xace457a9 in _mesa_get_fixed_func_vertex_program ../src/mesa/main/ffvertex_prog.c:1397 #15 0xac5ef8db in update_program ../src/mesa/main/state.c:281 #16 0xac5f0ece in _mesa_update_state_locked ../src/mesa/main/state.c:560 #17 0xac5f1653 in _mesa_update_state ../src/mesa/main/state.c:593 #18 0xacdf9fe2 in _mesa_DrawArrays ../src/mesa/main/draw.c:1403 Fixes: `487a493325` ("i915g: Add support for per-vertex point size.") Signed-off-by: Patrick Lerda <patrick9876@free.fr> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27570>	2025-04-09 23:54:52 +00:00
Patrick Lerda	92802ea90a	i915: fix slab_create() related memory leaks For instance, this issue is triggered with "piglit/bin/fcc-blit-between-clears -auto -fbo": Direct leak of 836 byte(s) in 1 object(s) allocated from: #0 0xb71eb6f2 in malloc (/usr/lib/libasan.so.6+0xb26f2) #1 0xaefadc78 in slab_add_new_page ../src/util/slab.c:179 #2 0xaefadc78 in slab_alloc ../src/util/slab.c:221 #3 0xaef7d461 in i915_texture_transfer_map ../src/gallium/drivers/i915/i915_resource_texture.c:789 #4 0xac9e931e in pipe_texture_map ../src/gallium/auxiliary/util/u_inlines.h:555 #5 0xac9e931e in _mesa_map_renderbuffer ../src/mesa/main/renderbuffer.c:494 #6 0xad49c5e4 in readpixels_memcpy ../src/mesa/main/readpix.c:260 #7 0xad49c5e4 in _mesa_readpixels ../src/mesa/main/readpix.c:898 #8 0xad5d8cfe in st_ReadPixels ../src/mesa/state_tracker/st_cb_readpixels.c:568 #9 0xad4a0caf in read_pixels ../src/mesa/main/readpix.c:1199 #10 0xad4a0caf in _mesa_ReadnPixelsARB ../src/mesa/main/readpix.c:1216 #11 0xad4a155b in _mesa_ReadPixels ../src/mesa/main/readpix.c:1231 or "piglit/bin/fcc-read-to-pbo-after-clear -auto": Direct leak of 772 byte(s) in 1 object(s) allocated from: #0 0xb726b6f2 in malloc (/usr/lib/libasan.so.6+0xb26f2) #1 0xaf0adc88 in slab_add_new_page ../src/util/slab.c:179 #2 0xaf0adc88 in slab_alloc ../src/util/slab.c:221 #3 0xaf07aad7 in i915_buffer_transfer_map ../src/gallium/drivers/i915/i915_resource_buffer.c:75 #4 0xad10de74 in pipe_buffer_map_range ../src/gallium/auxiliary/util/u_inlines.h:398 #5 0xad10de74 in _mesa_bufferobj_map_range ../src/mesa/main/bufferobj.c:499 #6 0xad5677ce in _mesa_map_pbo_dest ../src/mesa/main/pbo.c:308 #7 0xad59be3b in _mesa_readpixels ../src/mesa/main/readpix.c:894 #8 0xad6d8cfe in st_ReadPixels ../src/mesa/state_tracker/st_cb_readpixels.c:568 #9 0xad5a0caf in read_pixels ../src/mesa/main/readpix.c:1199 #10 0xad5a0caf in _mesa_ReadnPixelsARB ../src/mesa/main/readpix.c:1216 #11 0xad5a155b in _mesa_ReadPixels ../src/mesa/main/readpix.c:1231 Fixes: `e7a73b75a0` ("gallium: switch drivers to the slab allocator in src/util") Signed-off-by: Patrick Lerda <patrick9876@free.fr> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27570>	2025-04-09 23:54:52 +00:00
Paulo Zanoni	fdbdfaed01	anv: add ANV_SYS_MEM_LIMIT for debugging system memory restrictions If you suspect a workload is failing because it needs more memory, you can set ANV_SYS_MEM_LIMIT=100 to give it all the memory available. This could make, for example, certain games start working (it really depends on how much RAM you have and how much the game wants). If you suspect a workload is too resource hungry, you can try to limit it with ANV_SYS_MEM_LIMIT=30 (or some other value) to see if it can deal with the more restricted environment and behave accordingly. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28513>	2025-04-09 22:48:18 +00:00
Paulo Zanoni	ec4b2ce664	anv: restore the old behavior of up to 75% of RAM for the system heap "We paid for sixteen gigs of RAM, so we gonna use the whole damn sixteen gigs of RAM!" - My Mom First, some history: The Anv 50%-or-75% rule was originally added in 2017 by `060a6434ec` ("anv: Advertise larger heap sizes"). When i915.ko started reporting memory sizes in its ioctls, it didn't impose any restrictions: 100% of SRAM was reported as available, so the restriction was in Mesa. When xe.ko was introduced, it only reported 50% of the SRAM as available through its ioctls, so commit `b571ae6e7a` ("intel: Make memory heaps consistent between KMDs") adapted the code to not take an extra 25% of the 50% that was already cut, and restricted i915.ko to 50% instead of the 50%-or-75%. In Kernel commit d2d5f6d57884 ("drm/xe: Increase the XE_PL_TT watermark"), xe.ko changed to reporting 100% of SRAM through its ioctls, so we adapted Mesa to do the right thing depending on which Kernel version was running. While this was all happening, we were discussing about which behavior was actually the best: restrict everything to 50% in order to avoid issues when many things are running in parallel, or keep the restriction only at 75% in order to allow high demanding workloads to make full use of the hardware. The way I see, if parallel applications are causing the system to run out of resources, the user always has the option to kill applications and use one thing at a time. On the other hand, if a single application needs more than 50% of the SRAM and we don't allow it in our heaps, the application will never work (unless, of course, the user patches Mesa). So in this commit we go back to allowing high-demanding applications to work by restoring the 50%-or-75% rule. This commit is especially useful in systems with integrated graphics, like LNL, where the option to upgrade RAM is not present. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28513>	2025-04-09 22:48:18 +00:00
Paulo Zanoni	02e896bc49	anv/xe: detect the newer xe.ko memory reporting model and act accordingly Kernel commit d2d5f6d57884 ("drm/xe: Increase the XE_PL_TT watermark") changed how xe.ko reportes memory: its ioctls now report 100% of the system RAM as available. Since our policy is to report 50% of the SRAM as available for the heaps, add some code to check the amount reported by xe.ko against the amount reported by the system, then act accordingly. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28513>	2025-04-09 22:48:18 +00:00
Paulo Zanoni	3db8931d4a	intel/i915: restrict the RAM size restrictions to Anv Before commit `b571ae6e7a` ("intel: Make memory heaps consistent between KMDs"), we had the following policy for reporting Sytem RAM memory sizes: - For OpenGL, we reported the total available RAM. - For Vulkan, we reported the total available RAM as: - 50% of the total RAM if the total RAM was <= 4GB, - 75% otherwise - In addition, the Memory Budget (for VK_EXT_memory_budget) is 90% of the "free" memory, which can be an extra 10% off of the 50% or 75%. When xe.ko was added, one key difference was noted: while i915.ko reported the "real" RAM memory sizes in its ioctls, xe.ko reported only 50% of the system RAM as available. Because of that (and other reasons, see this discussion on MR 28513), commit `b571ae6e7a` decided to unify the behavior by changing the Anv i915.ko rule to "always 50%" instead of "50% or 75%". This also changed the Iris rule to 50% instead of 100%. In my research, I couldn't find any reason why this restriction should also apply to Iris, so here we revert back to handling these size restrictions on Anv only. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28513>	2025-04-09 22:48:18 +00:00
Ian Romanick	cb69d019cf	brw/nir: Use offset() for all uses of offs in emit_pixel_interpolater_alu_at_offset This is necessary to appropriately uniformize the first component access of a convergent vector. Without this, this is produced: load_payload(16) %18:D, 0d, 0d NoMask group0 add(32) %21:F, %18+0.0:F, 0.5f add(32) %22:F, %18+2.0<0>:F, 0.5f This is the correct code: load_payload(16) %18:D, 0d, 0d NoMask group0 add(32) %21:F, %18+0.0<0>:F, 0.5f add(32) %22:F, %18+2.0<0>:F, 0.5f Without `38b58e286f`, the code generated was more incorrect, but happened to work for this test case: load_payload(16) %18:D, 0d, 0d NoMask group0 add(32) %21:F, %18+0.0<0>:F, 0.5f add(32) %22:F, %18+0.4<0>:F, 0.5f Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `38b58e286f` ("brw/nir: Fix source handling of nir_intrinsic_load_barycentric_at_offset") Closes: #12969 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34427>	2025-04-09 22:21:18 +00:00
Dave Airlie	b5d1b0d7e3	nvk: update nvidia class header files. This adds Ada/Hopper compute headers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34440>	2025-04-09 22:03:18 +00:00
Caleb Callaway	64b5ee3001	intel/tools: fix 32b build for EU stall tool Fixes: `610ad8d3` ("intel/tools: create intel_monitor for sampling eu stalls") Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34439>	2025-04-09 21:40:46 +00:00
Benjamin Lee	fdf43f9152	panfrost: add core mask driconf options Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details These options are equivalent to the corresponding panvk options. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34374>	2025-04-09 21:09:20 +00:00
Benjamin Lee	e8cc44f4bd	panvk: add core mask driconf options These can be used to pin an application to specific cores. A tiler mask option is not included because there is no current hardware that includes more than one tiler. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34374>	2025-04-09 21:09:20 +00:00
Benjamin Lee	d46e76be02	driconf: add uint64 type This is needed for panvk, where we want to expose uint64 core masks. The previous int parsing logic was technically UB rather than guaranteed truncate-on-overflow, but was likely compiled to truncate in practice. It is very unlikely that anyone was relying on this. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34374>	2025-04-09 21:09:20 +00:00
Alyssa Rosenzweig	141f0ef4e4	asahi: fix i/a queries with tess fixes new CTS additions to KHR-GL46.pipeline_statistics_query_tests_ARB.functional_tess_queries Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33984>	2025-04-09 20:25:50 +00:00
Alyssa Rosenzweig	27d2bd5925	gallium: wire up asahi driver Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33984>	2025-04-09 20:25:50 +00:00
Alyssa Rosenzweig	1a76310fb7	asahi: remove unstable uapi header now unused. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33984>	2025-04-09 20:25:50 +00:00
Alyssa Rosenzweig	c64a2bbff5	asahi: port to stable uAPI Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33984>	2025-04-09 20:25:49 +00:00
Alyssa Rosenzweig	3e110005a6	asahi: disable virtio gpu for upstream until the wire protocol is stable. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33984>	2025-04-09 20:25:49 +00:00
Caio Oliveira	7457c4ecfd	brw: Make brw_range use half-open ranges Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34253>	2025-04-09 19:06:49 +00:00
Caio Oliveira	6509f8139d	brw: Use brw_range::last() to explicit get the last valid IP This is a preparation to change what is stored in brw_range::end. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34253>	2025-04-09 19:06:49 +00:00

... 22 23 24 25 26 ...

190082 commits