fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 13:38:06 +02:00

Author	SHA1	Message	Date
Georg Lehmann	0e21cd9e15	aco/gfx10+: work around non uniform ds_append wave64 result In wave64 for hw with native wave32, ds_append seems to be split in a load for the low half and an atomic for the high half, and other LDS instructions can be scheduled between the two. Which means the result of the low half is unusable because it might be out of date. I was only able to reproduce this issue in WGP mode, but be conservative and apply the workaround in CU mode too. Foz-DB Navi31: Totals from 13 (0.02% of 79395) affected shaders: Instrs: 7599 -> 7656 (+0.75%) CodeSize: 39708 -> 39972 (+0.66%) Latency: 83174 -> 83572 (+0.48%) InvThroughput: 8271 -> 8357 (+1.04%) Copies: 718 -> 717 (-0.14%) VALU: 3689 -> 3703 (+0.38%) SALU: 935 -> 965 (+3.21%) Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11921 Fixes: `45e935800a` ("aco: implement nir_shared_append/consume_amd") Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31301>	2024-09-23 13:17:58 +00:00
Patrick Lerda	b6b363c478	iris: fix iris_ensure_indirect_generation_shader() memory leak This change ensures that all these allocations are using the same memory context. For instance, this issue is triggered with: "piglit/bin/arb_shader_image_load_store-host-mem-barrier -auto -fbo": Indirect leak of 32816 byte(s) in 1 object(s) allocated from: #0 0x7f49a35447ef in __interceptor_malloc (/usr/lib64/libasan.so.6+0xb17ef) #1 0x7f49998e4b4f in ralloc_size ../src/util/ralloc.c:118 #2 0x7f49998e7521 in create_slab ../src/util/ralloc.c:801 #3 0x7f49998e7521 in gc_alloc_size ../src/util/ralloc.c:840 #4 0x7f49998e7d11 in gc_zalloc_size ../src/util/ralloc.c:868 #5 0x7f49999a6126 in nir_alu_instr_create ../src/compiler/nir/nir.c:682 #6 0x7f49999cba48 in clone_alu ../src/compiler/nir/nir_clone.c:217 #7 0x7f49999cc85a in clone_instr ../src/compiler/nir/nir_clone.c:456 #8 0x7f49999cee3a in clone_block ../src/compiler/nir/nir_clone.c:529 #9 0x7f49999cee3a in clone_cf_list ../src/compiler/nir/nir_clone.c:583 #10 0x7f49999d03be in clone_function_impl ../src/compiler/nir/nir_clone.c:660 #11 0x7f49999d13f7 in nir_function_impl_clone ../src/compiler/nir/nir_clone.c:678 #12 0x7f4999a0e2c5 in lower_call_function_impl ../src/compiler/nir/nir_functions.c:397 #13 0x7f4999a0e2c5 in function_link_pass ../src/compiler/nir/nir_functions.c:430 #14 0x7f4999a0e2c5 in function_link_pass ../src/compiler/nir/nir_functions.c:408 #15 0x7f4999a0e2c5 in nir_function_instructions_pass ../src/compiler/nir/nir_builder.h:108 #16 0x7f4999a0e2c5 in nir_link_shader_functions ../src/compiler/nir/nir_functions.c:452 #17 0x7f499ca30b8f in link_libintel_shaders ../src/gallium/drivers/iris/iris_program_cache.c:329 #18 0x7f499ca30b8f in iris_ensure_indirect_generation_shader ../src/gallium/drivers/iris/iris_program_cache.c:374 #19 0x7f499d185267 in gfx9_emit_indirect_generate ../src/gallium/drivers/iris/iris_indirect_gen.c:593 #20 0x7f499d119c79 in iris_upload_indirect_shader_render_state ../src/gallium/drivers/iris/iris_state.c:8744 #21 0x7f499fe86b01 in iris_indirect_draw_vbo ../src/gallium/drivers/iris/iris_draw.c:233 #22 0x7f499fe86b01 in iris_draw_vbo ../src/gallium/drivers/iris/iris_draw.c:343 #23 0x7f499a174e43 in tc_call_draw_indirect ../src/gallium/auxiliary/util/u_threaded_context.c:3828 #24 0x7f499a1557fe in batch_execute ../src/gallium/auxiliary/util/u_threaded_context.c:453 #25 0x7f499a1557fe in tc_batch_execute ../src/gallium/auxiliary/util/u_threaded_context.c:504 #26 0x7f499a167f26 in _tc_sync ../src/gallium/auxiliary/util/u_threaded_context.c:761 #27 0x7f499a168888 in tc_texture_map ../src/gallium/auxiliary/util/u_threaded_context.c:2783 #28 0x7f49986f2631 in pipe_texture_map ../src/gallium/auxiliary/util/u_inlines.h:556 #29 0x7f49986f2631 in _mesa_map_renderbuffer ../src/mesa/main/renderbuffer.c:494 #30 0x7f49991af7ca in readpixels_memcpy ../src/mesa/main/readpix.c:260 #31 0x7f49991af7ca in _mesa_readpixels ../src/mesa/main/readpix.c:898 #32 0x7f499931ee23 in st_ReadPixels ../src/mesa/state_tracker/st_cb_readpixels.c:575 #33 0x7f49991b40b5 in read_pixels ../src/mesa/main/readpix.c:1199 #34 0x7f49991b40b5 in _mesa_ReadnPixelsARB ../src/mesa/main/readpix.c:1216 #35 0x7f49991b4a20 in _mesa_ReadPixels ../src/mesa/main/readpix.c:1231 ... SUMMARY: AddressSanitizer: 323648 byte(s) leaked in 201 allocation(s). Fixes: `5438b19104` ("iris: enable generated indirect draws") Signed-off-by: Patrick Lerda <patrick9876@free.fr> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31313>	2024-09-23 12:47:11 +00:00
Samuel Pitoiset	5c897d00ef	radv: fix assigning mesh shader outputs when clip/cull distances are read in FS The per-primitive output offsets need to be recomputed. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31224>	2024-09-23 12:12:13 +00:00
Patrick Lerda	6e994fdb6e	i915: fix vertex atan regression This is a regression happening with the commit `87b99d5797` ("nir: use copysign for atan"). Indeed, the opcode "copysign" was generating an incompatible i915 sequence. For instance, this issue is triggered with "deqp-gles2 --deqp-case=dEQP-GLES2.functional.shaders.operator.angle_and_trigonometry.atan2.highp_float_vertex": deqp-gles2: ../src/compiler/nir/nir_lower_int_to_float.c:239: lower_alu_instr: Assertion `nir_alu_type_get_base_type(info->output_type) != nir_type_int && nir_alu_type_get_base_type(info->output_type) != nir_type_uint' failed. Fixes: `c4cec84231` ("nir/i915g/r300/nv30: skip marking varyings as flat in some drivers") Signed-off-by: Patrick Lerda <patrick9876@free.fr> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31315>	2024-09-23 11:46:40 +00:00
Connor Abbott	dbc4a2e30b	tu: Initial support for VK_KHR_calibrated_timestamps on a750 Starting with a750, the ALWAYS_ON counter is initialized from a loadable counter in CX power domain, which is never turned off except during a GPU reset. This means that timestamps should always be monotonic except if the GPU resets, in which case subsequent submits should return DEVICE_LOST anyway. Thus it should be good enough to satisfy the Vulkan requirement that vkCmdWriteTimestamp is monotonic. kgsl tries to synchronize the CX counter to the CPU counter, and additionally adds a synchronization ioctl to improve the accuracy. I'm not sure whether the former is really useful for us, but the latter should eventually be implemented in drm/msm. However for now we can expose the extension without any kernel support. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31100>	2024-09-23 07:17:01 -04:00
Juan A. Suarez Romero	c968c5a740	v3dv/ci: add new flake New flake for rpi4. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31310>	2024-09-23 09:30:21 +00:00
Valentine Burley	1494b2143d	freedreno/ci: Document some a630 EGL flakes Not related to the kernel uprev, unkown when they started appearing. Signed-off-by: Valentine Burley <valentine.burley@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31286>	2024-09-23 08:55:37 +00:00
Valentine Burley	4b51a2c9da	turnip/ci: Remove fixed test from a660 xfails It appears this was missed due to fractional runs, but the fix for this issue has already been merged. While the test hasn't run in the full runs yet, it should now be considered fixed, just like on other GPUs. Fixes: `812c8f6abe` ("tu: Treat partially-bound depth/stencil attachments as passthrough") Signed-off-by: Valentine Burley <valentine.burley@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31286>	2024-09-23 08:55:37 +00:00
Valentine Burley	28168d0971	freedreno/ci: Update expectations after Piglit uprev The expectations for the manual runs were missed during the uprev, update them now. Fixes: `213f5e9152` ("Uprev Piglit to e9ab30aeaed97b69868cf4d6d6a3f70f3b53c362") Signed-off-by: Valentine Burley <valentine.burley@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31286>	2024-09-23 08:55:36 +00:00
Valentine Burley	5b8f27d3d7	freedreno/ci: Uprev kernel to 6.11 The new kernel brings improved stability. Signed-off-by: Valentine Burley <valentine.burley@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31286>	2024-09-23 08:55:36 +00:00
Valentine Burley	b20983f9a8	freedreno/ci: Skip timing out test on a630 KHR-GL46.texture_swizzle.functional usually takes 50+ seconds and can time out on occasion. This isn't caused by the new kernel, it always took this long. Skip it for pre-merge jobs on a630. Signed-off-by: Valentine Burley <valentine.burley@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31286>	2024-09-23 08:55:36 +00:00
Iago Toral Quiroga	68014b0d9b	broadcom/compiler: skip small immediates optimization on vpm instructions total instructions in shared programs: 11164938 -> 10890641 (-2.46%) instructions in affected programs: 6557250 -> 6282953 (-4.18%) helped: 59134 HURT: 9752 Instructions are helped. total threads in shared programs: 431068 -> 431034 (<.01%) threads in affected programs: 68 -> 34 (-50.00%) helped: 0 Threads are HURT. total uniforms in shared programs: 3880437 -> 5308006 (36.79%) uniforms in affected programs: 2669367 -> 4096936 (53.48%) helped: 2 HURT: 74046 Uniforms are HURT. total max-temps in shared programs: 2244298 -> 2226555 (-0.79%) max-temps in affected programs: 463611 -> 445868 (-3.83%) helped: 17473 HURT: 8040 Max-temps are helped. total spills in shared programs: 4312 -> 4318 (0.14%) spills in affected programs: 0 -> 6 helped: 0 HURT: 2 total fills in shared programs: 6508 -> 6514 (0.09%) fills in affected programs: 0 -> 6 helped: 0 HURT: 2 total sfu-stalls in shared programs: 14794 -> 15143 (2.36%) sfu-stalls in affected programs: 1261 -> 1610 (27.68%) helped: 238 HURT: 586 Inconclusive result (value mean confidence interval and %-change mean confidence interval disagree). total inst-and-stalls in shared programs: 11179732 -> 10905784 (-2.45%) inst-and-stalls in affected programs: 6570407 -> 6296459 (-4.17%) helped: 59126 HURT: 9786 Inst-and-stalls are helped. total nops in shared programs: 273422 -> 183945 (-32.72%) nops in affected programs: 139446 -> 49969 (-64.17%) helped: 60679 HURT: 2277 Nops are helped. Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31259>	2024-09-23 07:45:46 +00:00
Eric R. Smith	fd11bbbb90	panfrost: print human readable versions of some swizzle fields In traces produced with PAN_MESA_DEBUG, print swizzles in human readable form (like BGRA) as well as the raw decimal format we were printing before. This is purely a convenience feature for developers. Reviewed-by: Boris Brezilllon <boris.brezillon@collabora.com> Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31242>	2024-09-21 09:18:55 -03:00
Faith Ekstrand	1b4e100779	nvk: Add an NVK_DEBUG=gart flag Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31295>	2024-09-20 18:15:13 -05:00
Faith Ekstrand	611b0bb73d	nvk: Silence a maybe-uninitialized warning Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31295>	2024-09-20 18:15:11 -05:00
Jason Macnak	6b83d49879	gfxstream: fix log levels in descriptor handling ... that potentially were accidentally promoted to info logs in aosp/3252215 which affects common hot path. Fixes: `6f0fff4634` ("gfxstream: guest: fully mesa-ify vulkan_enc") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31288>	2024-09-20 20:33:14 +00:00
Marek Olšák	58d5847fe3	radeonsi: don't use VS/PS/CS partial flushes if we use a TS event Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31291>	2024-09-20 19:49:45 +00:00
Marek Olšák	653bcd85e0	radeonsi: remove barriers around clears using aux_context.compute_resource_init Nothing else uses that context, so all barriers are unnecessary. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31291>	2024-09-20 19:49:45 +00:00
Marek Olšák	58b512ddd6	radeonsi: execute clears at resource allocation using compute instead of gfx This adds an additional aux_context, so that the gfx queue isn't stalled due to clearing buffers or initializing DCC. This aux context will only be used by resource_create, which will allow us to remove all barriers around the clears because there are no others users of those buffers on that context. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31291>	2024-09-20 19:49:45 +00:00
Marek Olšák	c99b55092f	radeonsi: move barriers out of si_execute_clears Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31291>	2024-09-20 19:49:45 +00:00
Marek Olšák	36c368d466	radeonsi: move si_execute_clears barrier code into separate functions Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31291>	2024-09-20 19:49:45 +00:00
Marek Olšák	0112fd7d40	radeonsi/aco: fix asm dumps to debug output via radeonsi_debug_disassembly=true Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31291>	2024-09-20 19:49:45 +00:00
Marek Olšák	997c39c268	radeonsi: clean up and make corrections to si_create_fmask_expand_cs Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31291>	2024-09-20 19:49:45 +00:00
Marek Olšák	799a0a980b	radeonsi: adjust GFX12 checks in si_compute.c Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31291>	2024-09-20 19:49:45 +00:00
Marek Olšák	40d9616bd3	radeonsi: don't pad esgs_vertex_stride if it's 0 so that we don't allocate any LDS for ES->GS varyings if it's unused. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31291>	2024-09-20 19:49:44 +00:00
Marek Olšák	02e9572335	radeonsi: wait for idle after end_query in si_test_blit_perf end_query writes the timestamp only when everything is finished, so the extra barrier only adds unnecessary overhead. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31291>	2024-09-20 19:49:44 +00:00
Marek Olšák	3527d9f81d	radeonsi: remove CB sync after FMASK and DCC decompression It's not needed according to docs. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31291>	2024-09-20 19:49:44 +00:00
Marek Olšák	a9eb83a15f	radeonsi: don't sync CS and PS before rendering if there are no FBO attachments because CB/DB don't read/write anything in that case Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31291>	2024-09-20 19:49:44 +00:00
Marek Olšák	58c72e9648	radeonsi: deduplicate code emitting VGT_FLUSH/PIPELINESTAT events Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31291>	2024-09-20 19:49:44 +00:00
Marek Olšák	d6f54a0551	radeonsi: count VS/PS/CS/L2 flushes in get_reduced_barrier_flags also it no longer counts PS flushes as VS flushes, which is just for the HUD Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31291>	2024-09-20 19:49:44 +00:00
Marek Olšák	15e320e970	radeonsi: don't sync VS and PS if they are idle Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31291>	2024-09-20 19:49:44 +00:00
Marek Olšák	17e994dab1	radeonsi: check and update compute_is_busy in get_reduced_barrier_flags Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31291>	2024-09-20 19:49:44 +00:00
Mike Blumenkrantz	ac912b3754	mesa: OVR_multiview_multisampled_render_to_texture this is automatically supported for anyone that supports OVR_multiview and EXT_multisampled_render_to_texture Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31235>	2024-09-20 18:54:26 +00:00
Mike Blumenkrantz	894b37e060	mesa: fix sample count handling for MSRTT this extension specifies error checking (which was not implemented) and also sample count clamping needs to be done since the samples specified are just min samples and not an exact param cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31235>	2024-09-20 18:54:26 +00:00
Boris Brezillon	157a4dc509	panvk/csf: Fix multi-layer rendering We assumed a tiler descriptor could handle 256 layers at a time, but it's actually limited to 8 on v10, so let's adjust the code to take that into account. Fixes: `5544d39f44` ("panvk: Add a CSF backend for panvk_queue/cmd_buffer") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11882 Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Tested-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31227>	2024-09-20 18:21:50 +00:00
Boris Brezillon	dbfaf15bc1	pan/genxml: Fix layer_offset definition on v9+ The layer offset is a 9-bit signed integer, not an 8-bit unsigned. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Tested-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31227>	2024-09-20 18:21:50 +00:00
Boris Brezillon	8822f5949c	pan/desc: Add layer_offset field to pan_tiler_context::valhall Compared to Bifrost, Valhall slightly improved layered rendering in that you no longer need one IDVS job per layer. But they didn't quite unleash things, because tiler descriptors still have a limited amount of layers they can deal with, forcing us to emit more than one IDVS/tiler descriptor per draw call if the number of layer exceeds this limit. In order to specify where the starting point, a {layer_offset,internal_layer_index} field has been added, so we need to extend pan_tiler_context to pass this information and let the common logic adjust the framebuffer internal_layer_index accordingly. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Tested-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31227>	2024-09-20 18:21:50 +00:00
Boris Brezillon	6224a1e4d1	pan/decode: Interpret CS_BRANCH instructions panvk uses loops and conditional blocks. We need to follow these conditional branches if we want to dump the right amount of jobs. Following branching has the annoying side effect of repeating instructions, so we probably want to dump the CS and jobs separately at some point, but that's good enough for now. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Tested-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31227>	2024-09-20 18:21:50 +00:00
Boris Brezillon	1e0c502a77	panfrost: Don't turn 3D/cube images into 2D arrays Instead of special-casing 3D image handling in the gallium driver, use the actual image type and extend the compiler to deal with cube/3D image coordinates. This fixes panvk without resorting to the image type casting that was in place in the gallium driver. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Tested-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31227>	2024-09-20 18:21:50 +00:00
Boris Brezillon	e171579f51	vk/meta: Make sure texel is 32-bit in build_buffer_to_image_cs() Just like fragment stores, image stores expect 32-bit values (at least that's the case of the Bifrost backend), so make sure the value passed to write_img() is always 32-bit, even when convert_texel() doesn't touch the texel because the image view format matches the buffer format. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Tested-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31227>	2024-09-20 18:21:50 +00:00
Corentin Noël	b4900cd3f8	ci: Allow to pass the PIGLIT_RUNNER_OPTIONS variable When debugging piglit job failure in CI, it is sometimes useful to pass options to the runner, allow to do this even in guest when using crosvm. Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31284>	2024-09-20 17:42:04 +00:00
Faith Ekstrand	f36e5dbe60	nvk: Advertise VK_KHR_shader_float_controls2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31170>	2024-09-20 17:09:14 +00:00
Nanley Chery	b3882c4488	intel: Avoid no-op calls to anv_image_clear_color Whenever we execute a fast-clear due to LOAD_OP_CLEAR, we decrease the number of layers to clear by one. We then enter the slow clear function and possibly exit without clearing if the layer count is zero. Unfortunately, we've already compiled the shader for slow clears by the time we exit. Skip the slow clear function if there are no layers to clear. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31167>	2024-09-20 16:34:37 +00:00
Nanley Chery	1c7fe9ad1b	anv: Support fast clears in anv_CmdClearColorImage At least two game traces make use of this path: TWWH3 and Factorio. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31167>	2024-09-20 16:34:37 +00:00
Nanley Chery	46d58583ff	anv: Move exec_ccs_op and exec_mcs_op higher up The next patch will use them in anv_CmdClearColorImage(). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31167>	2024-09-20 16:34:37 +00:00
Nanley Chery	03286117ef	anv: Move and rename anv_can_fast_clear_color_view It's no longer specific to image views. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31167>	2024-09-20 16:34:36 +00:00
Nanley Chery	44351d67f8	anv: Change params of anv_can_fast_clear_color_view Expand the scope to more than just image views. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31167>	2024-09-20 16:34:36 +00:00
Sil Vilerino	83fdbf8772	d3d12: Plumb H264/HEVC temporal_id from pipe params Reviewed-By: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31268>	2024-09-20 15:50:42 +00:00
Sil Vilerino	1b66866275	d3d12: d3d12_video_encoder_references_manager_h264 to use FrameDecodingOrderNumber as h264Pic->slice.frame_num Fixes: `da2cbfe3bf` ("d3d12: Video Encode H264 to use direct DPB from frontend") Reviewed-By: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31268>	2024-09-20 15:50:42 +00:00
Sil Vilerino	96bf8f5a7d	d3d12: H264 Encode - Set SPS.gaps_in_frame_num_value_allowed_flag=1 when num_temporal_layers > 1 Reviewed-By: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31268>	2024-09-20 15:50:42 +00:00

... 4 5 6 7 8 ...

195646 commits