fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-04 14:08:05 +02:00

Author	SHA1	Message	Date
Thong Thai	7d3c29dc60	frontends/va: fix some coverity scan reported issues Added some checks for NULL pointer dereferencing and loop bounds. v2: Use ARRAY_SIZE instead of magic numbers (@jenatali) Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23598>	2023-06-23 20:31:21 +00:00
Caio Oliveira	dc93f205c1	meson: Explicitly add "check : false" to a couple instances of run_command In both cases there's code right after the execution to check the result and give a proper message. This gets rid of meson warning ``` WARNING: You should add the boolean check kwarg to the run_command call. It currently defaults to false, but it will default to true in future releases of meson. See also: https://github.com/mesonbuild/meson/issues/9300 ``` Reviewed-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23821>	2023-06-23 18:57:31 +00:00
Rhys Perry	d3e5e04a75	amd/drm-shim: use fixed-width types Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Eric Engestrom <eric@igalia.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9221 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23725>	2023-06-23 18:35:52 +00:00
Alyssa Rosenzweig	766535c867	agx: Implement vector live range splitting The SSA killer feature is that, under an "optimal" allocator, the number of registers used (register demand) is equal to the number of registers required (register pressure, the maximum number of variables simultaneously live at any point in the program). I put "optimal" in scare quotes, because we don't need to use the exact minimum number of registers as long as we don't sacrifice thread count or introduce spilling, and using a few extra registers when possible can help coalesce moves. Details-shmetails. The problem is that, prior to this commit, our register allocator was not well-behaved in certain circumstances, and would require an arbitrarily large number of registers. In particular, since different variables have different sizes and require contiguous allocation, in large programs the register file may become fragmented, causing the RA to use arbitrarily many registers despite having lots of registers free. The solution is vector live range splitting. First, we calculate the register pressure (the minimum number of registers that it is theoretically possible to allocate successfully), and round up to the maximum number of registers we will actually use (to give some wiggle room to coalesce moves). Then, we will treat this maximum as a bound, requiring that we don't use more registers than chosen. In the event that register file fragmentation prevents us from finding a contiguous sequence of registers to allocate a variable, rather than giving up or using registers we don't have, we shuffle the register file around (defragmenting it) to make room for the new variable. That lets us use a few moves to avoid sacrificing thread count or introducing spilling, which is usually a great choice. Android GLES3.1 shader-db results are as expected: some noise / small regressions for instruction count, but a bunch of shaders with improved thread count. The massive increase in register demand may seem weird, but this is the RA doing exactly what it's supposed to: using more registers if and only if they would not hurt thread count. Notice that no programs whatsoever are hurt for thread count, which is the salient part. total instructions in shared programs: 1781473 -> 1781574 (<.01%) instructions in affected programs: 276268 -> 276369 (0.04%) helped: 1074 HURT: 463 Inconclusive result (value mean confidence interval includes 0). total bytes in shared programs: 12196640 -> 12201670 (0.04%) bytes in affected programs: 1987322 -> 1992352 (0.25%) helped: 1060 HURT: 513 Bytes are HURT. total halfregs in shared programs: 488755 -> 529651 (8.37%) halfregs in affected programs: 295651 -> 336547 (13.83%) helped: 358 HURT: 9737 Halfregs are HURT. total threads in shared programs: 18875008 -> 18885440 (0.06%) threads in affected programs: 64576 -> 75008 (16.15%) helped: 82 HURT: 0 Threads are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23832>	2023-06-23 17:37:41 +00:00
Alyssa Rosenzweig	72e6b683f3	agx/lower_parallel_copy: Lower 64-bit copies To 32-bit. This way we don't get into bad situations where we need to eg swap unaligned 64-bit values or something funny like that. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23832>	2023-06-23 17:37:41 +00:00
Alyssa Rosenzweig	bfdaab6512	agx: Validate predecessor information Including the new loop header? flag. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23832>	2023-06-23 17:37:41 +00:00
Alyssa Rosenzweig	923b966775	agx: Add loop header? flag This is useful for deciding whether we need to fix up phis in RA. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23832>	2023-06-23 17:37:41 +00:00
Alyssa Rosenzweig	a2dbe6b688	agx: Recollect stored vectors at their use This is Timur's cheesy solution to split-hell.shader_test. Seems to work ok here. Before: 94 inst, 588 bytes, 165 halfregs, 1 threads, 0 loops, 0:0 spills:fills After: 63 inst, 454 bytes, 129 halfregs, 1 threads, 0 loops, 0:0 spills:fills On Android GLES3.1 shader-db, a few shaders are helped a lot: total instructions in shared programs: 1781706 -> 1781473 (-0.01%) instructions in affected programs: 4284 -> 4051 (-5.44%) helped: 16 HURT: 2 Instructions are helped. total bytes in shared programs: 12197854 -> 12196640 (<.01%) bytes in affected programs: 29526 -> 28312 (-4.11%) helped: 20 HURT: 2 Bytes are helped. total halfregs in shared programs: 489007 -> 488755 (-0.05%) halfregs in affected programs: 945 -> 693 (-26.67%) helped: 7 HURT: 0 Halfregs are helped. total threads in shared programs: 18873216 -> 18875008 (<.01%) threads in affected programs: 5376 -> 7168 (33.33%) helped: 7 HURT: 0 Threads are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23832>	2023-06-23 17:37:41 +00:00
Alyssa Rosenzweig	91d98975a6	agx: Extract coordinate register size calculation It will be used for image writes too, not just reads. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23832>	2023-06-23 17:37:41 +00:00
Asahi Lina	eef7fff852	asahi: Pass through surface sample count This makes PIPE_CAP_SURFACE_SAMPLE_COUNT do something, namely, explode with lots of fireworks. We'll have to figure out what's wrong, but at least now we aren't just not trying at all. Should not break anything as long as PIPE_CAP_SURFACE_SAMPLE_COUNT is not flipped on. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23832>	2023-06-23 17:37:41 +00:00
Asahi Lina	87bbaf680a	asahi: Disable PIPE_CAP_SURFACE_SAMPLE_COUNT This never worked, disable it. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23832>	2023-06-23 17:37:41 +00:00
Asahi Lina	af895692b3	asahi: Revert "Advertise ARB_texture_barrier" This reverts commit `9e67d3f237`. We do not, in fact, implement texture barriers. Texture barriers are supposed to allow non-overlapping rendering feedback loops. We cannot support that at non-tile boundaries when texture compression is enabled without some kind of downgrade path or other special handling. Fixes Emacs corruption on X/Glamor. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23832>	2023-06-23 17:37:41 +00:00
Alyssa Rosenzweig	b5fccfa197	agx: Fix discards Switch our frontends from generating sample_mask_agx to discard_agx, and switching from legalizing sample_mask_agx to lowering discard_agx to sample_mask_agx. This is a much easier problem and is done here in a way that is simple (and inefficient) but obviously correct. This should fix corruption in Darwinia. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23832>	2023-06-23 17:37:41 +00:00
Alyssa Rosenzweig	baf67144bd	agx: Update explanation of sample_mask behaviour We discovered today that these (probably) trigger depth/stencil testing, which has significant implications for the correct/performant use. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23832>	2023-06-23 17:37:41 +00:00
Alyssa Rosenzweig	942c206cd1	nir: Add discard_agx intrinsic sample_mask_agx corresponds directly to the hardware's 2-source instruction, but it's hard to use correctly and even harder to legalize after the fact, since it's responsible for not only discard but also late depth/stencil testing. For our various high-level lowering passes, it's easier to use a one-source discard (where we don't have to worry about sample masks), which the compiler will internally lower to the two-source instruction. Introduce such an instruction. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23832>	2023-06-23 17:37:41 +00:00
Samuel Pitoiset	0f8864e047	radv: adjust alignment of the preprocess buffer with DGC The preprocess buffer is the buffer used to generate the cmdbuf. It was aligned to 256 bytes but the correct alignment is actually ac_gpu_info::ib_alignment. Otherwise, if a DGC IB is executed like a IB1, this hits an assertion in radv_amdgpu_cs_submit() because the alignment is incorrect. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23764>	2023-06-23 17:17:08 +00:00
Samuel Pitoiset	06cdf222a6	radv: only dirty the active push constant stages with DGC It's unnecessary to dirty all stages. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23761>	2023-06-23 16:56:44 +00:00
Samuel Pitoiset	3b329e195e	radv: only dirty the index type when necessary with DGC This should only be needed for non-indexed draws and it's already dirty if the DGC binds an index buffer. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23761>	2023-06-23 16:56:44 +00:00
Samuel Pitoiset	2d97cc89fb	radv/amdgpu: dump all cs with RADV_DEBUG=noibs It was only dumping the oldest. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23646>	2023-06-23 16:35:22 +00:00
Samuel Pitoiset	8af705a856	radv/amdgpu: fix dumping cs with RADV_DEBUG=noibs The ib_buffer is NULL now. Fixes: `50e6b16855` ("radv/amdgpu: Use fallback submit for queues that can't use IBs.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23646>	2023-06-23 16:35:21 +00:00
Matt Coster	a1e2e01f62	pvr: Correctly read dynamic state setup during blend constant setup Somewhat counterintuitively, dynamic_state.set contains the bits that have been loaded from static state, i.e. those that are _not_ dynamic. Signed-off-by: Matt Coster <matt.coster@imgtec.com> Reviewed-by: Karmjit Mahil <Karmjit.Mahil@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23590>	2023-06-23 15:38:43 +00:00
Boyuan Zhang	036d3dc066	radeonsi: disable H264HIGH10 profile Issue: H.264 high 10 profile is currently not supported, but is shown as supported in vainfo. Reason: Kernel reported capabilities for video encoder/decode doesn't consider the actual profile (only using reduced profile). Solution: Use kernel reported capabilities only for basic H.264/HEVC profiles. Other profiles (e.g. 10 bits) should be checked based on HW. Fixes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9242 Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23824>	2023-06-23 14:11:33 +00:00
Samuel Pitoiset	ae7721d163	radv: reserve more space in CS for SQTT Otherwise, it can hit an assertion. Fixes: `7893040f80` ("radv: Add stricter space checks.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23826>	2023-06-23 13:51:13 +00:00
Alyssa Rosenzweig	bbdbab15fc	aco: Drop NIR parallel copy handling Backends never see these instructions. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Suggested-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23831>	2023-06-23 13:25:22 +00:00
Timur Kristóf	3b21c59fc3	aco: Remove unneeded stage related info fields. Cleanup of various fields with redundant information. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23597>	2023-06-23 12:49:05 +00:00
Timur Kristóf	bc971ba2c7	aco: Use aco_shader_info::hw_stage instead of guessing. With this change, ACO is going to rely on the caller to set the HW stage and will no longer guess it from the input shaders. This will help enable compiling merged shaders separately, but that will need further changes in instruction selection. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23597>	2023-06-23 12:49:05 +00:00
Timur Kristóf	6028c146d5	radv: Set aco_shader_info::hw_stage ACO will rely on this field instead of guessing the stage internally. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23597>	2023-06-23 12:49:05 +00:00
Timur Kristóf	016370b4f9	radeonsi: Set aco_shader_info::hw_stage ACO will rely on this field instead of guessing the stage internally. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23597>	2023-06-23 12:49:04 +00:00
Timur Kristóf	0fef6b95ca	aco: Add hw_stage field to aco_shader_info. Unused in this commit, but this is going to replace the shader stage selection inside ACO after the drivers set it correctly. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23597>	2023-06-23 12:49:04 +00:00
Timur Kristóf	05928f4200	aco: Use ac_hw_stage instead of aco-specific HWStage. The new ac_hw_stage is going to be used by drivers as well. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23597>	2023-06-23 12:49:04 +00:00
Timur Kristóf	cc2307008a	ac: Add ac_hw_stage enum. This is going to be shared between RADV, RadeonSI and ACO. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23597>	2023-06-23 12:49:04 +00:00
Diederik de Haas	231fa269ea	treewide: spelling fixes Debian's lintian tool flagged some spelling issues: assumtion -> assumption unkown -> unknown memeber -> member sucess -> success perfomance -> performance Signed-off-by: Diederik de Haas <didi.debian@cknow.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23618>	2023-06-23 12:20:59 +00:00
Lionel Landwerlin	a13ac83f1b	anv: fix utrace batch allocation The introduction of a workaround adding lots of MI_NOOPs broke our computation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b9aa66d5d0` ("anv: disable preemption for 3DPRIMITIVE during streamout") Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23792>	2023-06-23 11:26:27 +00:00
Danylo Piliaiev	8e729a2f57	freedreno/decode: Correctly handle chip_id gpu_id is not decodable from chip_id in general case, so we should use chip_id to search for fd_dev_info and get GPU generation from that. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23828>	2023-06-23 10:31:07 +00:00
Danylo Piliaiev	3111a70a55	freedreno,ir3: Don't call fd_dev_64b more than necessary fd_dev_64b calls fd_dev_gen which after the last commit calls fd_dev_info that may scan through all hw definitions. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23828>	2023-06-23 10:31:07 +00:00
Danylo Piliaiev	00900b76e0	freedreno: Decouple GPU gen from gpu_id/chip_id gpu_id is obsolete, chip_id doesn't encode the GPU generation. Thus we have to manually specify the GPU gen instead of inferring it. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23828>	2023-06-23 10:31:07 +00:00
Danylo Piliaiev	7a8d92e25f	freedreno/perfcntrs: Link with libfreedreno_common Header from freedreno/common is used without linking with its implementation. It worked before because all called functions were header only, which would change soon. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23828>	2023-06-23 10:31:07 +00:00
Gert Wollny	f18afc886a	ci: Upref virglrenderer Update expectation too. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23768>	2023-06-23 10:00:49 +00:00
Gert Wollny	90bc0ccf4a	virgl/ci: Drop duplicate runs CTS GL 3.2 includes all the tests of previous versions. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23768>	2023-06-23 10:00:49 +00:00
Tatsuyuki Ishi	b69a1b4153	vulkan: Migrate shader module hash to BLAKE3. Shaders are the largest thing we hash now, so they benefit from a faster hash. Change the field name from `sha1` to `hash` to avoid tying the definition to a particular algorithm. This doubles down as a precaution against callers still assuming a 20-byte hash (in which case the compilation will error out). Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22571>	2023-06-23 09:28:04 +00:00
Tatsuyuki Ishi	e5173e62d7	util/blake3: Add blake3_hash typedef. This is more ergonomic than unsigned char hash[BLAKE3_OUT_LEN]. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22571>	2023-06-23 09:28:04 +00:00
Marek Olšák	0823ab43c5	Revert "egl: return correct error for EGL_KHR_image_pixmap" This reverts commit `5db031bf3e`. It crashes X after logging in on Ubuntu 20.04. Fixes: `5db031bf3e` - egl: return correct error for EGL_KHR_image_pixmap Reviewed-by: Boyuan Zhang <Boyuan.Zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23740>	2023-06-23 06:50:08 +00:00
Gert Wollny	34163e19f7	r600/sfn: Don't clear clear group flag on vec4 that comes from TEX or FETCH If we consider clearing the group flag of a vec4 register that is used as source for some instruction we have to take into account that the parent of the register element may also be part of a group in the parent instruction. In this case we must not clear the group flag. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9118 Fixes: `f3415cb26a` (r600/sfn: copy propagate register load chains) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23813>	2023-06-23 06:16:30 +00:00
Hyunjun Ko	23d4e21d83	anv/video: fix to set U/V offset correctly. Fixes: `98c58a16ef` ("anv: add initial video decode support for h264.") Closes: mesa/mesa#9227 Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23819>	2023-06-23 09:35:42 +09:00
Timothy Arceri	d336bc3926	glsl: call nir_opt_find_array_copies() when linking shader-db results IRIS (BDW): total instructions in shared programs: 17883388 -> 17859658 (-0.13%) instructions in affected programs: 48100 -> 24370 (-49.33%) helped: 6 HURT: 0 helped stats (abs) min: 1450 max: 7028 x̄: 3955.00 x̃: 3387 helped stats (rel) min: 40.31% max: 51.92% x̄: 47.07% x̃: 48.96% 95% mean confidence interval for instructions value: -6613.28 -1296.72 95% mean confidence interval for instructions %-change: -52.73% -41.40% Instructions are helped. total cycles in shared programs: 866961809 -> 863521521 (-0.40%) cycles in affected programs: 9179396 -> 5739108 (-37.48%) helped: 6 HURT: 0 helped stats (abs) min: 252584 max: 972430 x̄: 573381.33 x̃: 495130 helped stats (rel) min: 21.80% max: 48.65% x̄: 35.01% x̃: 34.58% 95% mean confidence interval for cycles value: -917157.00 -229605.67 95% mean confidence interval for cycles %-change: -47.61% -22.40% Cycles are helped. total spills in shared programs: 20417 -> 15521 (-23.98%) spills in affected programs: 6966 -> 2070 (-70.28%) helped: 6 HURT: 0 total fills in shared programs: 25151 -> 21005 (-16.48%) fills in affected programs: 4374 -> 228 (-94.79%) helped: 6 HURT: 0 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9055 Fixes: `d75a36a9ee` ("glsl: remove do_copy_propagation_elements() optimisation pass") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23737>	2023-06-23 09:10:15 +10:00
Karol Herbst	570c263ea3	nir/load_libclc: run some opt passes for everybody Cuts down serialized size from 2850288 to 1377780 bytes. Reduces clinfo with Rusticl time by 40% for debug builds. (Old data, but the point stands) Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15996>	2023-06-22 21:02:57 +00:00
Karol Herbst	3a981acf55	rusticl/device: create helper context before loading libclc Some drivers (llvmpipe) postpone some screen initialization until the first context is created. Signed-off-by: Karol Herbst <git@karolherbst.de> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15996>	2023-06-22 21:02:57 +00:00
Lina Versace	98c8d7b7cf	venus: Fix detection of push descriptor set - Fix null deref. VkPipelineLayoutCreateInfo::pSetLayouts is allowed to contain VK_NULL_HANDLE. - The loop 'break' was misplaced. Fixes crash in dEQP-VK.pipeline.pipeline_library.graphics_library.fast.0_00_11_11 after VK_EXT_graphics_pipeline_library is enabled in a later patch. Fixes: `91966f2eff` ("venus: extend lifetime of push descriptor set layout") Signed-off-by: Lina Versace <linyaa@google.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Dawn Han <dawnhan@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23810>	2023-06-22 20:37:01 +00:00
Faith Ekstrand	f278b30e94	nir/opt_if: Use block_ends_in_jump Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23782>	2023-06-22 19:55:49 +00:00
Alyssa Rosenzweig	7ddfc43fdf	nir: Remove integer and 64-bit modifiers Now that Intel and R600 both do their own modifier propagation, the only backends that still lower modifiers in NIR are: * nir-to-tgsi * lima * etnaviv * a2xx The latter 3 backends do not support integers, and certainly do not support fp64. So they don't use these. TGSI in theory supports integer negate modifiers but NTT doesn't use them, so they're unused there too. Since they're unused, we remove NIR support for integer and 64-bit modifiers, leaving only 16/32-bit float modifiers. This will reduce the scope needed for a replacement to NIR modifiers, being pursued in !23089. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23782>	2023-06-22 19:55:49 +00:00

1 2 3 4 5 ...

173355 commits