fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-23 04:18:14 +02:00

Author	SHA1	Message	Date
Timothy Arceri	8317a37ea7	glsl: implement nir version of lower discard flow Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28005>	2024-03-07 04:02:45 +00:00
Teng, Jin Chung	ef45417690	d3d12: HEVC Encode - Query slice config mode based on user slice setting Queries D3D12_FEATURE_VIDEO_ENCODER_SUPPORT1 for HEVC setting D3D12_FEATURE_DATA_VIDEO_ENCODER_SUPPORT1.SubregionFrameEncoding as D3D12_VIDEO_ENCODER_FRAME_SUBREGION_LAYOUT_MODE_UNIFORM_PARTITIONING_SUBREGIONS_PER_FRAME or D3D12_VIDEO_ENCODER_FRAME_SUBREGION_LAYOUT_MODE_FULL_FRAME depending on the frontend number of slices requested. Doing this avoids d3d12_video_encoder_config_dirty_flag_slices from being set on every frame otherwise, triggering a reconstruction of the encoder objects on every frame on some platforms. Signed-off-by: Teng, Jin Chung <jin.chung.teng@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28007>	2024-03-07 03:43:10 +00:00
Jesse Natalie	cda6877cb6	nir_lower_tex_shadow: For old-style shadows, use vec4(result, 0, 0, 1) If the app requests a swizzle on the shadow sampler which doesn't just return the red channel or literal 0s/1s, we'll crash attempting to build the result vector. Use something that's probably valid. Cc: mesa-stable Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28001>	2024-03-07 01:15:46 +00:00
Mike Blumenkrantz	4b7bf9a6db	zink: update nvk baseline Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28026>	2024-03-07 00:21:05 +00:00
Sil Vilerino	2074da0c39	d3d12: Refactor graphics functions from context and blit to separate files Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27997>	2024-03-06 23:06:59 +00:00
Sil Vilerino	55e377e965	d3d12: Add partial media, compute, graphics support with CORE and GENERIC feature levels Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27997>	2024-03-06 23:06:59 +00:00
Sil Vilerino	0cd023bf6a	frontend/va: Use get_resources in VaDeriveImage for media only devices without get_surfaces support Reviewed-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27997>	2024-03-06 23:06:59 +00:00
Sil Vilerino	bf6a415841	frontend/va: Support media only post proc without compositor using shaders or surfaces Reviewed-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27997>	2024-03-06 23:06:59 +00:00
Lionel Landwerlin	0b6a2c24d6	anv: don't copy the null descriptor from the GPU memory Performance regression with vkd3d-proton. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `9506d3f338` ("anv: implement data write entry points for EXT_descriptor_buffer") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Tested-by: Felix DeGrood felix.j.degrood@intel.com Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28010>	2024-03-06 22:45:13 +00:00
Faith Ekstrand	d20b547e8e	nvk: Report official GPU names from NVIDIA when we have them Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28024>	2024-03-06 22:28:22 +00:00
Faith Ekstrand	1069b216ac	nouveau: Import g_nv_name_released.h from NVIDIA OGK Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28024>	2024-03-06 22:28:22 +00:00
Sil Vilerino	43b857a015	d3d12: HEVC encode - Update CQP using current frame type as per VA frontend change Fixes: `8c9445896f` ("frontends/va: Separate QP for I/P/B frames") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28018>	2024-03-06 22:09:45 +00:00
Sil Vilerino	f8274eea76	d3d12: H264 encode - Update CQP using current frame type as per VA frontend change Fixes: `8c9445896f` ("frontends/va: Separate QP for I/P/B frames") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28018>	2024-03-06 22:09:45 +00:00
Sil Vilerino	e3e593d721	d3d12: AV1 encode - Configure CQP using qp and new qp_inter parameters Fixes: `8c9445896f` ("frontends/va: Separate QP for I/P/B frames") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28018>	2024-03-06 22:09:45 +00:00
Vasily Khoruzhick	4762d03391	lima: update expected CI failures Backport-to: 23.3 Backport-to: 24.0 Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24855>	2024-03-06 21:43:43 +00:00
Vasily Khoruzhick	feccf4121b	lima: gpir: abort compilation if load_uniform instrinsic src isn't const GP supports indirect indexing of uniforms, but it's never been implemented in GPIR, so just abort compilation instead of crashing an app with assertion failure. Backport-to: 23.3 Backport-to: 24.0 Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24855>	2024-03-06 21:43:43 +00:00
Vasily Khoruzhick	6998c48f77	lima: ppir: use dummy program if FS has empty body As per spec, any colors, or color components, associated with a fragment that are not written by the fragment shader are undefined. So we might as well just write vec4(1.0) to output, since HW doesn't allow us to have an empty FS. Backport-to: 23.3 Backport-to: 24.0 Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24855>	2024-03-06 21:43:43 +00:00
Vasily Khoruzhick	b999e41250	lima: ppir: always use vec4 for output register gl_FragDepth is a float, but the hardware still uses a vec4 register, .x component for depth and another component for stencil, so we have to always allocate a vec4 for output. Backport-to: 23.3 Backport-to: 24.0 Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24855>	2024-03-06 21:43:43 +00:00
Felix DeGrood	f6c908293e	iris: Increase target batch size to 128 KB Doubling batch size speeds up GFXBench Manhattan +0.5% by reducing batches / frame from 3 -> 2. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28002>	2024-03-06 21:22:17 +00:00
Danylo Piliaiev	a76fcebfc0	tu: Fix dynamic state not always being emitted We precompile static state and count it as dynamic, so we have to manually clear bitset that tells which dynamic state is set, in order to make sure that future dynamic state will be emitted. The issue is that framework remembers only a past REAL dynamic state and compares a new dynamic state against it, and not against our static state masquaraded as dynamic. Example: - Set dynamic state S with value A - Bind pipeline with dynamic state S - Draw - Bind pipeline with static state S with value B - Draw - Set dynamic state S with value A - Bind pipeline with dynamic state S - Draw Previously, at the last draw the dynamic state S was not dirty and current dynamic state was equal to the past dynamic state, so it was not emitted, while GPU used value B from static pipeline. This fix, at the point of static pipeline binding, clears the bitset which tells that dynamic state S was previously set. This forces the next dynamic state to be re-emitted. Fixes broken rendering in Arma 3, and probably some other games running through DXVK. Fixes: `97da0a7734` ("tu: Rewrite to use common Vulkan dynamic state") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27961>	2024-03-06 20:57:35 +00:00
Lionel Landwerlin	6823ffe70e	anv: try to keep the pipeline in GPGPU mode when buffer transfer ops To avoid ping-ponging between 3D & GPGPU in the following sequence : vkCmdDispatch(...) vkCmdCopyBuffer(...) vkCmdDispatch(...) We can try to keep the pipeline in GPGPU mode when doing blorp buffer operations (we have blorp support for the CCS and can use the same shaders on RCS). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27956>	2024-03-06 20:33:12 +00:00
Lionel Landwerlin	194afe8416	anv/iris/blorp: use the right MOCS values for each engine There are multiple problems currently : - blorp blitter commands overwrite the protection value coming from the driver - anv & iris are using render target MOCS for compute commands Driver already have the ability to pass the MOCS values so we choose to stick to that in this change. But now the driver need to select the right MOCS depending on the engine the commands are going to run onto. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27956>	2024-03-06 20:33:12 +00:00
Lionel Landwerlin	c40f14bb31	anv: fix incorrect ISL usage in buffer view creation We need to use the usage parameter. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `64f20cec28` ("anv: prepare image/buffer views for non indirect descriptors") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27956>	2024-03-06 20:33:12 +00:00
Faith Ekstrand	33bf7ca710	nvk: Return os_page_size for minMemoryMapAlignment Fixes: `8017ac0e79` ("nvk: add some limits/features from binary driver.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28019>	2024-03-06 19:21:20 +00:00
Gert Wollny	1882527f78	zink: decrease aggressiveness of increasing descriptor data space adaptive An increase by factor 10 with each re-allocation is a bit aggressive and we hit the available limit easily on lavapipe. By starting of with an initial larger scale, but decreasing this over time this error can be avoided. Specifically with "spec@arb_shader_texture_lod@execution@tex-miplevel-selection *gradarb 1d" originally the buffer sizes would be 250, 2500, 25000, and 250000, with the patch it's 250, 4000, and 32000. v2: use minimum scale of 4 instead of 2 (Mike) v3: fix typo (Mike) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27977>	2024-03-06 18:50:46 +00:00
Gert Wollny	8e239dda41	zink: use only ZINK_BIND_DESCRIPTOR ZINK_BIND_RESOURCE_DESCRIPTOR and ZINK_BIND_SAMPLER_DESCRIPTOR are always used together, so that we can replace these two values with ZINK_BIND_DESCRIPTOR and use only one bit to represent the value. With that we can also remove the aliasing of ZINK_BIND_DESCRIPTOR with PIPE_BIND_CONST_BW. Fixes: `13c6ad0038` zink: use a single descriptor buffer for all non-bindless types Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28016>	2024-03-06 18:15:21 +00:00
Konstantin Seurer	b55580cab8	lavapipe/ci: Document ray query failures This is the same issue as RADV+emulate_rt has. (Except the jit timeout of course) Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25616>	2024-03-06 16:34:26 +00:00
Konstantin Seurer	c2646c6bbc	lavapipe: Advertise VK_KHR_ray_query Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25616>	2024-03-06 16:34:26 +00:00
Konstantin Seurer	32e86e1bff	lavapipe: Advertise VK_KHR_acceleration_structure Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25616>	2024-03-06 16:34:26 +00:00
Konstantin Seurer	09bf35e3c4	lavapipe: Advertise VK_KHR_deferred_host_operations Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25616>	2024-03-06 16:34:26 +00:00
Konstantin Seurer	ed6c0a7443	lavapipe: Implement VK_KHR_ray_query Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25616>	2024-03-06 16:34:26 +00:00
Konstantin Seurer	b69ae8b355	lavapipe: Add ray traversal code Basically the software implementation in radv_rt_common without traversal stack. Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25616>	2024-03-06 16:34:26 +00:00
Konstantin Seurer	897ccbd180	lavapipe: Implement VK_KHR_acceleration_structure Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25616>	2024-03-06 16:34:25 +00:00
Konstantin Seurer	ff09e95080	vulkan/cmd_queue: Implement CmdBuildAccelerationStructuresKHR This is needed for copying the arguments properly. Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25616>	2024-03-06 16:34:25 +00:00
Amber	48da361eb7	tu: wideLines support for a7xx. Passes dEQP-VK.clipping.clip_volume.clipped.wide_lines_* Signed-off-by: Amber Harmonia <amber@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27775>	2024-03-06 16:01:09 +00:00
Rhys Perry	beb07fafba	nir/search: fix nir_replace_instr() debug code Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27335>	2024-03-06 15:23:18 +00:00
Rhys Perry	a93bd52f4f	nir/lower_int64: allow 64-bit comparisons when lowering minmax RADV doesn't need these to be lowered. fossil-db (navi31): Totals from 1 (0.00% of 79242) affected shaders: Instrs: 28 -> 26 (-7.14%) CodeSize: 140 -> 128 (-8.57%) Latency: 605 -> 604 (-0.17%) Copies: 5 -> 6 (+20.00%) VALU: 14 -> 13 (-7.14%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27335>	2024-03-06 15:23:18 +00:00
Rhys Perry	b37804c8de	nir/algebraic: optimize 64-bit comparisons with zero'd halves to 32-bit These expect nir_lower_int64 to replace u2u64 to pack_64_2x32_split(, 0). fossil-db (navi31): Totals from 149 (0.19% of 79242) affected shaders: Instrs: 433095 -> 431830 (-0.29%); split: -0.29%, +0.00% CodeSize: 2165980 -> 2160284 (-0.26%); split: -0.27%, +0.00% SpillSGPRs: 689 -> 688 (-0.15%) Latency: 3801497 -> 3799901 (-0.04%); split: -0.05%, +0.01% InvThroughput: 1547916 -> 1546567 (-0.09%); split: -0.09%, +0.01% VClause: 4698 -> 4693 (-0.11%) SClause: 9981 -> 9977 (-0.04%); split: -0.05%, +0.01% Copies: 66148 -> 65431 (-1.08%); split: -1.09%, +0.01% PreSGPRs: 6732 -> 6729 (-0.04%) PreVGPRs: 7976 -> 7945 (-0.39%) VALU: 252936 -> 252336 (-0.24%) SALU: 51794 -> 51274 (-1.00%); split: -1.03%, +0.02% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27335>	2024-03-06 15:23:18 +00:00
Rhys Perry	417eb390c6	nir/algebraic: remove duplicated iand(ien, ine)/ior(ieq, ieq) patterns These don't seem useful, since they're already done in the early optimizations. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27335>	2024-03-06 15:23:18 +00:00
Rhys Perry	6952bb359c	nir/algebraic: don't create 64-bit min/max/ior if lowered fossil-db (navi31): Totals from 58 (0.07% of 79242) affected shaders: Instrs: 11692 -> 11304 (-3.32%) CodeSize: 65836 -> 62412 (-5.20%) VGPRs: 1320 -> 1344 (+1.82%) Latency: 51712 -> 50234 (-2.86%) InvThroughput: 10190 -> 10160 (-0.29%) Copies: 460 -> 688 (+49.57%) VALU: 6130 -> 5897 (-3.80%) SALU: 1231 -> 1284 (+4.31%); split: -0.32%, +4.63% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27335>	2024-03-06 15:23:18 +00:00
Daniel Schürmann	61854009f3	aco: rematerialize constants in every basic block during optimizer Totals from 16837 (21.25% of 79242) affected shaders: (GFX11) MaxWaves: 441634 -> 444546 (+0.66%); split: +0.66%, -0.00% Instrs: 25908303 -> 25838469 (-0.27%); split: -0.36%, +0.09% CodeSize: 133943168 -> 135446948 (+1.12%); split: -0.04%, +1.16% VGPRs: 985332 -> 977440 (-0.80%); split: -0.83%, +0.03% SpillSGPRs: 9133 -> 7535 (-17.50%); split: -17.74%, +0.24% SpillVGPRs: 1418 -> 1359 (-4.16%); split: -4.58%, +0.42% Scratch: 5047552 -> 5040640 (-0.14%) Latency: 204330340 -> 204179212 (-0.07%); split: -0.32%, +0.25% InvThroughput: 36584220 -> 36508856 (-0.21%); split: -0.40%, +0.19% VClause: 437847 -> 437344 (-0.11%); split: -0.34%, +0.22% SClause: 771311 -> 771013 (-0.04%); split: -0.42%, +0.38% Copies: 1774950 -> 1712070 (-3.54%); split: -4.46%, +0.91% Branches: 580595 -> 580478 (-0.02%); split: -0.03%, +0.01% PreSGPRs: 877017 -> 817549 (-6.78%) PreVGPRs: 852747 -> 846966 (-0.68%); split: -0.68%, +0.00% Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26875>	2024-03-06 15:02:21 +00:00
Rohan Garg	9baa57158d	intel/genxml: update PIPE_CONTROL so that we can decode it on the CCS Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28013>	2024-03-06 14:37:11 +00:00
Rhys Perry	3b28ba8239	aco: optimize for purely linear VGPR copies fossil-db: Totals from 2 (0.00% of 79242) affected shaders: Instrs: 1344 -> 1340 (-0.30%) CodeSize: 6968 -> 6952 (-0.23%) Latency: 4414 -> 4410 (-0.09%) InvThroughput: 1018 -> 1020 (+0.20%) Copies: 60 -> 56 (-6.67%) SALU: 40 -> 36 (-10.00%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27697>	2024-03-06 12:55:46 +00:00
Rhys Perry	8cd3a3a520	aco/tests: add tests for linear VGPR register allocation Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27697>	2024-03-06 12:55:46 +00:00
Rhys Perry	f9b37723d0	aco/ra: emit linear VGPR parallel copy separately Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27697>	2024-03-06 12:55:46 +00:00
Rhys Perry	d9b69a7cbf	aco/ra: disable live range splitting of linear vgprs These shouldn't happen anymore. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27697>	2024-03-06 12:55:46 +00:00
Rhys Perry	b7738de4f9	aco/ra: rework linear VGPR allocation We allocate them at the end of the register file and keep them separate from normal VGPRs. This is for two reasons: - Because we only ever move linear VGPRs into an empty space or a space previously occupied by a linear one, we never have to swap a normal VGPR and a linear one. This simplifies copy lowering. - As linear VGPR's live ranges only start and end on top-level blocks, we never have to move a linear VGPR in control flow. fossil-db (navi31): Totals from 5493 (6.93% of 79242) affected shaders: MaxWaves: 150365 -> 150343 (-0.01%) Instrs: 7974740 -> 7976073 (+0.02%); split: -0.06%, +0.08% CodeSize: 41296024 -> 41299024 (+0.01%); split: -0.06%, +0.06% VGPRs: 283192 -> 329560 (+16.37%) Latency: 64267936 -> 64268414 (+0.00%); split: -0.17%, +0.17% InvThroughput: 10954037 -> 10951735 (-0.02%); split: -0.09%, +0.07% VClause: 132792 -> 132956 (+0.12%); split: -0.06%, +0.18% SClause: 223854 -> 223841 (-0.01%); split: -0.01%, +0.01% Copies: 559574 -> 561395 (+0.33%); split: -0.24%, +0.56% Branches: 179630 -> 179636 (+0.00%); split: -0.02%, +0.02% VALU: 4572683 -> 4574487 (+0.04%); split: -0.03%, +0.07% SALU: 772076 -> 772111 (+0.00%); split: -0.01%, +0.01% VOPD: 1095 -> 1099 (+0.37%); split: +0.73%, -0.37% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27697>	2024-03-06 12:55:46 +00:00
Rhys Perry	2d49c79c7e	aco/ra: change get_reg_bounds() helper We will have a separate bounds for linear VGPRs. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27697>	2024-03-06 12:55:46 +00:00
Rhys Perry	a38bc9e165	aco/ra: move parallelcopy creation into helper This is almost a direct copy+paste into it's own function. This is useful both for future work and the make the caller smaller. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27697>	2024-03-06 12:55:46 +00:00
Rhys Perry	a8b72082cf	aco/ra: constify various RegisterFile This makes it more obvious that these functions don't change it. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27697>	2024-03-06 12:55:45 +00:00

1 2 3 4 5 ...

172173 commits