fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 00:38:06 +02:00

Author	SHA1	Message	Date
Harri Nieminen	f85f511a38	amd: fix typos in code Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22432>	2023-04-13 23:08:22 +00:00
Harri Nieminen	aea48a4ff1	amd: fix typos Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22432>	2023-04-13 23:08:22 +00:00
Qiang Yu	45826e42c5	ac,aco: move gfx10 ngg prim count zero workaround to nir To simplify both llvm and aco backend and remove unnecessary workaround code where prim count is known to be not zero. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22381>	2023-04-13 08:12:03 +00:00
Timur Kristóf	ba537ac25a	ac/llvm: Cover runtime 0 in GFX10 gs_alloc_req workaround. Previously, the workaround only covered compile-time zero, but this is insufficient and can cause GPU hangs in RadeonSI when NGG culling is enabled. Fix this by handling runtime zero in the workaround. Cc: mesa-stable Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22370>	2023-04-08 13:54:06 +00:00
Qiang Yu	2c78cbbfe1	ac/llvm: remove some unused code replaced by nir Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22304>	2023-04-07 03:42:25 +00:00
Marek Olšák	d3b03fedd8	amd: add initial code for gfx940 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22158>	2023-04-06 15:00:53 +00:00
Qiang Yu	7be81a680b	ac/llvm: remove ac_build_opencoded_load_format Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22045>	2023-04-03 01:35:06 +00:00
Qiang Yu	1165758b8b	ac/llvm,radeonsi: remove abi->load_inputs implementation No nir_load_input in VS now. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22045>	2023-04-03 01:35:06 +00:00
Qiang Yu	531acf548a	ac/llvm: move ac_fixup_ls_hs_input_vgprs to amd common To be shared with radeonsi. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22045>	2023-04-03 01:35:06 +00:00
Qiang Yu	297f97a42b	ac/llvm: vs_rel_patch_id can also be fixed up It's currently used when LS store output to LDS. The LS/HS bug fix seems does not affect this case. But we'd better treat it as other fixed args. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22045>	2023-04-03 01:35:06 +00:00
Marek Olšák	e0d449dd40	amd: set the correct LLVM processor name for gfx1036 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22084>	2023-03-29 20:36:09 +00:00
Marek Olšák	0b6a7cba0b	amd: rename GFX1036 -> RAPHAEL_MENDOCINO Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22084>	2023-03-29 20:36:09 +00:00
Qiang Yu	3df1c4455e	ac/llvm: implement float16 nir_op_pack_(s\|u)norm_2x16 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21552>	2023-03-28 19:57:11 +00:00
Marek Olšák	5d8f0c570e	amd/llvm: remove no-op code for vec3 loads in ac_build_tbuffer_load Formatted loads always support vec3, so this code didn't do anything. Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22117>	2023-03-27 22:38:07 +00:00
Marek Olšák	03c97b212e	amd/llvm: fix handling of unsupported vec3 loads on gfx6 VMEM loads promoted from vec3 to vec4 didn't trim the vector, thus returning vec4 on gfx6 and vec3 on later generations, which callers don't expect. SMEM loads were adding an extra component on gfx6, causing same issues. Fixes: `82919e2d` - amd: lower subdword UBO loads in NIR Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8693 Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22117>	2023-03-27 22:38:07 +00:00
Qiang Yu	0cd89a27ed	ac/llvm: add missing type convert for nir_load_buffer_amd Fixes: `afcbccb078` ("ac/llvm: implement ACCESS_USE_FORMAT_AMD as buffer_load/store_format") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22043>	2023-03-23 01:55:20 +00:00
Qiang Yu	5ddb46e963	ac/llvm: respect channel_type when ac_build_buffer_load Mainly for nir_load_smem_buffer_amd which pass i32 for this parameter. Fixes: `8030fbcf16` ("nir,ac/llvm: add nir_load_smem_buffer_amd") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22043>	2023-03-23 01:55:20 +00:00
Pierre-Eric Pelloux-Prayer	3f272fd15e	ac/llvm: fix build with LLVM 17 This builds with LLVM 12 -> 17 and a running a simple app seems to work. I couldn't test LLVM 11 because meson fails with: Looking for a fallback subproject for the dependency llvm (modules: bitwriter, engine, mcdisassembler, mcjit, core, executionengine, scalaropts, transformutils, instcombine, amdgpu, bitreader, ipo, native) Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8297 Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22021>	2023-03-21 15:05:25 +00:00
Qiang Yu	719366c2b2	ac/llvm,radeonsi: lower nir_load_ring_tess_factors_amd No one implement this intrinsic in llvm, so remove the llvm entry too. This will be used in TCS nir tess factor write. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21437>	2023-03-16 04:33:30 +00:00
Timur Kristóf	819ba6f7ae	ac/llvm: Remove unused function ac_build_struct_tbuffer_load. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16805>	2023-03-15 14:54:28 +00:00
Timur Kristóf	22ca8c8561	ac/llvm: Implement typed buffer load intrinsic. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16805>	2023-03-15 14:54:27 +00:00
Marek Olšák	61da19a262	amd/llvm,radeonsi/gfx11: switch to using GDS_STRMOUT registers This is required by register shadowing (required by the new PAIRS packets), preemption, user queues, and we only have to wait for VS after streamout, not PS. This is how gfx11 streamout should have been done. Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21584>	2023-03-07 22:08:47 +00:00
Marek Olšák	f7076d129d	amd: add nir_intrinsic_xfb_counter_sub_amd and fix overflowed streamout offsets Fixes: `5ec79f9899` - ac/nir/ngg: nogs support streamout Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21584>	2023-03-07 22:08:47 +00:00
Marek Olšák	82919e2dcb	amd: lower subdword UBO loads in NIR This fixes broken subdword UBO loads with LLVM. It's only needed for LLVM, but it's done for both LLVM and ACO because the pass can be fully validated only with ACO and the Vulkan CTS right now. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19399>	2023-03-03 03:27:40 +00:00
Marek Olšák	1a424fee4a	ac/llvm: implement nir_op_unpack_32_4x8 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19399>	2023-03-03 03:27:40 +00:00
Qiang Yu	e9616d1d2a	ac/llvm: only init outputs when fragment shader for radv LS pass output to TCS by reg is not enabled when LLVM. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21435>	2023-02-28 07:19:29 +00:00
Qiang Yu	822e756511	ac/llvm,radeonsi: lower fbfetch in abi Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21436>	2023-02-27 09:43:53 +08:00
Qiang Yu	5c44404b5f	ac/llvm,radeonsi: lower nir_load_barycentric_at_sample in abi RADV already did this in radv_lower_fs_intrinsics(). Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21436>	2023-02-27 09:39:41 +08:00
Marek Olšák	df6380ddc9	amd: implement conformant TRUNC_COORD behavior for gfx11 For testing, the conformant behavior can be enabled by setting conformant_trunc_coord to true manually and running this to enable the conformant behavior in hw: umr -w ..regTA_CNTL2 0x40000 The layer index rounding and TRUNC_COORD resetting workarounds can disabled in the shader compiler. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21525>	2023-02-24 21:27:24 +00:00
Marek Olšák	091268944d	amd,radeonsi: remove unused LLVM functions Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21525>	2023-02-24 21:27:24 +00:00
Georg Lehmann	ee47cc8256	amd,nir: remove byte_permute_amd intrinsic It's unused and if we ever want to use it again we should make it an alu opcode instead. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21445>	2023-02-22 20:13:52 +00:00
Daniel Schürmann	2bb369dd8d	nir: add assertions that loops don't have a Continue Construct Hoping that I didn't miss any, this should add assertions to all functions and passes which explicitly handle 'nir_loop'. Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Konstantin Seurer	0f709510f4	ac/llvm: Implement bvh64_intersect_ray_amd Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21268>	2023-02-17 17:04:47 +00:00
Timur Kristóf	450e173de0	ac/llvm: Change ac_build_tbuffer_load to take format and channel type. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21358>	2023-02-16 15:29:37 +00:00
Timur Kristóf	0ae778ca59	ac/llvm: Fix ac_build_buffer_load to work with more than 4 channels. LLVM is unable to select instructions for num_channels > 4, so we workaround that by manually splitting larger buffer loads. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21358>	2023-02-16 15:29:37 +00:00
Timur Kristóf	a2755fc203	ac/llvm: Fix buffer_load_amd with larger than 32-bit channel sizes. LLVM is unable to select instructions for larger than 32-bit channel types. Workaround by using i32 and casting to the correct type later. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21358>	2023-02-16 15:29:37 +00:00
Timur Kristóf	b5b0ded4c1	ac/llvm: Remove "structurized" argument and instead check vindex. Change ac_build_buffer_load_common and ac_build_tbuffer_load so the use structurized load when the vindex argument is not NULL. Adjust callers to match the new behaviour. This fixes the load_buffer_amd intrinsic with index source. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21358>	2023-02-16 15:29:37 +00:00
Rhys Perry	be6f30a0db	ac/llvm: let ring_offsets be accessed like a normal arg Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19202>	2023-02-06 14:25:16 +00:00
Konstantin Seurer	10ac51a52b	ac/llvm: Fix validation error with global io Fixes: `afd645f057` ("ac/llvm: remove LLVMBuildGEP usages") Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20521>	2023-02-05 12:16:05 +00:00
Ian Romanick	ea413e826b	nir: Eliminate nir_op_f2b Builds on the work of !15121. This gets to delete even more code because many drivers shared a lot of code for i2b and f2b. No shader-db or fossil-db changes on any Intel platform. v2: Rebase on `1a35acd8d9`. v3: Update a comment in nir_opcodes_c.py. Suggested by Konstantin. v4: Another rebase. Remove f2b stuff from Midgard. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20509>	2023-02-03 22:39:57 +00:00
Qiang Yu	f6b194b648	nir,ac/llvm,aco,radv,radeonsi: remove nir_export_vertex_amd Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20691>	2023-02-03 12:27:44 +00:00
Qiang Yu	f44872c7b6	nir,ac/llvm,aco: remove nir_export_primitive_amd Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20691>	2023-02-03 12:27:44 +00:00
Qiang Yu	601ad9e0a9	amd,radeonsi: implement nir_load_force_vrs_rates_amd in driver abi Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20691>	2023-02-03 12:27:43 +00:00
Qiang Yu	5fe4dd3d68	ac/llvm: implement nir_export_amd Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20691>	2023-02-03 12:27:43 +00:00
Marek Olšák	da7dfbe3b8	amd/llvm: fix LLVM 15 & 16 crashes in SelectionDAG.cpp Cc: stable Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:02 +00:00
Marek Olšák	84d59cdb59	amd: split GFX1103 into GFX1103_R1 and GFX1103_R2 Fixes: `caa09f66ae` - amd: add chip identification for gfx1100-1103 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Marek Olšák	2ae08c3e8f	ac/llvm: remove llvm:: now that we use "using namespace llvm" Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20297>	2023-01-26 19:33:55 -05:00
Marek Olšák	a273f64f80	ac/llvm: run the IPSCCP pass AMDVLK runs it and it seems useful. https://en.wikipedia.org/wiki/Sparse_conditional_constant_propagation 58380 shaders in 35438 tests Totals: SGPRS: 2709080 -> 2709224 (0.01 %) VGPRS: 1592972 -> 1592808 (-0.01 %) Spilled SGPRs: 2420 -> 2420 (0.00 %) Spilled VGPRs: 1077 -> 1077 (0.00 %) Private memory VGPRs: 253 -> 253 (0.00 %) Scratch size: 1232 -> 1232 (0.00 %) dwords per thread Code Size: 61382088 -> 61356504 (-0.04 %) bytes Max Waves: 849293 -> 849308 (0.00 %) Outputs: 127090 -> 127090 (0.00 %) Patch Outputs: 579 -> 579 (0.00 %) Totals from affected shaders: SGPRS: 5400 -> 5544 (2.67 %) VGPRS: 6200 -> 6036 (-2.65 %) Spilled SGPRs: 0 -> 0 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 975824 -> 950240 (-2.62 %) bytes Max Waves: 1214 -> 1229 (1.24 %) Outputs: 232 -> 232 (0.00 %) Patch Outputs: 0 -> 0 (0.00 %) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20297>	2023-01-26 19:33:43 -05:00
Marek Olšák	d05c3811cd	ac/llvm: run the LLVM sinking pass because LLVM will stop running it shader-db was run with the sinking pass disabled in LLVM. 58380 shaders in 35438 tests Totals: SGPRS: 2730768 -> 2730768 (0.00 %) VGPRS: 1592932 -> 1592928 (-0.00 %) Spilled SGPRs: 2687 -> 2687 (0.00 %) Spilled VGPRs: 551 -> 551 (0.00 %) Private memory VGPRs: 253 -> 253 (0.00 %) Scratch size: 700 -> 700 (0.00 %) dwords per thread Code Size: 61238872 -> 61238868 (-0.00 %) bytes Max Waves: 849209 -> 849209 (0.00 %) Outputs: 127090 -> 127090 (0.00 %) Patch Outputs: 579 -> 579 (0.00 %) Totals from affected shaders: SGPRS: 440 -> 440 (0.00 %) VGPRS: 396 -> 392 (-1.01 %) Spilled SGPRs: 0 -> 0 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 49880 -> 49876 (-0.01 %) bytes Max Waves: 105 -> 105 (0.00 %) Outputs: 14 -> 14 (0.00 %) Patch Outputs: 0 -> 0 (0.00 %) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20297>	2023-01-26 19:33:17 -05:00
Timur Kristóf	c644461b71	radv, aco, ac: Implement pack_half_2x16_rtz_split. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15838>	2023-01-26 12:24:24 +00:00

1 2 3 4 5 ...

574 commits