fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-24 04:30:10 +01:00

Author	SHA1	Message	Date
Dave Airlie	3153d74207	ac/nir: account for view index in the user sgpr allocation. The view index user sgpr wasn't being accounted for properly, this refactors out the code to decide if it's required and then uses that info to account for it. Fixes: `180c1b924e` (ac/nir: Add shader support for multiviews.) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2018-01-18 19:47:40 +00:00
Timothy Arceri	9248f72c4e	ac: tidy up array indexing logic Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-01-18 15:59:27 +11:00
Timothy Arceri	e2b9296146	ac: fix buffer overflow bug in 64bit SSBO loads Fixes: `441ee1e65b` "radv/ac: Implement Float64 SSBO loads" Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-01-18 10:26:58 +11:00
Timothy Arceri	409e15f26f	ac: fix nir_intrinsic_get_buffer_size for radeonsi Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-01-18 10:25:20 +11:00
Timothy Arceri	7898eb9a60	ac: rework load_tcs_{inputs,outputs} This shares more code and calls the new shared load_tess_varyings() abi so that the radeonsi nir path now supports tcs output loads. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-01-18 00:03:33 +11:00
Timothy Arceri	9622b445c8	ac/radeonsi: add tcs load outputs support The code to load outputs is essentially the same as load inputs so we make the interface more generic to maximise code sharing. We will make use of the new support in the following patch. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-01-18 00:03:33 +11:00
Samuel Pitoiset	05f73b9672	ac: set no-signed-zeros-fp-math when RADV_DEBUG="unsafemath" is used This is an optimisation that is recommended by Matt Arsenault, and used by RadeonSI, but it's not compatible with Vulkan. Note that AC_FLOAT_MODE_UNSAFE_FP_MATH includes the no signed zeros flag in LLVM. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-01-16 21:39:57 +01:00
Samuel Pitoiset	4f5318df2c	ac: set fast math flags when RADV_DEBUG="unsafemath" is used When that debug option is not used, we use the default float mode because the no signed zeros optimisation is not Vulkan compatible. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-01-16 21:39:55 +01:00
Samuel Pitoiset	ad2b3b2a9c	ac: replace llvm.AMDGPU.kilp by llvm.amdgcn.kill with LLVM 6 This also replaces llvm.AMDGPU.kilp by llvm.AMDGPU.kill with LLVM < 6. Similar to RadeonSI codepath. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-01-16 21:39:51 +01:00
Samuel Pitoiset	8045f01e2a	Revert "ac/shader: gather If TES reads TESSINNER or TESSOUTER" This can't work for two reasons: - TESSINNER/TESSOUTER are shader input values, so never translated to the intrinsic ops - the shader info pass scans the current stage but we want to know in TCS, if TES reads the tess factors. This fixes 6 regressions related to deqp-vk/tessellation/shader_input_output/tess_level_{inner,outer}_XXX_tes This reverts commit `5ba1a61648`.	2018-01-15 13:47:18 +01:00
Samuel Pitoiset	5842cb0df1	amd/common: fix loading InstanceID for tess on < GFX9 InstanceID is in VGPR2, not 1. One more failure that CTS didn't catch up... Reported-by: Alex Smith <asmith@feralinteractive.com> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-01-15 11:59:16 +01:00
Samuel Pitoiset	5ba1a61648	ac/shader: gather If TES reads TESSINNER or TESSOUTER This shouldn't be scanned in the pipeline. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-01-15 11:51:47 +01:00
Samuel Pitoiset	aebde47840	ac: remove ac_shader_variant_info::fs::output_mask Unused. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-01-15 11:48:42 +01:00
Timothy Arceri	e6378962ce	ac: add doubles support to isign Fixes a number of int64 piglit tests, for example: generated_tests/spec/arb_gpu_shader_int64/execution/built-in-functions/fs-sign-i64vec2.shader_test Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-01-14 11:40:03 +11:00
Timothy Arceri	741b21b713	ac/nir: fix translation of nir_op_b2i for doubles V2: just zero-extend the 32-bit value. Fixes a number of int64 piglet tests, for example: generated_tests/spec/arb_gpu_shader_int64/execution/conversion/frag-conversion-explicit-bool-int64_t.shader_test Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-01-14 11:40:03 +11:00
Dave Airlie	e37db93246	radv: trim buffer load result (fixes dota2) Running dota2 since the below commit crashes with an llvm assert. Trim the vector like the other user. This possible could also be avoided by not padding inside the load vec3->vec4. Fixes: `41c36c4549` (amd/common: use ac_build_buffer_load() for emitting UBO loads) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2018-01-12 00:41:55 +00:00
Timothy Arceri	30c1a93f6d	ac/nir: fix translation of nir_op_fsign for doubles Without this we end up with the llvm error message: "Both operands to a binary operator are not of the same type!" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-01-12 09:29:18 +11:00
Timothy Arceri	7b971c828a	ac/nir: fix translation of nir_op_frcp for doubles Without this we end up with the llvm error message: "Both operands to a binary operator are not of the same type!" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-01-12 09:29:18 +11:00
Timothy Arceri	24575c815c	ac/nir: fix translation of nir_op_frsq for doubles Without this we end up with the llvm error message: "Both operands to a binary operator are not of the same type!" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-01-12 09:29:17 +11:00
Timothy Arceri	c797cd605a	ac: add load_patch_vertices_in() to the abi Fixes the follow test for radeonsi nir: tests/spec/arb_tessellation_shader/execution/quads.shader_test Also stops 8 other tests from crashing, they now just fail e.g. tcs-output-array-float-index-rd-after-barrier.shader_test Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-01-11 14:28:37 +11:00
Bas Nieuwenhuizen	67e09c8b45	ac/nir: Sanitize location_frac for local variables. If they were promoted from inputs/outputs, they could have a non-zero value left over, which messed with our store handling. Fixes: `06f05040eb` "radv: Link shaders." Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-01-11 00:56:52 +01:00
Samuel Pitoiset	41c36c4549	amd/common: use ac_build_buffer_load() for emitting UBO loads Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-01-10 19:02:27 +01:00
Samuel Pitoiset	7145b20afb	amd/common: bump the number of available user SGPRS to 32 on GFX9 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-01-10 12:35:08 +01:00
Samuel Pitoiset	d43f50c00b	amd/common: do not rely on the pipeline for the push constants logic It makes more sense to rely on nir_intrinsic_load_push_constant instead of the pipeline layout. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-01-10 12:31:54 +01:00
Samuel Pitoiset	9e2395faf5	amd/common: determine the ES type (VS or TES) for the GS on GFX9 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-01-10 12:31:49 +01:00
Timothy Arceri	f04d2ca0d9	ac: rework emit_barrier() to not segfault on radeonsi nir_to_llvm_context will always be NULL for radeonsi so we need work around this. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-01-09 10:21:32 +11:00
Timothy Arceri	19f3141e6a	ac: add load_tess_level() to the abi Fixes the following piglit tests in radeonsi: vs-tcs-tes-tessinner-tessouter-inputs-quads.shader_test vs-tcs-tes-tessinner-tessouter-inputs-tris.shader_test vs-tes-tessinner-tessouter-inputs-quads.shader_test vs-tes-tessinner-tessouter-inputs-tris.shader_test v2: make use of si_shader_io_get_unique_index_patch() via the helper in the previous patch rather than shader_io_get_unique_index() Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> (v1) Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-01-09 10:21:32 +11:00
Samuel Pitoiset	08a5f4412a	radv: get InstanceID from VGPR1 (or VGPR2 for tess) instead of VGPR3 VGPR1 = InstanceID / StepRate0; // StepRate0 can be set to 1 Ported from RadeonSI. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-01-08 21:30:01 +01:00
Samuel Pitoiset	ec63ab39be	radv: enable denorms for 64-bit and 16-bit floats Similar to RadeonSI. This fixes: dEQP-VK.image.texel_view_compatible.graphic.basic.attachment_read.bc*r16g16b16a16_sfloat dEQP-VK.image.extended_usage_bit.attachment_write.r16_sfloat Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-01-05 09:51:33 +01:00
Samuel Pitoiset	7643c71527	amd/common: correctly detect if we need ring buffers When allocate_user_sgprs() was called, ctx->stage was actually unset and 0 is for the vertex shader. This doesn't change anything for now because of the spill support thing. Though, the number of user SGPRs has to be fixed for merged shaders on GFX9. It was broken before anyway. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-01-05 09:49:51 +01:00
Samuel Pitoiset	50cfad0298	amd/common: use ac_image_load when lod is zero This might decrease VGPR spilling, because we no longer have to use v4i32 for 2D fetches when level == 0. We now use v2i32 for those cases. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-01-05 09:49:45 +01:00
Timothy Arceri	14adf7853a	ac/radeonsi: add load_tess_coord() to the abi Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-01-05 11:58:55 +11:00
Timothy Arceri	9e1a3caf32	ac/radeonsi: add tcs_rel_ids to the abi Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-01-05 11:58:55 +11:00
Timothy Arceri	f93740efc1	ac: add {tcs,tes}_patch_id to the abi Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-01-05 11:58:55 +11:00
Timothy Arceri	b99ebaa4fd	ac: move some helpers to ac_llvm_build.c We will call these from the radeonsi NIR backend. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-01-05 11:58:55 +11:00
Timothy Arceri	2deb822075	ac: add store_tcs_outputs() to the abi Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-01-05 11:58:55 +11:00
Timothy Arceri	b104e7e172	ac: call load_tcs_input() via the abi This also enables some code sharing with tes. V2: drop type param and just use ctx->i32 Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-01-05 11:58:55 +11:00
Timothy Arceri	b09a3196e0	ac: add load_tes_inputs() to the abi V2: drop type param and just use ctx->i32 Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-01-05 11:58:55 +11:00
Samuel Pitoiset	a4d2782664	amd/common: scan if gl_PrimitiveID is used before translating to LLVM It makes more sense to move all scan stuff in the same place. Also, we don't really need to duplicate the uses_primid field for each stages. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-01-04 18:43:09 +01:00
Bas Nieuwenhuizen	c99426ea83	ac/nir: Handle loading data from compact arrays. Fixes: `f4e499ec79` "radv: add initial non-conformant radv vulkan driver" Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-01-04 00:14:23 +01:00
Samuel Pitoiset	3260a96c17	amd/common: rework set_userdata_location() and rename to set_loc() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-12-27 10:25:17 +01:00
Samuel Pitoiset	4221a816e2	amd/common: rename set_userdata_location_shader() to set_loc_shader() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-12-27 10:25:15 +01:00
Samuel Pitoiset	5081fd398e	amd/common: replace set_userdata_location_indirect() by set_loc_desc() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-12-27 10:25:13 +01:00
Samuel Pitoiset	f8202ef683	amd/common: rename radv_define_vs_user_sgprs_phase2() ... to set_vs_specific_input_locs(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-12-27 10:25:11 +01:00
Samuel Pitoiset	9d5a1787ee	amd/common: rename radv_define_common_user_sgprs_phase2() ... to set_global_input_locs(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-12-27 10:25:08 +01:00
Samuel Pitoiset	9a2393a510	amd/common: rename add_user_sgpr_array_argument() to add_array_arg() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-12-27 10:25:06 +01:00
Samuel Pitoiset	b6217bdbee	amd/common: replace add_sgpr_argument() by add_arg() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-12-27 10:25:04 +01:00
Samuel Pitoiset	32bbc9eb0f	amd/common: replace add_user_sgpr_argument() by add_arg() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-12-27 10:25:02 +01:00
Samuel Pitoiset	e946b5360d	amd/common: replace add_vgpr_argument() by add_arg() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-12-27 10:24:59 +01:00
Samuel Pitoiset	f1242a8976	amd/common: add new add_arg() helper for SGPRs/VGPRs arguments The idea is to clean up the add arguments logic. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-12-27 10:24:57 +01:00

1 2 3 4 5 ...

389 commits