fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-24 13:10:10 +01:00

Author	SHA1	Message	Date
Marek Olšák	854593b8eb	ac: clean up ac_build_indexed_load function interfaces Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-17 22:03:03 +02:00
Dave Airlie	4e93d6baae	radv: emit fmuladd instead of fma to llvm. For Vulkan SPIR-V the spec states fma() Inherited from OpFMul followed by OpFAdd. Matt says the backend will do the right thing depending on the hardware being compiled for, if you use the fmuladd intrinsic. Using the Mad Max pts test, on high settings at 4K: CHP: 55->60 HGDD: 46->50 LM: 55->60 No change on Stronghold. Thanks to Feral for spending the time to track this down. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: "17.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-10-04 06:22:44 +01:00
Nicolai Hähnle	9ddc6e16a9	amd/common: remove ac_shader_abi::chip_class Redundant with the recently added ac_llvm_context::chip_class. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-09-29 11:37:03 +02:00
Nicolai Hähnle	6772452e4c	amd/common: remove has_ds_bpermute argument from ac_build_ddxy Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-09-18 11:25:18 +02:00
Nicolai Hähnle	3db86d86ed	amd/common: add chip_class to ac_llvm_context Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-09-18 11:25:18 +02:00
Nicolai Hähnle	e0af3bed2c	amd/common: round cube array slice in ac_prepare_cube_coords The NIR-to-LLVM pass already does this; now the same fix covers radeonsi as well. Fixes various tests of dEQP-GLES31.functional.texture.filtering.cube_array.combinations.* Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-09-18 11:25:18 +02:00
Bas Nieuwenhuizen	979978ee06	radv: Check for GFX9 for 1D arrays in image_size intrinsic. Only on GFX9 we implement them as 2D images. This fixes: dEQP-VK.image.image_size.1d_array.readonly_12x34 dEQP-VK.image.image_size.1d_array.readonly_1x1 dEQP-VK.image.image_size.1d_array.readonly_32x32 dEQP-VK.image.image_size.1d_array.readonly_7x1 dEQP-VK.image.image_size.1d_array.readonly_writeonly_12x34 dEQP-VK.image.image_size.1d_array.readonly_writeonly_1x1 dEQP-VK.image.image_size.1d_array.readonly_writeonly_32x32 dEQP-VK.image.image_size.1d_array.readonly_writeonly_7x1 dEQP-VK.image.image_size.1d_array.writeonly_12x34 dEQP-VK.image.image_size.1d_array.writeonly_1x1 dEQP-VK.image.image_size.1d_array.writeonly_32x32 dEQP-VK.image.image_size.1d_array.writeonly_7x1 Fixes: `1bcb953e16` "radv: handle GFX9 1D textures" Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-09-15 22:06:56 +02:00
Dave Airlie	aba441be44	radv/ac: bump params array for image atomic comp swap For the comp_swap case this was overflowing and crashing sometimes. Fixes: dEQP-VK.image.atomic_operations.compare_exchange.* Cc: "17.2" <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-13 17:17:02 +10:00
Dave Airlie	1bcb953e16	radv: handle GFX9 1D textures As GFX9 can't handle 1D depth textures, radeonsi and apparantly pro just update all 1D textures to 2D, and work around it. This ports the workarounds from radeonsi. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: "17.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-13 08:40:41 +10:00
Connor Abbott	50967cd0b0	ac: move ac_to_integer() and ac_to_float() to ac_llvm_build.c We'll need to use ac_to_integer() for other stuff in ac_llvm_build.c. Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-09-08 04:24:02 +01:00
Dave Airlie	4cab214e76	radv/ac: use ac_get_type_size. Just moved to newly shared code. Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-08 04:15:50 +01:00
Dave Airlie	b880cd3b59	radv/gfx9: fix buffer size on gfx9. The VI sizing only applies to VI. This fixes: dEQP-VK.image.image_size.buffer.* Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-06 03:05:44 +01:00
Grazvydas Ignotas	29f46488cc	ac/nir: remove misleading condition location is never set to INTERP_SAMPLE, and Nicolai comments: "... that part is misleading. location refers to the base location, not the final location of the sample, and it can never be INTERP_SAMPLE." Suggested-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Signed-off-by: Grazvydas Ignotas <notasas@gmail.com>	2017-08-29 01:36:57 +03:00
Grazvydas Ignotas	2b4e31bc9b	ac/nir: silence maybe-uninitialized warnings These are likely false positives, but are also annoying because they show up on every "make install", which causes ac_nir_to_llvm to be rebuilt here. Initializing those variables to NULL should be harmless even when unnecessary. Signed-off-by: Grazvydas Ignotas <notasas@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-08-29 01:16:58 +03:00
Bas Nieuwenhuizen	180c1b924e	ac/nir: Add shader support for multiviews. It uses an user SGPR to pass the view index to the shaders, except for the fragment shader where we use layer=view (which comes in handy when we want to do the NV ext that allows us to execute pre-FS stages once instead of per view). Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-08-24 19:20:47 +02:00
Bas Nieuwenhuizen	3d5f29f5f9	ac/nir: Implement input attachments with layered rendering. Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-08-24 19:20:47 +02:00
Bas Nieuwenhuizen	43595db302	ac/nir: Cast sources of integer ops to int. The int32->float semantic conversion got dropped in a testcase, because the src was already float. On closer inspection I decided to add a few more casts for integer op operands to be safe too. Cc: 17.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-08-24 19:20:47 +02:00
Bas Nieuwenhuizen	6bafb56df6	radv: Implement bc optimize. Seems like we actually enabled it already, but did not implement the shader part. With this patch we do. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-08-24 00:57:03 +02:00
Bas Nieuwenhuizen	a7f5545ede	ac/nir: refactor input variable iteration. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-08-24 00:57:03 +02:00
Dave Airlie	b040f51b61	ac/nir: fixup layer/viewport export for GFX9. GFX9 moved where the viewport index export goes. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: "17.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-08-21 04:26:37 +01:00
Dave Airlie	4c02e2bd95	radv: disable texture gather workaround on gfx9. Not required anymore. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: "17.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-08-17 02:24:36 +01:00
Connor Abbott	c12c2e40a3	ac/nir: fix saturate emission The .f32 was already getting added by emit_intrin_2f_param(). Noticed when enabling LLVM module verification. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-08-08 11:58:21 -07:00
Dave Airlie	3f389f75b6	radv: fix f16->f32 denorm handling for SI/CIK. (v2) This just copies the code from the -pro shaders, and fixes the tests on CIK. With this CIK passes the same set of conformance tests as VI. Fixes: `83e58b03` (radv: flush f32->f16 conversion denormals to zero. (v2)) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-08-07 00:00:05 +01:00
Bas Nieuwenhuizen	341578a6ae	ac/nir: Add float cast before shadow comparator clamp. LLVM complained about passing an i32 to a float clamp. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Fixes: `0f9e32519b` "ac/nir: clamp shadow texture comparison value on VI" Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-08-02 08:43:13 +02:00
Dave Airlie	cb6f16dce9	radeon/ac: use ds_swizzle for derivs on si/cik. This looks like it's supported since llvm 3.9 at least, so switch over radeonsi and radv to using it, -pro also uses this. We can now drop creating lds for these operations as the ds_swizzle operation doesn't actually write to lds at all. Acked-by: Marek Olšák <marek.olsak@amd.com> (stable requested due to fixing radv CIK conformance tests) Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-08-02 00:12:01 +01:00
Connor Abbott	ddd9e11795	ac/nir: fix nir_op_unpack_64_2x32_split_y emission This was broken thanks to a typo in `b2367cf`. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-08-01 12:20:49 -07:00
Connor Abbott	6d731c5651	ac/nir: fix lsb emission This makes it match radeonsi. The LLVM backend itself will emit the correct instruction, but LLVM might do incorrect optimizations since it thinks the output is undefined when the input is 0, even though it's not supposed to be. We really need a new intrinsic, or for the backend to become smarter and recognize this pattern. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Bas Nieuwenhuizen <basni@google.com>	2017-08-01 12:20:49 -07:00
Dave Airlie	df61a05019	radv: handle 10-bit format clamping workaround. This fixes: dEQP-VK.api.copy_and_blit.core.blit_image.all_formats.* for a2r10g10b10 formats as destination on SI/CIK hardware. This adds support to the meta program for emitting 10-bit outputs, and adds 10-bit support to the fragment shader key. It also only does the int8/10 on SI/CIK. Fixes: `f4e499ec7` (radv: add initial non-conformant radv vulkan driver) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-08-01 00:10:23 +01:00
Nicolai Hähnle	b7d36efc2d	ac/nir: implement load_frag_coord intrinsic Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:44 +02:00
Nicolai Hähnle	bcf85fcd9a	ac/nir: pass ac_llvm_context to unpack_param Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:44 +02:00
Nicolai Hähnle	1c64637c26	ac/nir,radeonsi: add and use ac_shader_abi::frag_pos v2: update for LLVMValueRefs in ac_shader_abi Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:43 +02:00
Nicolai Hähnle	f03c54e05a	ac/nir,radeonsi: add and use ac_shader_abi::{ancillary,sample_coverage} v2: update for LLVMValueRefs in ac_shader_abi Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:43 +02:00
Nicolai Hähnle	7de445377c	ac/nir,radv: move force_persample to ac_shader_info::force_persample Avoid accessing radv-specific structures during the meat of NIR-to-LLVM translation. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:43 +02:00
Nicolai Hähnle	0f9e32519b	ac/nir: clamp shadow texture comparison value on VI Needed for TC-compatible HTILE in radeonsi for test cases like piglit spec/arb_texture_rg/execution/fs-shadow2d-red-01.shader_test Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:42 +02:00
Nicolai Hähnle	ac2ab5acad	ac/nir: add always_vector argument to ac_build_gather_values_extended This simplifies a bunch of places that no longer need special treatment of value_count == 1. We rely on LLVM to optimize away the 1-element vector types. This fixes a bunch of bugs where 1-element arrays are indexed indirectly. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:42 +02:00
Nicolai Hähnle	e247357240	ac/nir,radeonsi: add ac_shader_abi::front_face v2: update for LLVMValueRefs in ac_shader_abi Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:42 +02:00
Nicolai Hähnle	28634ff7d3	ac/nir: pass ac_nir_context to emit_ddxy Allocating the ddxy_lds is considered to be part of the API shader translation and not part of the ABI. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:41 +02:00
Nicolai Hähnle	c5f3912e13	ac/nir: pass ac_nir_context to SSBO intrinsic handlers Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:41 +02:00
Nicolai Hähnle	b78eae6f2a	ac/nir: load buffer descriptors via ac_shader_abi::load_ssbo Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:40 +02:00
Nicolai Hähnle	aa66fec47e	ac/nir: pass ac_nir_context to emit_discard_if Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:40 +02:00
Nicolai Hähnle	4ba201ee36	ac/nir: extract shader_info->fs.can_discard from NIR shader info Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:40 +02:00
Nicolai Hähnle	9061dca872	ac/nir: handle old-style shadow tex instructions correctly The first element is only extracted for new-style shadow tex. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:39 +02:00
Nicolai Hähnle	07597632a5	ac/nir: whitespace fixes Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:39 +02:00
Nicolai Hähnle	ba06e8bbe8	ac/nir: use shader_info pass to determine whether instance_id is used This improves the separation of ABI and NIR translation. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:39 +02:00
Nicolai Hähnle	be0488a173	ac/nir: move setting shader_info->fs.writes_memory to radv-specific code Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:39 +02:00
Nicolai Hähnle	f37f9aed84	ac/nir: add image and write parameter to ac_shader_abi::load_sampler_desc Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:38 +02:00
Nicolai Hähnle	b36b6f76fa	ac/nir: add support for arrays-of-arrays to get_sampler_desc Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:38 +02:00
Nicolai Hähnle	35b7b3a80f	ac/nir: pass ac_nir_context to tex_fetch_ptrs and related functions Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:37 +02:00
Nicolai Hähnle	6ff5317589	ac/nir: add and use ac_shader_abi::load_sampler_desc Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:37 +02:00
Nicolai Hähnle	57fbf3f9eb	ac/nir: pass ac_nir_context to visit_tex and various related functions Get most of the churn out of the way before actually loading samplers via the ABI. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-07-31 14:55:37 +02:00

1 2 3 4 5 ...

262 commits