fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 04:48:07 +02:00

Author	SHA1	Message	Date
Alex Smith	ce4058dafd	radv/ac: Fix shared memory offset calculation The index passed to get_shared_memory_ptr is an attribute slot index, i.e. the index of a vec4 within LDS. Therefore this must be scaled by sizeof(vec4) to give the LDS byte offset. Fixes: `f4e499ec79` ("radv: add initial non-conformant radv vulkan driver") Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> CC: <mesa-stable@lists.freedesktop.org>	2017-03-17 09:35:48 +01:00
James Legg	e88cac1df0	radv: Fix using more than 4 bound descriptor sets Avoid a buffer overflow in ac_nir_to_llvm.c's create_function when using more than 4 descriptor sets. radv claims support for 8. Cc: 17.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-03-17 09:12:43 +01:00
Dave Airlie	7372e3cf5f	radv/ac: workaround regression in llvm 4.0 release LLVM 4.0 released with a pretty messy regression, that hopefully get fixed in the future. This work around was proposed by Tom, and it fixes the CTS regressions here at least, I'm not sure if this will cause any major side effects, but correctness over speed and all that. radeonsi should possibly consider the same workaround until an llvm fix can be found. Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-03-15 09:51:53 +10:00
Dave Airlie	3ece76f03d	radv/ac: gather4 cube workaround integer This fix is extracted from amdgpu-pro shader traces. It appears the gather4 workaround for integer types doesn't work for cubes, so instead if forces a float scaled sample, then converts to integer. It modifies the descriptor before calling the gather. This also produces some ugly asm code for reasons specified in the patch, llvm could probably do better than dumping sgprs to vgprs. This fixes: dEQP-VK.glsl.texture_gather.basic.cube.rgba8* Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-03-15 09:51:53 +10:00
Jason Ekstrand	762a6333f2	nir: Rework conversion opcodes The NIR story on conversion opcodes is a mess. We've had way too many of them, naming is inconsistent, and which ones have explicit sizes was sort-of random. This commit re-organizes things and makes them all consistent: - All non-bool conversion opcodes now have the explicit size in the destination and are named <src_type>2<dst_type><size>. - Integer <-> integer conversion opcodes now only come in i2i and u2u forms (i2u and u2i have been removed) since the only difference between the different integer conversions is whether or not they sign-extend when up-converting. - Boolean conversion opcodes all have the explicit size on the bool and are named <src_type>2<dst_type>. Making things consistent also allows nir_type_conversion_op to be moved to nir_opcodes.c and auto-generated using mako. This will make adding int8, int16, and float16 versions much easier when the time comes. Reviewed-by: Eric Anholt <eric@anholt.net>	2017-03-14 07:36:40 -07:00
Dave Airlie	b8ee70384a	radv: setup llvm target data layout Ported from radeonsi, pointed out by Tom. "This prevents LLVM from using sext instructions for local memory offsets and allows the backend to fold immediate offsets into the instruction. This also prevents some incorrect code generation for ptrtoint and inttoptr instructions." Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Tom Stellard <tstellar@redhat.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-03-14 10:33:59 +10:00
Dave Airlie	e27fdbcb4c	radv/ac: move to new image intrinsics. This hooks up radv to the new image intrinsic builders. Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-03-13 09:44:53 +10:00
Emil Velikov	a1d186cb70	amd: remove shebang from python scripts Analogous to earlier commit(s). Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-03-10 14:12:46 +00:00
Emil Velikov	f6180a5ab7	amd: remove execute bit from python scripts Analogous to earlier commit(s). Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-03-10 14:12:46 +00:00
Fredrik Höglund	162beb2abb	radv/ac: fix multiple descriptor sets with dynamic buffers The dynamic_offset_offset in the descriptor set binding layout is relative to the dynamic_offset_start for the set in the pipeline layout. Cc: 17.0 <mesa-stable@lists.freedesktop.org> Signed-off-by: Fredrik Höglund <fredrik@kde.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-03-07 20:23:32 +01:00
Dave Airlie	03f5405fc2	amd/common: document PREDICATION OP 3 as 64-bit bool. This just documents some info for possible future use. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-03-07 15:20:01 +10:00
Dave Airlie	5c45d2051a	radv/ac: introduce i1true/i1false to context. This uses these in a few places, and fixes one or two cases which were using da as 32-bit instead of bool. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-03-07 08:17:03 +10:00
Dave Airlie	ca884aef86	radv/ac: handle Z export using new builder. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-03-07 08:17:03 +10:00
Dave Airlie	bf2be50774	radv/ac: move to using common ac_get_image_intr_name. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-03-07 08:17:03 +10:00
Dave Airlie	10ae83a9c2	radeonsi/ac: move get_image_intr_name to common This code is used in radv, so move to common build code. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-03-07 08:17:03 +10:00
Marek Olšák	7e1faa79d3	radeonsi: drop support for LLVM 3.6 & 3.7 They are too old. Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-06 14:13:04 +01:00
Marek Olšák	d5d74fe2b5	radeonsi: set the convergent attribute where needed Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-06 14:13:04 +01:00
Marek Olšák	ef883fc554	gallivm,ac: add LP_FUNC_ATTR_CONVERGENT Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-06 14:13:04 +01:00
Marek Olšák	9b08f044be	radeonsi: fix LLVM 3.9 - don't use non-matching attributes on declarations Call site attributes are used since LLVM 4.0. This also reverts commit `b19caecbd6` "radeon/ac: fix intrinsic version check", because this is the correct fix. Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-06 14:13:04 +01:00
Dave Airlie	2e73ccb485	radv/ac: use bitfield extract new intrinsics. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-03-06 15:27:33 +10:00
Dave Airlie	9c7309b09b	radv/ac: move to new kill build. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-03-06 15:27:33 +10:00
Dave Airlie	a2652719f3	radv/ac: move to using new export intrinsics. This uses the new code in build to do exports. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-03-06 15:27:33 +10:00
Dave Airlie	2830ece0fc	radv/ac: switch to new intrinsics for pkrtz and clamp. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-03-06 15:27:32 +10:00
Dave Airlie	b19caecbd6	radeon/ac: fix intrinsic version check Reported-by: 375gnu@gmail.com Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100068 Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-03-06 06:05:58 +10:00
Marek Olšák	7f1446a8a1	ac: normalize build helper names s/emit/build/ Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-03 17:30:07 +01:00
Marek Olšák	8bde7fb3fc	ac: replace SI.vs.load.input with amdgcn.buffer.load.format Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-03 17:30:07 +01:00
Marek Olšák	94811dc66c	radeonsi: move SI.vs.load.input building into amd/common Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-03 17:30:07 +01:00
Marek Olšák	97e21cfa25	ac: replace llvm.SI.tbuffer.store with llvm.amdgcn.buffer.store if ADD_TID=0 ADD_TID doesn't work. Needs more investigation. v2: remove leftover dead code Reviewed-by: Dave Airlie <airlied@redhat.com> (v1)	2017-03-03 15:29:30 +01:00
Marek Olšák	8cfdbba6c7	ac: remove offen parameter from ac_build_buffer_store_dword Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-03 15:29:30 +01:00
Marek Olšák	27439dfdae	radeonsi: merge and simplify tbuffer_store functions Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-03 15:29:30 +01:00
Marek Olšák	d4324ddb89	radeonsi: replace AMDGPU.bfe.* with amdgcn.*bfe Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-03 15:29:30 +01:00
Marek Olšák	9c09592086	radeonsi: move kill intrinsic building into amd/common just a cleanup Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-03 15:29:30 +01:00
Marek Olšák	e729dc7c46	radeonsi: set readnone on reads from read-only memory	2017-03-03 15:29:30 +01:00
Marek Olšák	653ac0b389	radeonsi: replace SI.packf16 with amdgcn.cvt.pkrtz	2017-03-03 15:29:30 +01:00
Marek Olšák	4b2e5b9389	ac: replace old image intrinsics with new ones Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-03 15:29:30 +01:00
Marek Olšák	ad18d7f040	radeonsi: move image intrinsic building to amd/common Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-03 15:29:30 +01:00
Marek Olšák	2b3ebe307c	ac: replace SI.export with amdgcn.exp.* Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-03 15:29:30 +01:00
Marek Olšák	369f4a8726	radeonsi: move llvm.SI.export building to amd/common Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-03 15:29:30 +01:00
Marek Olšák	9af03318aa	ac: unify build_type_name_for_intr functions Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-03 15:29:30 +01:00
Marek Olšák	b5744310d4	gallivm, ac: add writeonly and inaccessiblememonly attributes Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-03 15:29:30 +01:00
Tobias Klausmann	6d600cf632	amd/common: Fix build with new ac_add_function_attr() Fix usage of ac_add_function_attr() and make it known! common/ac_nir_to_llvm.c: In function 'create_llvm_function': common/ac_nir_to_llvm.c:265:4: error: implicit declaration of function 'ac_add_function_attr' [-Werror=implicit-function-declaration] ac_add_function_attr(main_function, i + 1, AC_FUNC_ATTR_BYVAL); ^~~~~~~~~~~~~~~~~~~~ Signed-off-by: Tobias Klausmann <tobias.johannes.klausmann@mni.thm.de> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-03-01 23:53:38 +01:00
Marek Olšák	940da36a65	gallivm,ac: add function attributes at call sites instead of declarations They can vary at call sites if the intrinsic is NOT a legacy SI intrinsic. We need this to force readnone or inaccessiblememonly on some amdgcn intrinsics. This is only used with LLVM 4.0 and later. Intrinsics only used with LLVM <= 3.9 don't need the LEGACY flag. gallivm and ac code is in the same patch, because splitting would be more complicated with all the LEGACY uses all over the place. v2: don't change the prototype of lp_add_function_attr. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> (v1)	2017-03-01 18:59:36 +01:00
Marek Olšák	408f370710	gallivm,ac: remove unused FUNC_ATTR_LAST enums Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2017-03-01 18:59:36 +01:00
Dave Airlie	e66be3d3bb	radv: fix txs for sampler buffers I messed this up when I wrote it, this fixes: dEQP-VK.memory.pipeline_barrier.uniform_texel_buffer. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: "17.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-03-01 08:02:24 +10:00
Marek Olšák	8c838730d0	amd/common: fix ASICREV_IS_POLARIS11_M for Polaris12 Cc: 17.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-02-28 21:44:30 +01:00
Bas Nieuwenhuizen	137b06b437	radv/ac: Use constants for immutable samplers. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-02-28 20:48:14 +01:00
Timothy Arceri	f0aaa4b3a4	radeon/ac: make ac_shader_binary_config_start() available externally The read config functions are different for r600 and radeonsi so we can't just share the one in amd common. So just share this instead. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-02-28 13:20:31 +11:00
Timothy Arceri	affc8314cb	radeon/ac: add llvm_ir_string to ac_shader_binary struct Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-02-28 13:20:31 +11:00
Bas Nieuwenhuizen	336b05c49a	radv/ac: Add integer->integer casts. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Acked-by: Edward O'Callaghan <funfunctor@folklore1984.net>	2017-02-26 19:59:27 +01:00
Marek Olšák	c7878b0167	ac: silence a warning trivial	2017-02-25 00:16:38 +01:00

1 2 3 4

185 commits