fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 19:58:19 +02:00

Author	SHA1	Message	Date
Connor Abbott	2ec77f7a3c	ac/nir: fix 64-bit shifts NIR always makes the shift amount 32 bits, but LLVM asserts if the two sources aren't the same type. Zero-extend the shift amount to make LLVM happy. Signed-off-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-07-03 11:58:59 -07:00
Connor Abbott	7168425dd7	ac/nir: implement 64-bit packing and unpacking We implement the split opcodes, and tell NIR to lower the original ones. The lowering to LLVM is a little more complicated, but NIR can optimize the split ones a little better, and some NIR lowering passes that we might want to use (particularly for doubles) emit the split ones. This should fix pack/unpackDouble2x32, which seems like a bug since when we enabled the Float64 capability. It will also fix pack/unpackInt2x32 when we enable the Int64 capability. Fixes: `798ae37c` ("radv: Enable Float64 support.") Signed-off-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-07-03 11:58:58 -07:00
Bas Nieuwenhuizen	87d3349393	radv: Use v4i32 variant of llvm.SI.load.const. We apparently still used v16i8 .... As radeonsi doesn't use it with LLVM version checks I don't think we need them either. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-06-30 23:30:55 +02:00
Dave Airlie	ff422500cc	ac/nir: remove last remnants of v16i8 llvm doesn't need this workaround anymore. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-28 20:22:30 +01:00
Alex Smith	909184ac9c	ac/nir: Use correct LLVM intrinsics for atomic ops on imageBuffers The buffer intrinsics should be used instead of the image ones. Signed-off-by: Alex Smith <asmith@feralinteractive.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-06-28 21:05:04 +02:00
James Legg	69a17da037	ac/nir: assert printfs will fit Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-06-28 21:05:04 +02:00
James Legg	6fc41bb4d5	ac/nir: Make intrinsic_name buffer long enough When using cmpswap on an image, it was being trunctated to lvm.amdgcn.image.atomic.cmpswa, with the coords type missing entirely. v2: Add stable CC CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Grazvydas Ignotas <notasas@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-06-28 21:05:04 +02:00
Nicolai Hähnle	2ce126df3a	ac/nir: convert emit helpers to ac_llvm_context Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:30 +10:00
Nicolai Hähnle	58d496c8e2	ac/nir: remove unused nir_to_llvm_context::has_ddxy Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:30 +10:00
Nicolai Hähnle	6ecef25545	ac/nir: implement nir_op_f2b Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:30 +10:00
Nicolai Hähnle	dacf73e527	ac/nir: implement nir_op_{b2i,i2b} Booleans in NIR are ~0 for true, b2i returns 0/1. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:30 +10:00
Nicolai Hähnle	77d7764d5e	ac/nir: convert type helpers to ac_llvm_context Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:30 +10:00
Nicolai Hähnle	b7bd49158e	ac/llvm: fix type of second llvm.cttz.* parameter LLVM has required an i1 here for a long time. llvm.ctlz.* was fixed in commit `edd23e0606` ("ac/llvm: fix various findMSB bugs"). Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:30 +10:00
Nicolai Hähnle	e8ba03d32a	ac/shader_info: fix a comment Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:29 +10:00
Nicolai Hähnle	edfd3be77e	ac: add ac_llvm_context::v8i32 Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:29 +10:00
Nicolai Hähnle	331a574732	ac: add ac_llvm_context::{i,f}32_{0,1} Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:29 +10:00
Nicolai Hähnle	7bf8c944dc	ac: add ac_llvm_context::{i16, i64, f16, f64} Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:29 +10:00
Dave Airlie	6a68170c83	radv: handle primitive id input into fragment shader with no geom shader Fixes: dEQP-VK.pipeline.framebuffer_attachment.no_attachments dEQP-VK.pipeline.framebuffer_attachment.no_attachments_ms Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 08:45:30 +10:00
Dave Airlie	a563f611c3	radv: set prim_id for geometry shaders Noticed in passing. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 08:45:22 +10:00
Dave Airlie	4042892cee	radv: set use_prim_id for tess shaders correctly. Just noticed in passing. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 08:45:14 +10:00
Marek Olšák	0f827b51c0	radeonsi/gfx9: fix TC-compatible stencil compression Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-19 20:15:36 +02:00
Marek Olšák	064f07fef3	ac/sid.h: don't use parentheses in PKT3_RELEASE_MEM definition The parses skips the line if it contains parentheses. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-19 20:15:36 +02:00
Marek Olšák	ed291cea3d	ac: parse EVENT_WRITE_EOP, RELEASE_MEM, WAIT_REG_MEM, NOWHERE Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-19 20:15:36 +02:00
Nicolai Hähnle	67e49a7f65	amd/common: fix off-by-one in sid_tables.py The very last entry in the sid_strings_offsets table ended up missing, leading to out-of-bounds reads and potential crashes. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-06-19 12:03:59 +02:00
Emil Velikov	84bf7e5ad6	ac: resolve conflicts introduced with "ac: remove amdgpu.h dependency" The commit did not add the relevant includes - in particular stdint.h and stdbool.h for the respective standard types. At the same time, the amdgpu_device_handle typedef redeclaration was off. Fixes: `81945ded0d` ("ac: remove amdgpu.h dependency") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=101471 Cc: Mark Janes <mark.a.janes@intel.com> Cc: Gregor Münch <gr.muench@gmail.com> Reported-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reported-by: Mark Janes <mark.a.janes@intel.com> Reported-by: Gregor Münch <gr.muench@gmail.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-06-17 11:37:51 +01:00
Emil Velikov	81945ded0d	ac: remove amdgpu.h dependency Add a couple of forward declarations and drop the amdgpu.h requirement. With this we can build the r300 and r600 drivers without the need for amdgpu. v2: - Add amdgpu.h include in the C file (Marek) - Add a comment about pre C11 typedef redeclaration warning (Eric) Cc: Nicolai Hähnle <nicolai.haehnle@amd.com> Cc: Marek Olšák <marek.olsak@amd.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=101189 Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2017-06-16 12:41:44 +01:00
Dave Airlie	95c0591087	ac/gpu: drop duplicated code line. has_hw_decode is assigned twice. Pointed out by coverity. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-13 10:01:40 +10:00
Grazvydas Ignotas	19f6cc3cba	ac/nir: remove another unused variable Declared by each loop already. Trivial. Signed-off-by: Grazvydas Ignotas <notasas@gmail.com>	2017-06-08 00:02:42 +03:00
Grazvydas Ignotas	7dfa54399c	ac/nir: convert several ifs to a switch Also solve "outinfo may be used uninitialized" warning by putting in an unreachable(). Signed-off-by: Grazvydas Ignotas <notasas@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-06-08 00:02:26 +03:00
Grazvydas Ignotas	ae3262c1f2	ac/nir: mark some arguments const Most functions are only inspecting nir, so nir related arguments can be marked const. Some more can be done if/when some nir changes are accepted. Signed-off-by: Grazvydas Ignotas <notasas@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-06-08 00:02:02 +03:00
Dave Airlie	1ec4f008a2	ac/nir: move gpr counting inside argument handling. This just moves this code in here to it's cleaner. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-07 06:00:30 +01:00
Dave Airlie	7b46e2a74b	ac/nir: assign argument param pointers in one place. Instead of having the fragile code to do a second pass, just give the pointers you want params in to the initial code, then call a later pass to assign them. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-07 06:00:23 +01:00
Dave Airlie	b19cafd441	ac/nir: consolidate setting userdata location Just pass a pointer and increment inside the function, makes the code less error prone. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-07 05:59:57 +01:00
Eric Engestrom	63a8a88ac4	tree-wide: remove trailing backslash Simple search for a backslash followed by two newlines. If one of the newlines were to be removed, this would cause issues, so let's just remove these trailing backslashes. Signed-off-by: Eric Engestrom <eric.engestrom@imgtec.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2017-06-07 01:18:09 +01:00
Bas Nieuwenhuizen	ecdace80f4	ac/surface: Fix HTILE for radv. We always compute HTILE size using addrlib, even when not TC compatible. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlied <airlied@redhat.com>	2017-06-06 03:17:02 +02:00
Dave Airlie	0063da8393	radv: add some misc gfx9 pieces. This just adds the strings and includes the gfx9 register defs in some files that we need them in. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-06 09:43:21 +10:00
Nicolai Hähnle	dfc06d2fac	radv: use ac_surface data structures This is mostly mechanical changes of renaming types and introducing "legacy" everywhere. It doesn't use the ac_surface computation functions yet. Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-06-05 10:44:09 +10:00
Nicolai Hähnle	e07d5c7296	ac/surface/gfx6: explicitly support S8 surfaces This is needed by radv for dEQP-VK.renderpass.simple.stencil Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-06-05 10:43:29 +10:00
Dave Airlie	72f0830ecd	ac/nir: set workgroup size attribute to correct value. This ports: `55445ff189` from radeonsi radeonsi: tell LLVM not to remove s_barrier instructions LLVM 5.0 removes s_barrier instructions if the max-work-group-size attribute is not set. What a surprise. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-05 01:37:44 +01:00
Dave Airlie	68c812f699	ac: add new helper function to add a integer target dependent function attr. This is needed to add the max workgroup size attribute. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-05 01:37:29 +01:00
Leo Liu	ea79c0440c	amd/common: set vcn dec as hw decode as well Recommit after issue resolved by the previous patch. Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2017-05-29 14:32:29 -04:00
Leo Liu	0abc24723c	amd/common: add vcn dec ip info query for amdgpu version 3.17 Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-05-29 14:32:29 -04:00
Marek Olšák	e019ea8f4b	radeonsi: move building llvm.SI.load.const into ac_build_buffer_load Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-05-29 01:52:16 +02:00
Marek Olšák	e1942c970f	radeonsi: rename readonly_memory -> can_speculate This is more accurate. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-05-29 01:52:16 +02:00
Dave Airlie	e1409f7302	Revert "amd/common: add vcn dec ip info query" This reverts commit `524d4fff9e`. This commit breaks amdgpu on kernels with no DEC IP support. Caught by the airlied CI system.	2017-05-26 16:36:57 +10:00
Dave Airlie	ae1f32915b	Revert "amd/common: set vcn dec as hw decode as well" This reverts commit `50d322be2f`. A previous patch breaks amdgpu on non-vcn decode systems, but have to revert this first.	2017-05-26 16:36:38 +10:00
Leo Liu	50d322be2f	amd/common: set vcn dec as hw decode as well Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2017-05-25 11:40:20 -04:00
Leo Liu	524d4fff9e	amd/common: add vcn dec ip info query Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2017-05-25 11:40:20 -04:00
Leo Liu	c23ffafc50	radeon: rename has_uvd info to has_hw_decode Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2017-05-25 11:40:20 -04:00
Christian König	5318870f54	winsys/amdgpu: align VA allocations to fragment size v2 BOs larger than the minimum fragment size should have their VA alignet to at least the fragment size for optimal performance. v2: drop unused leftover from initial implementation Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-05-24 10:32:19 +02:00

1 2 3 4 5 ...

323 commits