fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 07:28:05 +02:00

Author	SHA1	Message	Date
Marek Olšák	a2e9d9b4c1	ac/gpu_info: add has_read_registers_query Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:40:11 -04:00
Marek Olšák	9b1fdfc541	ac/gpu_info: add has_2d_tiling Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:40:10 -04:00
Marek Olšák	d26696283d	ac/gpu_info: add has_sparse_vm_mappings Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:40:08 -04:00
Marek Olšák	125adc92ad	ac/gpu_info: add has_unaligned_shader_loads Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:40:07 -04:00
Marek Olšák	e9c08bc658	ac/gpu_info: add has_indirect_compute_dispatch Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:40:03 -04:00
Marek Olšák	64265ac8d5	ac/gpu_info: add kernel_flushes_tc_l2_after_ib Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:40:01 -04:00
Marek Olšák	14c5a93bfa	ac/gpu_info: add has_format_bc1_through_bc7 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:40:00 -04:00
Marek Olšák	2bd2c173e8	ac/gpu_info: add has_eqaa_surface_allocator Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:39:58 -04:00
Marek Olšák	e720cb6135	radeonsi: clean up the reset status query implementation Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:39:57 -04:00
Marek Olšák	3060f62340	ac/gpu_info: add has_bo_metadata Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:39:56 -04:00
Marek Olšák	09f1bab483	ac/gpu_info: add si_TA_CS_BC_BASE_ADDR_allowed Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:39:54 -04:00
Marek Olšák	8b58a14ef7	ac/gpu_info: add htile_cmask_support_1d_tiling Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:39:53 -04:00
Marek Olšák	b81149e258	ac/gpu_info: add kernel_flushes_hdp_before_ib Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:39:47 -04:00
Marek Olšák	912b0163dc	ac/surface: add EQAA support Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:34:31 -04:00
Marek Olšák	bdc3e410f7	ac/surface: unify common legacy and gfx9 fmask fields Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:26:33 -04:00
Marek Olšák	9bf3570fed	ac/surface/gfx6: compute FMASK together with the color surface instead of invoking FMASK computation separately. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:26:33 -04:00
Marek Olšák	276acda835	ac/surface/gfx9: fix a typo in CMASK RB/pipe alignment No change in behavior because it's always aligned. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:26:32 -04:00
Marek Olšák	6841845b00	ac: set correct LLVM processor names for Raven & Vega12 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:26:32 -04:00
Marek Olšák	6f7f10d285	ac: sort raster configs Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:26:32 -04:00
Marek Olšák	e7b82a9978	ac: remove 1 RB raster config for Iceland Iceland always reports 2 RBs. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:26:32 -04:00
Marek Olšák	cb0f5cddcc	ac: move the Fiji kernel workaround for raster config out of the switch Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:26:32 -04:00
Marek Olšák	ce954ac6f3	ac: enable both RBs on Kaveri This can result in 2x increase in performance on non-harvested Kaveris. v2: don't do it on radeon Tested-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:26:32 -04:00
Nicolai Hähnle	c0acb596f4	amd/common: use llvm.amdgcn.wqm for explicit derivatives To comply with an upcoming change in LLVM, see https://reviews.llvm.org/D46051 Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-05-04 11:02:48 +02:00
Dave Airlie	8d3529872c	ac/nir: expand 64-bit vec3 loads to fix shuffling. If loading 64-bit vec3 values, a 4 component load would be followed by a 2 component load and the resulting shuffle would fail as it requires 2 4 components. This just expands the second results vector out to 4 components. This fixes 100 CTS tests: dEQP-VK.spirv_assembly.type.vec3.64 Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-05-01 05:58:14 +10:00
Marek Olšák	43f0a10051	radeonsi: add triple into si_compiler Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Tested-by: Benedikt Schemmer <ben at besd.de> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	174e11c3f5	ac/surface: handle DCC subresource fast clear restriction on VI v2: require the previous level to be clearable for determining whether the last unaligned level is clearable Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Samuel Pitoiset	d38425ce87	ac: fix texture query LOD for 1D textures on GFX9 1D textures are allocated as 2D which means we only need one coordinate for texture query LOD. Fixes: `625dcbbc45` ("amd/common: pass address components individually to ac_build_image_intrinsic") Cc: 18.1 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 11:15:35 +02:00
Dave Airlie	a90c9f33cf	ac/radv/radeonsi: refactor harvest config register getters. This refactors the code out to share it between radv and radeonsi. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-04-24 09:08:34 +10:00
Dave Airlie	f77caa7411	ac/radv/radeonsi: refactor max simd waves into common code. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-04-24 09:08:33 +10:00
Dave Airlie	899df55ee0	ac/radv/radeonsi: refactor raster_config default values getters. This just makes this common code between the two drivers. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-04-24 09:07:51 +10:00
Dave Airlie	5e2ef28390	ac/info: move gs table depth to common code. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-04-24 09:05:38 +10:00
Samuel Pitoiset	d136a5fad9	ac: fix the number of coordinates for ac_image_get_lod and arrays This fixes crashes for the following CTS: dEQP-VK.glsl.texture_functions.query.texturequerylod.* Cubemaps are the same as 2D arrays. Fixes: `625dcbbc45` ("amd/common: pass address components individually to ac_build_image_intrinsic") Cc: 18.1 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-23 21:48:38 +02:00
Samuel Pitoiset	e37e643589	ac: teach get_ac_sampler_dim() about subpass attachments Suggested by Nicolai. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-23 19:10:56 +02:00
Samuel Pitoiset	84fef802fb	ac/nir: add missing round_slice for 1D arrays This fixes a bunch of CTS fails with 1D arrays: dEQP-VK.glsl.texture_functions.texture.sampler1darray_ Fixes: `625dcbbc45` ("amd/common: pass address components individually to ac_build_image_intrinsic") Cc: 18.1 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-23 19:10:52 +02:00
Samuel Pitoiset	8f13975713	ac/nir: fix image dimension for subpass attachments For subpass attachments we need one more coordinate with the layer, so make them array types. This fixes a bunch of CTS fails with RADV. Fixes: `24fb3e6aa1` ("ac/nir: use ac_build_image_opcode for image intrinsics") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-04-20 18:44:51 +02:00
Samuel Pitoiset	dd069e9b41	ac/nir: handle nir_intrinsic_load_first_vertex like base_vertex This fixes a ton of CTS crashes. Fixes: `c366f422f0` ("nir: Offset vertex_id by first_vertex instead of base_vertex") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-04-20 17:07:38 +02:00
Nicolai Hähnle	24fb3e6aa1	ac/nir: use ac_build_image_opcode for image intrinsics So that we'll use the dimension-aware intrinsics in the future. Acked-by: Marek Olšák <marek.olsak@amd.com>	2018-04-20 09:30:07 +02:00
Nicolai Hähnle	74063431f1	radeonsi: generate image load/store/atomic ops using ac_build_image_opcode In preparation of dimension-aware LLVM image intrinsics. Acked-by: Marek Olšák <marek.olsak@amd.com>	2018-04-20 09:29:57 +02:00
Nicolai Hähnle	625dcbbc45	amd/common: pass address components individually to ac_build_image_intrinsic This is in preparation for the new image intrinsics. Acked-by: Marek Olšák <marek.olsak@amd.com>	2018-04-20 09:23:52 +02:00
Nicolai Hähnle	f931583828	amd/common: pass new enum ac_image_dim to ac_build_image_opcode This is in preparation for the new, dimension-aware LLVM image intrinsics. Acked-by: Marek Olšák <marek.olsak@amd.com>	2018-04-20 09:23:40 +02:00
Nicolai Hähnle	a807a9b215	ac/nir: fix atomic compare-and-swap The LLVM instruction returns { i32, i1 }, where the i1 indicates success. We're only interested in the first part, which is the loaded value. Fixes dEQP-GLES31.functional.compute.shared_var.atomic.compswap.* Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-04-20 09:21:40 +02:00
Marek Olšák	c6f1d36019	radeonsi: add support for VegaM Acked-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-18 14:45:33 -04:00
Bas Nieuwenhuizen	b0e3a9b19f	ac/nir: Make the GFX9 buffer size fix apply to image loads/atomics too. No clue how I missed those ... Fixes: `4503ff760c` "ac/nir: Add workaround for GFX9 buffer views." CC: <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=105320 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-16 11:55:48 +02:00
Daniel Schürmann	4b0616e533	ac: handle subgroup intrinsics Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-04-14 01:03:15 +02:00
Daniel Schürmann	d5f7ebda3e	ac: add LLVM build functions for subgroup instrinsics Co-authored-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-04-14 01:03:09 +02:00
Daniel Schürmann	d19f20e793	ac: make ballot and umsb capable of 64bit inputs Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-04-14 00:52:22 +02:00
Marek Olšák	a3b785be4d	winsys/amdgpu: allow local BOs on APUs Local BOs ignore BO priorities, and we don't need those on APUs. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-04-13 12:31:04 -04:00
Bas Nieuwenhuizen	7eff8d7d35	ac/surface: Allow S swizzle for displayable surfaces. For dcn1 && < 64 bpp displayable surfaces, addrlib only accepts S swizzles. At the same time addrlib prefers D swizzles is allowed, so we can just allow S swizzles as fallback. Fixes: `b64b712558` "ac/surface/gfx9: request desired micro tile mode explicitly" Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-04-12 21:24:55 +02:00
Nicolai Hähnle	0630e52c9e	radeonsi: pass -O halt_waves to umr for hang debugging This will give us meaningful wave information in the case of a hang where shaders are still running in an infinite loop. Note that we call umr multiple times for different sections of the ddebug hang dump, and so the wave information will not necessarily match up between sections. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-04-11 12:44:24 +02:00
Marek Olšák	e29facff31	ac/surface: don't set the display flag for obviously unsupported cases (v2) This enables the tile swizzle for some cases of the displayable micro mode, and it also fixes an addrlib assertion failure on Vega. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2018-04-10 13:06:03 -04:00

1 2 3 4 5 ...

891 commits