fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-06-05 10:48:24 +02:00

Author	SHA1	Message	Date
Marek Olšák	2b18d67a1e	gallium/radeon: remove dead code creating LLVMTargetMachine This was for some old unsupported LLVM version. Only si_create_context creates the target machine now. r600g doesn't use this function. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-08 19:23:42 +02:00
Marek Olšák	a343ab55f7	radeonsi: don't enable scratch just for SGPR spills Diff from shader-db: Scratch: 3221504 -> 17408 (-99.46 %) bytes per wave v2: add "break;" Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-08 19:23:41 +02:00
Marek Olšák	95288277d5	Revert "radeonsi: allow direct hw MSAA resolve for scanout surfaces" This reverts commit `ffd54d1936`. No, it doesn't work. The test case is "glxgears -samples 2".	2016-06-08 19:21:55 +02:00
Marek Olšák	f39439d166	radeonsi: re-enable PBO ReadPixels acceleration disabled by `4f1cccf570` Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-08 00:22:45 +02:00
Marek Olšák	7c6e88b643	radeonsi: allow MSAA resolving into a texture that has DCC enabled Since DCC is enabled almost everywhere now, it's important not to disable this fast path. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	9a472a3e0b	gallium/radeon: move DCC clearing into a separate function Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	ffd54d1936	radeonsi: allow direct hw MSAA resolve for scanout surfaces No idea why this was disabled, but it works fine. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	4be46c7d9d	radeonsi: don't allocate DCC for the temporary MSAA resolve surface Allocating it has no effect, but it adds overhead (useless DCC clear). Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	c06246501e	radeonsi: don't enable DCC in the sampler if first_level doesn't have it If first_level > 0 and DCC is disabled for that level, let's skip DCC reads entirely. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	00389100b6	winsys/amdgpu: enable DCC for mipmapped textures Also add dcc_fast_clear_size for clearing only the necessary subset of DCC. For no AA, it's equal to the size of the whole DCC level. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	c65361763c	gallium/radeon: don't disable DCC because of SDMA We want to keep DCC enabled to save bandwidth. It was a bad idea to disable it here. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	2fd74a05bb	radeonsi: don't flag renderbuffer feedback loop if DCC has just been disabled Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	aa7fe70443	radeonsi: add per-level dcc_enabled flags Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	60e93ddd06	radeonsi: compute DCC register parameters in si_emit_framebuffer_state This will get more complicated with mipmapped DCC or when DCC is enabled after allocation. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	a01536a29f	gallium/radeon: add an assertion checking the validity of PIPE_BIND_SCANOUT Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	d4d733e39d	gallium/radeon: don't allocate DCC for non-renderable texture formats R9G9B9E5 is the only uncompressed one hopefully. This fixes incorrect rendering not discovered (due to a lack of tests) until DCC mipmapping was enabled. Cc: 11.1 11.2 12.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Nicolai Hähnle	b42bc90b6a	radeonsi: enable WQM in PS prolog when needed WQM is needed when the PS prolog computes a VGPR that is consumed by a shader with (implicit or explicit) derivatives. Depends on http://reviews.llvm.org/D20839 / LLVM r272063 for this to be effective (otherwise it's just a no-op). Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=95130 Cc: 12.0 <mesa-dev@lists.freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-07 23:46:02 +02:00
Nicolai Hähnle	d3a584defe	tgsi/scan: add uses_derivatives (v2) v2: - TG4 does not calculate derivatives (Ilia) - also handle SAMPLE* instructions (Roland) Cc: 12.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> (v1) Reviewed-by: Brian Paul <brianp@vmware.com> (v1) Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2016-06-07 23:45:17 +02:00
Tim Rowley	87f0a0448f	swr: fix provoking vertex Use rasterizer provoking vertex API. Fix rasterizer provoking vertex for tristrips and quad list/strips. v2: make provoking vertex tables static const Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2016-06-07 11:47:52 -05:00
Ilia Mirkin	71ad8a173f	gk104/ir: fix conditions for adding a texbar Sometimes a register source can actually be double- or even quad-wide. We must make sure that the inserted texbars take that width into account. Based on an earlier patch by Samuel Pitoiset. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: "12.0 11.2" <mesa-stable@lists.freedesktop.org>	2016-06-07 10:18:13 -04:00
Nicolai Hähnle	8239da28e8	radeonsi: keep track of dirty descriptor sets Reduces CPU load for draw calls that change none or few of the descriptors. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-07 15:18:10 +02:00
Nicolai Hähnle	d152c73712	radeonsi: move si_descriptors into a per-context array Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-07 15:18:07 +02:00
Nicolai Hähnle	a29c4f9ebd	radeonsi: pass shader stage to si_disable_shader_image Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-07 15:18:05 +02:00
Nicolai Hähnle	4e0fb72786	radeonsi: access descriptor sets via local variables This will simplify moving them to a per-context array. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-07 15:18:02 +02:00
Nicolai Hähnle	ba4a2840c7	radeonsi: add si_set_rw_buffer to be used for internal descriptors So that callers outside of si_descriptors.c need to worry less about the details of descriptor handling. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-07 15:17:59 +02:00
Nicolai Hähnle	c615a055f4	radeonsi: pass shader stage to si_set_shader_image Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-07 15:17:57 +02:00
Nicolai Hähnle	e6612a3e68	radeonsi: pass shader stage to si_set_sampler_view Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-07 15:17:55 +02:00
Nicolai Hähnle	c32cd4b78d	radeonsi: move descriptor set begin_new_cs handling into a separate function Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-07 15:17:39 +02:00
Nicolai Hähnle	031b57bc2f	radeonsi: move enabled_mask out of si_descriptors This mask is irrelevant for the generic descriptor set handling, and having it outside simplifies subsequent changes slightly. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-07 15:17:23 +02:00
Marek Olšák	095803a37a	gallium/radeon: add support for sharing textures with DCC between processes v2: use a function for calculating WORD1 of bo metadata Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-07 11:12:26 +02:00
Marek Olšák	9e5b5fbde0	gallium/radeon: don't discard DCC if an external user can write to it We don't import textures with DCC now, but soon we will. v2: if we can't disable DCC for image writes, at least decompress DCC at bind time Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-07 11:12:26 +02:00
Dave Airlie	c6b14bafa4	i915: fix typo CAP. Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-06-07 18:31:14 +10:00
Ilia Mirkin	704bc0f0e9	nvc0: add support for VOTE tgsi opcodes Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2016-06-06 20:49:29 -04:00
Ilia Mirkin	edfa7a4b25	gallium: add PIPE_CAP_TGSI_VOTE for when the VOTE ops are allowed Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-06-06 20:49:29 -04:00
Ilia Mirkin	30684b50d7	gallium: add VOTE_* opcodes to implement GL_ARB_shader_group_vote Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-06-06 20:49:28 -04:00
Samuel Pitoiset	08ddfe7b2f	nv50/ir: use round toward 0 when converting doubles to integers Like floats, we should use the round toward 0 mode instead of the nearest one (which is the default) for doubles to integers. This fixes all arb_gpu_shader_fp64 piglits which convert doubles to integers (16 tests). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "11.2 12.0" <mesa-stable@lists.freedesktop.org>	2016-06-06 22:56:04 +02:00
Marek Olšák	00e6899ae5	gallium/radeon: don't re-set BO metadata after CMASK deallocation CMASK has no effect on metadata, because it's not sharable. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-06 22:50:55 +02:00
Marek Olšák	991cbfcb14	radeonsi: add a performance tweak for 4 SE parts Ported from Vulkan. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-06 22:50:55 +02:00
Marek Olšák	2802310c25	radeonsi: simplify PRIMGROUP_SIZE computation for tessellation Ported from Vulkan. v2: keep the comment Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-06 22:50:55 +02:00
Marek Olšák	014c8ec770	r600g: use hw MSAA resolve for non-trivial resolves This improves MSAA resolve performance. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-06 22:50:55 +02:00
Marek Olšák	6b449783f6	radeonsi: use hw MSAA resolve for non-trivial resolves This improves MSAA resolve performance. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-06 22:50:55 +02:00
Nicolai Hähnle	ec2b52e2d9	radeonsi: set descriptor dirty mask on shader buffer unbind Found randomly while skimming the code. This might have caused VM faults in robustness tests. Cc: 12.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-06 21:43:18 +02:00
Charmaine Lee	627e975896	tgsi: fix mixed data type comparison in tgsi_point_sprite.c Cast the unsigned semantic index to integer datatype before comparing to max_generic, otherwise, max_generic which is initialized to -1 will be converted to unsigned int before the comparison, causing a wrong semantic index to be assigned to a shader output. Fixes the assert running TurboCAD_gl.trace. (VMware bug 1667265) Also tested with glretrace, mesa demos pointblast, spriteblast and pointcoord. v2: use the original max_generic variable but add the (int) cast to the semantic index, as suggested by Brian. Reviewed-by: Brian Paul <brianp@vmware.com>	2016-06-06 10:20:45 -06:00
Charmaine Lee	304b5a1446	svga: print shader linkage info when tgsi debug bit is on When TGSI debug flag is enabled, print the shader linkage info as well. Tested with mesa demos with SVGA_DEBUG=tgsi Reviewed-by: Brian Paul <brianp@vmware.com>	2016-06-06 10:20:45 -06:00
Lars Hamre	4163c71010	tgsi: use truncf in micro_trunc Switches to using truncf in micro_trunc. Fixes the following piglit tests (for softpipe): /spec/glsl-1.30/execution/built-in-functions/... fs-trunc-float fs-trunc-vec2 fs-trunc-vec3 fs-trunc-vec4 vs-trunc-float vs-trunc-vec2 vs-trunc-vec3 vs-trunc-vec4 /spec/glsl-1.50/execution/built-in-functions/... gs-trunc-float gs-trunc-vec2 gs-trunc-vec3 gs-trunc-vec4 Signed-off-by: Lars Hamre <chemecse@gmail.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2016-06-06 15:56:28 +02:00
Ilia Mirkin	092ec3920f	nv50,nvc0: fix BGR10_A2UI vertex format This is mostly academic as this is not reachable from GL, which only has the packed RGB10_A2UI vertex format. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-06-05 15:13:46 -04:00
Samuel Pitoiset	be365f34f0	nvc0: do not clear surfaces bins in the validate function We should not call nouveau_bufctx_reset() inside a validate function. This only affects Fermi where images are aliased between 3D and CP. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-06-05 19:02:59 +02:00
Samuel Pitoiset	43d3ecfb33	nvc0: re-validate images after launching a grid on Fermi Images invalidation is a bit weird on Fermi and there is already a hack which forces invalidating all images when launching a computer shader to help in fixing 3D<->CP interaction. However, we need to re-validate images for compute because nvc0_compute_invalidate_surfaces() will destroy the previous binding. This is not really good for performance purposes but this might be improved later. This fixes the following piglits: - spec/arb_compute_shader/execution/basic-uniform-access - spec/arb_compute_shader/execution/mutiple-texture-reading - spec/arb_compute_shader/execution/multiple-workgroups - spec/glsl-4.30/execution/built-in-functions/cs-* (207 tests) Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-06-05 18:48:02 +02:00
Marek Olšák	3b44864ab7	radeonsi: fix images with level > 0 This should fix spec@arb_shader_image_load_store@level. Broken by: Commit: `95c5bbae66` radeonsi: set some image descriptor fields at bind time Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-05 17:00:14 +02:00
Ilia Mirkin	fd6bbc2ee2	nvc0: reduce overhead from always marking images dirty We would revalidate images when anything was touched at all. Which is unfortunate, since the state tracker does not use CSO's to reduce the workload. So instead implement a protocol to ensure that something has changed before revalidating all the images. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-06-04 23:50:56 -04:00

1 2 3 4 5 ...

27608 commits