fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-30 05:18:16 +02:00

Author	SHA1	Message	Date
Marek Olšák	3fbf250dfa	gallium/pb_bufmgr_cache: use the new pb_cache module Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Acked-by: Michel Dänzer <michel.daenzer@amd.com>	2015-12-11 15:25:12 +01:00
Marek Olšák	2b396eeed9	gallium/pb_cache: add a copy of cache bufmgr independent of pb_manager This simplified (basically duplicated) version of pb_cache_manager will allow removing some ugly hacks from radeon and amdgpu winsyses and flatten simplify their design. The difference is that winsyses must manually add buffers to the cache in "destroy" functions and the cache doesn't know about the buffers before that. The integration is therefore trivial and the impact on the winsys design is negligible. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Acked-by: Michel Dänzer <michel.daenzer@amd.com>	2015-12-11 15:25:12 +01:00
Marek Olšák	1a24f443b4	radeonsi: implement fast stencil clear Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2015-12-11 15:25:12 +01:00
Marek Olšák	8ee96ce834	radeonsi: re-enable Hyper-Z for stencil Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2015-12-11 15:25:12 +01:00
Marek Olšák	99e63338fb	r600g: remove a Hyper-Z workaround that's likely not needed anymore FORCE_OFF == 0, no need to set that Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2015-12-11 15:25:12 +01:00
Marek Olšák	96e8d38ac4	r600g: re-enable Hyper-Z for stencil on Evergreen & Cayman Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2015-12-11 15:25:12 +01:00
Marek Olšák	d3c08309ab	gallium/radeon: fix Hyper-Z hangs by programming PA_SC_MODE_CNTL_1 correctly This is the recommended setting according to hw people and it makes Hyper-Z stable. Just the two magic states. This fixes Evergreen, Cayman, SI, CI, VI (using the Cayman code). Cc: 11.0 11.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2015-12-11 15:25:12 +01:00
Marek Olšák	7c29bf26bb	radeonsi: don't use the CP DMA workaround on Fiji and newer Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2015-12-11 15:25:12 +01:00
Marek Olšák	787ada6bf6	radeonsi: apply the streamout workaround to Fiji as well Cc: 11.0 11.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2015-12-11 15:25:12 +01:00
Marek Olšák	62d82193b8	radeonsi: also print hexadecimal values for register fields in the IB parser Reviewed-by: Michel Dänzer <michel.daenzer@amd.com Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>	2015-12-11 15:25:12 +01:00
Marek Olšák	de887ba90c	radeonsi: implement RB+ for Stoney (v2) v2: fix dual source blending Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2015-12-11 15:25:12 +01:00
Marek Olšák	0f9519b938	radeonsi: don't call of u_prims_for_vertices for patches and rectangles Both caused a crash due to a division by zero in that function. This is an alternative fix. Cc: 11.0 11.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>	2015-12-11 15:25:12 +01:00
Marek Olšák	51603af390	radeonsi: use tgsi_shader_info::colors_written Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2015-12-11 15:25:11 +01:00
Marek Olšák	b5b87c4ed1	r600g: write all MRTs only if there is exactly one output (fixes a hang) This fixes a hang in piglit/arb_blend_func_extended-fbo-extended-blend-pattern_gles2 on REDWOOD. Cc: 11.0 11.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2015-12-11 15:25:11 +01:00
Marek Olšák	eb4813a952	tgsi/scan: add flag colors_written This is a prerequisite for the following r600g fix. Cc: 11.0 11.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2015-12-11 15:25:11 +01:00
Marek Olšák	37208c4fd7	Revert "radeonsi: disable DCC on Stoney" This reverts commit `32f05fadbb`. It turned out the problem with Stoney was caused by incorrect handling of a non-power-two VRAM size in the kernel driver. This is an optional BIOS setting and can be worked around by choosing a different VRAM size in the BIOS. Cc: 11.1 <mesa-stable@lists.freedesktop.org>	2015-12-11 15:25:11 +01:00
Roland Scheidegger	64c59b0624	draw: fix clipping with linear interpolated values and gl_ClipVertex Discovered this when working on other clip code, apparently didn't work correctly - the combination of linear interpolated values and using gl_ClipVertex produced wrong values (failing all such combinations in piglits glsl-1.30 interpolation tests, named interpolation-noperspective-XXX-vertex). Use the pre-clip-pos values when determining the interpolation factor to fix this. Noone really understands this code well, but everybody agrees this looks sane... This fixes all those failing tests (10 in total) both with the llvm and non-llvm draw paths, with no piglit regressions. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2015-12-11 02:21:39 +01:00
Dave Airlie	5362e53a06	r600: add missing return value check. Pointed out by coverity scan. Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-12-11 09:37:20 +10:00
Jason Ekstrand	78b81be627	nir: Get rid of _indirect variants of input/output load/store intrinsics There is some special-casing needed in a competent back-end. However, they can do their special-casing easily enough based on whether or not the offset is a constant. In the mean time, having the _indirect variants adds special cases a number of places where they don't need to be and, in general, only complicates things. To complicate matters, NIR had no way to convdert an indirect load/store to a direct one in the case that the indirect was a constant so we would still not really get what the back-ends wanted. The best solution seems to be to get rid of the _indirect variants entirely. This commit is a bunch of different changes squashed together: - nir: Get rid of _indirect variants of input/output load/store intrinsics - nir/glsl: Stop handling UBO/SSBO load/stores differently depending on indirect - nir/lower_io: Get rid of load/store_foo_indirect - i965/fs: Get rid of load/store_foo_indirect - i965/vec4: Get rid of load/store_foo_indirect - tgsi_to_nir: Get rid of load/store_foo_indirect - ir3/nir: Use the new unified io intrinsics - vc4: Do all uniform loads with byte offsets - vc4/nir: Use the new unified io intrinsics - vc4: Fix load_user_clip_plane crash - vc4: add missing src for store outputs - vc4: Fix state uniforms - nir/lower_clip: Update to the new load/store intrinsics - nir/lower_two_sided_color: Update to the new load intrinsic NIR and i965 changes are Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> NIR indirect declarations and vc4 changes are Reviewed-by: Eric Anholt <eric@anholt.net> ir3 changes are Reviewed-by: Rob Clark <robdclark@gmail.com> NIR changes are Acked-by: Rob Clark <robdclark@gmail.com>	2015-12-10 12:25:16 -08:00
Patrick Rudolph	79bff488bc	gallium/util: return correct number of bound vertex buffers In case a state tracker unbinds every slot by a seperate pipe->set_vertex_buffers() call, starting from slot zero, the number of bound buffers would not reach zero at all. The current algorithm does not account for pre-existing holes in the buffer list. Unbinding all buffers at once or starting at the top-most slot results in correct behaviour. Calculating the correct number of bound buffers fixes a NULL pointer dereference in nvc0_validate_vertex_buffers_shared(). Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=93004 Signed-off-by: Patrick Rudolph <siro@das-labor.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "11.0 11.1" <mesa-stable@lists.freedesktop.org>	2015-12-10 13:55:53 -05:00
Michel Dänzer	b4a03e7f8f	clover: Fix build against LLVM 3.8 SVN >= r255078 Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2015-12-10 10:45:29 +09:00
Serge Martin	2b930327e8	freedreno: little clean up in fd_create_surface in order to avoid returing invalid adress if CALLOC_STRUCT return NULL. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-12-09 17:32:41 -05:00
Serge Martin	0149e7a944	freedreno: change to goto fail in fd_resource_transfer_map, like the others error cases Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-12-09 17:31:16 -05:00
Serge Martin	e63fec29a1	freedreno: fix bind_sampler_states when hwcso is NULL src/gallium/tests/trivial/compute.c expects samplers to be cleaned when the samplers list is NULL. Like in radeon, the function behave like when the number of samplers parameter is set to 0. [small s/hwsco/hwcso/ typo fix] Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-12-09 17:30:58 -05:00
Edward O'Callaghan	f32f80e19d	gallium/util: Make u_prims_for_vertices() safe Let us avoid trapping in hardware from a SIGFPE and instead assert on a zero divisor. Hint: This can occur if a PIPE_PRIM_? is not handled in u_prim_vertex_count() that results in ' info ' not being initialized in the expected manner. Further, we also fix a possibly NULL pointer dereference from ' info ' being NULL from a u_prim_vertex_count() call. Signed-off-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-12-09 22:51:56 +01:00
Patrick Rudolph	432a798cf5	nv50,nvc0: fix use-after-free when vertex buffers are unbound Always reset the vertex bufctx to make sure there's no pointer to an already freed pipe_resource left after unbinding buffers. Fixes use after free crash in nvc0_bufctx_fence(). Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=93004 Signed-off-by: Patrick Rudolph <siro@das-labor.org> [imirkin: simplify nvc0 fix, apply to nv50] Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "11.0 11.1" <mesa-stable@lists.freedesktop.org>	2015-12-09 13:38:15 -05:00
Andreas Boll	9246df2280	st/osmesa: Fix a typo in a comment s/suport/support/ Signed-off-by: Andreas Boll <andreas.boll.dev@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2015-12-09 18:29:18 +01:00
Brian Paul	aa9af32752	svga: initialize pipe_driver_query_info entries with a macro To be safe, set all the fields in case the enums ordering/values ever change. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2015-12-09 09:43:47 -07:00
Dave Airlie	e307cfa7d9	radeonsi: handle loading doubles as geometry shader inputs. This adds the double code to the geometry shader input handling. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Cc: "11.0 11.1" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-12-09 17:04:04 +10:00
Dave Airlie	8c9e40ac22	radeonsi: handle doubles in lds load path. This handles loading doubles from LDS properly. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Cc: "11.0 11.1" <mesa-stable@lists.fedoraproject.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-12-09 17:03:38 +10:00
Dave Airlie	cce3864046	r600: handle geometry dynamic input array index This fixes: glsl-1.50/execution/geometry/dynamic_input_array_index.shader_test my profanity. We need to load the AR register with the value from the index reg Cc: "11.0 11.1" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-12-09 15:07:53 +10:00
Dave Airlie	38542921c7	r600g: fix geom shader input indirect indexing. This fixes: gs-input-array-vec4-index-rd The others run out of gprs unfortunately. Cc: "11.0 11.1" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-12-09 15:07:47 +10:00
Dave Airlie	e97ac006d7	r600g: fix outputing to non-0 buffers for stream 0. This fixes: arb_transform_feedback3-ext_interleaved_two_bufs_gs arb_transform_feedback3-ext_interleaved_two_bufs_gs_max transform-feedback-builtins If we are only emitting one ring, then emit all output buffers on it. Cc: "11.0 11.1" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-12-09 15:07:01 +10:00
Edward O'Callaghan	1f61447ce1	r600: Add ARB_copy_image support [airlied: update relnotes] Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-12-09 14:41:46 +10:00
Edward O'Callaghan	d13ac27200	r600g: allow copying between compatible un/compressed formats See: `commit e82c527f1fc2f8ddc64954ecd06b0de3cea92e93` which is where a block in src maps to a pixel in dst and vice versa. e.g. DXT1 <-> R32G32_UINT DXT5 <-> R32G32B32A32_UINT Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-12-09 14:40:32 +10:00
Ilia Mirkin	f920f8eb02	nv50/ir: fix cutoff for using r63 vs r127 when replacing zero The only effect here is a space savings - 822 programs in shader-db affected with the following overall change: total bytes used in shared programs : 44154976 -> 44139880 (-0.03%) Fixes: `641eda0c` (nv50/ir: r63 is only 0 if we are using less than 63 registers) Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "11.0 11.1" <mesa-stable@lists.freedesktop.org>	2015-12-08 23:15:29 -05:00
Ilia Mirkin	44260d9080	nv50/ir: prefer to color mad def and src2 with the same color This allows us to use the short encoding, and potentially fold immediates in later on. total instructions in shared programs : 6379731 -> 6367861 (-0.19%) total gprs used in shared programs : 728502 -> 728683 (0.02%) total local used in shared programs : 9904 -> 9904 (0.00%) total bytes used in shared programs : 44661008 -> 44154976 (-1.13%) local gpr inst bytes helped 0 51 7267 20306 hurt 0 232 125 274 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-12-08 23:15:29 -05:00
Ilia Mirkin	c1c1248b94	nv50/ir: reduce degree limit on ops that can't encode large reg dests Operations that take immediates can only encode registers up to 64. This fixes a shader in a "Powered by Unity" intro. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-12-08 23:15:29 -05:00
Ilia Mirkin	99581ca393	nv50/ir: only unspill once ahead of a group of instructions We already semi-did this but the list of uses as unsorted, so it was unreliable. Sort the uses by bb and serial, and don't unspill for each instruction in a sequence. (And also don't unspill multiple times for a single instruction that uses the value in question multiple times.) This causes a minor reduction in generated instructions for shader-db (as few programs spill) but more importantly it brings determinism to each run's output. On SM10: total instructions in shared programs : 6387945 -> 6379359 (-0.13%) total gprs used in shared programs : 728544 -> 728544 (0.00%) total local used in shared programs : 9904 -> 9904 (0.00%) local gpr inst bytes helped 0 0 322 322 hurt 0 0 0 0 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-12-08 23:15:29 -05:00
Ilia Mirkin	0f647bd65b	nv50/ir: check if the target supports the new offset before inlining Fixes: `abd326e81b` (nv50/ir: propagate indirect loads into instructions) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=93300 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-12-08 23:15:29 -05:00
Dave Airlie	a13b14930d	llvmpipe: fix fp64 inputs to geom shader. This fixes the fetching of fp64 inputs to the geometry shader, this fixes the recently posted piglit's arb_gpu_shader_fp64/execution/gs-fs-vs-double-array.shader_test arb_vertex_attrib_64bit/execution/gs-fs-vs-attrib-double-array.shader_test Reviewed-by: Roland Scheidegger <sroland@vmware.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-12-09 13:56:39 +10:00
Eric Anholt	f61ceeb3fd	vc4: Enable MSAA. We still have several failures in the newly enabled tests in simulation: sRGB downsampling is done as if it was just linear, stencil blits are not supported on MSAA either, and derivatives are still not supported (breaking some MSAA simulation shaders). So, other than sRGB downsampling quality, things seem to be in good shape.	2015-12-08 10:09:52 -08:00
Eric Anholt	fc4a1bfb88	vc4: Add support for mapping of MSAA resources. The pipe_transfer_map API requires that we do an implicit downsample/upsample and return a mapping of that.	2015-12-08 09:49:56 -08:00
Eric Anholt	6b4dfd53ae	vc4: Add support for texel fetches from MSAA resources. This is the core of ARB_texture_multisample. Most of the piglit tests for GL_ARB_texture_multisample require GL 3.0, but exposing support for this lets us use the gallium blitter for multisample resolves. We can sometimes multisample resolve using just the RCL, but that requires that the blit is 1:1, unflipped, and aligned to tile boundaries.	2015-12-08 09:49:55 -08:00
Eric Anholt	a97b40dca4	vc4: Add support for multisample framebuffer operations. This includes GL_SAMPLE_COVERAGE, GL_SAMPLE_ALPHA_TO_ONE, and GL_SAMPLE_ALPHA_TO_COVAGE. I haven't implemented a dithering function yet, and gallium doesn't give me a good chance to do so for GL_SAMPLE_COVERAGE.	2015-12-08 09:49:54 -08:00
Eric Anholt	edc3305de7	vc4: Add a workaround for HW-2905, and additional failure I saw with MSAA. I only stumbled on this while experimenting due to reading about HW-2905. I don't know if the EZ disable in the Z-clear is actually necessary, but go with it for now.	2015-12-08 09:49:54 -08:00
Eric Anholt	edfd4d853a	vc4: Add support for drawing in MSAA.	2015-12-08 09:49:53 -08:00
Eric Anholt	e7c8ad0a6c	vc4: Add kernel RCL support for MSAA rendering.	2015-12-08 09:49:53 -08:00
Eric Anholt	568d3a8e32	vc4: Rename color_ms_write to color_write. I was thinking this was the only MSAA resolve thing, so it should be noted separately, but actually load/store general also do MSAA resolve.	2015-12-08 09:49:52 -08:00
Eric Anholt	bf92017ace	vc4: Allow RCL blits to the edge of the surface. The recent unaligned fix successfully prevented RCL blits that weren't aligned inside of the surface, but we also want to be able to do RCL blits for the whole surface when the width or height of the surface aren't aligned (we don't care what renders inside of the padding).	2015-12-08 09:49:52 -08:00

1 2 3 4 5 ...

25509 commits