fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-22 20:00:10 +01:00

Author	SHA1	Message	Date
Rob Clark	4c91930a25	freedreno: fix segfault when no color buffer bound Don't crash when no color buffer bound. Something caught when starting to run piglit, fixes a hanful of piglit tests. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:23:32 -04:00
Rob Clark	7eeab24344	freedreno/a3xx/compiler: cat4 cannot use const reg as src Category 4 instructions (rsq, rcp, sqrt, etc) seem to be unable to take a const register as src. In these cases we need to move the src to a temporary gpr first. This is the second case of such a restriction, where the instruction encoding appears to support a const src, but in fact the hw appears to ignore that bit. So split things out into a helper that can be re-used for any instructions which have this limitation. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:23:32 -04:00
Rob Clark	2effac5a67	freedreno/a3xx/compiler: use max_reg rather than file_count Our current (rather naive) register assignment is based on mapping different register files (INPUT, OUTPUT, TEMP, CONST, etc) based on the max register index of the preceding file. But in some cases, the lowest used register in a file might not be zero. In which case file_count[file] != file_max[file] + 1. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:23:32 -04:00
Rob Clark	aee1ed708a	freedreno/a3xx/compiler: handle saturate on dst Sometimes things other than color dst need saturating, like if there is a 'clamp(foo, 0.0, 1.0)'. So for saturated dst add the extra instructions to fix up dst. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:23:32 -04:00
Rob Clark	8b250bb8aa	freedreno/a3xx/compiler: fix CMP The 1st src to add.s needs (r) flag (repeat), otherwise it will end up: add.s dst.xyzw, tmp.xxxx -1 instead of: add.s dst.xyzw, tmp.xyzw, -1 Also, if we are using a temporary dst to avoid clobbering one of the src registers, we actually need to use that as the dst for the sel instruction. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:23:32 -04:00
Rob Clark	528bee59fe	freedreno/a3xx: some texture fixes Stop hard coding bits that indicate texture type (2d/3d/cube/etc). Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:21:59 -04:00
Rob Clark	fd59f3ea98	freedreno: update register headers resync w/ rnndb database Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:12:26 -04:00
Rob Clark	c2babfccb5	freedreno: add debug option to disable scissor optimization Useful for testing and debugging. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:11:50 -04:00
Rob Clark	ae1a3f1736	freedreno/a3xx: fix viewport on gmem->mem resolve Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:04:29 -04:00
Rob Clark	fbef4e795f	freedreno/a3xx: fix color inversion on mem->gmem restore Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:04:29 -04:00
Niels Ole Salscheider	288a252523	radeonsi: Handle additional PIPE_COMPUTE_CAP_* This patch adds support for: PIPE_COMPUTE_CAP_MAX_INPUT_SIZE PIPE_COMPUTE_CAP_MAX_LOCAL_SIZE Return the values reported by the closed source driver for now. Signed-off-by: Niels Ole Salscheider <niels_ole@salscheider-online.de> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2013-08-23 17:00:01 -07:00
Niels Ole Salscheider	04349541cd	radeonsi: copy r600_get_timestamp Signed-off-by: Niels Ole Salscheider <niels_ole@salscheider-online.de> Reviewed-by: Marek Olšák <maraeo@gmail.com>	2013-08-23 16:59:55 -07:00
Niels Ole Salscheider	db6f4165f4	radeonsi: Implement PIPE_QUERY_TIMESTAMP Signed-off-by: Niels Ole Salscheider <niels_ole@salscheider-online.de> Reviewed-by: Marek Olšák <maraeo@gmail.com>	2013-08-23 16:59:44 -07:00
Michel Dänzer	237cb074cb	radeonsi: Fix y/z/w component values of TGSI_SEMANTIC_FOG pixel shader inputs They are defined as constant 0.0/0.0/1.0. Three more little piglits. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2013-08-22 16:12:17 +02:00
José Fonseca	c5f2cd6e41	trace: Handle null tokens. Used for example on stream out without geometry shader.	2013-08-22 12:14:15 +01:00
Chia-I Wu	b6037e734e	ilo: do not need last shader stage for 3DSTATE_SBE We have set up 3DSTATE_SBE (or 3DSTATE_SF on GEN6) in ilo_shader_select_kernel_routing(). There is no need to pass the last shader stage to the GPE function.	2013-08-22 15:18:29 +08:00
Chia-I Wu	627d7ca763	ilo: fix a potential issue with STATE_SIP Command length is ORed to the wrong place. Since the ORed value is zero, there is no real change.	2013-08-22 15:18:29 +08:00
Chia-I Wu	475d7ecce2	ilo: add GEN check to 3DSTATE_CLIP Assert that gen6_emit_3DSTATE_CLIP is for GEN 6 and 7.	2013-08-22 15:18:29 +08:00
Brian Paul	e4217396b7	svga: minor clean-ups in emit_hw_vs_vdecl()	2013-08-21 17:55:06 -06:00
Roland Scheidegger	ac1a2714c7	gallivm: implement better control of per-quad/per-element/scalar lod There's a new debug value used to disable per-quad lod optimizations in fragment shader (ignored for vs/gs as the results are just too wrong typically). Also trying to detect if a supplied lod value is really a scalar (if it's coming from immediate or constant file) in which case sampler code can use this to stay on per-quad-lod path (in fact for explicit lod could simplify even further and use same lod for both quads in the avx case but this is not implemented yet). Still need to actually implement per-element lod bias (and derivatives), and need to handle per-element lod in size queries. v2: fix comments, prettify. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-20 23:00:24 +02:00
Michel Dänzer	be301f707e	radeonsi: Always pre-load separate VGPRs for centroid vs. center interpolation The LLVM R600 backend currently always uses separate VGPRs for these. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=68162 (Centroid interpolation is identical to center interpolation without multisampling, so the shader hardware was only pre-loading one set of interpolation coefficients, and the pixel shader code was using uninitialized values as the centroid interpolation coefficients) Cc: mesa-stable@lists.freedesktop.org Tested-by: Laurent Carlier <lordheavym@gmail.com>	2013-08-20 18:50:28 +02:00
Michel Dänzer	5edcb682c9	radeonsi: Fix SPI_BARYC_CNTL register initialization The centroid / center interpolation related bits have different meanings as of SI. Fixes 7 centroid interpolation related piglit tests.	2013-08-20 18:50:10 +02:00
Chia-I Wu	ce87c51e9a	ilo: add ILO_DEBUG=flush When specified, ilo will print a line similar to cp flushed for render with 949+888 DWords (22.4%) because of frame end for every ilo_cp_flush() call.	2013-08-20 13:54:39 +08:00
Chia-I Wu	216a576e11	ilo: add ILO_DEBUG=draw It can print out pipe_draw_info and the dirty bits set, useful for debugging.	2013-08-20 13:54:38 +08:00
Vinson Lee	ff3cb378ad	r600g/sb: Move memsets of member structs to within constructor bodies. Silences "Uninitialized pointer field" defects reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-08-19 17:37:08 -07:00
Vinson Lee	b1d05eeb1f	radeonsi: Ensure fmask_format is initialized in release builds. Fixes "Uninitialized scalar variable" defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2013-08-19 09:19:19 -07:00
Christian König	5ddd840f5a	vl: add entrypoint to is_video_format_supported Signed-off-by: Christian König <christian.koenig@amd.com>	2013-08-19 10:21:15 +02:00
Christian König	a15cbabb8b	vl: add entrypoint to get_video_param Signed-off-by: Christian König <christian.koenig@amd.com>	2013-08-19 10:21:15 +02:00
Christian König	f2f7064e56	vl: rename pipe_video_decoder to pipe_video_codec Signed-off-by: Christian König <christian.koenig@amd.com>	2013-08-19 10:21:15 +02:00
Christian König	8e423ab984	vl: rename enum pipe_video_codec to pipe_video_format Signed-off-by: Christian König <christian.koenig@amd.com>	2013-08-19 10:21:15 +02:00
Christian König	53e20b8b41	vl: use a template for create_video_decoder Signed-off-by: Christian König <christian.koenig@amd.com>	2013-08-19 10:21:14 +02:00
Ilia Mirkin	a8346a2f52	nv50: allow non-nv12 buffers to be created, just pass them through to vl Since we expose non-NV12 formats as supported when there is no decoer profile selected, make sure that those formats are actually allowed to be allocated. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Tested-by: Emil Velikov <emil.l.velikov@gmail.com> Cc: "9.2" <mesa-stable@lists.freedesktop.org>	2013-08-17 17:58:36 +02:00
Marek Olšák	aafb0f9e06	radeonsi: fix feature support reporting broken by `21d9a1b5ef`	2013-08-17 02:49:00 +02:00
Marek Olšák	21d9a1b5ef	radeonsi: require LLVM 3.4 for MSAA	2013-08-17 01:48:25 +02:00
Marek Olšák	87b88f1dae	radeonsi: don't make scanout resources linear except for cursors The surface allocator understands the scanout flag just fine. This seems to improve performance for Ubuntu Unity on top of st/xorg and it fixes the cursor. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-08-17 01:48:25 +02:00
Marek Olšák	89ca4a00f5	radeonsi: remove useless code from tex_fetch_args The array slice has already been added to "address". Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-08-17 01:48:25 +02:00
Marek Olšák	5550554f1e	radeonsi: disable unbound colorbuffers Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-08-17 01:48:25 +02:00
Marek Olšák	356c041167	radeonsi: port texture improvements from r600g This started as an attempt to add support for MSAA texture transfers and MSAA depth-stencil decompression for the DB->CB copy path. It has gotten a bit out of control, but it's for the greater good. Some changes do not make much sense, they are there just to make it look like the other driver. With a few cosmetic modifications, r600_texture.c can be shared with a symlink. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-08-17 01:48:25 +02:00
Marek Olšák	4855acd461	radeonsi: implement texture fetching for compressed MSAA textures (v2) v2: use resource slots 16..31 for FMASK textures Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-08-17 01:48:25 +02:00
Marek Olšák	f671dfa8aa	radeonsi: add FMASK texture binding slots and resource setup (v2) v2: bind FMASK textures to shader resource slots 16..31 Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-08-17 01:48:25 +02:00
Marek Olšák	3c3feb38f4	radeonsi: implement FMASK decompression for MSAA texturing Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-08-17 01:48:25 +02:00
Marek Olšák	8c04f25360	radeonsi: scanout buffers cannot be a destination of MSAA resolve Resolving to scanout buffers just doesn't work. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-08-17 01:48:25 +02:00
Marek Olšák	2a4b2e2305	radeonsi: implement MSAA colorbuffer compression for rendering Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-08-17 01:48:25 +02:00
Marek Olšák	2f1c449415	radeonsi: implement uncompressed MSAA texturing This is glBlitFramebuffer support for MSAA surfaces as required by GL 3.0 and texturing as required by GL 3.2 and GL_ARB_texture_multisample. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-08-17 01:48:25 +02:00
Marek Olšák	f083f79751	radeonsi: disable alpha-to-coverage for integer colorbuffers Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-08-17 01:48:25 +02:00
Marek Olšák	6d4755a4d7	radeonsi: implement GL_SAMPLE_ALPHA_TO_ONE Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-08-17 01:48:25 +02:00
Marek Olšák	07955d4f2b	radeonsi: implement uncompressed MSAA rendering and color resolving This is basic MSAA support which should work with most apps. Some features are missing, those will be implemented by other commits. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-08-17 01:48:25 +02:00
Marek Olšák	c8e70e64ac	radeonsi: add flexible shader descriptor management and use it for sampler views It moves all sampler view descriptors to a buffer. It supports partial resource updates and it can also unbind resources (required for FMASK texturing). The buffer contains all sampler view descriptors for one shader stage, represented as an array. On top of that, there are N arrays in the buffer, which are used to emulate context registers as implemented by the previous ASICs (each array is a context). This uses the RCU synchronization approach to avoid read-after-write hazards as discussed in the thread: "radeonsi: add FMASK texture binding slots and resource setup" CP DMA is used to clear the descriptors at context initialization and to copy the descriptors from one context to the next. v2: - use PKT3_DMA_DATA on CIK (I'll test CIK later) - turn the bool CP DMA parameters into self-explanatory flags - add a nice simple API for packet emission to radeon_winsys.h - use 256 contexts, 128 causes texture corruption in openarena	2013-08-17 01:48:25 +02:00
Tom Stellard	764502b481	radeonsi/compute: Let the state tracker do all the flushing It shouldn't be necessary to call radeon_winsys::cs_flush() from radeonsi_launch_grid(), because the state tracker is responsible for flushing the pipeline at the appropriate time. The current behavior is also wrong, because radeonsi_launch_grid() submits packets to the compute ring, but when the state tracker calls pipe->flush() everything is submitted to the graphics ring. This has the potential to create a race condition. The downside of removing this flush is that the compute dispatch packets will be sent to the graphics ring rather than the compute ring. In the future we will need to come up with a way to detect 'compute' command streams and submit them to the appropriate ring. Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2013-08-17 01:48:25 +02:00
Ilia Mirkin	a2061eea0f	nv50: add vp3/vp4 support for mpeg2/vc1 h264/mpeg4 remain disabled for pre-nvc0, there's some minor bug/difference which causes the decoding to hang after some frames. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2013-08-16 09:48:47 +02:00

... 14 15 16 17 18 ...

11465 commits