fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-26 14:38:13 +02:00

Author	SHA1	Message	Date
Rob Clark	56ea2c4816	freedreno/a3xx: don't leak so much Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:58:01 -04:00
Rob Clark	9b9038496c	freedreno/a3xx/compiler: fix SGT/SLT/etc The cmps.f.* instruction doesn't actually seem to give a float 1.0 or 0.0 output. It either needs a cov.u16f16 or add.s + sel.f16. This makes SGT/SLT/etc more similar to CMP, so handle them in trans_cmp(). This fixes a bunch of piglit tests. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:23:32 -04:00
Rob Clark	572d4646f7	freedreno/a3xx/compiler: bit of re-arrange/cleanup It seems there are a number of cases where instructions have limitations about taking reading src's from const register file, so make get_unconst() a bit easier to use. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:23:32 -04:00
Rob Clark	d63bbac3a5	freedreno/a3xx/compiler: make compiler errors more useful We probably should get rid of assert() entirely, but at this stage it is more useful for things to crash where we can catch it in a debugger. With compile_error() we have a single place to set an error flag (to bail out and return an error on the next instruction) so that will be a small change later when enough of the compiler bugs are sorted. But re-arrange/cleanup the error/assert stuff so we at least get a dump of the TGSI that triggered it. So we see some useful output in piglit logs. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:23:32 -04:00
Rob Clark	4c91930a25	freedreno: fix segfault when no color buffer bound Don't crash when no color buffer bound. Something caught when starting to run piglit, fixes a hanful of piglit tests. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:23:32 -04:00
Rob Clark	7eeab24344	freedreno/a3xx/compiler: cat4 cannot use const reg as src Category 4 instructions (rsq, rcp, sqrt, etc) seem to be unable to take a const register as src. In these cases we need to move the src to a temporary gpr first. This is the second case of such a restriction, where the instruction encoding appears to support a const src, but in fact the hw appears to ignore that bit. So split things out into a helper that can be re-used for any instructions which have this limitation. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:23:32 -04:00
Rob Clark	2effac5a67	freedreno/a3xx/compiler: use max_reg rather than file_count Our current (rather naive) register assignment is based on mapping different register files (INPUT, OUTPUT, TEMP, CONST, etc) based on the max register index of the preceding file. But in some cases, the lowest used register in a file might not be zero. In which case file_count[file] != file_max[file] + 1. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:23:32 -04:00
Rob Clark	aee1ed708a	freedreno/a3xx/compiler: handle saturate on dst Sometimes things other than color dst need saturating, like if there is a 'clamp(foo, 0.0, 1.0)'. So for saturated dst add the extra instructions to fix up dst. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:23:32 -04:00
Rob Clark	8b250bb8aa	freedreno/a3xx/compiler: fix CMP The 1st src to add.s needs (r) flag (repeat), otherwise it will end up: add.s dst.xyzw, tmp.xxxx -1 instead of: add.s dst.xyzw, tmp.xyzw, -1 Also, if we are using a temporary dst to avoid clobbering one of the src registers, we actually need to use that as the dst for the sel instruction. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:23:32 -04:00
Rob Clark	528bee59fe	freedreno/a3xx: some texture fixes Stop hard coding bits that indicate texture type (2d/3d/cube/etc). Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:21:59 -04:00
Rob Clark	fd59f3ea98	freedreno: update register headers resync w/ rnndb database Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:12:26 -04:00
Rob Clark	c2babfccb5	freedreno: add debug option to disable scissor optimization Useful for testing and debugging. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:11:50 -04:00
Rob Clark	ae1a3f1736	freedreno/a3xx: fix viewport on gmem->mem resolve Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:04:29 -04:00
Rob Clark	fbef4e795f	freedreno/a3xx: fix color inversion on mem->gmem restore Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-24 13:04:29 -04:00
Niels Ole Salscheider	288a252523	radeonsi: Handle additional PIPE_COMPUTE_CAP_* This patch adds support for: PIPE_COMPUTE_CAP_MAX_INPUT_SIZE PIPE_COMPUTE_CAP_MAX_LOCAL_SIZE Return the values reported by the closed source driver for now. Signed-off-by: Niels Ole Salscheider <niels_ole@salscheider-online.de> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2013-08-23 17:00:01 -07:00
Niels Ole Salscheider	04349541cd	radeonsi: copy r600_get_timestamp Signed-off-by: Niels Ole Salscheider <niels_ole@salscheider-online.de> Reviewed-by: Marek Olšák <maraeo@gmail.com>	2013-08-23 16:59:55 -07:00
Niels Ole Salscheider	db6f4165f4	radeonsi: Implement PIPE_QUERY_TIMESTAMP Signed-off-by: Niels Ole Salscheider <niels_ole@salscheider-online.de> Reviewed-by: Marek Olšák <maraeo@gmail.com>	2013-08-23 16:59:44 -07:00
Roland Scheidegger	ad9b5b9ae9	gallivm: fix min/mag switchover point for nearest/none mip filter Previously, the min/mag switchover point when using nearest/none mip filter was effectively -0.5 which can't be right. Looks like new OpenGL thinks it's ok if it's always 0.0 (older versions required 0.5 in some cases), let's hope everybody else thinks that's fine too. Refactor this slightly and get the per-quad/per-pixel min/mag decision values further down to sampling, though still only the first component is used yet. While here also fix code trying to skip lod bias application etc. when mipfilter is none, as this is still needed for determining min/mag filter. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-23 23:46:28 +02:00
Jon Severinsson	b47bde0079	gallium/osmesa: Link, not copy, the shared library to the LIB_DIR. Just like all other mesa libraries... CC: "9.2" <mesa-stable@lists.freedesktop.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2013-08-23 12:58:48 -07:00
Jon Severinsson	aeb9c9e4b0	gallium/osmesa: Always link with the c++ linker. Just like all other gallium targets... CC: "9.2" <mesa-stable@lists.freedesktop.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2013-08-23 12:58:45 -07:00
Jon Severinsson	c811190430	gallium/osmesa: Make and install an osmesa.pc. As of "2f142d59 build: Add --enable-gallium-osmesa flag." the pkgconfig file from classic osmesa is no longer installed when building gallium osmesa, so copy it to gallium osmesa and install the copy instead. CC: "9.2" <mesa-stable@lists.freedesktop.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2013-08-23 12:58:30 -07:00
Roland Scheidegger	bd0b6c5180	gallivm: do per-element lod for lod bias and explicit derivs too Except for explicit derivs with cube maps which are very bogus anyway. Just like explicit lod this is only used if no_quad_lod is set in GALLIVM_DEBUG env var. Minification is terrible on cpus which don't support true vector shifts (but should work correctly). Cannot do the min/mag filter decision (if they are different) per pixel though, only selecting different mip levels works. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-22 19:05:52 +02:00
Roland Scheidegger	33694a1800	gallivm: (trivial) fix int/uint border color clamping Just a copy & paste error. Fixes https://bugs.freedesktop.org/show_bug.cgi?id=68409. Note that the test passing before probably simply means it doesn't verify clamping of the border color itself as required by the OpenGL spec. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-22 19:05:52 +02:00
Roland Scheidegger	6ff9008544	gallivm: (trivial) fix linear aos sampling of 3d compressed formats block size depth is always 1 even for compressed formats (unless someone invents true 3d compressed formats at least which we can't represent). Nearest (and soa) path had it right. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-22 19:05:52 +02:00
Michel Dänzer	237cb074cb	radeonsi: Fix y/z/w component values of TGSI_SEMANTIC_FOG pixel shader inputs They are defined as constant 0.0/0.0/1.0. Three more little piglits. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2013-08-22 16:12:17 +02:00
José Fonseca	fb62388d6a	gallium: Support PIPE_FORMAT_R10G10B10A2_UINT. Same as PIPE_FORMAT_B10G10R10A2_UINT but without the swizzling. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2013-08-22 12:14:15 +01:00
José Fonseca	c5f2cd6e41	trace: Handle null tokens. Used for example on stream out without geometry shader.	2013-08-22 12:14:15 +01:00
Chia-I Wu	b6037e734e	ilo: do not need last shader stage for 3DSTATE_SBE We have set up 3DSTATE_SBE (or 3DSTATE_SF on GEN6) in ilo_shader_select_kernel_routing(). There is no need to pass the last shader stage to the GPE function.	2013-08-22 15:18:29 +08:00
Chia-I Wu	627d7ca763	ilo: fix a potential issue with STATE_SIP Command length is ORed to the wrong place. Since the ORed value is zero, there is no real change.	2013-08-22 15:18:29 +08:00
Chia-I Wu	475d7ecce2	ilo: add GEN check to 3DSTATE_CLIP Assert that gen6_emit_3DSTATE_CLIP is for GEN 6 and 7.	2013-08-22 15:18:29 +08:00
Matt Turner	2f142d596f	build: Add --enable-gallium-osmesa flag. The Gallium implementation is apparently not ready for regular consumption, so as much as I hate adding more build-time options, here's another. Acked-by: Brian Paul <brianp@vmware.com>	2013-08-21 23:07:10 -07:00
Brian Paul	e4217396b7	svga: minor clean-ups in emit_hw_vs_vdecl()	2013-08-21 17:55:06 -06:00
Roland Scheidegger	e6013e4bee	gallivm: unify sin and cos implementation The (complicated!) math is all identical, there's just minimal differences how sign bit is calculated plus there's an additional subtraction for the argument going into the polynomial for cos. The logic stays 100% the same (with a small exception, sign bit calculation for sin is minimally simplified, applying sign mask after xoring the arguments instead of applying it to each argument). Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-21 22:05:53 +02:00
Roland Scheidegger	275d2efeed	gallivm: add comment for bogus min/mag filter selection with nearest mip filter Detected this hunting some other bug, not sure if it really needs fixing but it is definitely wrong. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-21 22:05:52 +02:00
Roland Scheidegger	21d8fa2759	gallivm: fix rho calculation for 1d case Was using wrong (undefined) vector element (the elements are at 0/2 position, not 0/1). Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-21 22:05:52 +02:00
Rico Schüller	00fcdc81ff	vdpau/decode: Fix comment. Reviewed-by: Christian König <christian.koenig@amd.com>	2013-08-21 11:25:36 +02:00
Rico Schüller	d8d90ecf30	vl/query: Only support VDP_CHROMA_TYPE_420 for 12 bit formats. Reviewed-by: Christian König <christian.koenig@amd.com>	2013-08-21 11:25:10 +02:00
Roland Scheidegger	4b45b61fef	util: add avx2 and xop detection to cpu detection code Going to need this soon (not going to bother with avx2 intrinsics at this time but don't want to do workarounds for true vector shifts if llvm itself can use them just fine and won't need the gazillion instruction emulation). Not really tested other than my cpu returns 0 for these features... (I have no idea if llvm actually would emit avx2/xop instructions neither...) Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-20 23:00:24 +02:00
Roland Scheidegger	9299128bf2	gallivm: fix bogus aos path detection Need to check the wrap mode of the actually used coords not a fixed 2. While checking more than necessary would only potentially disable aos and not cause any harm I'm pretty sure for 3d textures it could have caused assertion failures (if s,t coords have simple filter and r not). Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-20 23:00:24 +02:00
Roland Scheidegger	fe92d7fab4	gallivm: do clamping of border color correctly for all formats Turns out it is actually very complicated to figure out what a format really is wrt range, as using channel information for determining unorm/snorm etc. doesn't work for a bunch of cases - namely compressed, subsampled, other. Also while here add clamping for uint/sint as well - d3d10 doesn't actually need this (can only use ld with these formats hence no border) and we could do this outside the shader for GL easily (due to the fixed texture/sampler relation) do it here too just so I can forget about it. v2: move border color clamping out of fetch texel. Also change it to clamp the whole border vector at once (and use vectorized load of border color), which saves a couple of instructions - needs some different handling of mixed signed/unsigned formats so skip the per channel stuff and just derive this from first channel except for special formats. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-20 23:00:24 +02:00
Roland Scheidegger	ac1a2714c7	gallivm: implement better control of per-quad/per-element/scalar lod There's a new debug value used to disable per-quad lod optimizations in fragment shader (ignored for vs/gs as the results are just too wrong typically). Also trying to detect if a supplied lod value is really a scalar (if it's coming from immediate or constant file) in which case sampler code can use this to stay on per-quad-lod path (in fact for explicit lod could simplify even further and use same lod for both quads in the avx case but this is not implemented yet). Still need to actually implement per-element lod bias (and derivatives), and need to handle per-element lod in size queries. v2: fix comments, prettify. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-20 23:00:24 +02:00
Ross Burton	76feef0823	build: fix out-of-tree builds in gallium/auxiliary The rules were writing files to e.g. util/u_indices_gen.py, but in an out-of-tree build this directory doesn't exist in the build directory. So, create the directories just in case. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Ross Burton <ross.burton@intel.com>	2013-08-20 10:35:14 -07:00
Michel Dänzer	be301f707e	radeonsi: Always pre-load separate VGPRs for centroid vs. center interpolation The LLVM R600 backend currently always uses separate VGPRs for these. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=68162 (Centroid interpolation is identical to center interpolation without multisampling, so the shader hardware was only pre-loading one set of interpolation coefficients, and the pixel shader code was using uninitialized values as the centroid interpolation coefficients) Cc: mesa-stable@lists.freedesktop.org Tested-by: Laurent Carlier <lordheavym@gmail.com>	2013-08-20 18:50:28 +02:00
Michel Dänzer	5edcb682c9	radeonsi: Fix SPI_BARYC_CNTL register initialization The centroid / center interpolation related bits have different meanings as of SI. Fixes 7 centroid interpolation related piglit tests.	2013-08-20 18:50:10 +02:00
Maarten Lankhorst	86751cbddf	gallium/osmesa: add same checks to OSMesaMakeCurrent as the other osmesa Fixes a opengl crash in wine. Cc: "9.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com>	2013-08-20 12:36:17 +02:00
Maarten Lankhorst	603160d4c0	gallium/osmesa: link against static libglapi library too to get the gl exports This should fix missing symbols in a osmesa built against shared glapi osmesa build. All opengl exports were missing that are defined in the static glapi, so link against both to fix this. I could swear I've done this before, maybe there was a glitch in the matrix. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=47824 Cc: "9.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com>	2013-08-20 10:44:53 +02:00
Chia-I Wu	ce87c51e9a	ilo: add ILO_DEBUG=flush When specified, ilo will print a line similar to cp flushed for render with 949+888 DWords (22.4%) because of frame end for every ilo_cp_flush() call.	2013-08-20 13:54:39 +08:00
Chia-I Wu	216a576e11	ilo: add ILO_DEBUG=draw It can print out pipe_draw_info and the dirty bits set, useful for debugging.	2013-08-20 13:54:38 +08:00
Vinson Lee	ff3cb378ad	r600g/sb: Move memsets of member structs to within constructor bodies. Silences "Uninitialized pointer field" defects reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-08-19 17:37:08 -07:00
Emil Velikov	b9d1173f2c	vl/buffers: consistent use on VL_MAX_SURFACES Reviewed-by: Christian König <christian.koenig@amd.com> Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>	2013-08-19 18:32:08 +02:00

1 2 3 4 5 ...

19124 commits