fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 15:58:05 +02:00

Author	SHA1	Message	Date
Zack Rusin	8490d21cbe	tgsi/ureg: make the dst register match the src indirection In ureg src registers could have an indirect register that was either a temp or an addr register, while dst registers allowed only addr. That made moving between them a little difficult so make them behave the same way and allow temp's and addr registers as indirect files for both (tgsi supports it, just ureg didn't). Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2013-05-03 07:07:33 -04:00
Roland Scheidegger	23025ed15d	gallium: tgsi documentation updates and clarification for integer opcodes. A lot of them were missing. Others were moved from the Compute ISA to a new Integer ISA section as that seemed more appropriate. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-05-03 21:36:28 +02:00
Roland Scheidegger	ae507b6260	llvmpipe: get rid of depth swizzling. Eliminating this we no longer need to copy between linear and swizzled layout. This is probably not quite ideal since it's a bit more work for now, could do some optimizations by moving depth testing outside the fragment shader loop (but tricky for early depth test as we don't have neither the mask nor the interpolated z in the right order handy). The large amount of tile/untile code is no longer needed will be deleted in next commit. No piglit regressions. v2: change a forgotten LAYOUT_NONE to LAYOUT_LINEAR. v3: fix (bogus) uninitialized variable warnings, add comments, fix a bad type Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-05-03 21:36:20 +02:00
Lauri Kasanen	e495d88453	r600g: Correctly initialize the shader key, v2 Assigning a struct only copies the members - any padding is left as is. Thus this code: struct foo_t foo; foo = bar; leaves the padding of foo intact, ie uninitialized random garbage. This patch fixes constant shader recompiles by initializing the struct to zero. For completeness, memcpy is used to copy the key to the shader struct. NOTE: This is a candidate for the stable branches. Signed-off-by: Lauri Kasanen <cand@gmx.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Signed-off-by: Andreas Boll <andreas.boll.dev@gmail.com>	2013-05-03 19:28:57 +02:00
Lauri Kasanen	5ff81cfd86	st/xvmc/tests: Fix build failure, v2 v2: Removed extra libs as requested by Matt Turner. Signed-off-by: Lauri Kasanen <cand@gmx.com> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Andreas Boll <andreas.boll.dev@gmail.com>	2013-05-03 19:14:54 +02:00
Andreas Boll	e62be5de53	scons: remove nouveau build One build system for linux/unix only drivers should be enough. Additionally the nouveau target was disabled anyway. Acked-by: Jose Fonseca <jfonseca@vmware.com>	2013-05-03 18:44:57 +02:00
Andreas Boll	4ca44f2c5e	scons: remove radeon build One build system for linux/unix only drivers should be enough. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=48694 Acked-by: Jose Fonseca <jfonseca@vmware.com>	2013-05-03 18:44:43 +02:00
Alex Deucher	4539f8e20a	r600g: don't emit surface_sync after FLUSH_AND_INV_EVENT It shouldn't be needed since the FLUSH_AND_INV_EVENT has already made sure the destination caches are flushed. Additionally, we didn't previously emit the surface_sync until this commit: http://cgit.freedesktop.org/mesa/mesa/commit/?id=e5e4c07e7964a3258ed02b530bcdc24c0650204b Emitting them together causes hangs in compute on cayman/TN and hangs in Heaven on evergreen. Note: this patch is a candidate for the 9.1 branch, but requires: http://cgit.freedesktop.org/mesa/mesa/commit/?id=156bcca62c9f4e79e78929f72bc085757f36a65a as well. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com>	2013-05-03 10:55:05 -04:00
Vadim Girlin	41005d7bd2	r600g/sb: zero-initialize bytecode structs Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-05-03 16:53:42 +04:00
Vadim Girlin	f92bd0958e	r600g/sb: fix constant propagation in gvn pass Fixes the bug that prevented propagation of literals in some cases. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-05-03 16:53:42 +04:00
Vadim Girlin	3c201a22ca	r600g/sb: don't run unnecessary passes Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-05-03 16:53:42 +04:00
Vadim Girlin	48ba5712f5	r600g/sb: silence warnings with gcc 4.8 Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-05-03 16:53:42 +04:00
Vadim Girlin	c49b6d7f27	r600g/sb: fix handling of interference sets in post_scheduler post_scheduler clears interference set for reallocatable values when the value becomes live first time, and then updates it to take into account modified order of operations, but this was not handled properly if the value appears first time as a source in copy operation. Fixes issues with webgl demo: http://madebyevan.com/webgl-water/ Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-05-03 16:53:42 +04:00
Vadim Girlin	e16ef1f454	r600g/sb: fix allocation of indirectly addressed input arrays Some inputs may be preloaded into predefined GPRs, so we can't reallocate arrays with such inputs. Fixes issues with webgl demo: http://oos.moxiecode.com/js_webgl/snake/ Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-05-03 16:53:41 +04:00
Vadim Girlin	a6fe055fa7	r600g/sb: use hex instead of binary constants This should fix build issues with GCC < 4.3 Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-05-03 16:53:41 +04:00
Vadim Girlin	4ca67dbf0c	r600g: use old shader disassembler by default New disassembler is not completely isolated yet from further processing in r600g/sb that is not required for printing the dump, so it has higher probability to fail in case of any unexpected features in the bytecode. This patch adds "sbdisasm" flag for R600_DEBUG that allows to use new disassembler in r600g/sb for shader dumps when shader optimization is not enabled. If shader optimization is enabled, new disassembler is used by default. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-05-03 16:53:41 +04:00
Christian König	b4b3041132	radeon/uvd: enable interlaced buffers by default Kills tilling on UVD buffers, but we currently don't really need that. Signed-off-by: Christian König <christian.koenig@amd.com>	2013-05-03 11:00:21 +02:00
Christian König	85b0880a17	vl/idct: fix for commit `7d2f2a0c89` We still need the option for handling 3D textures as well. Should fix: https://bugs.freedesktop.org/show_bug.cgi?id=64143 Signed-off-by: Christian König <christian.koenig@amd.com>	2013-05-03 11:00:21 +02:00
Christian König	379753869d	vl/buffers: fix typo in function name Signed-off-by: Christian König <christian.koenig@amd.com>	2013-05-03 11:00:20 +02:00
Christian König	9c353ea293	radeon/uvd: fix some MPEG4 artifacts Still not perfect, but a step in the right direction. Signed-off-by: Christian König <christian.koenig@amd.com>	2013-05-03 11:00:20 +02:00
José Fonseca	abbbc9b667	draw: Update for u_assembled_primitive -> u_assembled_prim rename. Mesa build is too complex to rely on successful builds. On refactorings it is always a good idea to use git grep to prevent missing cases: $ git grep u_assembled_primitive src/gallium/auxiliary/draw/draw_pt_fetch_shade_pipeline_llvm.c: u_assembled_primitive(in_prim);	2013-05-03 08:35:17 +01:00
Chia-I Wu	8b2a967e32	st/egl: fix bulid errors on Android 4.2 The differences from the previous releases that affect st/egl are - logging macros are prefixed with an 'A' - dequeueBuffer() and enqueueBuffer() require an additoinal argument for fence fd, acquired from libsync Additionally, include gralloc_drm.h with extern "C".	2013-05-03 13:04:00 +08:00
Chia-I Wu	7346ab3b43	ilo: use u_reduced_prims_for_vertices() We do not need our own prim_count() anymore.	2013-05-03 11:59:10 +08:00
Chia-I Wu	f87dccdc19	util/prim: add u_reduced_prims_for_vertices() The function returns the number of reduced/tessellated primitives for the given vertex count. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Acked-by: Zack Rusin <zackr@vmware.com>	2013-05-03 11:59:10 +08:00
Chia-I Wu	90d5190594	util/prim: assorted fixes for u_decomposed_prims_for_vertices() Switch to '>=' for comparisons, and it becomes obvious that the comparison for PIPE_PRIM_QUAD_STRIP was wrong. Add minimum vertex count check for PIPE_PRIM_LINE_LOOP. Return 1 for PIPE_PRIM_POLYGON with 3 vertices. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Acked-by: Zack Rusin <zackr@vmware.com>	2013-05-03 11:59:10 +08:00
Chia-I Wu	30671cecc0	util/prim: use vertex count info in u_validate_pipe_prim() As a side effect, primitives with adjacency are now correctly validated. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Acked-by: Zack Rusin <zackr@vmware.com>	2013-05-03 11:59:10 +08:00
Chia-I Wu	ddf0e3930f	util/prim: fix the name of the include guard It should be U_PRIM_H, not U_BLIT_H. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Acked-by: Zack Rusin <zackr@vmware.com>	2013-05-03 11:59:10 +08:00
Chia-I Wu	5dd3bd70a1	draw: use u_assembled_prim() instead of u_assembled_primitive() The latter function is also removed as a result of the change. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Acked-by: Zack Rusin <zackr@vmware.com>	2013-05-03 11:59:10 +08:00
Chia-I Wu	185692e72c	util/prim: clean up and add comments Move together (or add) functions to decompose/reduce/assemble a primitive, give them consistent names, and document them. Add u_prim_vertex_count() so that the vertex count information can be used elsewhere. u_assembled_primitive() will be removed in a folow-on commit. [olv: fix a warning when -Wold-style-declaration is enabled] Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Acked-by: Zack Rusin <zackr@vmware.com>	2013-05-03 11:58:57 +08:00
Chia-I Wu	64913002e4	util/prim: fix primitive trimming for triangles with adjacency Fix for PIPE_PRIM_TRIANGLES_ADJACENCY and PIPE_PRIM_TRIANGLE_STRIP_ADJACENCY. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Acked-by: Zack Rusin <zackr@vmware.com>	2013-05-03 11:39:12 +08:00
Eric Anholt	573d8813fd	i965/vs: Add instruction scheduling. While this is ignorant of dependency control, it's still good for a 0.39% +/- 0.08% performance improvement on GLBenchmark 2.7 (n=548) v2: Rewrite as a subclass of the base class for the FS instruction scheduler, inheriting the same latency information. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2013-05-02 15:54:47 -07:00
Eric Anholt	3b00a6acac	i965: Move most of the FS instruction scheduler code to a general class. About half of this is shareable with the VS code. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2013-05-02 15:54:43 -07:00
Eric Anholt	ce22dd75b7	i965: Pull a couple of FS scheduling functions out to methods. These will get virtualized as we add VS scheduling support. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2013-05-02 15:54:39 -07:00
Eric Anholt	ee0223ba2a	i965: Move FS instruction scheduling to a non-FS-specific file. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2013-05-02 15:54:35 -07:00
Eric Anholt	ab04f3b2d7	i965: Share the register file enum between the two backends. I need this so I can look at vec4 and fs registers' files from the same .cpp file without namespaces. As far as I can tell we never rely on the particular numerical values of the files, though I thought it sounded like a good idea when doing the VS (it turns out having 0 be BAD_FILE is nicer). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2013-05-02 15:54:31 -07:00
Eric Anholt	63c8155b09	i965: Make dump_instructions be a virtual method of the visitor. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2013-05-02 15:54:26 -07:00
Eric Anholt	74e670d0a3	i965/vs: Do round-robin register allocation on gen6+ like we do in the FS. This will free instruction scheduling to make better choices. No statistically significant performance difference on GLB2.7 (n=93). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2013-05-02 15:54:09 -07:00
Rob Bradford	15e64de9e6	wayland: Make eglQueryBufferWL succeed for width and height requests too Following the addition of the EGL_WIDTH and EGL_HEIGHT this function should return EGL_TRUE for those requested attributes too.	2013-05-02 16:46:04 -04:00
Zack Rusin	396b861ceb	draw/gs: don't crash when vs/gs signatures don't match instead of crashing just fill zeros at the input slots that don't match, that's the mandated behavior and it avoids debug asserts. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: José Fonseca <jfonseca@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2013-05-02 02:43:42 -04:00
Zack Rusin	999cd79c9e	tgsi: allow negation of all integer types It's valid because we reuse certain arithmetic operations for both signed and unsigned types (e.g. uadd, umad, which have a bit unfortunate naming) Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: José Fonseca <jfonseca@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2013-05-02 02:43:42 -04:00
Eric Anholt	1dfea559c3	i965: Fix SNB GPU hangs when a blorp batch is the first thing to execute. The GPU apparently goes looking for constants even though there are no shader stages enabled, and gets stuck because we haven't told it there are no constants to collect. If any other user of the 3D pipeline had run (even the Render accel of the X server!) since power on, then the in-GPU constant buffers would have been set up with some contents we didn't use, and we would succeed. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=56416 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Tested-by: Dave Airlie <airlied@redhat.com> NOTE: This is a candidate for the stable branches.	2013-05-02 11:27:37 -07:00
Tom Stellard	156bcca62c	r600g: Don't set the dest cache bits on surface sync for R600_CONTEXT_FLUSH_AND_INV We are already emitting a EVENT_TYPE_CACHE_FLUSH_AND_INV_EVENT packet when this flush flag is set, so flushing the dest caches with a SURFACE_SYNC should not be necessary. The motivation for this change is that emitting a SURFACE_SYNC packet with the CB bits set was causing compute shaders to hang on Cayman. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2013-05-02 09:00:37 -07:00
Tom Stellard	5752be0cb7	r600g/compute: Fix build error in debug code Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2013-05-02 09:00:37 -07:00
Armin K	cd84353d57	radeon: Fix build with LLVM 3.3 Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2013-05-02 09:00:37 -07:00
Armin K	4742f9b00b	gallivm: Fix build with LLVM 3.3 Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2013-05-02 09:00:37 -07:00
Brian Paul	fcfbf4a19f	mesa: update comments, simplify code in vtxfmt.c Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-05-02 09:03:16 -06:00
Brian Paul	5dc0081ade	mesa: update GLvertexformat comments Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-05-02 09:03:16 -06:00
Brian Paul	200e09e393	mesa: remove GLvertexformat::EvalMesh1(), EvalMesh2() See previous commit comments. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-05-02 09:03:16 -06:00
Brian Paul	0f365b2d77	mesa: remove GLvertexformat::Rectf() As with the glDraw* functions, this doesn't have to be in GLvertexformat. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-05-02 09:03:16 -06:00
Brian Paul	49993a1a9d	mesa: simplify dispatch for glDraw* functions Remove all the glDraw* functions from the GLvertexformat structure. The point of that dispatch struct is to handle all the functions which dispatch differently depending on whether we're inside glBegin/End. glDraw* are never allowed inside glBegin/End so we can remove those entries. This simplifies the code paths and gets rid of quite a bit of code. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-05-02 09:03:16 -06:00

1 2 3 4 5 ...

56501 commits