fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-30 22:38:11 +02:00

Author	SHA1	Message	Date
Eric Anholt	62ea2461ed	vc4: Don't recompile the CS when the FS changes. The compiled_fs_id is a proxy for the vc4->prog.fs->input_slots[], but only the VS dereferences it. Drops 754 shaders from shader-db.	2016-08-04 08:48:27 -07:00
Eric Anholt	d577dbc201	vc4: Move FS inputs setup out to a helper function. It's a pretty big block, and I was about to make it bigger.	2016-08-04 08:48:27 -07:00
Michel Dänzer	67c5e843b9	vl/dri3: Destroy Present event context when destroying drawable v2 Without this, the X server may accumulate stale Present event contexts if a client performs several video decoding sessions using the same window. v2: Based on Chris Wilson's review: * Use xcb_discard_reply() instead of free(xcb_request_check()) Reviewed-and-Tested-by: Leo Liu <leo.liu@amd.com>	2016-08-04 15:45:43 +09:00
Eric Anholt	bc1fc9c985	vc4: Avoid generating a custom shader per level in glGenerateMipmaps(). We were baking in the LOD of the source level to each shader. Instead, pass it in as a uniform -- this requires storing it to a temp register, but that's better than compiling a ton of separate shaders: total instructions in shared programs: 115032 -> 115036 (0.00%) instructions in affected programs: 96 -> 100 (4.17%) LOST: 572	2016-08-03 10:55:54 -07:00
Eric Anholt	e97e9e62a1	vc4: Tell valgrind about BO allocations from mmap time to destroy. This helps in debugging memory pressure. It would be nice if we could tell valgrind about it all the way from allocation time to destroy, but we need a pointer to hand to VALGRIND_MALLOCLIKE_BLOCK.	2016-08-03 10:28:20 -07:00
Eric Anholt	a0671d67de	vc4: Fix a leak of the src[] array of VPM reads in optimization. Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-08-03 10:25:09 -07:00
Eric Anholt	9f95690959	vc4: Fix leak of the bo_handles table.	2016-08-03 10:25:08 -07:00
Eric Anholt	02f8c444e8	vc4: Fix handling of UBO range offsets. The ranges are in units of bytes, not dwords. This wasn't caught by piglit tests because ttn tends to make one big uniform file, so we only had one UBO range with a src and dst offset of 0.	2016-08-03 10:25:08 -07:00
Eric Anholt	36b9eb82c1	vc4: Dump NIR at shader state creation time as well. I keep wanting to see this version of the NIR.	2016-08-03 10:25:08 -07:00
Marek Olšák	435d9595d3	r600g: use last_gfx_fence like radeonsi Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-03 17:46:46 +02:00
Marek Olšák	a6bfafa083	gallium/radeon: move last_gfx_fence from radeonsi to common code Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-03 17:46:46 +02:00
Marek Olšák	c15a9dec29	radeonsi: skip unnecessary si_update_shaders calls Small decrease in draw call overhead. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-03 17:46:46 +02:00
Marek Olšák	c2a0e99169	radeonsi: print the command line to VM fault reports (v2) v2: rebase on top of Brian's commit Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-03 17:46:46 +02:00
Marek Olšák	6573ad69ef	ddebug: print the command line to all logs (v2) for piglit with the pipelined hang detection mode v2: rebase on top of Brian's commit Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-03 17:46:46 +02:00
Marek Olšák	840353059a	ddebug: don't use fmemopen on non-Linux OS Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=97140 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-03 17:46:46 +02:00
Marek Olšák	c88b309fd5	radeonsi: don't set the last parameter component of llvm.AMDGPU.cube LLVM doesn't use it. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-03 17:46:46 +02:00
Marek Olšák	42c5f839ad	radeonsi: use llvm.amdgcn.cube* if available Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-03 17:46:46 +02:00
Marek Olšák	1fb6e55eaf	radeonsi: use llvm.amdgcn.rsq.f64 if available Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-03 17:46:46 +02:00
Marek Olšák	db2d31dab1	radeonsi: use v_mad_f32 for fma v_fma_f32 runs at FP64 rate (= slow). Alien Isolation and F1 2015 seem to use fma for all d3d multiply-add instructions, which is silly. This tries to restore performance for those games. The main difference between v_mad_f32 and v_fma_f32 is that v_mad doesn't support denormals, which we don't enable anyway, because they are slow too. Also, there is code size reduction: Totals from affected shaders: VGPRS: 109796 -> 109808 (0.01 %) Spilled SGPRs: 29995 -> 30022 (0.09 %) Spilled VGPRs: 12 -> 13 (8.33 %) <-- it's just one shader going from 12 to 13 Code Size: 6667596 -> 6476356 (-2.87 %) bytes Max Waves: 26931 -> 26899 (-0.12 %) I've not actually tested real performance. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-03 17:46:46 +02:00
Tim Rowley	11072de368	swr: build swr with -fno-strict-aliasing swr rasterizer contains numerous data transfers between vectors and ordinary C types. Fixing for strict aliasing will take time. Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-08-02 14:30:33 -05:00
Marek Olšák	6db93cd167	gallium/util: fix align64 it cut off the upper 32 bits Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net>	2016-08-01 23:28:14 +02:00
Matt Turner	be35c6ba92	draw: Avoid aliasing violations. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-08-01 12:09:17 -07:00
Matt Turner	8e68f35d32	r600g: Avoid aliasing violations. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-08-01 12:09:17 -07:00
Matt Turner	d2838f77ec	r300g: Avoid aliasing violation. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-08-01 12:09:17 -07:00
Matt Turner	16ff8f9ae8	gallium/auxiliary: Add u_bitcast.h header. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-08-01 12:09:17 -07:00
Brian Paul	13fa051356	auxiliary/os: add new os_get_command_line() function This can be used by the driver to get the command line which started the process. Will be used by the VMware driver for extra logging. For now, this is only implemented for Linux via /proc/self/cmdline and Windows via GetCommandLine(). Reviewed-by: Charmaine Lee <charmainel@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-08-01 12:20:19 -06:00
Charmaine Lee	c2b4942afc	svga: avoid redundant SetVertexBuffer/SetIndexBuffer commands at rebind This patch eliminates the redundant SetVertexBuffers and SetIndexBuffer commands that are emitted for rebind purpose. With this patch, the set commands will be skipped, but we will still reference the associated resources to allow the kernel to bring in the resources. Tested with Lightsmark2008, Valley, MTT glretrace, piglit, conform. Reviewed-by: Brian Paul <brianp@vmware.com>	2016-08-01 12:20:19 -06:00
Rob Clark	53b2b8bf6f	u_vbuf: fix potentially bogus assert There are cases where we hit u_vbuf path due to alignment or pitch- alignment restrictions, but for an output-format that u_vbuf does not support translating (yet the driver does support natively). In which case we hit the memcpy() path and don't care that u_vbuf doesn't understand it. Fixes crash with debug build of mesa in: dEQP-GLES3.functional.vertex_arrays.single_attribute.strides.fixed.user_ptr_stride17_components2_quads1 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=95000 Signed-off-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-08-01 13:42:11 -04:00
Eric Anholt	26ff7e373f	vc4: Zero-initialize the hardware sampler view structure. Fixes failure to initialize the force_first_level flag, causing failures in piglit levelclamp.	2016-07-31 19:23:03 -07:00
Roland Scheidegger	99a47391e4	Revert "gallium/util: fix resource leak" This reverts commit `d1fe26a628`. Replacing a resource leak with a segfault isn't the solution.	2016-07-30 18:18:09 +02:00
Eric Engestrom	d1fe26a628	gallium/util: fix resource leak CovID: 401540 Signed-off-by: Eric Engestrom <eric@engestrom.ch> Signed-off-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net>	2016-07-30 17:27:42 +02:00
francians@gmail.com	e713a9e613	freedreno/a4xx: fix comparison out of range warnings Signed-off-by: Francesco Ansanelli <francians@gmail.com> Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:25:42 -04:00
francians@gmail.com	43492c7f2c	freedreno/a3xx: fix comparison out of range warnings Signed-off-by: Francesco Ansanelli <francians@gmail.com> Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:25:31 -04:00
francians@gmail.com	089cc74b6a	freedreno/a2xx: fix comparison out of range warnings Signed-off-by: Francesco Ansanelli <francians@gmail.com> Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:25:16 -04:00
francians@gmail.com	3fa68fdc90	freedreno/ir3: init ir3_shader_key with memset() To silence missing initializers warning Signed-off-by: Francesco Ansanelli <francians@gmail.com> Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:24:59 -04:00
Eric Engestrom	a63bac9271	gallium/freedreno: move cast to avoid integer overflow Previously, the bitshift would be performed on a simple int (32 bits on most systems), overflow, and then be cast to 64 bits. CovID: 1362461 Signed-off-by: Eric Engestrom <eric@engestrom.ch> Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Eric Engestrom	3563c4d161	freedreno/a2xx: remove duplicate assignment CovID: 1362445, 1362446 Signed-off-by: Eric Engestrom <eric@engestrom.ch> Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	2d64a003c5	freedreno: defer flush_queue allocation Some apps, like warsow, create a bazillion contexts but don't render on most of them. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	4175606474	freedreno: add some hw query traces Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	e684c32d2f	freedreno: some locking Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	010e4b2d52	os: add pipe_mutex_assert_locked() Would be nice if we could also have lockdep, like in the linux kernel. But this is better than nothing. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	9f0eb69527	freedreno: drop needs_rb_fbd We need to emit RB_FRAME_BUFFER_DIMENSION once per batch.. tracking this in fd_context is wrong when the gmem code executes asynchronously from the flush_queue worker. But in fact we don't really need to track it at all. We cannot assume previous value at the beginning of the batch (because of other processes potentially using the GPU), so just drop the tracking and emit it in _tile_init(). Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	e6bfe1c773	freedreno: move needs_wfi into batch This is also used in gmem code, which executes from the "bottom half" (ie. from the flush_queue worker thread), so it cannot be in fd_context. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	0739bbceec	freedreno: a bit of micro-optimization Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	e1b1052700	freedreno: drop mem2gmem/gmem2mem query stages They weren't really used, and it gets somewhat more complicated to deal with if batches are flushed asynchronously (on another thread). So just drop them, and move _query_set_state(NULL) call into batch (so it is not happening on background thread). Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	00bed8a794	freedreno: threaded batch flush With the state accessed from GMEM+submit factored out of fd_context and into fd_batch, now it is possible to punt this off to a helper thread. And more importantly, since there are cases where one context might force the batch-cache to flush another context's batches (ie. when there are too many in-flight batches), using a per-context helper thread keeps various different flushes for a given context serialized. TODO as with batch-cache, there are a few places where we'll need a mutex to protect critical sections, which is completely missing at the moment. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	c44163876a	freedreno: track batch/blit types Add a bit of extra book-keeping about blits and back-blits (from resource shadowing). If the app uploads all mipmap levels, as opposed to uploading the first level and then glGenerateMipmap(), we can discard the back-blit (as opposed to being naive and shadowing the resource for each mipmap level). Also, after a normal blit, we might as well flush the batch immediately, since there is not likely to be further rendering to the surface. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	7f8fd02dc7	freedreno: re-order support for hw queries Push query state down to batch, and use the resource tracking to figure out which batch(es) need to be flushed to get the query result. This means we actually need to allocate the prsc up front, before we know the size. So we have to add a special way to allocate an un- backed resource, and then later allocate the backing storage. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	10baf05b2c	freedreno: use prsc for hw queries Switch to using a pipe_resource (rather than an fd_bo directly) for hw query result buffers. This is first step towards making queries work properly with reordered batches, since we'll need the additional dependency tracking to know which batches to flush. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	ba30096888	freedreno: support discarding previous rendering in special cases Basically, to "DCE" blits triggered by resource shadowing, in cases where the levels are immediately completely overwritten. For example, mid-frame texture upload to level zero triggers shadowing and back-blits to the remaining levels, which are immediately overwritten by glGenerateMipmap(). Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00

... 18 19 20 21 22 ...

29173 commits