fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-02-05 12:40:26 +01:00

Author	SHA1	Message	Date
Gurchetan Singh	c0773315af	virgl: quadruple command buffer size Tested running WebGL aquarium on Nvidia host (10,000 fishes) This moves us from 7 fps to 9 fps. After quadrupling, performance gains diminish. v2: Remove change ID (Erik) Tested-By: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2018-11-30 12:20:06 +01:00
Lionel Landwerlin	37f9788e9a	anv: flush pipeline before query result copies Pipeline state pending bits should be taken into account when copying results. In the particular bug below, the results of the vkCmdCopyQueryPoolResults() command was being overwritten by the preceding vkCmdCopyBuffer() with a same destination buffer. This is because we copy the buffers using the 3D pipeline whereas we copy the query results using the command streamer. Those pieces of HW work in parallel and the results are somewhat undefined. v2: Unconditionally flush the pipeline before copying the results (Jason) v3: Wrap & expressions (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Suggested-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108894 Cc: mesa-stable@lists.freedesktop.org	2018-11-29 22:07:31 +00:00
Marek Olšák	39b20b7d4f	Revert "winsys/amdgpu: overallocate buffers for faster address translation on Gfx9" I didn't mean to push this. I don't think it makes any difference. This reverts commit `f737fe00a0`.	2018-11-29 14:46:06 -05:00
Roland Scheidegger	fbf95ce074	draw: fix infinite loop in line stippling The calculated length of a line may be infinite, if the coords we get are bogus. This leads to an infinite loop in line stippling. To prevent this test for this explicitly (although technically on at least x86 sse it would actually work without the explicit test, as long as we use the int-converted length value). While here also get rid of some always-true condition. Note this does not actually solve the root cause, which is that the coords we receive are bogus after clipping. This seems a difficult problem to solve. One issue is that due to float arithmetic, clip w may become 0 after clipping if the incoming geometry is "sufficiently degenerate", hence x/y/z ndc (and window) coords will be all inf (or nan). Even with w not quite 0, I believe it's possible we produce values which are actually outside the view volume. (Also, x=y=z=w=0 coords in clipspace would be not considered subject to clipping, and similarly result in all NaN coords.) We just hope for now other draw stages (and rasterizers) can handle those relatively safely (llvmpipe itself should be sort of robust against this, certainly converstion to fixed point will produce garbage, it might fail a couple assertions but should neither hang nor crash otherwise). Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2018-11-29 18:39:40 +01:00
Józef Kucia	94bfb8bf38	nir: Fix assert in print_intrinsic_instr(). Signed-off-by: Józef Kucia <joseph.kucia@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-11-29 16:29:37 +00:00
Nicolai Hähnle	776b911365	amd/addrlib: update Mesa's copy of addrlib Update to the internal master as of 2018-11-15. This has a lot of gratuitous whitespace change, but on the plus side it's built using the same tooling that's used for AMDVLK, which should help going forward.	2018-11-29 13:18:24 +01:00
Nicolai Hähnle	621c107760	ac/surface/gfx9: let addrlib choose the preferred swizzle kind Our choices here are simply redundant as long as sin.flags is set correctly. (v2: - remove unused function parameter) Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-11-29 13:18:23 +01:00
Nicolai Hähnle	729ebdf07e	radv: remove dependency on addrlib gfx9_enum.h v2: - use SI_CONTEXT_REG_OFFSET Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-11-29 13:18:23 +01:00
Thomas Hellstrom	058f85d41c	winsys/svga: Fix a memory leak The ioctl.cap_3d member was never freed. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-29 10:42:06 +01:00
Thomas Hellstrom	7fce3ca375	st/xa: Fix a memory leak Free the context after destruction. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-11-29 10:42:06 +01:00
Samuel Pitoiset	cc7deb749c	radv: drop few useless state changes when doing color/depth decompressions Viewport/scissor don't need to be updated for array textures. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-29 10:18:55 +01:00
Samuel Pitoiset	6d4f65deea	radv: remove unused pending_clears param in the transition path Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-29 10:18:53 +01:00
Samuel Pitoiset	4b9df824f7	radv: optimize CmdClear{Color,DepthStencil}Image() for layered textures If all layers are bound we can perform a fast color or depth clear instead of iterating over all layers. This has the advantage to avoid trashing the framebuffer for nothing if you we end up by doing a fast clear when calling radv_clear_image_layer(), and clearing all layers in one shot is obviously faster. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-29 10:18:42 +01:00
Samuel Pitoiset	7484bc894b	radv: refactor the fast clear path for better re-use Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-29 10:18:42 +01:00
Samuel Pitoiset	f78ee19702	radv: simplify a check in emit_fast_color_clear() Currently only true if RADV_PERFTEST=dccmsaa is set. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-29 10:18:42 +01:00
Samuel Pitoiset	eca931a726	radv: add radv_can_fast_clear_{color,depth}() helpers For further optimisations. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-29 10:18:42 +01:00
Samuel Pitoiset	93f5ce8fa7	radv: add radv_image_view_can_fast_clear() helper Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-29 10:18:42 +01:00
Samuel Pitoiset	aeaf8dbd09	radv: add radv_image_can_fast_clear() helper Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-29 10:18:42 +01:00
Samuel Pitoiset	3e718db1ff	radv: remove useless check in emit_fast_color_clear() The driver doesn't support DCC/CMASK for mipmapped textures. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-29 10:18:42 +01:00
Vinson Lee	d0c7b079d0	freedreno: Fix autotools build. Fix build error. CXXLD pipe_msm.la ../../../../src/gallium/drivers/freedreno/.libs/libfreedreno.a(freedreno_batch.o): In function `batch_init': src/gallium/drivers/freedreno/freedreno_batch.c:54: undefined reference to `fd_device_version' src/gallium/drivers/freedreno/freedreno_batch.c:59: undefined reference to `fd_submit_new' src/gallium/drivers/freedreno/freedreno_batch.c:61: undefined reference to `fd_submit_new_ringbuffer' src/gallium/drivers/freedreno/freedreno_batch.c:64: undefined reference to `fd_submit_new_ringbuffer' src/gallium/drivers/freedreno/freedreno_batch.c:66: undefined reference to `fd_submit_new_ringbuffer' src/gallium/drivers/freedreno/freedreno_batch.c:70: undefined reference to `fd_submit_new_ringbuffer' Fixes: `b4476138d5` ("freedreno: move drm to common location") Fixes: `aa0fed10d3` ("freedreno: move ir3 to common location") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Rob Clark <robdclark@gmail.com>	2018-11-28 22:23:52 -08:00
Marek Olšák	075fd5d8f2	radeonsi: add memory management stress tests for GDS Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	c1d3c08699	winsys/amdgpu: add support for allocating GDS and OA resources Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	d7a4fa91f0	radeonsi: allow si_cp_dma_clear_buffer to clear GDS from any IB Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	72b2b61d8c	winsys/amdgpu: use optimal VM alignment for CPU allocations Acked-by: Christian König <christian.koenig@amd.com>	2018-11-28 20:20:27 -05:00
Marek Olšák	27f9935075	winsys/amdgpu: use optimal VM alignment for imported buffers Window system buffers didn't use the optimal alignment. Acked-by: Christian König <christian.koenig@amd.com>	2018-11-28 20:20:27 -05:00
Marek Olšák	6b554d863f	winsys/amdgpu,radeon: pass vm_alignment to buffer_from_handle Acked-by: Christian König <christian.koenig@amd.com>	2018-11-28 20:20:27 -05:00
Marek Olšák	f737fe00a0	winsys/amdgpu: overallocate buffers for faster address translation on Gfx9 Sadly, the 3 games I tested (DeusEx:MD, DiRT Rally, DOTA 2) are unaffected by the overallocation, because I guess their buffers don't fall into the small range below a power-of-two size. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	8c00f778fc	winsys/amdgpu: increase the VM alignment to the MSB of the size for Gfx9 Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	a2a6b06d48	winsys/amdgpu: use >= instead of > for VM address alignment Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	98f2312b4f	winsys/amdgpu: clean up code around BO VM alignment Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	5f9ccf827e	winsys/amdgpu: optimize slab allocation for 2 MB amdgpu page tables - the slab buffer size increased from 128 KB to 2 MB (PTE fragment size) - the max suballocated buffer size increased from 64 KB to 256 KB, this increases memory usage because it wastes memory - the number of suballocators increased from 1 to 3 and they are layered on top of each other to minimize unused space in slabs The final increase in memory usage is: DeusEx:MD: 1.8% DOTA 2: 1.75% DiRT Rally: 0.2% The kernel driver will also receive fewer buffers.	2018-11-28 20:20:27 -05:00
Marek Olšák	cf6835485c	radeonsi: generalize the slab allocator code to allow layered slab allocators There is no change in behavior. It just makes it easier to change the number of slab allocators.	2018-11-28 20:20:27 -05:00
Marek Olšák	9576266a37	winsys/amdgpu: always reclaim/release slabs if there is not enough memory	2018-11-28 20:20:27 -05:00
Marek Olšák	015061beb3	radeonsi: fix is_oneway_access_only for bindless images	2018-11-28 20:20:27 -05:00
Marek Olšák	8c25ab1a23	radeonsi/nir: parse more information about bindless usage fill more tgsi_shader_info fields.	2018-11-28 20:20:27 -05:00
Marek Olšák	2a936f8afa	tgsi/scan: add more information about bindless usage radeonsi will use this.	2018-11-28 20:20:27 -05:00
Marek Olšák	fba91b5173	radeonsi: small cleanup for memory opcodes	2018-11-28 20:20:27 -05:00
Marek Olšák	709905cbb6	radeonsi: fix is_oneway_access_only for image stores We need to look at the Dst for image stores.	2018-11-28 20:20:27 -05:00
Marek Olšák	648dc52367	radeonsi: use structured buffer intrinsics for image views to stop using the workaround in si_make_buffer_descriptor.	2018-11-28 20:20:27 -05:00
Marek Olšák	442dae2693	radeonsi: clean up primitive binning enablement no change in behavior. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Dave Airlie	8eb8be3f54	virgl: fix undefined shift to use unsigned. Ported from virglrenderer. Signed-off-by: Dave Airlie <airlied@redhat.com>	2018-11-29 09:09:31 +10:00
Dave Airlie	2ddd44d941	r600: make suballocator 256-bytes align Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108311 Cc: <mesa-stable@lists.freedesktop.org>	2018-11-29 09:09:02 +10:00
Kenneth Graunke	f11780779f	intel/compiler: Use nir's info when checking uses_streams. Vulkan and Gallium don't use Mesa's gl_program data structure, so they can't poke at 'prog'. But we can simply use the copy of the shader info stored with the NIR shader, which is guaranteed to exist. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2018-11-28 13:35:29 -08:00
Jason Ekstrand	199a0353d6	nir/derefs: Add a nir_derefs_do_not_alias enum value This makes some of the code more clear. Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-11-28 14:29:25 -06:00
Gurchetan Singh	eb44c36cf1	egl: add missing #include <stddef.h> in egldevice.h Otherwise, I get this error: main/egldevice.h:54:13: error: ‘NULL’ undeclared (first use in this function) dev = NULL; ^~~~ with this config: ./autogen.sh --enable-gles1 --enable-gles2 --with-platforms='surfaceless' --disable-glx --with-dri-drivers="i965" --with-gallium-drivers="" --enable-gbm v3: Use stddef.h (Matt) v4: Modify commit message (Eric) Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-11-28 11:22:47 -08:00
Matt Turner	2d48d5116b	gallivm: Use nextafterf(0.5, 0.0) as rounding constant The common truncf(x + 0.5) fails for the floating-point value just less than 0.5 (nextafterf(0.5, 0.0)). nextafterf(0.5, 0.0) + 0.5, after rounding is 1.0, thus truncf does not produce the desired value. The solution is to add nextafterf(0.5, 0.0) instead of 0.5 before truncating. This works for all values. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-11-28 11:22:47 -08:00
Juan A. Suarez Romero	e2ad94d928	docs: update calendar, add news item and link release notes for 18.2.6 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2018-11-28 19:20:09 +01:00
Juan A. Suarez Romero	a53a280479	docs: add sha256 checksums for 18.2.6 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `cfd1f8b92c`)	2018-11-28 19:20:09 +01:00
Juan A. Suarez Romero	f6ab6e2867	docs: add release notes for 18.2.6 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `3e741344d7`)	2018-11-28 19:20:09 +01:00
Nicolai Hähnle	c02390f8fc	egl/wayland: rather obvious build fix Fixes: `ce74a7bb8d` ("egl/wayland: plug memory leak in drm_handle_device()") Fixes: `c59d3aa4b9` ("egl/wayland: bail out when drmGetMagic fails")	2018-11-28 18:30:36 +01:00

... 188 189 190 191 192 ...

115447 commits