fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-04-24 02:50:39 +02:00

Author	SHA1	Message	Date
Axel Davy	f91f748fab	st/nine: Fix volumetexture dtor on ctor failure The dtor is called on allocation failure, thus we must check the volumes are allocated before trying to release them. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Cc: mesa-stable@lists.freedesktop.org	2018-12-23 08:14:50 +01:00
Axel Davy	1cc8192ad0	st/nine: Switch to presentation buffer if resize is detected This enables to match the window size on resize on all cases, as it only works currently with presentation buffers. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-12-23 08:14:50 +01:00
Axel Davy	c442dd7890	st/nine: Use helper to release swapchain buffers later This patch introduces a structure to release the present_handles only when they are fully released by the server, thus making "DestroyD3DWindowBuffer" actually release the buffer right away when called. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-12-23 08:14:50 +01:00
Rob Clark	51a44c3aac	freedreno/a6xx: fix 3d texture layout Maybe not 100% perfect, but seems to be a pretty good approximation of that. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-22 15:29:15 -05:00
Rob Clark	8f60f1381d	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-22 15:28:50 -05:00
Rob Clark	be9ec158d7	freedreno/a6xx: improve setup_slices() debug msgs Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-22 15:28:24 -05:00
Rob Clark	2b497fc507	freedreno/a6xx: simplify special case for 3d layout This logic can be re-written as the two cases for 3d (ie. before/after the miplevel sizes start reducing) vs everything else. I think it is easier to read this way. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-22 15:27:57 -05:00
Rob Clark	d71a50f831	freedreno: combine fd_resource_layer_offset()/fd_resource_offset() We really only need this logic in one place. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-22 15:27:37 -05:00
Rob Clark	6667dde098	freedreno/ir3: don't treat all inputs/outputs as vec4 This was a hold-over from the early TGSI days, and mostly not needed with NIR. This avoids burning an entire 4 consecutive scalar regs for vec3 outputs, for example. Which fixes a few places that we were doing worse that we should on register usage. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-22 15:27:21 -05:00
Rob Clark	3453814622	freedreno/ir3: fix fallout of extra assert Fixes the following crash that happened after `d6110d4d` The problem happens if we first compile a "vanilla" shader with nothing lowered in NIR, which perform the final lowering passes on so->shader-> nir (including nir_lower_locals_to_regs()), and then later we have compile a shader with some lowering. The second time through we would have already done nir_lower_locals_to_regs(). Arguably this was already a bug, just one we hadn't noticed yet. Fixes: `d6110d4d54` intel/compiler: move nir_lower_bool_to_int32 before nir_lower_locals_to_regs Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-21 19:04:22 -05:00
Kenneth Graunke	626f2477ab	st/nir: Drop unused gl_program parameter in VS input handling helper. Nobody uses this, so let's drop it. This makes the helper callable from places without a gl_program. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-21 15:29:32 -08:00
Kenneth Graunke	3a78b46e59	st/nir: Gather info after applying lowering FS variant features DrawPixels lowering, for example, adds new varyings that need to be accounted for in inputs_read. The earlier info gathering at link time cannot account for this. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-21 15:29:30 -08:00
Kenneth Graunke	bcb6f19947	st/mesa: Combine the DrawPixels and Bitmap passthrough VS programs. They're now identical, so we can just compile it once. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-21 15:29:29 -08:00
Kenneth Graunke	80dd9dfe33	st/mesa: Don't open code the drawpixels vertex shader. Now that we always copy color, we can just use the util function. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-21 15:29:28 -08:00
Kenneth Graunke	ed1a356c5e	st/mesa: Drop !passColor optimization in drawpixels shaders. The glDrawPixels passthrough vertex shader copies position and texcoord vertex attributes to varying outputs. It also optionally copies a third gl_Color attribute, which sometimes is unnecessary. Until now, we've compiled separate variants of the shader, one of which does this extra copy, and the other of which doesn't. We have done this since 2007. But, the vertex shader runs for a whopping four vertices, and so the cost of a copying a single input to output is likely inconsequential. In theory, we could bind one fewer vertex element - but we always bind all three regardless. So, we don't even get that savings. This patch unifies the two, so we always copy the optional color, and save having to compile the variant. It also makes the VS input interface match up with the vertex element state without any dead (unused) input attributes. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-21 15:29:25 -08:00
Kenneth Graunke	42d31e0516	st/mesa: Drop dead 'passthrough_fs' field. Dead since 2015 (commit `5142564734`). Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-21 15:29:20 -08:00
Bas Nieuwenhuizen	bba5749484	radv: Fix wrongly positioned paren. Trivial. Fixes: `9f0bfbed11` "radv: Work around non-renderable 128bpp compressed 3d textures on GFX9."	2018-12-21 21:06:55 +01:00
Roland Scheidegger	171983dc89	gallivm: abort when trying to use non-existing intrinsic Whenever llvm removes an intrinsic (we're using), we're hitting segfaults due to llvm doing calls to address 0 in the jitted code instead. However, Jose figured out we can actually detect this with LLVMGetIntrinsicID(), so use this to abort, so we don't have to wonder what got broken. (Of course, someone still needs to fix the code to no longer use this intrinsic.) Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2018-12-21 17:37:00 +01:00
Roland Scheidegger	f3b1acff48	gallivm: don't use pavg.b intrinsic on llvm >= 6.0 This intrinsic disppeared with llvm 6.0, using it ends up in segfaults (due to llvm issuing call to NULL address in the jited shaders). Add code doing the same thing as the autoupgrade code in llvm so it can be matched and replaced back with a pavgb. While here, also improve lp_test_format, so it tests both with and without cache (as it was, it tested the cache versions only, whereas cache is actually disabled in llvmpipe, and in any case even with it enabled vertex and geometry shaders wouldn't use it). (Although at least for the unorm8 uncached fetch, the code is still quite different to what llvmpipe is using, since that would use unorm8x16 type, whereas the test code is using unorm8x4 type, hence disabling some intrinsic paths.) Fixes: `6f4083143b` ("gallivm: use llvm jit code for decoding s3tc") Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com>	2018-12-21 17:35:05 +01:00
Emil Velikov	f331419f26	glx/test: meson: assorted include fixes Swap '..' with the symbolic inc_glx and add glproto as dependency. That will pull the correct include, effectively fixing the tests on macOS. Fixes: `a47c525f32` ("meson: build glx") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-12 19:24:14 +00:00
Emil Velikov	e139d7a8a3	glx: meson: wire up the dispatch-index-check test Accidentally dropped with earlier commit.! Fixes: `4ccb981673` ("meson: Use consistent style for tests") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-12 19:07:52 +00:00
Emil Velikov	b44875e2dc	glx: meson: drop includes from a link-only library When producing the final libGL.so/libGLX_mesa.so we only link the local static helper lib (libglx). Thus there's no reason for the includes. Fixes: `a47c525f32` ("meson: build glx") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-12 17:55:08 +00:00
Emil Velikov	9527f9ea26	TODO: glx: meson: build dri based glx tests, only with -Dglx=dri The library itself (libGL) is only built when -Dglx=dri, yet it's accompanying tests are build even with -Dglx=xlib. Adjust the guards, so we don't build the tests when they are not applicable v2: - Reword commit message (Dylan) - Drop build_by_default hunk (Dylan) Fixes: `a47c525f32` ("meson: build glx") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-12 17:47:36 +00:00
Emil Velikov	2eedb79e1a	pipe-loader: meson: reference correct library The library is called libgalliumvl_stub - note singular. Fixes: `42ea0631f1` ("meson: build clover") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-12-13 04:10:50 +00:00
Eric Anholt	17218a0406	vc4: Hook up perf_debug() output to GL_ARB_debug_output as well. This is the right channel to report these things, so that end-users don't need to know each driver's custom debug options.	2018-12-20 11:31:25 -08:00
Rhys Kidd	acc481ad79	vc4: Wire up core pipe_debug_callback This lets the driver use pipe_debug_message() for GL_ARB_debug_output. Signed-off-by: Rhys Kidd <rhyskidd@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-12-20 11:31:19 -08:00
Eric Anholt	ba36312fbd	v3d: Hook up perf_debug() output to GL_ARB_debug output as well. This is the right channel to report these things, so that end-users don't need to know each driver's custom debug options.	2018-12-20 11:31:19 -08:00
Rhys Kidd	d3991d2472	v3d: Wire up core pipe_debug_callback This lets the driver use pipe_debug_message() for GL_ARB_debug_output. Signed-off-by: Rhys Kidd <rhyskidd@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-12-20 11:31:16 -08:00
Eric Anholt	d80761b8f3	v3d: Drop shadow comparison state from shader variant key. The shadow state is now in the sampler.	2018-12-20 11:29:30 -08:00
Eric Anholt	0e2758daad	v3d: Fix simulator mode on i915 render nodes. i915 render nodes refuse the dumb ioctls, so the simulator would crash on the original non-apitrace shader-db. Replace them with direct i915 calls if we detect that we're on one of their gem fds.	2018-12-20 11:29:30 -08:00
Samuel Pitoiset	9606310081	radv: enable shaderStorageImageMultisample feature on GFX8+ Untested on older chips. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-20 18:01:19 +01:00
Samuel Pitoiset	6b976024a8	radv: add support for FMASK expand Original patch by Dave Airlie. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-20 18:01:17 +01:00
Samuel Pitoiset	fa16da53d8	radv: initialize FMASK for images in fully expanded mode The value depends on the number of samples. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-20 18:01:15 +01:00
Samuel Pitoiset	65d82c84d2	ac/nir: restrict fmask lookup to image load intrinsics We don't ever want to do the fmask lookup on a atomic or store, the fmask should have been decompressed if the surface has been moved to IMAGE_LAYOUT. Original patch by Dave Airlie. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-20 18:01:11 +01:00
Samuel Pitoiset	f45e43e156	spirv: add support for SpvCapabilityStorageImageMultisample Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-20 18:01:09 +01:00
Samuel Pitoiset	5b1ec10e4c	radv: compute optimal VM alignment for imported buffers This fixes GPU hangs on GFX9 with dEQP-VK.memory.external_memory_host.bind_image_memory_and_render.with_zero_offset.* Copied from RadeonSI. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-20 17:34:04 +01:00
Bas Nieuwenhuizen	9f0bfbed11	radv: Work around non-renderable 128bpp compressed 3d textures on GFX9. Exactly what title says, the new addrlib does not allow the above with certain dimensions that the CTS seems to hit. Work around it by not allowing the app to render to it via compat with other 128bpp formats and do not render to it ourselves during copies. Fixes: `776b911365` "amd/addrlib: update Mesa's copy of addrlib" Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-12-20 15:07:20 +01:00
Samuel Pitoiset	5c7935f8fc	radv: fix subpass image transitions with multiviews The driver needs to decompress all image layers if a fast depth/color clear has been performed. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-20 13:36:37 +01:00
Samuel Pitoiset	0a7e767e58	radv: drop the amdgpu-skip-threshold=1 workaround for LLVM 8 This workaround has been introduced by `135e4d434f` for fixing DXVK GPU hangs with many games. It is no longer needed since LLVM r345718. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-20 12:09:57 +01:00
Samuel Pitoiset	576040f2e5	ac/nir: remove the bitfield_extract workaround for LLVM 8 This workaround has been introduced by `3d41757788` and it is no longer needed since LLVM r346422. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-20 09:40:16 +01:00
Iago Toral Quiroga	d6110d4d54	intel/compiler: move nir_lower_bool_to_int32 before nir_lower_locals_to_regs The former expects to see SSA-only things, but the latter injects registers. The assertions in the lowering where not seeing this because they asserted on the bit_size values only, not on the is_ssa field, so add that assertion too. Fixes: `11dc130779` "nir: Add a bool to int32 lowering pass" CC: mesa-stable@lists.freedesktop.org Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-12-20 08:02:44 +01:00
Ilia Mirkin	1250383e36	st/mesa: remove sampler associated with buffer texture in pbo logic A long time ago, when this was first implemented, not having a sampler bound would cause problems on Fermi. I didn't work out the reasons, but the solution was simple -- just put the samplers back in. Since then, regular texturing paths appear to have lost their associated samplers which required a fuller investigation and fix in nouveau. Now that this is done, this code should no longer need a sampler state for fetching texels from a buffer texture. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-20 00:27:16 -05:00
Roland Scheidegger	6f4083143b	gallivm: use llvm jit code for decoding s3tc This is (much) faster than using the util fallback. (Note that there's two methods here, one would use a cache, similar to the existing code (although the cache was disabled), except the block decode is done with jit code, the other directly decodes the required pixels. For now don't use the cache (being direct-mapped is suboptimal, but it's difficult to come up with something better which doesn't have too much overhead.) Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2018-12-20 06:03:20 +01:00
Jason Ekstrand	ec1d5841fa	radv/query: Use 1-bit booleans in query shaders Fixes: `44227453ec` "nir: Switch to using 1-bit Booleans for almost..." Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Tested-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-19 16:36:40 -06:00
Jason Ekstrand	6896c91c10	radv/query: Add a nir_test_flag helper This is little more than an iadd_imm right now but it will help in the next commit where we refactor things further. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Tested-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-12-19 16:36:26 -06:00
Eduardo Lima Mitev	c2ebc38052	freedreno/ir3: Handle GL_NONE in get_num_components_for_glformat() An earlier patch that introduced the function failed to handle the case where an image format layout qualifier is not specified, which is allowed on desktop GL profiles. In these cases, nir_variable's image format is GL_NONE, and we don't need to print a debug message for those. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Rob Clark <robdclark@gmail.com>	2018-12-19 22:49:05 +01:00
Eric Anholt	fcfb7f573c	v3d: Load and store aligned utiles all at once. This calls the expensive uif offset function once per utile, but it still gets us a 212.218% +/- 2.41216% (n=10) win on 1024x1024 glTexImage over calling it on each pixel.	2018-12-19 10:27:26 -08:00
Eric Anholt	7c56b7a6ea	v3d: Add a fallthrough path for utile load/store of 32 byte lines. Now that V3D has 8 byte per pixel formats exposed, we've got stride==32 utiles to load and store. Just handle them through the non-NEON paths for now.	2018-12-19 10:27:26 -08:00
Eric Anholt	f6a0f4f41e	vc4: Move the utile load/store functions to a header for reuse by v3d. These implementations of whole-utile load/stores would be the same for v3d, though the layouts of blocks of utiles has changed.	2018-12-19 10:27:26 -08:00
Eric Anholt	8ee752194c	v3d: Implement texture_subdata to reduce teximage upload copies. This lets us store the non-PBO glTexImage data directly into the tiled image without making an extra untiled memcpy for the gallium transfer. Improves 1024x1024 TexImage perf by ~19%, mostly from not thrashing around in the kernel mapping and unmapping the transfer's temporary area.	2018-12-19 10:27:26 -08:00

1 2 3 4 5 ...

98099 commits