fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-02-02 21:50:34 +01:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	e3bcce01d9	panfrost: Disable shader-assisted indirect draws Although it is passing all of dEQP-GLES31, it is failing a few KHR-GLES31.* tests. It also has performance issues at the moment. Invert the existing noindirect debug flag to become a indirect debug flag. Set this flag for dEQP-GLES31 CI on G52, to make sure the code doesn't bit rot on the hope someone will pick this up later on. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12478> (cherry picked from commit `a7f7d74137`) Conflicts: src/gallium/drivers/panfrost/pan_screen.c src/panfrost/lib/pan_util.h	2021-08-24 20:11:11 -07:00
Alyssa Rosenzweig	4d3e06e55c	panfrost: Handle non-dithered clear colours In `b9c095cc2c` ("panfrost: Rewrite the clear colour packing code"), packing of clear colours was corrected to use the tilebuffer's fractional bits, fixing dithering of the clear colour with formats like RGB565. Unfortunately, that commit did so unconditionally. If the framebuffer is dithered, but dithering is disabled at the time of the clear, we would incorrectly dither the clear. This is a regression, as the old (broken) code passed the relevant CTS test. What's the catch? Depending on dither state, there are two formulas to pack tilebuffer colours. We need to handle both. Fixes KHR-GLES31.core.draw_buffers_indexed.color_masks. Fixes: `b9c095cc2c` ("panfrost: Rewrite the clear colour packing code") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12460> (cherry picked from commit `22538b89b3`) Conflicts: src/panfrost/lib/tests/test-clear.c src/panfrost/vulkan/panvk_cmd_buffer.c	2021-08-24 16:28:24 -07:00
Erik Faye-Lund	12f77679a6	lavapipe: fix reported subpixel precision for lines We have no reason to report a subpixel precision of 4 for lines; in fact LLVMpipe uses 8 subpixel bits for lines, similar to other primitives. But let's use the pipe-cap for this instead of hard-coding it. Fixes: `9fbf6b2abf` ("lavapipe: implement VK_EXT_line_rasterization") Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12277> (cherry picked from commit `a16f3963d3`)	2021-08-24 15:59:31 -07:00
Vinson Lee	76d1828b97	freedreno: Require C++17. Commit `3a772be026` ("freedreno: Add perfetto renderpass support") uses C++17 init-statement feature. GCC ../src/gallium/drivers/freedreno/freedreno_perfetto.cc: In lambda function: ../src/gallium/drivers/freedreno/freedreno_perfetto.cc:148:11: warning: init-statement in selection statements only available with ‘-std=c++17’ or ‘-std=gnu++17’ 148 \| if (auto state = tctx.GetIncrementalState(); state->was_cleared) { \| ^~~~ Clang ../src/gallium/drivers/freedreno/freedreno_perfetto.cc:148:11: warning: 'if' initialization statements are a C++17 extension [-Wc++17-extensions] if (auto state = tctx.GetIncrementalState(); state->was_cleared) { ^ Intel C++ Compiler ../src/gallium/drivers/freedreno/freedreno_perfetto.cc(148): error: expected a ")" if (auto state = tctx.GetIncrementalState(); state->was_cleared) { ^ Fixes: `3a772be026` ("freedreno: Add perfetto renderpass support") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5193 Suggested-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Signed-off-by: Vinson Lee <vlee@freedesktop.org> Acked-by: Rob Clark <robdclark@chromium.org> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12293> (cherry picked from commit `4fc2a6cbdb`)	2021-08-24 15:59:29 -07:00
Dave Airlie	cc3e149f80	vulkan/wsi/sw: wait for image fence before submitting to queue With hw devices, when you submit a present, implicit sync will make sure the work submitted to the gpu on the client will end up happening before the present work submitted on the server. However with sw paths there is no real GPU, the lavapipe fake GPU thread is client side only and presenting is done directly from the pixmap (or later shared pixmap). In order for this to make sense the wsi common code should wait for the fence on the image before queueing the submit to the server so that all client works has been flushed to the pixmap before the copy or present operation is submitted. Fixes: `8004fa9c95` ("vulkan/wsi: add sw support. (v2)") Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12502> (cherry picked from commit `0cddfba328`)	2021-08-24 15:59:26 -07:00
Samuel Pitoiset	56f21793de	radv: fix copying depth+stencil images on compute Using separate aspects is required. Fixes few CTS failures (dEQP-VK.api.copy_and_blit.*) when the compute path is forced in the driver. Note that CTS coverage of compute queue is rather limited. Cc: 21.2 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12287> (cherry picked from commit `be6bdb0918`)	2021-08-24 15:59:12 -07:00
Timothy Arceri	8f18e97dd7	glsl: fix variable scope for instructions inside case statements Fixes: `665d75cc5a` ("glsl: Fix scoping bug in if statements.") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5247 Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12435> (cherry picked from commit `02b394023b`)	2021-08-24 15:59:12 -07:00
Connor Abbott	387039e4d0	ir3/ra: Handle huge merge sets It can happen that we create an enormous merge set, even larger than the entire register file, in which case find_best_gap() would loop infinitely. This seems to be triggered more often with IR3_SHADER_DEBUG=spillall, since it actually happened with a CTS test. Just bail out in that case. Fixes: `0ffcb19b9d` ("ir3: Rewrite register allocation") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12033> (cherry picked from commit `efb34d6ee6`)	2021-08-24 15:59:11 -07:00
Connor Abbott	8e4d6692f3	ir3/ra: Fix available bitset for live-through collect srcs When we mark live-through sources that are merged with the destination as killed, we kept the bitsets in sync, but we forgot to keep them in sync when unmarking them after allocating the destination. The result was that "available" wasn't correct for any instruction afterwards. This resulted in a bad register allocation with IR3_SHADER_DEBUG=spillall for a dEQP-VK test. While we're changing this, use ra_foreach_src(). Fixes: `0ffcb19b9d` ("ir3: Rewrite register allocation") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12033> (cherry picked from commit `70c22d3894`)	2021-08-24 15:59:11 -07:00
Jason Ekstrand	e64eeb5240	anv: Set CONTEXT_PARAM_RECOVERABLE to false We want the kernel to ban our context immediately instead of foolhardily attempting to recover. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: mesa-stable@lists.freedesktop.org Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12476> (cherry picked from commit `a6a449837b`)	2021-08-24 15:59:10 -07:00
Simon Ser	2a88afe58a	v3d: implement resource_get_param Prior to this commit, the stride, offset and modifier were fetched via WINSYS_HANDLE_TYPE_KMS. However we can't make such a query succeed if the buffer couldn't be imported to the KMS device. Instead, implement the resource_get_param hook to allow users to fetch this information without WINSYS_HANDLE_TYPE_KMS. A tiny helper function is introduced to compute the modifier of a resource. Signed-off-by: Simon Ser <contact@emersion.fr> Fixes: `7bcb223639` ("v3d, vc4: Fix dmabuf import for non-scanout buffers") Reported-by: Roman Stratiienko <r.stratiienko@gmail.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12370> (cherry picked from commit `8de086e12f`) Conflicts: src/broadcom/ci/piglit-v3d-rpi4-fails.txt	2021-08-19 10:08:54 -07:00
Simon Ser	5d99b0869a	vc4: implement resource_get_param Prior to this commit, the stride, offset and modifier were fetched via WINSYS_HANDLE_TYPE_KMS. However we can't make such a query succeed if the buffer couldn't be imported to the KMS device. Instead, implement the resource_get_param hook to allow users to fetch this information without WINSYS_HANDLE_TYPE_KMS. A tiny helper function is introduced to compute the modifier of a resource. Signed-off-by: Simon Ser <contact@emersion.fr> Fixes: `7bcb223639` ("v3d, vc4: Fix dmabuf import for non-scanout buffers") Reported-by: Roman Stratiienko <r.stratiienko@gmail.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12370> (cherry picked from commit `b1fbceac6f`)	2021-08-19 10:07:57 -07:00
Simon Ser	5aa0faf512	panfrost: implement resource_get_param Prior to this commit, the stride, offset and modifier were fetched via WINSYS_HANDLE_TYPE_KMS. However we can't make such a query succeed if the buffer couldn't be imported to the KMS device. Instead, implement the resource_get_param hook to allow users to fetch this information without WINSYS_HANDLE_TYPE_KMS. Signed-off-by: Simon Ser <contact@emersion.fr> Fixes: `4c092947df` ("panfrost: fail in get_handle(TYPE_KMS) without a scanout resource") Reported-by: Roman Stratiienko <r.stratiienko@gmail.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12370> (cherry picked from commit `99fc6f7271`)	2021-08-19 10:07:56 -07:00
Simon Ser	e26269535d	etnaviv: add stride, offset and modifier to resource_get_param Prior to this commit, the stride, offset and modifier were fetched via WINSYS_HANDLE_TYPE_KMS. However we can't make such a query succeed if the buffer couldn't be imported to the KMS device. Instead, extend the resource_get_param hook to allow users to fetch this information without WINSYS_HANDLE_TYPE_KMS. Signed-off-by: Simon Ser <contact@emersion.fr> Fixes: `9da901d2b2` ("etnaviv: fail in get_handle(TYPE_KMS) without a scanout resource") Reported-by: Roman Stratiienko <r.stratiienko@gmail.com> Reviewed-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12370> (cherry picked from commit `b5919b0b10`)	2021-08-19 10:07:56 -07:00
Erik Faye-Lund	099a7f6ff9	gallium/nir/tgsi: initialize file_max for inputs When this was rewritten to support Vulkan, we stopped initializing file_max to -1 in the case of no inputs. This causes the draw module to go down a needlessly pessimistic case, printing an error while we're at it. Fixes: `42b5cfdbd2` ("gallivm/nir: fix vulkan vertex inputs") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12440> (cherry picked from commit `63529782d3`)	2021-08-19 10:07:55 -07:00
Erik Faye-Lund	33ace70ae1	gallium/nir/tgsi: fixup indentation This was using mixed tabs and spaces, let's fix that before we start modifying the code. Fixes: `42b5cfdbd2` ("gallivm/nir: fix vulkan vertex inputs") Reviewed-by: default avatarDave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12440> (cherry picked from commit `4674698008`)	2021-08-19 10:07:54 -07:00
Ilia Mirkin	1a3180d595	mesa: don't return errors for gl_* GetFragData* queries There is nothing in the spec about this. BindFragDataLocation* is supposed to return an error, but not Get. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5221 Fixes: `59012c3133` ("mesa: Implement glGetFragDataLocation") Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12333> (cherry picked from commit `bce19b3a77`)	2021-08-19 10:07:53 -07:00
Mike Blumenkrantz	fa069379b4	nir/lower_vectorize_tess_levels: set num_components for vectorized loads this otherwise explodes when rewriting e.g., a single array component load to a vec4 Fixes: `f5adf27fb9` ("nir,radv: add and use nir_vectorize_tess_levels()") fixes zmike/mesa#94 Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12419> (cherry picked from commit `649251ad4e`)	2021-08-19 10:07:35 -07:00
Erik Faye-Lund	1133671ea3	gallivm: fix texture-mapping with 16-bit result 16bit integer support also implies using 16-bit results when sampling textures. Because we're returning the results in float SSA values instead of int, we need to bitcast back to integers before truncating the values. Fixes: `00ff60f799` ("gallivm: add 16-bit integer support") Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12413> (cherry picked from commit `45a61f1782`)	2021-08-19 10:07:35 -07:00
Mao, Marc	8e0d61e1c7	iris: declare padding for iris_vue_prog_key Otherwise with some compilers/environments (Android) padding may contain garbage and memcmp of the key will fail. Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12438> (cherry picked from commit `fae1e99a15`)	2021-08-19 10:07:34 -07:00
Samuel Pitoiset	1225326046	radv: fix fast clearing depth images with mips on GFX10+ Found by inspection. Cc: 21.2 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12325> (cherry picked from commit `b16f3261a7`)	2021-08-19 10:07:33 -07:00
Jason Ekstrand	384184ac70	intel/isl: Add a missing assert in isl_tiling_get_intratile_offset_sa Fixes: `a4dafe1fad` "intel/isl: Make the offset helpers four dimensional" Acked-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11765> (cherry picked from commit `eb7c28bf24`)	2021-08-17 10:33:47 -07:00
Jason Ekstrand	9c01036e77	intel/isl: Explicitly set offset_B = 0 in get_uncomp_surf for arrays The only user of this case is iris which initializes offset_B to 0 so there's no real bug here. However, it is unexpected from an API PoV. Fixes: `9946120d2b` "intel/isl: Add more cases to isl_surf_get_uncompressed_surf" Acked-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11765> (cherry picked from commit `3702406154`)	2021-08-17 10:33:47 -07:00
Roman Stratiienko	7afc784345	lima: Implement lima_resource_get_param() callback Currently stride, offset, modifier is obtained by invoking lima_resource_get_handle() with WINSYS_HANDLE_TYPE_KMS. Before commit `47f000c170` this path was working. Obtained handle was simply ignored by DRI frontend and only requested data used. After commit `47f000c170` such requests started to fail when DRI is initialized using KMSRO and resource has no scanout data. When lima_resource_get_param() is implemented, it will be used in a first place to obtain resource data. Fixes: `47f000c170` ("lima: fail in get_handle(TYPE_KMS) without a scanout resource") Signed-off-by: Roman Stratiienko <r.stratiienko@gmail.com> Reviewed-by: Simon Ser <contact@emersion.fr> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12362> (cherry picked from commit `5ec6b6e9bb`)	2021-08-17 10:33:46 -07:00
Marcin Ślusarz	8d4f8e7e6a	glsl/opt_algebraic: disable invalid optimization When operators other than eq and ne are involved we can't really move operands around and negate them because such transformation may change the value of the whole expression. Some examples: For unsigned var: 0 >= 1u + var would eventually become 0xffffffff >= var, which would always evaluate to true, when original expression was true only for var == 0xffffffff. For signed var: 0 >= 1 + var would become -1 >= var, which would evaluate to false for var == 2147483647, when original expression evaluated to true (because signed overflow is defined to wrap around in glsl, 1 + 2147483647 == -2147483648, so 0 >= -2147483648). Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5226 Fixes: `34ec1a24d6` ("glsl: Optimize (x + y cmp 0) into (x cmp -y).") Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12359> (cherry picked from commit `89bc8ff408`)	2021-08-17 10:33:45 -07:00
Alyssa Rosenzweig	1bfddefadf	panfrost: Rewrite the clear colour packing code At the beginning of a render pass, the hardware will fill the tilebuffer with an arbitrary 128-bit word. To implement colour clears, the driver must pack the API-specific clear colour according to the 128-bit layout of the tilebuffer. This layout depends only on the render target format. The existing code to handle this was based on loose guesswork. It works for the format / clear colour combinations tested in dEQP-GLES3, but it is severely deficient in the general case. It works by matching on the PIPE format of the render target (not the layout of the tilebuffer). For special cased PIPE formats, it open codes a buggy pack routine. Otherwise, it defaults to util_pack_color in the hope that will work. Since util_pack_color doesn't know anything about Mali tilebuffer layouts, that means it's defaulting to wrong behaviour. Now that we understand internal tilebuffer layouts, let's rewrite the packing code. Instead of matching PIPE formats, map the PIPE format to the internal tilebuffer layout using the common table, ensuring the mapping remains in sync with the render target descriptor. Then for blendable tilebuffer formats, pack using a common float -> fixed point path supporting optional sRGB translation. Raw formats use util_pack_color as before. For formats with less than 8 bits per channel, the new code uses the fractional bits of the fixed-point representation. This is required for correct dithering if the clear colour is not exactly representable in the final low precision format. In summary, at least the following bugs in the old code are fixed: * Swapped R/B channels with sRGB * Swapped R/B channels with some missing formats * Incorrect dithering with RGB565, RGB5_A1 Fixes the following test cases: dEQP-EGL.functional.wide_color.window_8888_colorspace_srgb dEQP-EGL.functional.wide_color.pbuffer_8888_colorspace_srgb dEQP-EGL.functional.wide_color.window_888_colorspace_srgb dEQP-EGL.functional.wide_color.pbuffer_888_colorspace_srgb Later in the series, unit tests are added for the new implementation. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12365> (cherry picked from commit `b9c095cc2c`) Conflicts: src/panfrost/lib/pan_util.h	2021-08-16 11:42:04 -07:00
Icecream95	8d5da6b0b4	panfrost: Only allow colour blit shaders to be killed Fixes timeouts in SuperTuxKart with the advanced rendering pipeline. Fixes: `d034461921` ("panfrost: Set allow_forward_pixel_to_be_killed for blit") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12267> (cherry picked from commit `0624346a20`)	2021-08-16 11:40:39 -07:00
Alyssa Rosenzweig	4637f20a34	panfrost: Fix leak of render node fd Transfer ownership of the render node fd to the panfrost_device (minor change to panvk), and then close the file descriptor for the render node bound to the panfrost_device when destroying the panfrost_device. Of all the users of panfrost_open_device, panvk is the only one that correctly closed the fd before. Accordingly, this fixes an fd leak in the Gallium driver (and performance counter utilities). This fix still applies to the Gallium driver when renderonly is in use-- although renderonly closes its own fd, the fd is _duplicated_ in panfrost_drm_winsys.c, so renderonly and panfrost must _both_ close their respective fd to fix the leak. This fixes a crash when running dEQP-EGL for more than two hours. dEQP-EGL creates a new screen for every test case and then immediately destroys it. If destroying a screen leaks the fd, this causes the number of open file descriptors to increase monotonically until the process ends. This will eventually hit the system limit for number of open files and abort the process. This bug was identified while attempting to run the OpenGL ES conformance tests via cts-runner, and then confirmed with `lsof`. With the fix, the number of file descriptors reported by `lsof \| wc -l` is now constant while running dEQP-EGL as expected. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12346> (cherry picked from commit `76377de99b`)	2021-08-16 11:40:38 -07:00
Eric Engestrom	50c489f2a0	isl: drop left-over comment Fixes: `cf9ff082b4` ("isl: Bring back isl_format_layout::bpb") Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3674> (cherry picked from commit `773a70f9cb`)	2021-08-16 11:40:37 -07:00
Eric Engestrom	533a3ebdd0	Revert "python: Explicitly add the 'L' suffix on Python 3" This reverts commit `ad363913e6`. This code was added to be able to compare the output file while porting the script from python2 to python3, but this has long been finished and the extra complexity is not needed anymore. Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3674> (cherry picked from commit `93cb3aca03`)	2021-08-16 11:40:29 -07:00
Alyssa Rosenzweig	ca9ab792c6	drm-shim: Support kernels with >4k pages mmap requires its offset is page aligned, but the current code only guarantees 4k alignment, causing drm-shim to break badly on kernels with >4k page sizes. This fixes drm-shim on my Apple M1, running bare metal Linux with 16k pages. It probably also fixes exotic PowerPC systems with 64k pages. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Zoltan Boszormenyi <zboszor@gmail.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Emma Anholt <emma@anholt.net> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12347> (cherry picked from commit `38f39cc1e2`)	2021-08-16 11:40:28 -07:00
Michel Zou	9ed17f7465	radv: fix build with mingw Cc: 21.2 mesa-stable Reviewed-by: Joshua Ashton <joshua@froggi.es> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Closes #5092 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12178> (cherry picked from commit `e4c0a34bfe`)	2021-08-13 10:28:41 -07:00
Lionel Landwerlin	809b4d1356	anv/android: handle image bindings from gralloc buffers When creating an image out of a swapchain on Android, the android layer call will detect a VkBindImageMemorySwapchainInfoKHR in the pNext chain of the vkBindImageMemory2() call and add a VkNativeBufferANDROID in the chain. This is what we should use as backing memory for that image. v2: Fix a couple of obvious mistakes (Tapani) v3: Silence build warning (Lionel) Fix invalid object argument to vk_error() (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `bc3c71b87a` ("anv: don't try to access Android swapchains") Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5180 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12244> (cherry picked from commit `19b7bbba73`)	2021-08-13 10:28:40 -07:00
Vinson Lee	ac87f88cff	nir: Initialize evaluate_cube_face_index_amd dst.x. Fix defect reported by Coverity Scan. Uninitialized scalar variable (UNINIT) uninit_use: Using uninitialized value dst.x. Fixes: `a1a2a8dfda` ("nir: add AMD_gcn_shader extended instructions") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12290> (cherry picked from commit `8d679f4f4e`)	2021-08-13 10:28:39 -07:00
Axel Davy	f7e77b7708	util: Fix translate from block compressed to rgba Since `2b5178ee` util: Switch the non-block formats to unpacking rgba rows instead of rects, compressed formats define unpack_rgba_8unorm_rect instead of unpack_rgba_8unorm. Fixes the u_format_translate check to take this into account. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5201 Fixes: `2b5178ee` ("util: Switch the non-block formats to unpacking rgba rows instead of rects") Signed-off-by: Axel Davy <davyaxel0@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12315> (cherry picked from commit `6a0e703512`)	2021-08-13 10:28:38 -07:00
Roland Scheidegger	996946b3fe	aux/cso: try harder to keep cso state in sync on cso context unbind Before `a73cb106a6`, cso contexts were never reused, but now that they are we need to be extra careful that the state in the cso context and in the pipe context matches even after an unbind, since when the cso context is reused the state might otherwise get out of sync (as there is no concept of "initial state", basically cso always relied on the default values being the same both in cso and the drivers). This fixes some errors we've seen internally with lavapipe. Fixes: `a73cb106a6` ("aux/cso: split cso_destroy_context into unbind and a destroy functions") Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12261> (cherry picked from commit `513fb5438b`)	2021-08-12 10:06:57 -07:00
Ian Romanick	5fb386c48d	Revert "nir/algebraic: Convert some f2u to f2i" Per https://gitlab.freedesktop.org/mesa/mesa/-/issues/5178#note_1019666, the assumption fundamental to this optimization is false. Section 2.4.1 (Float to Integer) of Ivy Bridge PRMs describes the situation. The wording of the section is somewhat confusing (because it doesn't clearly delineate between signed and unsigned integers), but the last two rows of the table make it clear that F->UD conversion clamps negative float values to 0. All other hardware mentioned in that thread seems to behave the same way. The real problem is that, with hardware that behaves in this ways, converting f2u(2147483648.0) to f2i(2147483648.0) changes the bit pattern that would be produced from 0x80000000 to 0x7fffffff. This reverts commit `ad05920258`. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12297> (cherry picked from commit `84d2e53789`)	2021-08-12 10:06:57 -07:00
Lionel Landwerlin	6d2727b2dc	nir/lower_shader_calls: remove empty phis This is confusing opt_cse. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `8dfb240b1f` ("nir: Add raytracing shader call lowering pass.") Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11953> (cherry picked from commit `01b0935d31`)	2021-08-12 10:06:57 -07:00
Marcin Ślusarz	bcf16071a7	nir/builder: invalidate metadata per function Fixes: `a62098fff2` ("nir: Add a helper for general instruction-modifying passes.") Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12324> (cherry picked from commit `e1b325f587`)	2021-08-12 10:06:57 -07:00
Icecream95	1e4967c184	pan/bi: Use the computed scale for fexp NaN propagation This makes pow(NaN, x) return NaN rather than 1.0. Fixes: `499397700c` ("pan/bi: Don't lower fpow") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5189 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12269> (cherry picked from commit `ee2bb57f1e`)	2021-08-12 10:06:57 -07:00
Alyssa Rosenzweig	0a1f12a488	nir/lower_mediump: Fix metadata in all passes Fixes: `fb29cef8dd` ("nir: add many passes that lower and optimize 16-bit input/outputs and samplers") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11732> (cherry picked from commit `9b57a81815`)	2021-08-12 10:06:57 -07:00
Alyssa Rosenzweig	4b0f88e7e5	nir/lower_mediump_io: Don't remap base unless needed Otherwise drivers that don't use 16-bit slots for varyings will get confused and have their driver_locations scribbled over. This has caused multiple problems for both Panfrost and Asahi this week. Given the only other user of the pass for varyings is radeonsi, which needs both together, I think this is the least controversial fix. Fixes: `fb29cef8dd` ("nir: add many passes that lower and optimize 16-bit input/outputs and samplers") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11732> (cherry picked from commit `03c18f7efc`)	2021-08-12 10:06:57 -07:00
Tapani Pälli	71876890b6	crocus: disable depth and d+s formats with memory objects This is similar to i965 commit `ba11f673a2`, we set depth and d+s formats unsupported for now. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12330>	2021-08-12 06:32:53 +10:00
Tapani Pälli	962178d5ab	crocus: take a reference to memobj bo in crocus_resource_from_memobj This is the same fix as commit `2d87ea3166` for iris driver. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12330>	2021-08-12 06:32:49 +10:00
Dave Airlie	51310d7c30	intel/vec4: sel.cond writes the flags on Gfx4 and Gfx5 This is the equivalent of idr's intel/fs: sel.cond writes the flags on Gfx4 and Gfx5 except for the vec4 backend. This fixes buggy rendering seen with crocus on a qt trace. v2 (idr): Trivial whitespace change. Add unit tests. v3: Fix type in comment in unit tests. Noticed by Jason and Priit. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Iron Lake total instructions in shared programs: 8183077 -> 8184543 (0.02%) instructions in affected programs: 198990 -> 200456 (0.74%) helped: 0 HURT: 1355 HURT stats (abs) min: 1 max: 8 x̄: 1.08 x̃: 1 HURT stats (rel) min: 0.29% max: 6.00% x̄: 0.99% x̃: 0.70% 95% mean confidence interval for instructions value: 1.04 1.12 95% mean confidence interval for instructions %-change: 0.96% 1.03% Instructions are HURT. total cycles in shared programs: 238967672 -> 238962784 (<.01%) cycles in affected programs: 4666014 -> 4661126 (-0.10%) helped: 406 HURT: 314 helped stats (abs) min: 4 max: 54 x̄: 22.46 x̃: 18 helped stats (rel) min: <.01% max: 12.80% x̄: 1.82% x̃: 0.65% HURT stats (abs) min: 2 max: 112 x̄: 13.48 x̃: 12 HURT stats (rel) min: <.01% max: 7.82% x̄: 0.81% x̃: 0.16% 95% mean confidence interval for cycles value: -8.60 -4.98 95% mean confidence interval for cycles %-change: -0.87% -0.49% Cycles are helped. GM45 total instructions in shared programs: 4986888 -> 4988354 (0.03%) instructions in affected programs: 198990 -> 200456 (0.74%) helped: 0 HURT: 1355 HURT stats (abs) min: 1 max: 8 x̄: 1.08 x̃: 1 HURT stats (rel) min: 0.29% max: 6.00% x̄: 0.99% x̃: 0.70% 95% mean confidence interval for instructions value: 1.04 1.12 95% mean confidence interval for instructions %-change: 0.96% 1.03% Instructions are HURT. total cycles in shared programs: 153577826 -> 153572938 (<.01%) cycles in affected programs: 4666014 -> 4661126 (-0.10%) helped: 406 HURT: 314 helped stats (abs) min: 4 max: 54 x̄: 22.46 x̃: 18 helped stats (rel) min: <.01% max: 12.80% x̄: 1.82% x̃: 0.65% HURT stats (abs) min: 2 max: 112 x̄: 13.48 x̃: 12 HURT stats (rel) min: <.01% max: 7.82% x̄: 0.81% x̃: 0.16% 95% mean confidence interval for cycles value: -8.60 -4.98 95% mean confidence interval for cycles %-change: -0.87% -0.49% Cycles are helped. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12330>	2021-08-12 06:31:23 +10:00
Ian Romanick	e71eb0f2ea	intel/fs: sel.cond writes the flags on Gfx4 and Gfx5 On Gfx4 and Gfx5, sel.l (for min) and sel.ge (for max) are implemented using a separte cmpn and sel instruction. This lowering occurs in fs_vistor::lower_minmax which is called very, very late... a long, long time after the first calls to opt_cmod_propagation. As a result, conditional modifiers can be incorrectly propagated across sel.cond on those platforms. No tests were affected by this change, and I find that quite shocking. After just changing flags_written(), all of the atan tests started failing on ILK. That required the change in cmod_propagatin (and the addition of the prop_across_into_sel_gfx5 unit test). Shader-db results for ILK and GM45 are below. I looked at a couple before and after shaders... and every case that I looked at had experienced incorrect cmod propagation. This affected a LOT of apps! Euro Truck Simulator 2, The Talos Principle, Serious Sam 3, Sanctum 2, Gang Beasts, and on and on... :( I discovered this bug while working on a couple new optimization passes. One of the passes attempts to remove condition modifiers that are never used. The pass made no progress except on ILK and GM45. After investigating a couple of the affected shaders, I noticed that the code in those shaders looked wrong... investigation led to this cause. v2: Trivial changes in the unit tests. v3: Fix type in comment in unit tests. Noticed by Jason and Priit. v4: Tweak handling of BRW_OPCODE_SEL special case. Suggested by Jason. Fixes: `df1aec763e` ("i965/fs: Define methods to calculate the flag subset read or written by an fs_inst.") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Tested-by: Dave Airlie <airlied@redhat.com> Iron Lake total instructions in shared programs: 8180493 -> 8181781 (0.02%) instructions in affected programs: 541796 -> 543084 (0.24%) helped: 28 HURT: 1158 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.35% max: 0.86% x̄: 0.53% x̃: 0.50% HURT stats (abs) min: 1 max: 3 x̄: 1.14 x̃: 1 HURT stats (rel) min: 0.12% max: 4.00% x̄: 0.37% x̃: 0.23% 95% mean confidence interval for instructions value: 1.06 1.11 95% mean confidence interval for instructions %-change: 0.31% 0.38% Instructions are HURT. total cycles in shared programs: 239420470 -> 239421690 (<.01%) cycles in affected programs: 2925992 -> 2927212 (0.04%) helped: 49 HURT: 157 helped stats (abs) min: 2 max: 284 x̄: 62.69 x̃: 70 helped stats (rel) min: 0.04% max: 6.20% x̄: 1.68% x̃: 1.96% HURT stats (abs) min: 2 max: 48 x̄: 27.34 x̃: 24 HURT stats (rel) min: 0.02% max: 2.91% x̄: 0.31% x̃: 0.20% 95% mean confidence interval for cycles value: -0.80 12.64 95% mean confidence interval for cycles %-change: -0.31% <.01% Inconclusive result (value mean confidence interval includes 0). GM45 total instructions in shared programs: 4985517 -> 4986207 (0.01%) instructions in affected programs: 306935 -> 307625 (0.22%) helped: 14 HURT: 625 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.35% max: 0.82% x̄: 0.52% x̃: 0.49% HURT stats (abs) min: 1 max: 3 x̄: 1.13 x̃: 1 HURT stats (rel) min: 0.12% max: 3.90% x̄: 0.34% x̃: 0.22% 95% mean confidence interval for instructions value: 1.04 1.12 95% mean confidence interval for instructions %-change: 0.29% 0.36% Instructions are HURT. total cycles in shared programs: 153827268 -> 153828052 (<.01%) cycles in affected programs: 1669290 -> 1670074 (0.05%) helped: 24 HURT: 84 helped stats (abs) min: 2 max: 232 x̄: 64.33 x̃: 67 helped stats (rel) min: 0.04% max: 4.62% x̄: 1.60% x̃: 1.94% HURT stats (abs) min: 2 max: 48 x̄: 27.71 x̃: 24 HURT stats (rel) min: 0.02% max: 2.66% x̄: 0.34% x̃: 0.14% 95% mean confidence interval for cycles value: -1.94 16.46 95% mean confidence interval for cycles %-change: -0.29% 0.11% Inconclusive result (value mean confidence interval includes 0). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12330>	2021-08-12 06:31:23 +10:00
Dave Airlie	c24604acb8	crocus: align staging resource pitch on gen4/5 to allow BLT usage. Aligning the pitch to 4 bytes allows the BLT engine to be used for transfers to/from these surfaces. Fixes: `f3630548f1` ("crocus: initial gallium driver for Intel gfx 4-7") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12330>	2021-08-12 06:31:23 +10:00
Dave Airlie	d9287281d1	crocus/blt: add pitch/offset checks to fix blt corruption I lost these in my conversion from i965 but they are necessary. This should fix corruption in qt fonts at seen in the minecraft launcher. Fixes: `f3630548f1` ("crocus: initial gallium driver for Intel gfx 4-7") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12330>	2021-08-12 06:31:23 +10:00
Bas Nieuwenhuizen	403e7213b2	radv: Use correct signedness in misalign test. Lots of the MAX2 args end up subtracting two unsigned numbers, which blows up when the result is negative. Fixes: `4c99d6ff54` ("radv: flush L2 for images affected by the pipe misaligned issue on GFX10+") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12272> (cherry picked from commit `b2b1e8e40a`) Conflicts: src/amd/vulkan/radv_image.c	2021-08-10 11:16:50 -07:00
Marcin Ślusarz	b17e7ddbae	glsl: evaluate switch expression once v2: intialize test_val in constructor Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5185 Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Cc: mesa-stable Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12234> (cherry picked from commit `bdae3c366e`)	2021-08-10 10:36:54 -07:00

1 2 3 4 5 ...

131799 commits