fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 04:58:05 +02:00

Author	SHA1	Message	Date
Timothy Arceri	49025292fb	radeonsi: add config entry for Counter-Strike Global Offensive This fixes rendering issues with gun scopes which is rather important. Cc: "19.0" "19.1" <mesa-stable@lists.freedesktop.org> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100239	2019-05-07 09:42:09 +10:00
Vasily Khoruzhick	d085920b64	lima/gpir: fix float uniform alignment issue If PIPE_CAP_PACKED_UNIFORMS is not set uniforms are vec4 aligned, so lima_nir_lower_uniform_to_scalar should use first channel of vec4 for float uniforms. Reviewed-by: Qiang Yu <yuq825@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-05-06 14:08:09 -07:00
Erik Faye-Lund	d84b85bc28	draw: flush when setting stream-out targets We need to re-prepare the middle-end state to pick up changes to this state to react correctly to pausing/resuming stream-out. So let's add a flush here. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Fixes: `ec8cbd79ac` "draw/softpipe: EXT_transform_feedback support (v2)" Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2019-05-06 22:42:37 +02:00
Erik Faye-Lund	ed53e61bec	llvmpipe: pass stream-out targets to draw-module early We currently set this state in the draw-module twice on each draw, but which trashes this state. So far that's not a problem, because we don't really do much from that function. But it turns out, we're going to have to do more; namely flush when the state changes. This will incur a large performance penalty due to the excessive setting. Instead, let's rely on the CSO caching making sure that llvmpipe_set_so_targets doesn't get called needlessly, and setup the state directly there instead. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2019-05-06 22:42:37 +02:00
Uros Bizjak	fc7649c4b7	doc: Update GL_KHR_robustness in features.txt for r600 glxinfo for Cypress XT [Radeon HD 5870] lists GL_KHR_robustness as supported extension. This was the last missing extension for GL 4.5, so Mark GL 4.5 as all DONE for r600. Reviewed-by: Dave Airlie <airlied@redhat.com>	2019-05-07 06:21:48 +10:00
Chia-I Wu	c7078397ca	virgl: do not use inline writes for subdata Inline writes skip transfer map/unamp at the cost of an extra copy on the data during execbuffer. That is generally a win for small transfers. But the heuristic to use inline writes based on buffer sizes rather than transfer sizes makes little sense. More importantly, inline writes miss optimizations that are done for buffer transfers. Let's just use transfers. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-By: Gert Wollny <gert.wollny@collabora.com>	2019-05-06 10:31:56 -07:00
Chia-I Wu	898be8036d	virgl: rework queries virglrender has been changed such that - VIRGL_CCMD_GET_QUERY_RESULT is fenced - query buffers (PIPE_BIND_CUSTOM) are coherent We can check if a query is ready using DRM_IOCTL_VIRTGPU_WAIT, and also avoid a synchronized transfer to retrieve the query result. When running against an older virglrenderer, it falls back to the old behavior automatically. TF2 @ 640x480 for pts4.dem went from 17fps to 40fps on my testing machine. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org>	2019-05-06 10:20:40 -07:00
Chia-I Wu	b4da53b0c3	virgl: export resource_is_busy from winsys Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org>	2019-05-06 10:20:38 -07:00
Samuel Pitoiset	c10808441c	radv: fix rowPitch for R32G32B32 formats on GFX9 The pitch is actually the number of components per row. We found the problem when we implemented some meta operations for these formats and the wrong pitch has been confirmed with a small test case. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108325 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-05-06 19:07:44 +02:00
Kenneth Graunke	a032a9665f	iris: Enable PIPE_CAP_SURFACE_REINTERPRET_BLOCKS This makes CompressedTexSubImage from a PBO source do proper GPU rendering to upload instead of stalling to map the PBO source on the CPU (then copying it on the CPU). Thanks Bas Nieuwenhuizen for pointing out that Vulkan includes this functionality, and to Jason Ekstrand for writing the code I adapted. Vulkan only supports a single layer, however, and this code tries to support multiple layers as long as it's miplevel 0. Improves performance in Sid Meier's Civilization VI: Average frame time (ms): -3.67423% +/- 1.46201% (n=5) 99th percentile frame time (ms): -5.09910% +/- 3.87874% (n=5)	2019-05-06 09:50:32 -07:00
Bas Nieuwenhuizen	8139efbbbd	radv: Use given stride for images imported from Android. Handled similarly as radeonsi. I checked the offsets are actually used. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-05-06 15:36:39 +00:00
Erico Nunes	11602ccd5d	lima/ppir: abort compilation in case of unsupported intrinsic Currently ppir continues compilation when there is an unsupported intrinsic, resulting in a shader that will surely not work as intended. This is a problem during piglit runs as some tests don't compile properly due to this but actually still get submitted to the gpu and leave the system in an unstable state after executing, causing further tests to fail. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com>	2019-05-06 17:15:27 +02:00
Erico Nunes	60a128fe81	lima/ir: print names of unsupported intrinsics While lima still doesn't support some kinds of intrinsics, it is more helpful to display the name of the unsupported instr->intrinsic to make debugging easier. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com>	2019-05-06 17:15:06 +02:00
John Stultz	c7f2145b4b	mesa: Makefile.sources: Add nir_lower_fb_read.c to Makefile.sources list In commit `a99c360a46` (nir: add pass to lower fb reads), a new file was added that needs to also be added to the Makefile.sources list used by the Android and SCons build system. Cc: Rob Clark <robdclark@chromium.org> Cc: Emil Velikov <emil.l.velikov@gmail.com> Cc: Amit Pundir <amit.pundir@linaro.org> Cc: Sumit Semwal <sumit.semwal@linaro.org> Cc: Alistair Strachan <astrachan@google.com> Cc: Greg Hartman <ghartman@google.com> Cc: Tapani Pälli <tapani.palli@intel.com> Cc: Jason Ekstrand <jason@jlekstrand.net> Fixes: `a99c360a46` ("nir: add pass to lower fb reads") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: John Stultz <john.stultz@linaro.org>	2019-05-06 11:29:26 +00:00
John Stultz	d04f44a459	mesa: Makefile.sources: Add ir3_nir_lower_load_barycentric_at_sample/offset to Makefile.sources In commit `2f0b9d2249` ("freedreno/ir3: lower load_barycentric_at_offset") a new file was added that needs to also be added to the Makefile.sources list used by Android and SCons build system. Cc: Rob Clark <robdclark@chromium.org> Cc: Emil Velikov <emil.l.velikov@gmail.com> Cc: Amit Pundir <amit.pundir@linaro.org> Cc: Sumit Semwal <sumit.semwal@linaro.org> Cc: Alistair Strachan <astrachan@google.com> Cc: Greg Hartman <ghartman@google.com> Cc: Tapani Pälli <tapani.palli@intel.com> Cc: Jason Ekstrand <jason@jlekstrand.net> Fixes: `2f0b9d2249` ("freedreno/ir3: lower load_barycentric_at_offset") Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: John Stultz <john.stultz@linaro.org>	2019-05-06 11:29:26 +00:00
John Stultz	c935862127	mesa: android: freedreno: Fix build failure due to path change The ir3_nir_trig.py file was moved in a previous commit, `aa0fed10d3` (freedreno: move ir3 to common location), so update the Android.gen.mk file to match. Cc: Rob Clark <robdclark@chromium.org> Cc: Emil Velikov <emil.l.velikov@gmail.com> Cc: Amit Pundir <amit.pundir@linaro.org> Cc: Sumit Semwal <sumit.semwal@linaro.org> Cc: Alistair Strachan <astrachan@google.com> Cc: Greg Hartman <ghartman@google.com> Cc: Tapani Pälli <tapani.palli@intel.com> Cc: Jason Ekstrand <jason@jlekstrand.net> Fixes: `aa0fed10d3` ("freedreno: move ir3 to common location") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: John Stultz <john.stultz@linaro.org>	2019-05-06 11:29:26 +00:00
Amit Pundir	88105375c9	mesa: android: freedreno: build libfreedreno_{drm,ir3} static libs Add libfreedreno_drm/ir3 to the build Cc: Rob Clark <robdclark@chromium.org> Cc: Emil Velikov <emil.l.velikov@gmail.com> Cc: Amit Pundir <amit.pundir@linaro.org> Cc: Sumit Semwal <sumit.semwal@linaro.org> Cc: Alistair Strachan <astrachan@google.com> Cc: Greg Hartman <ghartman@google.com> Cc: Tapani Pälli <tapani.palli@intel.com> Cc: Jason Ekstrand <jason@jlekstrand.net> Fixes: `b4476138d5` ("freedreno: move drm to common location") Fixes: `aa0fed10d3` ("freedreno: move ir3 to common location") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Amit Pundir <amit.pundir@linaro.org> [jstultz: Tweaked to add extra ir3 files from master] Signed-off-by: John Stultz <john.stultz@linaro.org>	2019-05-06 11:29:26 +00:00
Alistair Strachan	0fda3eac31	mesa: android: Remove unnecessary dependency tracking rules The current AOSP master build system breaks building mesa due to the following error: external/mesa3d/src/compiler/Android.glsl.gen.mk:94: error: writing to readonly directory: "external/mesa3d/src/compiler/glsl/ir.h" This error is bogus -- nothing "writes" to ir.h -- but the rule is unnecessary because the generated header that is a dependency of the non-generated header should be added to LOCAL_GENERATED_SOURCES and this will track if the dependency needs to be regenerated. (This change fixes a similar problem affecting nir.h too.) Cc: Rob Clark <robdclark@chromium.org> Cc: Emil Velikov <emil.l.velikov@gmail.com> Cc: Amit Pundir <amit.pundir@linaro.org> Cc: Sumit Semwal <sumit.semwal@linaro.org> Cc: Alistair Strachan <astrachan@google.com> Cc: Greg Hartman <ghartman@google.com> Cc: Tapani Pälli <tapani.palli@intel.com> Cc: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Alistair Strachan <astrachan@google.com> [jstultz: Forward ported and tweaked commit subject] Signed-off-by: John Stultz <john.stultz@linaro.org>	2019-05-06 11:29:25 +00:00
Bas Nieuwenhuizen	5692351264	radv: Implement cosited_even sampling. Apparently cosited_even was the required one instead of midpoint. This adds slight offset of 0.5 pixels to the coordinates (+ we need the image size to convert to normalized coords) Fixes: `91702374d5` "radv: Add ycbcr lowering pass." Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-05-06 11:09:30 +00:00
Michel Dänzer	28784e494e	Restore erroneously removed .gitignore entry for "build" directory It was removed in "delete autotools .gitignore files", but the build directory is created by scons. [Skip CI] Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-05-06 12:11:44 +02:00
Bas Nieuwenhuizen	5cbe12ad1b	radv: Disable subsampled formats. Broken on Polaris and since I discovered NV12 is not subsampled, but a 2-plane format I decided I don't really care. Work to do to re-enable: 1) Figure out which devices support it natively. 2) Write some software emulation for the others. Fixes: `52c1adda21` "radv: Add ycbcr format features." Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-05-06 09:53:37 +00:00
Timothy Arceri	1af72fa4d6	util/drirc: add workarounds for bugs in Doom 3: BFG This makes the game playable on radeonsi. Cc: "19.0" "19.1" <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110143	2019-05-06 17:32:36 +10:00
Rob Clark	bdd273d873	freedreno: remove unused forward struct declaration Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-05-04 13:59:56 -07:00
Alyssa Rosenzweig	6823873246	panfrost/midgard: iabs cannot run on mul Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-05-04 19:08:51 +00:00
Alyssa Rosenzweig	cdd9189aad	panfrost/midgard: Lower mixed csel (NIR) Basically, when the conditions of a csel diverge, we scalarize to avoid going into weird code paths during emit. We could be doing better, but this case can't occur organically from GLSL as far as I can, though it does fix lowered atan2. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-05-04 19:08:51 +00:00
Alyssa Rosenzweig	58a1e1f86c	panfrost/midgard: Fix RA when temp_count = 0 A previous commit by Tomeu aborted RA early, which solves the memory corruption issue, but then generates an incorrect compile. This fixes that. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-05-04 19:08:51 +00:00
Alyssa Rosenzweig	3d7874c699	panfrost/midgard: Fix integer selection Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-05-04 19:08:51 +00:00
Alyssa Rosenzweig	31f5a43bf0	panfrost: Support RGB565 FBOs Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-05-04 19:08:50 +00:00
Alyssa Rosenzweig	f8c7ffa07a	panfrost/midgard/disasm: Handle dest_override generalized Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-05-04 19:08:50 +00:00
Alyssa Rosenzweig	b6b534c733	panfrost/midgard/disasm: Stub out 64-bit Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-05-04 19:08:50 +00:00
Alyssa Rosenzweig	8c36ecd4b1	panfrost/midgard/disasm: Print 8-bit sources This handles the usual case. 8-bit register access parallels 16-bit access, but with one major caveat: in 8-bit mode, only half of the register file is actually (directly) accessible as sources. In particular, for each 16-bit integer register (hrN), we can only index a single 8-bit integer (qrN), corresponding to the lower 8-bits. To get the upper 8-bits, it is required to do an explicit shift. For example, to add the bytes of a 16-bit integer hr0.x and get the result as an 8-bit qr0, you'd need to do something like: ilsr hr1.x, hr0.x, #8 iadd qr0.x, qr0.x, qr1.x This scheme diverges from 32-bit registers, in that both the upper and lower halves of a 32-bit register are individually accessible as a pair of half registers. For contrast, to add the lower and upper 16-bits of a 32-bit integer r0.x, you can just: iadd hr0.x, hr0.x, hr1.x Since hr1.x = upper 16-bit of r0.x. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-05-04 19:08:50 +00:00
Alyssa Rosenzweig	2800e822a4	panfrost/midgard/disasm: Support 8-bit destination Meanwhile, we're forced to disable dest_override, since it's not yet clear how this interacts with other bitnesses (it'll likely need to be overhauled in any case). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-05-04 19:08:50 +00:00
Alyssa Rosenzweig	d42c37e494	panfrost/midgard: Rename ilzcnt8 -> iclz Per OpenCL. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-05-04 19:08:50 +00:00
Alyssa Rosenzweig	9559280fc3	panfrost/midgard: Fix crash on unknown op Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-05-04 19:08:50 +00:00
Alyssa Rosenzweig	96eed4e04b	panfrost/midgard/disasm: Fill in .int mod Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-05-04 19:08:50 +00:00
Alyssa Rosenzweig	7469df70c8	panfrost/midgard/disasm: Extend print_reg to 8-bit Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-05-04 19:08:50 +00:00
Alyssa Rosenzweig	055f6def30	panfrost/midgard/disasm: Catch mask errors We silently ignored certain bits of the mask, which causes issues when disassembly 8/64-bit ops. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-05-04 19:08:50 +00:00
Alyssa Rosenzweig	576a27fd55	panfrost/midgard: reg_mode_full -> reg_mode_32, etc In preparation for 8-bit and 64-bit operands, let's not reinforce the 32-bit-centric biases in the ISA. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-05-04 19:08:50 +00:00
Rob Clark	2da36dd0b6	freedreno/a6xx: deduplicate a few lines Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-05-04 11:50:44 -07:00
Rob Clark	555ca49d2b	freedreno: add ubwc_enabled helper Since it is dependent on the tile mode (ie. disabled for smaller mipmap levels), we should handle it a similar way to fd_resource_level_linear(). The code previously mostly did the right thing because the old helper took the tile mode. Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-05-04 11:50:44 -07:00
Rob Clark	62c0b02717	freedreno: move UBWC color offset to fd_resource_offset() Best to keep it encapsulated in the helper which returns layer/level offset (and actually use that helper everywhere) rather than spreading the logic around the code. Also add a helper to find UBWC offset, to complete the encapsulation. Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-05-04 11:50:44 -07:00
Rob Clark	a871b5ffaa	freedreno/a6xx: buffer resources cannot be compressed Small cleanup. They are just an array of data and only ever linear/ uncompressed. Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-05-04 11:50:44 -07:00
Rob Clark	05f5122d4a	freedreno: mark imported resources as valid If someone is importing a buffer, we can't really know the state of it's contents, so assume it is valid. Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-05-04 11:50:44 -07:00
Rob Clark	11583dc655	freedreno/a6xx: UBWC support for images There are still some fallbacks we'll need to handle before we can enable UBWC by default. I think we may need to fallback to uncompressed if image atomic operations are used. And we still need to sort out how to handle image and sampler views of compressed resources if the image/ sampler view is using a format that does not support compression. (I think the latter should hopefully be uncommon outside of deqp/piglit.) But at least this gets us to the point where supertuxkart works properly with UBWC enabled ;-) Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-05-04 11:50:44 -07:00
Rob Clark	857d9f3b02	freedreno/a6xx: UBWC fixes A few fixes that get UBWC working for the games/benchmarks where I noticed problems before (in particular and manhattan, and stk (modulo image support for UBWC when compute shaders are used for post-process effects): + fix the size of the UBWC meta buffer (ie, the offset to color pixel data) that is returned by ->fill_ubwc_buffer_sizes() + correct size/layout for 8 and 16 byte per pixel formats + limit the supported formats.. Note all formats that can be tiled can be compressed. Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-05-04 11:50:44 -07:00
Rob Clark	6ffb58726b	freedreno: update generated headers Corrects tex state ubwc pitch/size Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-05-04 11:50:44 -07:00
Rob Clark	fb1488a800	freedreno/a6xx: OUT_RELOC vs OUT_RELOCW fixes Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-05-04 11:50:44 -07:00
Rob Clark	8c97b3c546	freedreno/ir3: remove assert Fixes dEQP-GLES31.functional.ubo.random.all_per_block_buffers.13 and .20 `ca3eb5db66` went from silently truncating the constant state, which was also the wrong thing to do, to an assert. Which then showed up in a couple of dEQPs. Actually there is nothing wrong with larger constant file so just drop the assert. Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-05-04 11:50:44 -07:00
Karol Herbst	7f85283103	spirv/cl: support vload/vstore Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-05-04 12:27:51 +02:00
Karol Herbst	d11b807da5	nir: Add nir_op_vec helper with that we can simplify code where nir vectors are created v2: merge both lines in nir_vec Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-05-04 12:27:51 +02:00

1 2 3 4 5 ...

110773 commits