fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 18:08:40 +02:00

Author	SHA1	Message	Date
Kenneth Graunke	9c19d07b1c	anv: Fix wrong printf formatter %lu is for unsigned long, %zu is for size_t. Just cast the data.	2019-06-19 11:57:01 -05:00
Kenneth Graunke	bbbf7a538c	iris: Bail on queries for INTEL_NO_HW=1. We don't execute any of the commands to record snapshots, so we can't actually produce a real result. We do however need to avoid waiting on a syncpt which will never be signalled. So, just return 0.	2019-06-19 11:55:43 -05:00
David Riley	11e74daae5	virgl: Support VIRGL_BIND_SHARED Support a new virgl bind type for shared buffers. Signed-off-by: David Riley <davidriley@chormium.org> Reviewed-By: Gert Wollny <gert.wollny@collabora.com>	2019-06-19 07:28:47 -07:00
Lionel Landwerlin	bc62673dce	anv: write spirv-nir logs back to the application Using the existing VK_EXT_debug_report extension. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-06-19 15:45:52 +03:00
Connor Abbott	53a7649e5d	ac/nir: Set speculatable for buffer loads where allowed This brings the nir path in line with the TGSI path. Totals from affected shaders: SGPRS: 2984 -> 2984 (0.00 %) VGPRS: 2792 -> 2652 (-5.01 %) Spilled SGPRs: 0 -> 0 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 247380 -> 248072 (0.28 %) bytes LDS: 0 -> 0 (0.00 %) blocks Max Waves: 121 -> 132 (9.09 %) Wait states: 0 -> 0 (0.00 %) Most of the change came from DiRT: Showdown, and came from sinking SSBO loads. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-06-19 14:08:28 +02:00
Connor Abbott	77be5b2f88	nir: Use reorderable access flag No changes with radeonsi shader-db. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-06-19 14:08:28 +02:00
Connor Abbott	a1c737927c	nir: Add a helper to determine if an intrinsic can be reordered This is simple now, but we're going to be adding a few more conditions to this later. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-06-19 14:08:28 +02:00
Connor Abbott	6fc83c253f	st/nir: Use gl_nir_opt_access Nothing uses its results yet, that will come with the following commits. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-06-19 14:08:28 +02:00
Connor Abbott	f2d0e48ddc	glsl/nir: Add optimization pass for access flags Right now, this just deduces when we can arbitrarily reorder SSBO and image loads, matching the existing logic in radeonsi's TGSI->LLVM pass. This approach can't handle some things that nir_opt_copy_prop_vars can, but it can handle images, and with GCM it lets us hoist reads outside of loops. We can also pass this information to LLVM which lets it do its own optimizations on it. This is GLSL only as I haven't tested it on Vulkan yet, and it would probably need a few changes to work there. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-06-19 14:08:28 +02:00
Connor Abbott	c813c5776d	nir: Add reorderable memory access enum Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-06-19 14:08:28 +02:00
Connor Abbott	75063fbac5	nir/copy_prop_vars: Ignore volatile accesses The spec explicitly says that volatile writes can't be removed and volatile reads do not guarantee that the same value will still be around after the read, as if there were a barrier after each read/write. Just ignore them. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-06-19 14:08:28 +02:00
Connor Abbott	364996d70d	glsl/nir: Propagate access qualifiers We were completely ignoring these before, except for putting them on variables. While we're here, don't set access qualifiers when converting to bindless since glsl_to_nir will already have set a more accurate qualifier that includes any qualifiers on struct members that are dereferenced. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-06-19 14:08:27 +02:00
Connor Abbott	6f20643b47	nir: Allow qualifiers on copy_deref and image instructions In the next commit, we'll properly handle access qualifiers on struct members by propagating them to load/store instructions, but these instructions had no way to specify the qualifier. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-06-19 14:08:27 +02:00
Connor Abbott	3bf8981c51	ac,radeonsi: Always mark buffer stores as inaccessiblememonly inaccessiblememonly means that it doesn't modify memory accesible via normal LLVM pointers. This lets LLVM's dead store elimination, memcpy forwarding, etc. ignore functions with this attribute. We don't represent descriptors as pointers, so this property is always true of buffer and image stores. There are plans to represent descriptors via pointers, but this just means that now nothing is inaccessiblememonly, as LLVM will then understand loads/stores via its usual alias analysis. Radeonsi was mistakenly only setting it if the driver could prove that there were no reads, and then it was cargo-culted into ac_llvm_build and ac_llvm_to_nir. Rip it out of everything. statistics with nir enabled: Totals from affected shaders: SGPRS: 152 -> 152 (0.00 %) VGPRS: 128 -> 132 (3.12 %) Spilled SGPRs: 0 -> 0 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 9324 -> 9244 (-0.86 %) bytes LDS: 2 -> 2 (0.00 %) blocks Max Waves: 17 -> 17 (0.00 %) Wait states: 0 -> 0 (0.00 %) The only difference was a manhattan31 shader. Acked-by: Timothy Arceri <tarceri@itsqueeze.com> Acked-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-06-19 14:08:27 +02:00
Eric Engestrom	4db2c1e2fe	egl: add missing #include close() is in <unistd.h> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2019-06-19 12:05:58 +00:00
Samuel Pitoiset	0a313cc285	radv: disable viewport clamping even if FS doesn't write Z This fixes new CTS dEQP-VK.pipeline.depth_range_unrestricted.*. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-06-19 11:18:50 +02:00
Samuel Pitoiset	e91c1ea06c	radv: implement compressed FMASK texture reads with RADV_PERFTEST=tccompatcmask This allows us to disable the FMASK decompress pass when transitioning from CB writes to shader reads. This will likely be improved and enabled by default in the future. No CTS regressions on GFX8 but a few number of multisample CTS failures on GFX9 (they look related to the small hint). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-06-19 10:06:39 +02:00
Samuel Pitoiset	a7f75377ab	radv: fix FMASK expand with SRGB formats Found while working on DCC for MSAA. Fixes: `6b976024a8` ("radv: add support for FMASK expand") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-06-19 07:53:53 +02:00
Tomeu Vizoso	0fcf73bc2d	panfrost: Move to use ralloc for some allocations We have some serious leaks, so plug some and also move to ralloc to limit the lifetime of some objects to that of their parent. Lots more such work to do. For some reason, this fixes: dEQP-GLES2.functional.lifetime.attach.deleted_output.texture_framebuffer Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-19 07:34:15 +02:00
Mathias Fröhlich	5743a36b2b	egl: Don't add hardware device if there is no render node v2. Do not offer a hardware drm backed egl device if no render node is available. The current implementation will fail on this egl device. On top it issues a warning that is actually missleading. There are finally more error paths that can fail on the way to a hardware backed egl device. Fixing all of them would kind of require opening the drm device and see if there is a usable driver associated with the device. The taken approach avoids a full probe and fixes at least this kind of problem on kvm virtualization hosts I observe here. Fixes: `dbb4457d98` ("egl: add EGL_EXT_device_drm support") Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Mathias Fröhlich <Mathias.Froehlich@web.de>	2019-06-19 07:17:23 +02:00
Christian Gmeiner	8dd26fa2f0	etnaviv: support GL_ARB_seamless_cubemap_per_texture Passes spec@amd_seamless_cubemap_per_texture@amd_seamless_cubemap_per_texture Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-By: Guido Günther <agx@sigxcpu.org>	2019-06-19 00:39:50 +02:00
Christian Gmeiner	a13efb3cdb	etnaviv: update headers from rnndb Update to etna_viv commit `a3bf0da`. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2019-06-19 00:39:50 +02:00
Dave Airlie	378ea92bf6	radeonsi: fix undefined shift in macro definition Pointed out by coverity Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-06-19 08:32:36 +10:00
Dave Airlie	93ba356544	nouveau: fix frees in unsupported IR error paths. This is pointless in that we won't ever hit those paths in real life, but coverity complains. Fixes: `f014ae3c7c` ("nouveau: add support for nir") Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2019-06-19 08:32:19 +10:00
Rohan Garg	ad284f794c	panfrost: Move clearing logic into pan_job Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 12:32:43 -07:00
Chia-I Wu	98eda99ab8	virgl: fix sync issue regarding discard/unsync transfers GL_MAP_INVALIDATE_BUFFER_BIT cannot be treated as GL_MAP_INVALIDATE_RANGE_BIT naively. When we run into ptr = glMapBufferRange(buf, 0, size, GL_WRITE_BIT\|GL_MAP_INVALIDATE_BUFFER_BIT); memcpy(ptr, data1, size); glUnmapBuffer(buf); ptr = glMapBufferRange(buf, size, size, GL_WRITE_BIT\|GL_MAP_UNSYNCHRONIZED_BIT); memcpy(ptr, data2, size); glUnmapBuffer(buf); we never want data1 to be copy_transfer'ed. Because that would mean that data2 might overwrite valid data. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Alexandros Frantzis alexandros.frantzis@collabora.com Fixes: `a22c5df079` ("virgl: Use buffer copy transfers to avoid waiting when mapping") Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2019-06-18 10:38:21 -07:00
Alyssa Rosenzweig	2a717f300b	panfrost: Enable sRGB Now that sRGB formats are supported for both rendering and sampling, advertise support. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 09:59:29 -07:00
Alyssa Rosenzweig	5aa51ba97f	panfrost: Disable AFBC on sRGB buffers The performance impact is slightly mitigated by tiling the render target, but it's undeniably still slow compared to AFBC. Unfortunately, it doesn't look like AFBC and sRGB play nice... Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 09:59:29 -07:00
Alyssa Rosenzweig	6585bb9f52	panfrost: Enable sRGB fixed-function blending For fixed-function, we have hardware to handle sRGB so we just set a flag. For blend shaders, it's rather more involved; this is currently unimplemented. Assert it out for now; we don't need it quite yet. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 09:59:29 -07:00
Alyssa Rosenzweig	4b137da409	panfrost: Specify sRGB in the render target Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 09:59:29 -07:00
Alyssa Rosenzweig	58c34e4a6c	panfrost: Implement sRGB texturing Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 09:59:29 -07:00
Alyssa Rosenzweig	31a4ef847c	panfrost: Add sRGB render target flag Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 09:59:29 -07:00
Alyssa Rosenzweig	01e1eecb95	panfrost: Implement tiled rendering We already can sample from Mali's linear/tiled encoding (the one from Utgard -- AFBC is mostly unrelated); let's be able to render to it as well. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 09:59:29 -07:00
Alyssa Rosenzweig	d50795109b	panfrost: Decode rendering block type A mode for rendering tiled/uncompressed was noticed, so we reshuffle the MFBD render target definitions to explicitly include block type. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 09:59:28 -07:00
Alyssa Rosenzweig	83c02a5ea9	panfrost: Refactor texture targets This combines the two cmdstream bits "is_3d" and "is_not_cubemap" into a single 2-bit texture target selection, noticing it's the same as the 2-bit selection in Midgard and Bifrost texturing ops. Accordingly, we share this definition and add the missing entry for 1D/buffer textures. This requires a nontrivial (but functionally similar) refactor of all parts of the driver to use the new definitions appropriately. Theoretically, this should add support for buffer textures, but that's obviously not tested and probably wouldn't work. While doing so, we notice the sRGB enable bit, which we document and decode as well here so we don't forget about it. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 09:59:28 -07:00
Rohan Garg	bfca21b622	panfrost: Figure out job requirements in pan_job.c Requirements for a job should be figured out in pan_job.c v2: [Alyssa] Fix early return Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 09:52:20 -07:00
Rohan Garg	debb85d1ec	panfrost: Reset job counters once the job is submitted Move the reset out of frame invalidation into job submission Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 09:52:20 -07:00
Rohan Garg	0f43a2ae8a	panfrost: Initial implementation of panfrost_job_submit Start fleshing out panfrost_job v2: [Alyssa: Remove unused variable, warning introduced] Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 09:52:01 -07:00
Gurchetan Singh	2daf3d8215	virgl_hw: add YUV support Add corresponding entries from p_format.h Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2019-06-18 09:18:58 -07:00
Gurchetan Singh	2480ce802a	virgl: sync to virglrenderer virgl_hw.h It's nice to keep these two files in sync, as they define guest userspace <---> host userspace communcation. Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2019-06-18 09:18:48 -07:00
Jason Ekstrand	58cb865313	anv: Make border colors the right size and alignment on HSW Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-06-18 16:07:08 +00:00
Lionel Landwerlin	51076eb87c	imgui: bump imgui memory editor copy Getting rid of a compiler warning : In file included from ../src/intel/tools/aubinator_viewer.cpp:225: ../src/imgui/imgui_memory_editor.h: In member function ‘void MemoryEditor::DisplayPreviewData(size_t, const u8, size_t, MemoryEditor::DataType, MemoryEditor::DataFormat, char, size_t) const’: ../src/imgui/imgui_memory_editor.h:637:16: warning: enumeration value ‘DataType_COUNT’ not handled in switch [-Wswitch] switch (data_type) ^ Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-06-18 15:34:13 +00:00
Alyssa Rosenzweig	9402970751	panfrost/midgard: Enable autovectorization Enable nir_opt_vectorize. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 06:44:13 -07:00
Connor Abbott	47e7c6961a	nir: add a vectorization pass This effectively does the opposite of nir_lower_alus_to_scalar, trying to combine per-component ALU operations with the same sources but different swizzles into one larger ALU operation. It uses a similar model as CSE, where we do a depth-first approach and keep around a hash set of instructions to be combined, but there are a few major differences: 1. For now, we only support entirely per-component ALU operations. 2. Since it's not always guaranteed that we'll be able to combine equivalent instructions, we keep a stack of equivalent instructions around, trying to combine new instructions with instructions on the stack. The pass isn't comprehensive by far; it can't handle operations where some of the sources are per-component and others aren't, and it can't handle phi nodes. But it should handle the more common cases, and it should be reasonably efficient. [Alyssa: Rebase on latest master, updating with respect to typeless moves] Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2019-06-18 06:43:30 -07:00
Boris Brezillon	c3558868da	panfrost: Add support for TXS instructions This patch adds support for nir_texop_txs instructions which are needed to support the OpenGL textureSize() function. This is also needed to support RECT texture sampling which is currently lowered to 2D sampling + a TXS() instruction by the nir_lower_tex() helper. Changes in v2: * Split options for the 1st and 2nd tex lowering passes Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 06:36:07 -07:00
Boris Brezillon	5c17f84ae2	panfrost: Prepare things to support non-native texture ops We are about to add support for the TXS (texture size) op which is not implemented using a midgard texture instruction. Let's rename emit_tex() into emit_texop_native() and repurpose emit_tex() as a dispatcher. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 06:36:07 -07:00
Boris Brezillon	c57f7d0f15	panfrost: Move sysval upload logic out of panfrost_emit_for_draw() We're about to add more sysval types, and panfrost_emit_for_draw() is big enough, so let's move the sysval upload logic in a separate function. We also add one sub-function per sysval type to keep the panfrost_upload_sysvals() small/readable. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 06:36:07 -07:00
Boris Brezillon	bd49c8f0eb	panfrost: Make the sysval logic more generic We are about to add support for nir_texop_txs which requires adding a sysval/uniform containing the texture size. Let's change the emit_sysval_read() prototype to take a nir_instr object instead of a nir_intrinsic_instr one so we can re-use this function when emitting a sysval for a txs instruction. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 06:36:07 -07:00
Boris Brezillon	296c5fd25d	nir/lower_tex: Add a way to lower TXS(non-0-LOD) instructions The V3D driver has an open-coded solution for this, and we need the same thing for Panfrost, so let's add a generic way to lower TXS(LOD) into max(TXS(0) >> LOD, 1). Changes in v2: * Use == 0 instead of ! * Rework the minification logic as suggested by Jason * Assign cursor pos at the beginning of the function * Patch the LOD just after retrieving the old value Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 06:36:07 -07:00
Boris Brezillon	0e489fd360	nir/lower_tex: Update ->sampler_dim value before calling get_texture_size() get_texture_size() will create a txs instruction with ->sampler_dim set to the original tex->sampler_dim. The condition to call lower_rect() only checks the value of ->sampler_dim and whether lower_rect is requested or not. This leads to an infinite loop when calling nir_lower_tex() with the same options until it returns false. In order to avoid that, let's move the tex->sampler_dim patching before get_texture_size() is called. This way the txs instruction will have ->sampler_dim set to GLSL_SAMPLER_DIM_2D and nir_lower_tex() won't try to lower it on the subsequent passes. Changes in v2: * Add Jason R-b * Add a comment explaining why we patch ->sampler_dim at the beginning of the lower_rect() func Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-18 06:36:07 -07:00

1 2 3 4 5 ...

111970 commits