fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-03-04 06:00:31 +01:00

Author	SHA1	Message	Date
Marek Olšák	c315cb509d	radeonsi/gfx10: don't set PA_SC_TILE_STEERING_OVERRIDE if CLEAR_STATE sets it Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-19 17:23:38 -04:00
Marek Olšák	5a2e65be89	radeonsi: don't emit PKT3_CONTEXT_CONTROL on amdgpu Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-19 17:23:38 -04:00
Marek Olšák	8d0d753bd0	radeonsi: fix an assertion failure: assert(!res->b.is_shared) This only appears to happen on Raven2. Possible way to reproduce: resource_get_handle(WINSYS_HANDLE_TYPE_KMS) --> sets is_shared = true resource_get_handle(WINSYS_HANDLE_TYPE_DMABUF) --> fail Cc: 19.1 19.2 <mesa-stable@lists.freedesktop.org>	2019-08-19 17:23:38 -04:00
Marek Olšák	bdcbac9459	radeonsi: handle the use_ngg_streamout flag in si_update_ngg	2019-08-19 17:23:38 -04:00
Marek Olšák	a6b3ca1c70	radeonsi: move the tess factor ring size assertion to a place where it matters Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-19 17:23:38 -04:00
Marek Olšák	21217efdfe	ac/nir: set image=true when loading FMASK for images Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-19 17:23:38 -04:00
Christian Gmeiner	f52b9218ff	etnaviv: rs: add support for 64bpp clears Starting with HALTI2 the RS supports 64bpp clears. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Philipp Zabel <philipp.zabel@gmail.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-08-19 22:36:45 +02:00
Christian GMEINER	7492685b1b	etnaviv: update headers from rnndb Update to etna_viv commit c51353e. Signed-off-by: Christian GMEINER <christian.GMEINER@bachmann.info> Reviewed-by: Philipp Zabel <philipp.zabel@gmail.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-08-19 22:36:45 +02:00
Eric Anholt	1395503424	swrast: Make the fetch funcs table sparse. This shrinks the table, avoids needing to update the table with NULL entries on every MESA_FORMAT addition, and removes a surprising, non-unit-tested format number ordering dependency. Acked-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2019-08-19 11:48:03 -07:00
Eric Anholt	c45c33a5a2	gallium: Remove manual defining of PIPE_FORMAT enum values. Now that SVGA doesn't have a table that has to be in PIPE_FORMAT order, we can let the enums have whatever values they naturally would without worrying about holes. Acked-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2019-08-19 11:48:01 -07:00
Eric Anholt	84db6ba740	svga: Drop unsupported formats from the format table. Now that we're using the array initializers, we don't need to manually fill out all these stub entries. Produced with "sed -i '/.INVALID.INVALID.*INVALID/d' src/gallium/drivers/svga/svga_format.c" Acked-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2019-08-19 11:43:02 -07:00
Eric Anholt	ef37da52c0	svga: Remove duplication in the format table. By using the [ ] = {} array initializer syntax, we no longer need the entries to be listed in PIPE_FORMAT_* value order. This means that people adding new gallium formats don't need to cargo-cult changes to this driver or regress that non-unit-tested requirement. While I'm here, drop the lines for formats that no longer exist (the numbered ones in the table). Acked-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2019-08-19 11:42:55 -07:00
Eric Anholt	42efa789b5	svga: Factor out the format conversion table entry lookup. Seemed like a sensible cleanup, while I was looking at whether I could make the table sparse. To make the svga table not require fixups on every new gallium format, we may want to change how it's populated. Acked-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2019-08-19 11:42:36 -07:00
Jason Ekstrand	5167e94f23	nir: Add more source types to nir_tex_instr_src_type Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 17:03:34 +00:00
Alyssa Rosenzweig	2bb4dc4054	pan/midgard: Compute liveness per-block Rather than using a regalloc based on live internals, computed hastily with repeated invocations of a forward-analysis pass, we switch to compute liveness information on a per-block basis. Within a given basic block, we compute liveness backwards with a linear-time algorithm; for common shaders, this may help RA terminate quicker. Across blocks, we use a work list (really a work set) and check if we're making progress. This isn't terribly efficient, but it gets the job done. Point is, we get the live_in/live_out for each block. From there, it's simple to rerun the linear-time update algorithm to compute the interference graph. The benefit of this technique is the ability to ignore "gaps" in liveness across intermediate blocks that are never executed. On simple shaders like the loops in glmark, this results in a minor reduction in register pressure. The motivation was a complex shader in Krita that failed register allocation due to an unfortunate interaction between texture pipeline registers and control flow. This shader now compiles successfully. total instructions in shared programs: 3439 -> 3438 (-0.03%) instructions in affected programs: 22 -> 21 (-4.55%) helped: 1 HURT: 0 total bundles in shared programs: 2077 -> 2076 (-0.05%) bundles in affected programs: 12 -> 11 (-8.33%) helped: 1 HURT: 0 total quadwords in shared programs: 3457 -> 3456 (-0.03%) quadwords in affected programs: 20 -> 19 (-5.00%) helped: 1 HURT: 0 total registers in shared programs: 341 -> 338 (-0.88%) registers in affected programs: 9 -> 6 (-33.33%) helped: 3 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 33.33% max: 33.33% x̄: 33.33% x̃: 33.33% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	24c91bb54b	pan/midgard: Analyze load/store for swizzle propagation If there's a nontrivial swizzle fed into an extra (shortened) argument, we bail on copyprop. No glmark changes (since it doesn't use fancy texturing/loads). Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	9ae4d3653e	pan/midgard: Treat cubemaps "stores" as loads It's always been ambiguous which they are, but their primary register is their output, not their input; therefore, they are loads. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	20dd482668	pan/midgard: Clamp cubemap swizzle to XYXX Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	2788721cc4	pan/midgard: Clamp st_vary swizzle by number of components Same issue with liveness analysis. If we store out a vec3, we should not reference the .w component. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	edc8e41566	pan/midgard: Use type-appropriate swizzle for texture coordinate The texture coordinate for a 2D texture could be a vec2 or a vec3, depending if it's an array texture or not. If it's vec2 (non-array texture), we should not reference the z component; otherwise, liveness analysis will get very confused when z is never written. v2: Fix typo (Ilia). Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	2bcb3d9226	pan/midgard: Set mask for lowered read-hazard moves If we need to lower a move for a read from a vec2 texture coordinate, we shouldn't write zw, even incidentally. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	739e09c297	pan/midgard: Fix texw lowering with complex control flow Fixes shaders with control flow like: out = 0; if (A) { if (B) out = texture(A, ...) } else { out = texture(B, ...) } Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	6f1c8c148d	pan/midgard: Add mir_rewrite_index_dst_single helper Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	d68019ad1f	pan/midgard: Print predecessors in MIR Just as a sanity check. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	e3a418fe86	pan/midgard: Index blocks for printing Better than having pointers flying about. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	2f92479ffc	pan/midgard: Add mir_foreach_src This is repeated often enough. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	84580c6dbc	pan/midgard: Add mir_foreach_instr_in_block_rev Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	c8c4471a92	pan/midgard: Add mir_foreach_successor helper Now we should be able to walk the control-flow graph naturally. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	b8e526c520	pan/midgard: Add mir_foreach_predecessor utility It's ugly, but c'est la vie. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	b4b2e111f8	pan/midgard: Link exit block The exit block has been 'dangling' in the successors graph, so let's ensure it's linked in. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	07c960cac0	pan/midgard: Add mir_exit_block helper The exit block is gauranteed to be empty, signaling the end of the program. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	aeeeef1242	pan/midgard: Maintain block predecessor set While we already compute the successors array, for backwards data flow analysis, it is useful to walk the control flow graph backwards based on predecessors, so let's compute that information as well. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	4fa09329c1	pan/midgard: Use ralloc on ctx/blocks This will allow us to get some level of automatic memory management. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	b59b1793b8	pan/midgard: Shrink successors[] to 2 length A block can't have more. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Roman Stratiienko	fdd6151612	nir: Add missing dependency in Android.nir.gen.mk Fixes incremental build with Android Signed-off-by: Roman Stratiienko <roman.stratiienko@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2019-08-19 09:53:18 +03:00
Connor Abbott	c550d367a7	ac/nir: Fix store_scratch with a non-full writemask By adding one more helper to ac_llvm_build, we can also easily keep vector stores together. Fixes the tests/spec/glsl-1.30/execution/fs-large-local-array-vec4.shader_test piglit test. Fixes: `74470baebb` ("ac/nir: Lower large indirect variables to scratch") Reviewed-by: Marek Olšák <marek.olsak@amd.com	2019-08-18 15:15:45 +02:00
Vasily Khoruzhick	0e394cda0d	glsl/standalone: init shader stage in init_gl_program() Otherwise lima standalone compiler fails when trying to compile fragment shader with: lima_compiler: ../src/compiler/nir/nir.c:55: nir_shader_create: Assertion `si->stage == stage' failed Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-08-17 11:14:40 -07:00
Jason Ekstrand	16edd02bfa	iris: Only request an input mask if the shader needs it Fixes: `aebca3961b` "iris: Fix handling of SIMD32 fragment shaders" Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-16 19:59:42 -05:00
Xiong, James	dcad15ff54	gallium: add back YVU support PIPE_FORMAT_YV12 is not handled so switching to PIPE_FORMAT_IYUV and adding back YVU support. Signed-off-by: James Xiong <james.xiong@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-16 13:24:49 -07:00
Erico Nunes	7a51abab42	lima: actually wait for bo in lima_bo_wait PIPE_TIMEOUT_INFINITE is unsigned and gets assigned to signed fields where it ends up as -1. When this reaches the kernel as a timeout it gets translated as no timeout, which cause the waiting functions to return immediately and not actually wait for a completion. This seems to cause unstable results with lima where even piglit tests randomly fail. Handle this by setting the signed max value in case of infinite timeout. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com>	2019-08-16 16:31:29 +02:00
Rhys Perry	0a790c3019	nir/algebraic: add a few masking-before-unpack optimizations Helps some Dawn of War 3 and F1 2017 shaders with ACO: Totals from affected shaders: SGPRS: 2136 -> 2128 (-0.37 %) VGPRS: 1624 -> 1628 (0.25 %) Spilled SGPRs: 0 -> 0 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 168068 -> 164332 (-2.22 %) bytes LDS: 44 -> 44 (0.00 %) blocks Max Waves: 222 -> 221 (-0.45 %) Wait states: 0 -> 0 (0.00 %) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-16 12:13:01 +01:00
Vasily Khoruzhick	861c2b8d31	lima: fix compilation of standalone compiler Fixes: e0aeee946004("lima: add summary report for shader-db") Reviewed-by: Qiang Yu <yuq825@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-08-15 16:59:51 -07:00
Bas Nieuwenhuizen	b9fb90e6d3	Revert "radv/gfx10: Enable DCC for storage images." Quite useless without DCC for LAYOUT_GENERAL. Fixes: `b4dad3afaa` Revert "radv: Do not decompress on LAYOUT_GENERAL." Acked-by: Dave Airlie <airlied@redhat.com>	2019-08-16 01:22:54 +02:00
Bas Nieuwenhuizen	b4dad3afaa	Revert "radv: Do not decompress on LAYOUT_GENERAL." Causes issues with a bunch of games with DXVK. Fixes: `50add1b33a` "radv: Do not decompress on LAYOUT_GENERAL." Acked-by: Dave Airlie <airlied@redhat.com>	2019-08-16 01:22:35 +02:00
Dave Airlie	f3af7886fe	mesa: add support for CET to x86/x86-64 asm files. Control-flow enforcement technology is a new instructions on x86 processors to denote where indirect jumps can land. Gcc auto adds the instruction (which encodes as a NOP on older CPUs) to entrypoints but assembler files need manual adding. This adds it to all the entry points in the mesa x86/x86-64 assembler files. This will only happen if mesa is built with the -fcf-protection flag to gcc as some distros are wanting to do. Acked-by: Eric Anholt <eric@anholt.net>	2019-08-16 09:00:35 +10:00
Alyssa Rosenzweig	78eda70892	pan/bifrost: Manually constant fold register class Fixes errors for some people building Mesa: ../src/panfrost/bifrost/bifrost_sched.c:32:31: error: initializer element is not constant const unsigned max_vec2_reg = max_primary_reg / 2; ../src/panfrost/bifrost/bifrost_sched.c:33:31: error: initializer element is not constant const unsigned max_vec3_reg = max_primary_reg / 4; // XXX: Do we need to align vec3 to vec4 boundary? ../src/panfrost/bifrost/bifrost_sched.c:34:31: error: initializer element is not constant const unsigned max_vec4_reg = max_primary_reg / 4; ../src/panfrost/bifrost/bifrost_sched.c:35:32: error: initializer element is not constant const unsigned max_registers = max_primary_reg + ../src/panfrost/bifrost/bifrost_sched.c:40:28: error: initializer element is not constant const unsigned vec2_base = primary_base + max_primary_reg; ../src/panfrost/bifrost/bifrost_sched.c:41:28: error: initializer element is not constant const unsigned vec3_base = vec2_base + max_vec2_reg; ../src/panfrost/bifrost/bifrost_sched.c:42:28: error: initializer element is not constant const unsigned vec4_base = vec3_base + max_vec3_reg; ../src/panfrost/bifrost/bifrost_sched.c:43:27: error: initializer element is not constant const unsigned vec4_end = vec4_base + max_vec4_reg; Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-15 19:06:35 +00:00
Erik Faye-Lund	18ab42644b	gallium/util: widen type before multiplication This method returns size_t, but the multiplication multiplies two integers, leading to overflow rather than type widening. Noticed by compiling with MSVC, which emits a warning. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-15 20:23:53 +02:00
Erik Faye-Lund	0091f62978	mesa: avoid warning on Windows On Windows, p_atomic_inc_return returns an unsigned long long rather than the type the pointer refers to, so let's make sure we cast the result to the right type. Otherwise, we'll trigger a warning about the wrong format-string for the type. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-15 20:23:49 +02:00
Erik Faye-Lund	544b088616	win32: unify strcasecmp definitions There was two incompatible definitions of strcasecmp, which lead to a compiler warning. Let's clean this up by only leaving one of them, and using that one all the time. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-15 20:23:44 +02:00
Erik Faye-Lund	ecd312be96	mesa/main: avoid warning when casting offset to pointer This generates a warning on some 64-bit systems, so let's cast to a properly sized integer first. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-15 20:23:39 +02:00

1 2 3 4 5 ...

105754 commits