fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-15 11:40:39 +01:00

Author	SHA1	Message	Date
Sagar Ghuge	02244bc515	iris: Pass isl_surf to fill_surface_state Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Suggested-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-20 00:50:45 -07:00
Sagar Ghuge	638a157e02	iris: Add infrastructure to support non coherent framebuffer fetch Create separate SURFACE_STATE for render target read in order to support non coherent framebuffer fetch on broadwell. Also we need to resolve framebuffer in order to support CCS_D. v2: Add outputs_read check (Kenneth Graunke) v3: 1) Import Curro's comment from get_isl_surf 2) Rename get_isl_surf method 3) Clean up allocation in case of failure Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-20 00:50:44 -07:00
Sagar Ghuge	61c0637afb	iris: Add helper functions to get tile offset All helper functions are ported from i965 driver. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-20 00:50:43 -07:00
Sagar Ghuge	7e816991cc	iris: Add helper function to get isl dim layout v2: Add missing space (Caio) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-20 00:50:41 -07:00
Sagar Ghuge	58471e20d2	iris: Add render target read entry in binding table This will be used in next patches for supporting non coherent framebuffer fetch on Broadwell. v2: Fix comment (Kenneth Graunke) v3: 1) Fix a few nits (Caio) 2) Add comment (Caio) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-20 00:50:31 -07:00
Kai Wasserbäch	1abe87383e	build: Bump C++ standard requirement to C++14 to fix FTBFS with LLVM 10 When building Mesa against a recent LLVM 10 with C++11, the build fails if the AMD common code is built as well due to "std::index_sequence" being undeclared. LLVM requires a minimum of C++14. Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Acked-by: Eric Engestrom <eric@engestrom.ch>	2019-08-20 05:39:19 +00:00
Rob Herring	d0ec5d38f6	panfrost: Add madvise support to BO cache The kernel now supports madvise ioctl to indicate which BOs can be freed when there is memory pressure. Mark BOs purgeable when they are in the BO cache. The BOs must also be munmapped when they are in the cache or they cannot be purged. We could optimize avoiding the madvise ioctl on older kernels once the driver version bump lands, but probably not worth it given the other driver features also being added. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Signed-off-by: Rob Herring <robh@kernel.org>	2019-08-19 19:33:20 -05:00
Rob Herring	c45c2d7960	panfrost: Sync UAPI header from kernel Sync the panfrost_drm.h UAPI header with the latest from the kernel. This adds madvise ioctl and GPU feature params. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Signed-off-by: Rob Herring <robh@kernel.org>	2019-08-19 19:33:20 -05:00
Pierre-Eric Pelloux-Prayer	0f07d18e48	mesa: add ext_dsa GetMultiTexLevelParameterEXT Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-19 18:50:08 -04:00
Pierre-Eric Pelloux-Prayer	e8c5dc9c24	mesa: add EXT_dsa glCompressedMultiTex* functions display list support Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-19 18:50:07 -04:00
Pierre-Eric Pelloux-Prayer	1cb8e12717	mesa: add EXT_dsa glCompressedMultiTex* functions Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-19 18:50:05 -04:00
Pierre-Eric Pelloux-Prayer	a886025ef5	mesa: add EXT_dsa glCompressedTex* functions display list support Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-19 18:50:03 -04:00
Pierre-Eric Pelloux-Prayer	8c76221886	mesa: add EXT_dsa glCompressedTexture(Sub)Image1D/2D/3D functions Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-19 18:49:57 -04:00
Pierre-Eric Pelloux-Prayer	7df233d68d	mesa: refactor compressed_tex_sub_image function Combine compressed_tex_sub_image, compressed_tex_sub_image_error and compressed_tex_sub_image_no_error in a single function. The added "enum tex_mode mode" parameter allows to implement the DSA / non-DSA variants and their error/no_error combination. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-19 18:49:43 -04:00
Bas Nieuwenhuizen	6c5d983865	radv: Add Renoir support. Took the freedom to enable dfsm even though I don't have benchmark results yet, but it seems Raven-like. Rest is from radeonsi. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-08-19 22:34:11 +00:00
Marek Olšák	223b3174bd	radeonsi/nir: always lower ballot masks as 64-bit, codegen handles it This fixes KHR-GL45.shader_ballot_tests.ShaderBallotBitmasks. This solution is better, because the IR isn't dependent on wave32.	2019-08-19 17:23:38 -04:00
Marek Olšák	5d37194d43	radeonsi: remove the unsafemath debug option unlikely to be used in the future Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-19 17:23:38 -04:00
Marek Olšák	5586411de4	radeonsi/nir: fix counting shader inputs & outputs	2019-08-19 17:23:38 -04:00
Marek Olšák	452cb7055f	radeonsi/nir: fix assertion in si_nir_load_sampler_desc	2019-08-19 17:23:38 -04:00
Marek Olšák	1f8a661748	radeonsi: clean up si_llvm_context_set_tgsi Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-19 17:23:38 -04:00
Marek Olšák	43f8b5642b	radeonsi: allocate and resize global_buffers as needed Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-19 17:23:38 -04:00
Marek Olšák	c315cb509d	radeonsi/gfx10: don't set PA_SC_TILE_STEERING_OVERRIDE if CLEAR_STATE sets it Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-19 17:23:38 -04:00
Marek Olšák	5a2e65be89	radeonsi: don't emit PKT3_CONTEXT_CONTROL on amdgpu Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-19 17:23:38 -04:00
Marek Olšák	8d0d753bd0	radeonsi: fix an assertion failure: assert(!res->b.is_shared) This only appears to happen on Raven2. Possible way to reproduce: resource_get_handle(WINSYS_HANDLE_TYPE_KMS) --> sets is_shared = true resource_get_handle(WINSYS_HANDLE_TYPE_DMABUF) --> fail Cc: 19.1 19.2 <mesa-stable@lists.freedesktop.org>	2019-08-19 17:23:38 -04:00
Marek Olšák	bdcbac9459	radeonsi: handle the use_ngg_streamout flag in si_update_ngg	2019-08-19 17:23:38 -04:00
Marek Olšák	a6b3ca1c70	radeonsi: move the tess factor ring size assertion to a place where it matters Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-19 17:23:38 -04:00
Marek Olšák	21217efdfe	ac/nir: set image=true when loading FMASK for images Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-19 17:23:38 -04:00
Christian Gmeiner	f52b9218ff	etnaviv: rs: add support for 64bpp clears Starting with HALTI2 the RS supports 64bpp clears. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Philipp Zabel <philipp.zabel@gmail.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-08-19 22:36:45 +02:00
Christian GMEINER	7492685b1b	etnaviv: update headers from rnndb Update to etna_viv commit c51353e. Signed-off-by: Christian GMEINER <christian.GMEINER@bachmann.info> Reviewed-by: Philipp Zabel <philipp.zabel@gmail.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-08-19 22:36:45 +02:00
Eric Anholt	1395503424	swrast: Make the fetch funcs table sparse. This shrinks the table, avoids needing to update the table with NULL entries on every MESA_FORMAT addition, and removes a surprising, non-unit-tested format number ordering dependency. Acked-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2019-08-19 11:48:03 -07:00
Eric Anholt	c45c33a5a2	gallium: Remove manual defining of PIPE_FORMAT enum values. Now that SVGA doesn't have a table that has to be in PIPE_FORMAT order, we can let the enums have whatever values they naturally would without worrying about holes. Acked-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2019-08-19 11:48:01 -07:00
Eric Anholt	84db6ba740	svga: Drop unsupported formats from the format table. Now that we're using the array initializers, we don't need to manually fill out all these stub entries. Produced with "sed -i '/.INVALID.INVALID.*INVALID/d' src/gallium/drivers/svga/svga_format.c" Acked-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2019-08-19 11:43:02 -07:00
Eric Anholt	ef37da52c0	svga: Remove duplication in the format table. By using the [ ] = {} array initializer syntax, we no longer need the entries to be listed in PIPE_FORMAT_* value order. This means that people adding new gallium formats don't need to cargo-cult changes to this driver or regress that non-unit-tested requirement. While I'm here, drop the lines for formats that no longer exist (the numbered ones in the table). Acked-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2019-08-19 11:42:55 -07:00
Eric Anholt	42efa789b5	svga: Factor out the format conversion table entry lookup. Seemed like a sensible cleanup, while I was looking at whether I could make the table sparse. To make the svga table not require fixups on every new gallium format, we may want to change how it's populated. Acked-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2019-08-19 11:42:36 -07:00
Jason Ekstrand	5167e94f23	nir: Add more source types to nir_tex_instr_src_type Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 17:03:34 +00:00
Alyssa Rosenzweig	2bb4dc4054	pan/midgard: Compute liveness per-block Rather than using a regalloc based on live internals, computed hastily with repeated invocations of a forward-analysis pass, we switch to compute liveness information on a per-block basis. Within a given basic block, we compute liveness backwards with a linear-time algorithm; for common shaders, this may help RA terminate quicker. Across blocks, we use a work list (really a work set) and check if we're making progress. This isn't terribly efficient, but it gets the job done. Point is, we get the live_in/live_out for each block. From there, it's simple to rerun the linear-time update algorithm to compute the interference graph. The benefit of this technique is the ability to ignore "gaps" in liveness across intermediate blocks that are never executed. On simple shaders like the loops in glmark, this results in a minor reduction in register pressure. The motivation was a complex shader in Krita that failed register allocation due to an unfortunate interaction between texture pipeline registers and control flow. This shader now compiles successfully. total instructions in shared programs: 3439 -> 3438 (-0.03%) instructions in affected programs: 22 -> 21 (-4.55%) helped: 1 HURT: 0 total bundles in shared programs: 2077 -> 2076 (-0.05%) bundles in affected programs: 12 -> 11 (-8.33%) helped: 1 HURT: 0 total quadwords in shared programs: 3457 -> 3456 (-0.03%) quadwords in affected programs: 20 -> 19 (-5.00%) helped: 1 HURT: 0 total registers in shared programs: 341 -> 338 (-0.88%) registers in affected programs: 9 -> 6 (-33.33%) helped: 3 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 33.33% max: 33.33% x̄: 33.33% x̃: 33.33% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	24c91bb54b	pan/midgard: Analyze load/store for swizzle propagation If there's a nontrivial swizzle fed into an extra (shortened) argument, we bail on copyprop. No glmark changes (since it doesn't use fancy texturing/loads). Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	9ae4d3653e	pan/midgard: Treat cubemaps "stores" as loads It's always been ambiguous which they are, but their primary register is their output, not their input; therefore, they are loads. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	20dd482668	pan/midgard: Clamp cubemap swizzle to XYXX Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	2788721cc4	pan/midgard: Clamp st_vary swizzle by number of components Same issue with liveness analysis. If we store out a vec3, we should not reference the .w component. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	edc8e41566	pan/midgard: Use type-appropriate swizzle for texture coordinate The texture coordinate for a 2D texture could be a vec2 or a vec3, depending if it's an array texture or not. If it's vec2 (non-array texture), we should not reference the z component; otherwise, liveness analysis will get very confused when z is never written. v2: Fix typo (Ilia). Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	2bcb3d9226	pan/midgard: Set mask for lowered read-hazard moves If we need to lower a move for a read from a vec2 texture coordinate, we shouldn't write zw, even incidentally. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	739e09c297	pan/midgard: Fix texw lowering with complex control flow Fixes shaders with control flow like: out = 0; if (A) { if (B) out = texture(A, ...) } else { out = texture(B, ...) } Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	6f1c8c148d	pan/midgard: Add mir_rewrite_index_dst_single helper Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	d68019ad1f	pan/midgard: Print predecessors in MIR Just as a sanity check. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	e3a418fe86	pan/midgard: Index blocks for printing Better than having pointers flying about. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	2f92479ffc	pan/midgard: Add mir_foreach_src This is repeated often enough. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	84580c6dbc	pan/midgard: Add mir_foreach_instr_in_block_rev Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	c8c4471a92	pan/midgard: Add mir_foreach_successor helper Now we should be able to walk the control-flow graph naturally. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00
Alyssa Rosenzweig	b8e526c520	pan/midgard: Add mir_foreach_predecessor utility It's ugly, but c'est la vie. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-19 08:32:17 -07:00

... 17 18 19 20 21 ...

115447 commits