fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-23 22:00:13 +01:00

Author	SHA1	Message	Date
Francisco Jerez	e549e4f6c0	intel/fs/gen12: Fix Render Target Read header setup for new thread payload layout. In Gen12 the Poly 0 Info DWORD containing the Viewport Index and Render Target Index fields were moved from r0.0 to r1.1 in order to make room for dual-polygon dispatch. The render target message format was updated to expect that information in the same location, so we didn't need to make any changes for framebuffer fetch to work with SIMD8 and SIMD16 dispatch. Unfortunately that won't work with SIMD32, since the render target message header is assembled from r0 and r2 instead of r1, and the r2 thread payload wasn't updated with an additional copy of the same information. We need to fix things up manually instead. This avoids a handful of EXT_shader_framebuffer_fetch regressions in combination with SIMD32 fragment shaders. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2020-04-28 23:00:29 -07:00
Francisco Jerez	72324035fb	intel/fs/gen12: Work around dual-source blending hangs in combination with SIMD32. This applies the same work-around I commited as `b84fa0b31e` "intel/fs/gen11: Work around dual-source blending hangs in combination with SIMD32." to Gen12, which seems to suffer from the same hardware bug found empirically. The failure mode seems to be identical. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2020-04-28 23:00:28 -07:00
Francisco Jerez	d6ae079771	intel/fs/gen12: Fix hangs with per-sample SIMD32 fragment shader dispatch. The Gen12 docs are rather contradictory regarding the dispatch configurations supported by the fragment shader -- The same table present in previous generations seems to imply that only one dispatch mode can be enabled when doing per-sample shading, but a restriction documented in the 3DSTATE_PS_BODY page implies the opposite: That SIMD32 can only be used in combination with some other dispatch mode. The latter seems to match the behavior of real hardware as I could tell from my testing: A bunch of multisample test-cases that do per-sample shading hang if we only provide a SIMD32 shader. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2020-04-28 23:00:28 -07:00
Dylan Baker	35ee6b3d36	mesa: Follow OpenGL conversion rules for values that exceed storage size Section 2.2.2 (Data Conversions For State Query Commands) of the OpenGL 4.5 spec says: Following these steps, if a value is so large in magnitude that it cannot be represented by the returned data type, then the nearest value representable using that type is returned. The current code doesn't do the correct thing, because it truncates a long (potentially a 64bit values) to an int. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2828 Fixes: `53c36dfcfe` ("replace IROUND with util functions") Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4673>	2020-04-29 04:26:41 +00:00
Alyssa Rosenzweig	76c5688018	pan/bit: Add BITWISE test Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4790>	2020-04-29 00:30:05 +00:00
Alyssa Rosenzweig	844c3f94b5	pan/bit: Interpret BI_BITWISE No shifting yet. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4790>	2020-04-29 00:30:05 +00:00
Alyssa Rosenzweig	a077da6273	pan/bi: Handle iand/ior/ixor in NIR->BIR Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4790>	2020-04-29 00:30:05 +00:00
Alyssa Rosenzweig	ef9582738e	pan/bi: Pack BI_BITWISE Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4790>	2020-04-29 00:30:05 +00:00
Alyssa Rosenzweig	9b415bf6a0	pan/bi: Add bitwise modifiers Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4790>	2020-04-29 00:30:05 +00:00
Rob Clark	6de01faac5	freedreno/a6xx: invalidate tex state cache entries on rebind When a resource's backing bo changes, its seqno will be incremented. Which would result in a new tex state cache key, and nothing to clean up the old tex state until the sampler view/state is destroyed. But in some games, that may never happen, or at least not happen before we run out of memory. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2830 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4744>	2020-04-29 00:08:57 +00:00
Rob Clark	ca05e6b04d	freedreno: rebind_resource() before bo changes This will matter in the next patch, where we need the original rsc->seqno. It means slight shuffling of where we call rebind_resource() in the `fd_try_shadow_resource()` path. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4744>	2020-04-29 00:08:57 +00:00
Rob Clark	d9e56d8a69	freedreno: rebind resource in all contexts If the resource is rebound, we need to invalidate in all contexts. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4744>	2020-04-29 00:08:57 +00:00
Rob Clark	f12188ff52	freedreno: optimize rebind_resource() Track how resources are used, ie. which state they may potentially dirty if the backing bo is changed/reallocated, to optimize rebind_resource(). This will be more important in a later patch when we hook up eviction of entries in a6xx tex state cache. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4744>	2020-04-29 00:08:57 +00:00
Rob Clark	1e18c58047	freedreno: mark more state dirty when rebinding resources Plus a bonus typo fix. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4744>	2020-04-29 00:08:57 +00:00
Rob Clark	bf97cc9221	freedreno: don't realloc idle bo's The `DISCARD_WHOLE_RESOURCE` is just a hint. And `rebind_resource()` is a bunch of faffing about (and going to get worse in a later patch), so let's not bother when the bo is already idle. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4744>	2020-04-29 00:08:57 +00:00
Rob Clark	938b6ed645	freedreno: small whitespace fix Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4744>	2020-04-29 00:08:57 +00:00
Jan Zielinski	a93b728bc6	gallium/swr: Fix crashes and failures in vertex fetch This commit fixes two problems: - In some cases SWR does not correctly report to Gallium which formats are supported. - Incorrect LLVM instructions are used in vertex fetch in some situations Reviewed-by: Krzysztof Raszkowski <krzysztof.raszkowski@intel.com> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4788>	2020-04-28 23:53:08 +00:00
Rob Clark	de0d3d1726	freedreno/log-parser: support to read gzip'd logs ~50MB gzip'd log files are nicer than ~300MB uncompressed Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4750>	2020-04-28 23:31:58 +00:00
Rob Clark	f561e516c8	freedreno/a6xx: pre-calculate expected vsc stream sizes We should only rely on overflow detection for indirect draws, where we have no other option. This doesn't use quite the worst-possible-case sizes, which in practice seem to be ~20x larger than what is required. But instead uses roughly half of that. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4750>	2020-04-28 23:31:58 +00:00
Rob Clark	99d802ccc7	freedreno: add helper to estimate # of bins per pipe For vsc size calculation, we need to know the # of bins per pipe. Or at least the worst-case # of bins, assuming we don't eliminate an unused depth/ stencil buffer. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4750>	2020-04-28 23:31:58 +00:00
Rob Clark	a9c255d70c	freedreno/a6xx+tu: rename VSC_DATA/VSC_DATA2 These are the draw-stream and primitive-stream, so lets give them more descriptive names. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4750>	2020-04-28 23:31:58 +00:00
Rhys Perry	3ee3ad561a	aco: fix vgpr nir_op_vecn with sgpr operands Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4772>	2020-04-28 23:16:55 +00:00
Rhys Perry	c5eda3c746	aco: improve clamped integer addition disassembly workaround Make it work with 16-bit and GFX10. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4772>	2020-04-28 23:16:55 +00:00
Rhys Perry	4ed83e2f94	aco: add various GFX10 int16 opcodes Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4772>	2020-04-28 23:16:55 +00:00
Rhys Perry	43f2ba39ef	aco: fix sub-dword overwrite check in RA validator Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4772>	2020-04-28 23:16:55 +00:00
Rhys Perry	cca8d6ce06	aco: fix sub-dword out-of-bounds check in RA validator Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4772>	2020-04-28 23:16:55 +00:00
Rhys Perry	307aca83a2	aco: add missing adjust_max_used_regs() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4772>	2020-04-28 23:16:55 +00:00
Rhys Perry	99ca96fbf5	aco: improve RA for uneven p_split_vector Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4772>	2020-04-28 23:16:55 +00:00
Rhys Perry	24116a8a56	aco: don't recurse in sub-dword get_reg_simple() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4772>	2020-04-28 23:16:55 +00:00
Rhys Perry	09c584caeb	aco: split self-intersecting copies instead of swapping Example situation: v1 = {v0.hi, v1.lo} v0.hi = v1.hi The 4-byte copy's definition is completely used, but swapping it makes no sense. We have to split it to generate correct code: swap(v0.hi, v1.lo) swap(v0.hi, v1.hi) Found in dEQP-VK.spirv_assembly.type.vec3.i16.constant_composite_vert Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4772>	2020-04-28 23:16:55 +00:00
Rhys Perry	be4a34966c	aco: fix neighboring register check in get_reg_simple() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4772>	2020-04-28 23:16:55 +00:00
Rhys Perry	fb59ed6bb9	aco: check alignment of non-subdword registers in get_reg_specified() When splitting a v6b vector into v1 and v2b components, we should ensure the v1 definition doesn't start at the upper half. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4772>	2020-04-28 23:16:55 +00:00
Rhys Perry	916cc3e231	aco: make RegisterFile::block() take a regclass Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4772>	2020-04-28 23:16:55 +00:00
Jason Ekstrand	b43366497b	anv: Claim VK_EXT_robustness2 support Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4767>	2020-04-28 22:55:25 +00:00
Jason Ekstrand	b07d26be65	anv: Handle null vertex buffer bindings Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4767>	2020-04-28 22:55:25 +00:00
Jason Ekstrand	fd817291c7	anv: Handle NULL descriptors Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4767>	2020-04-28 22:55:25 +00:00
Jason Ekstrand	ac581a06a4	nir/combine_stores: Handle volatile Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4767>	2020-04-28 22:55:25 +00:00
Jason Ekstrand	cb9292091b	nir/dead_write_vars: Handle volatile We can't remove volatile writes and we can't combine them with other volatile writes so all we can do is clear the unused bits. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4767>	2020-04-28 22:55:25 +00:00
Jason Ekstrand	ed67717167	nir/copy_prop_vars: Report progress when deleting self-copies Fixes: `62332d139c` "nir: Add a local variable-based copy prop..." Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4767>	2020-04-28 22:55:25 +00:00
Jason Ekstrand	d9af5277b3	nir/copy_prop_vars: Handle volatile better For deref_store, we can still delete invalid stores that write to statically OOB data. For everything, we need to make sure that we kill aliases of destinations even if it's volatile. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4767>	2020-04-28 22:55:25 +00:00
Jason Ekstrand	118f045fb7	vulkan: Update Vulkan XML and headers to 1.2.139 Acked-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4767>	2020-04-28 22:55:25 +00:00
Jason Ekstrand	76d2772472	anv: Allow all clear colors for texturing on Gen11+ Starting with Gen11, we have two indirect clear colors: An unconverted float/int version which is us used for rendering and a converted pixel value version which is used for texturing. Because the one used for texturing is stored as a single pixel of that color, it works no matter what format is being used. Because it's a simple HW indirect and doesn't involve copying surface states around, we can use it in the sampler without having to worry about surface states having out-of-date clear values. The result is that we can now allow any clear color when texturing. This cuts the number of resolves in a RenderDoc trace of Dota2 by 95% on Gen11+ (you read that right) and improves perf by 3.5%. It improves perf in a handful of other workloads by < 1%. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	e63c662c26	anv: Use anv_layout_to_aux_usage for color during render passes Previously, we tried to treat color image layouts as a special case during render passes. This is largely an artifact of history as our initial understanding of Vulkan placed much more emphasis on render passes than our current understanding. The only real practical use for magic layouts in the middle of a render pass, as far as I can tell, is to allow more clear colors to get passed through to input attachments. However, most apps aren't very creative with their clear colors and very few of them (none coming from DXVK) actually use render passes in any interesting way. Therefore, the risk of being able to pass fewer clear colors through to input attachments should be minimal. There are, however, three very big advantages to this change: 1. We are now consistent in our handling of aux usage and layouts between color and depth/stencil. 2. We are now actually following the layout guidelines from the app and aren't nearly as likely to see strange behavior due to us overriding the image layouts manually. 3. It's more obviously correct. While I think our old render pass code was probably correct, it was full of corner cases and it's very possible that it was behaving badly in weird ways. This follows the Vulkan API much more blindly and, as such, is more likely to be correct and behave the same as other implementations. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	30016f6e82	anv: Split color_attachment_compute_aux_usage in two In particular, we split out an anv_can_fast_clear_color_view helper which only cares about fast-clear and not aux_usage itself. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	3fe45a9b6c	anv: Rework depth_stencil_attachment_compute_aux_usage Instead of making it a function that pretends to choose aux usage (which isn't what it does at all), make it a function which returns whether or not we want to do a fast clear. This is far more accurate to the purpose of the function. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	26e6da90ab	anv: Refactor cmd_buffer_setup_attachments This commit just renames some things so that we use names for temporary variables which are more consistent with other places in the code-base. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	36a74835df	anv: Stop allowing non-zero clear colors in input attachments Previously, we bent over backwards to allow non-zero clear colors input attachments whenever we could. However, very few apps use input attachments and very few use non-zero clear colors. Getting rid of support for non-zero clear colors input attachments will allow us to treat them identically to textures which should help us simplify things a good bit. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	bf92e96d9c	anv: Disallow fast-clears which require format-reinterpretation In order to actually hit this case you have to be using a very odd color/view combination. The common cases of clear-to-zero and 0/1 clear colors with an sRGB view don't require any re-interpretation. This is probably better than always resolving whenever we have a format mismatch like we are today because that hits the sRGB case every time. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	20e72e435c	intel: Move swizzle_color_value from blorp to ISL Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	814dc66935	anv: Allocate surface states per-subpass Instead of allocating surface states for attachments in BeginRenderPass, we now allocate them in begin_subpass. Also, since we're zeroing things, we can be a bit cleaner about or implementation and just fill out all those passes for which we have allocated surface states. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00

... 36 37 38 39 40 ...

124869 commits