fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-21 15:28:18 +02:00

Author	SHA1	Message	Date
Zhang, Jianxun	bc42bbff4c	iris: Wa_14016820455 for GFX_VERx10 == 12.5 Reprogram SF CLIP viewport pointer by not skipping its dirty flag bit. Many thanks to Lin, Shuicheng <shuicheng.lin@intel.com>, Jerez Plata, Francisco <francisco.jerez.plata@intel.com>, Graunke, Kenneth W <kenneth.w.graunke@intel.com>, and others for their great help. Signed-off-by: Zhang, Jianxun <jianxun.zhang@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17171>	2022-06-22 22:22:50 +00:00
Jordan Justen	eaf2a35a76	iris/bufmgr: Use memory info from devinfo Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17075>	2022-06-22 00:30:49 +00:00
Mark Janes	7b74747854	iris: provide a callback to INTEL_MEASURE to clean up snapshots Snapshots are processed asynchronously by INTEL_MEASURE, but snapshot memory is allocated and associated with an iris batch. Provide a callback that will free snapshot memory after a batch is fully processed. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16571>	2022-06-16 02:58:08 +00:00
Jordan Justen	81d6ae31d6	anv, iris: Enable compute engine with INTEL_COMPUTE_CLASS=1 If this environment variable is set, then a detected compute engine will be used as described in docs/envvars.rst. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14395>	2022-06-15 08:58:20 +00:00
Jordan Justen	0c90c695f5	anv, iris: Add support for I915_ENGINE_CLASS_COMPUTE Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14395>	2022-06-15 08:58:20 +00:00
Emma Anholt	fa118be9ae	iris: Enable PIPE_CAP_LEGACY_MATH_RULES. Now that TTN hooks this up to use_legacy_math_rules, we can flip the switch and gallium nine can get the desired behavior from the hardware instead of emitting math workarounds. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16176>	2022-06-10 03:26:33 +00:00
Emma Anholt	cf265c6606	nir: Rename is_arb_asm to use_legacy_math_rules and document its meaning. On iris and crocus, this flag is used to set "alt mode" math on the shader as a whole. Some other drivers have a similar mode for DX9/ARB-program behavior, so document what it does so we can start using it. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16176>	2022-06-10 03:26:32 +00:00
Marek Olšák	ad8f9d5d58	gallium: rename PIPE_CAP_MAX_SHADER_BUFFER_SIZE -> *_UINT to imply the maximum of 4GB - 1. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16881>	2022-06-07 00:17:58 -04:00
Marek Olšák	fd6b8999d7	gallium: rename PIPE_CAP_MAX_TEXTURE_BUFFER_SIZE->MAX_TEXEL_BUFFER_ELEMENTS_UINT to allow exposing 4G - 1. The "SIZE" was also a misnomer because it meant elements. This no longer clamps the size to INT_MAX in st/mesa. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16881>	2022-06-07 00:17:58 -04:00
Marek Olšák	406cf871b2	gallium: rename PIPE_SHADER_CAP_MAX_CONST_BUFFER_SIZE to _BUFFER0_ UBOs will use a larger limit. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16881>	2022-06-07 00:17:57 -04:00
Emma Anholt	8c4b88ee48	gallium+glsl: Remove EmitNoSat/PIPE_CAP_VERTEX_SHADER_SATURATE The drivers not setting it were: - nv30, which gets lowering using NIR's lower_fsat flag. - r300, which gets lowering using NIR's lower_fsat flag. - a2xx, which has was getting it optimized back to fsat anyway. This drops the check for the cap from gallium nine. While nine does have a non-nir path, I think it's safe to assume that if you have SM3 texturing, you can do fsat. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16823>	2022-06-07 02:38:42 +00:00
Nagappa Koppad, Basanagouda	a99e85db9e	iris:Duplicate DRM fd internally instead of reuse. Scenario we want to avoid is double close of DRM fd in iris driver. Signed-off-by: Nagappa Koppad, Basanagouda <basanagouda.nagappa.koppad@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6620 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16886>	2022-06-06 20:04:28 +00:00
Timothy Arceri	26ff49038c	gallium: remove PIPE_SHADER_CAP_MAX_UNROLL_ITERATIONS_HINT CAP This is used for the old, buggy and slow GLSL IR loop unrolling code. All drivers have now switched to the NIR unrolling code so here we remove the CAP. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16366>	2022-06-04 16:11:49 +00:00
Erik Faye-Lund	8376fb0f33	iris: do not do STATIC_ASSERT on variables Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16670>	2022-06-03 07:14:43 +00:00
Jason Ekstrand	dfedeccc13	intel: Only set VectorMaskEnable when needed For cases with lots of very small primitives, this may improve performance because we're not executing those dead channels all the time. Shader-db reports no instruction or cycle-count changes. However, by hacking up the driver to report when this optimization triggers, it appears to affect about 10% of shader-db. v2 (Kenneth Graunke): Always enable VMask prior to XeHP for now, because using VMask on those platforms allows us to perform the eliminate_find_live_channel() optimization. However, XeHP doesn't seem to have packed fragment shader dispatch, so we lose that optimization regardless, and there's no reason not to avoid vmask. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/1054>	2022-05-27 21:52:48 +00:00
Kenneth Graunke	27314718a3	intel: Drop Wa_1409226450 (stall before instruction cache invalidation) Production Tigerlake and DG1 hardware shouldn't need this workaround. It was only needed on the very first steppings which never went public. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16575>	2022-05-19 21:31:45 +00:00
Lionel Landwerlin	1c077ca9c0	u_trace/anv/iris: drop cs argument for recording traces Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16605>	2022-05-19 19:04:28 +00:00
Kenneth Graunke	b8799a499e	iris: Add FLUSH_HDC to PIPE_CONTROL_CACHE_FLUSH_BITS This is considered a bottom-of-pipe flush bit. Fixes: `a969ad1ddf` ("iris: Demote DC flush to HDC flush in cache tracker") Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16565>	2022-05-17 22:35:06 +00:00
Lionel Landwerlin	66045acdf9	intel/perf: add max vfuncs New counters will use those from inside their read function to generate percentage numbers. v2: Forgot to update Iris (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16144>	2022-05-17 19:55:10 +00:00
Vadym Shovkoplias	55c71217ec	driconf: Add a limit_trig_input_range option With this option enabled range of input values for fsin and fcos is limited to [-2pi : 2pi] by calculating the reminder after 2*pi modulo division. This helps to improve calculation precision for large input arguments on Intel. -v2: Add limit_trig_input_range option to prog_key to update shader cache (Lionel) Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16388>	2022-05-13 06:47:53 +00:00
Jason Ekstrand	62f0677223	iris: Set BindingTableEntryCount for compute shaders This may slightly increase perf somewhere because the hardware can now pre-cache binding tables. The real feature is that INTEL_DEBUG=bat now dumps out surface states for compute. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15759>	2022-05-11 23:47:08 +00:00
Jason Ekstrand	3c07c3e16d	shader_info: Make images_used a bitset Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15988>	2022-05-10 11:23:11 -05:00
Karol Herbst	d98b82a103	iris/cs: take buffer offsets into account for CL Sadly we pass in an offset, which the driver can't ignore Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16348>	2022-05-10 03:37:44 +00:00
Emma Anholt	e9b491f9b5	gallium: Remove now-unused shader caps. The only interesting ones here were LOWER_IF_THRESHOLD (which previously had connected to some lowering in GLSL that was broken in the face of side effects), and FMA (which turned GLSL IR's fma() into TGSI_OPCODE_FMA instead of MAD). Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8044>	2022-05-05 22:25:03 +00:00
Lionel Landwerlin	acf6bf88c0	iris: use new kernel uAPI to compute video memory v2: Use os_get_available_system_memory() when kernel memory region uAPI is not available (Lionel) Cc: 22.1 <mesa-stable> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16210>	2022-05-02 22:57:06 +00:00
Jordan Justen	33456ae5a4	iris: Fix assertion meant to only target the clear-color stride Fixes: `2bc8c61fd0` ("iris: Return a 64B stride for clear color plane") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6398 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16241>	2022-04-29 09:34:56 -07:00
Nanley Chery	b023f18bad	isl,iris: Add DG2 CCS modifier support for XeHP Cc: 22.1 <mesa-stable> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14521>	2022-04-28 20:02:14 +00:00
Anuj Phogat	ac441d0953	isl,iris: Add I915_FORMAT_MOD_4_TILED support for XeHP This patch adds Tile 4 modifier support to Mesa and allows Mesa to use Tile 4 on gen12-hp with GBM. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Cc: 22.1 <mesa-stable> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14521>	2022-04-28 20:02:14 +00:00
Nanley Chery	2bc8c61fd0	iris: Return a 64B stride for clear color plane Although modifiers which use a clear color plane specify that the plane's pitch should be ignored, some kernels have been found to require 64-byte alignment. Cc: mesa-stable Fixes: `db475c81b7` ("iris: Return non-zero stride for clear color plane") Reported-by: Dongwon Kim <dongwon.kim@intel.com> Suggested-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14521>	2022-04-28 20:02:14 +00:00
David Heidelberg	c1e59bea05	ci: intel: Merge anv and iris into src/intel/ci This commit make simple adding tests which use both GL(ES) and VK. Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16048>	2022-04-27 12:35:13 +00:00
Paulo Zanoni	3532c374de	iris: fix race condition during busy tracking The Iris code that deals with implicit tracking is protected by bufmgr->bo_deps_lock. Before this patch, we hold this lock during update_batch_syncobjs() but don't keep it held until we actually submit the batch in the execbuf ioctl. This can lead to the following race condition: - Context C1 generates a batch B1 that signals syncobj S1. - Context C2 generates a batch B2 that depends on something that B1 from C1 is using, so we mark B2 as having to wait syncobj S1. - C2 calls submit_batch() before C1 does it. - The Kernel detects it was told to wait on syncobj S1 that was never even submitted, so it returns EINVAL to the execbuf ioctl. - We run abort() at the end of _iris_batch_flush(). - If DEBUG is defined, we also print: iris: Failed to submit batchbuffer: Invalid argument I couldn't figure out a way to reproduce this issue with real workloads, but I was able to write a small reproducer to trigger this. Basically it's a little GL program that has lots of contexts running in different threads submitting compute shaders that keep using the same SSBOs. I'll submit this as a piglit test. Edit: Tapani found a dEQP test case which fails intermintently without this fix, so I'm not sure a new Piglit is worth it now. The solution itself is quite simple: just keep bo_deps_lock held all the way from update_batch_syncobjs() until ioctl(). In order to make that easier we just call update_batch_syncobjs() a little later. We have to drop the lock as soon as the ioctl returns because removing the references on the buffers would trigger other functions to try to grab the lock again, leading to deadlocks. Thanks to Kenneth Graunke for pointing out this issue. This has also been confirmed to fix a dEQP test that was giving intermittent failures: dEQP-EGL.functional.sharing.gles2.multithread.random.images.copyteximage2d.12 v2: Move decode_batch() out, just to be safe (Jason). v3: Do it all after assembling validation_list (Ken). Cc: mesa-stable Fixes: `89a34cb845` ("iris: switch to explicit busy tracking") Tested-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14964>	2022-04-21 22:51:25 +00:00
Lionel Landwerlin	2ab57e056d	ci/iris: mark another test as flaky Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16032>	2022-04-19 14:27:26 +00:00
Erik Faye-Lund	7ca1253932	gallium: rename ldexp shader-cap This is no longer TGSI specific, so let's rename it to reflect reality. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15922>	2022-04-18 20:43:18 +00:00
Erik Faye-Lund	439c212a3c	gallium: rename dfracexp/dldexp shader-cap This is no longer TGSI specific, so let's rename it to reflect reality. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15922>	2022-04-18 20:43:18 +00:00
Erik Faye-Lund	3efd6d4bfe	gallium: rename dround shader-cap This is no longer TGSI specific, so let's rename it to reflect reality. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15922>	2022-04-18 20:43:18 +00:00
Erik Faye-Lund	9b545ea691	gallium: rename continue shader-cap This is no longer TGSI specific, so let's rename it to reflect reality. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15922>	2022-04-18 20:43:18 +00:00
Jason Ekstrand	c8df09ebd4	iris: More gracefully fail in resource_from_user_memory rusticl (and clover) would like to get a graceful fail here so they can fall back to a shadow copy instead of us asserting. We also start rejecting arrayed surface because isl doesn't allow selecting a QPitch yet. Even if it did, QPitch is horribly restrictive, even for linear surfaces, that it likely wouldn't be that useful. Fixes: `e81f3edf76` ("iris: Allow userptr on 1D and 2D images") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15903>	2022-04-13 19:18:54 +00:00
Jason Ekstrand	6ca328988f	iris: Don't leak scratch BOs Fixes: `4d219b0eb3` ("iris: implement scratch space!") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15897>	2022-04-13 15:56:50 +00:00
Rohan Garg	581035b3a9	iris: set a default EDSC flag anv sets the default EDSC flag, do the same for iris too Fixes: `5ae278da18` ("iris: use vtbl to avoid multiple symbols, fix state base address") Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15905>	2022-04-13 12:36:01 +00:00
Kenneth Graunke	b7111f89e8	iris: Add VF_CACHE_INVALIDATE to IRIS_DOMAIN_OTHER_WRITE flush bits Suggested by Francisco Jerez. Although including VF invalidation in the flush bits is strange, we believe this is the only way to guarantee that stream output has finished. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	a969ad1ddf	iris: Demote DC flush to HDC flush in cache tracker FLUSH_HDC is sufficient to flush things out to L3, so we'd rather use that where possible. It's also emulated via DATA_CACHE_FLUSH on platforms where it isn't supported, so we can use it unconditionally. We still use DATA_CACHE_FLUSH for invalidating the data cache, and to flush the DC-tagged cachelines in L3 to be globally-observable. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	1c8b4940eb	iris: Emit flushes for push constant source buffers Push constant loading is not coherent with L3 according to the document that describes the hardware change for the vertex buffer L3 Bypass Disable field. If we've updated a push constant buffer with say, a blorp_buffer_copy, we may need to flush both the render cache and the tile cache. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	bbd5714a7e	iris: Use cache-tracker for draw count flushing We should be using the cache tracker for this. We can consider this access IRIS_DOMAIN_OTHER_READ now that it's the catch-all non-L3-coherent read-only access domain. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	9c8874b9ab	iris: Add pre-draw flushing for stream output targets When stream output is active, we need to let the cache tracker know about any SO buffers, which we access via IRIS_DOMAIN_OTHER_WRITE. In particular, we may have written to those buffers via another mechanism, such as BLORP buffer copies. In that case, previous writes happened via IRIS_DOMAIN_RENDER_WRITE, in which case we'd need to flush both the render cache and the tile cache to make that data globally- observable before we begin writing via streamout, which is incoherent with the earlier mechanism. Fixes misrendering in Ryujinx. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6085 Fixes: `d8cb76211c` ("iris: Fix MOCS for buffer copies") Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	43e3747eea	iris: Extend the cache tracker to handle L3 flushes and invalidates Most clients are L3-coherent these days. However, there are some notable exceptions, such as push constants, stream output, and command streamer memory reads and writes. With the advent of the tile cache, flushing the render or depth caches alone are no longer sufficient for memory to become globally-observable. For those, we need to flush the tile cache as well. However, we'd like to avoid that for L3-coherent clients, as it shouldn't be necessary, and is expensive. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	8cd7e94eca	iris: Add a separate PIPE_CONTROL_L3_READ_ONLY_CACHE_INVALIDATE bit This will let us use it without performing a VF cache invalidation, should we want to do that. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	b92cd58508	iris: Add an iris_is_domain_l3_coherent helper. The render, depth, sampler, and data (HDC) caches are all coherent with L3. We consider OTHER_READ and OTHER_WRITE to be non-coherent, as they're kitchen-sink domains which include non-L3-clients. Starting with Tigerlake, the VF cache is coherent with L3 (because we set the L3BypassDisable bit in the vertex/index buffer packets). Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	536eee31d0	iris: Fix UBO cache tracking for the !indirect_ubos_use_sampler case On Tigerlake, we use the data cache for reading indirect UBOs instead of the sampler. But we still use the constant cache for direct UBO access, so unfortunately we may access it through two different domains. To work around this, we add a new domain for pull constants (UBOs), which will be either constant+texture or constant+data. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	d39bd7ba70	iris: Split out an IRIS_DOMAIN_SAMPLER_READ domain from OTHER_READ The bulk of IRIS_DOMAIN_OTHER_READ domain usage was the 3D sampler, but there were also a few oddball cases like command streamer reads, blitter access, and so on. The sampler is definitely L3 coherent, but some off the more esoteric reads may not be, so I'd like to separate them, so that OTHER_READ can become a non-L3-coherent kitchen-sink domain. The sampler cases only need TEXTURE_CACHE_INVALIDATE, and can skip the CONSTANT_CACHE_INVALIDATE we had on IRIS_DOMAIN_OTHER_READ. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	8e0ff0275d	iris: Use IRIS_DOMAIN_DEPTH_WRITE for read only depth/stencil. We were using IRIS_DOMAIN_OTHER_READ for read-only depth/stencil access in an attempt to avoid unnecessary flushing; IRIS_DOMAIN_DEPTH_WRITE could indicate read-write access. However, IRIS_DOMAIN_OTHER_READ is clearly the wrong domain. Depth and stencil data is read via the depth cache, while IRIS_DOMAIN_OTHER_READ currently corresponds to the sampler cache and constant cache together (although this will change in future patches). It's unclear whether this hack was useful. For now, just drop it and use the correct depth cache domain, even if it's marked as read-write. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00

1 2 3 4 5 ...

2291 commits