fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-24 21:50:12 +01:00

Author	SHA1	Message	Date
Erik Faye-Lund	55f6a2bb51	gallium: normalized_coords -> unnormalized_coords A lot of code zero-initializes pipe_sampler_state, and sets the states the non-zero fields manually. This means that normalized_coords is the "default" setting. However, setting normalized_coords to true isn't allways allowed, and we'd need to check PIPE_CAP_TEXRECT first. So it's not really the ideal default here. There's recently been found quite a bit of bugs in this area, where the state-tracker didn't properly lower texrects. Let's switch this around to avoid more bugs like this in the future. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18971>	2022-10-10 10:20:02 +00:00
Tapani Pälli	1cf1a94f97	intel: revert preemption disable via VFG changes This register will not be whitelisted and this change will be done in kernel instead. This change reverts commits `d5d4604a`, `ddcd6b38`, `27c5b93d`. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18897>	2022-10-04 10:38:49 +00:00
Tapani Pälli	58829d9f11	iris: implement Wa_14016118574 After each 3DPRIMITIVE, we need to send a dummy post sync op if point or line list was used or if had only 1 or 2 vertices per primitive. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18746>	2022-09-23 12:27:05 +00:00
Jason Ekstrand	3417a0c4a2	iris: Support up to 128 textures This is required for OpenCL. I kind-of hate this patch. I really don't like GROUP_TEXTURE_LOW64 and GROUP_TEXTURE_HIGH64 but it was either that or I had to make all the used bitsets 128 which would have mean making them BITSET and that would have been a lot more churn. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16442>	2022-09-22 09:50:23 +00:00
Jason Ekstrand	06a0de492a	iris: Support up to 64 images Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16442>	2022-09-22 09:50:23 +00:00
Jason Ekstrand	c4ff82d958	iris: Split max #defines for textures/samplers/images Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16442>	2022-09-22 09:50:23 +00:00
Jason Ekstrand	c9c8134d76	iris: Stop looking at textures_used for samplers Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16442>	2022-09-22 09:50:23 +00:00
Illia Polishchuk	74658b01d2	driconf/Intel: Add lower_depth_range_rate option workaround for Homerun Clash misrendering issue Intel has different Z interpolation float point rounding than other mesa gpus For example gl_Position.z = 0.0 will be interpolated to gl_FragCoord.z = 0.5 for all gpus gl_FragCoord = -0.00000001 will be interpolated to gl_FragCoord.z = 0.4999999702 for Intel and rounded to gl_FragCoord.z = 0.5 for other gpus Games with LEQUAL depth func will fail depth test on Intel and will pass it on other gpus in such case This workaround lowers translated depth range and several gl_FragCoord.z coords with extra small difference will be translated to the same UINT16\UINT24\UINT32 value of an integer depth buffer Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7199 Signed-off-by: Illia Polishchuk <illia.a.polishchuk@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18412>	2022-09-19 10:08:48 +00:00
Tapani Pälli	27c5b93d37	iris: disable preemption on VFG, Wa_14015207028 for DG2 This workaround disables batch level preemption for Polygon, Trifan and Lineloop primitive topologies. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18456>	2022-09-14 10:01:23 +00:00
Tapani Pälli	e37f534d7f	iris: implement Wa_14015946265 for DG2 SOL unit issues, wa is to send PC with CS stall after SO_DECL. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18409>	2022-09-07 04:38:05 +00:00
Jason Ekstrand	c52d5acf15	util,intel: Pull the bit packing helpers from genxml to a common header Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18062>	2022-08-30 04:28:34 +00:00
Kenneth Graunke	fe0152e216	iris: Pass devinfo to iris_resource_level_has_hiz() This will let us enforce 8x4 alignment rules differently based on the specific hardware generation in question. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4674>	2022-08-17 01:20:25 +00:00
Sagar Ghuge	50802f96a8	iris: Handle new untyped dataport cache flush PIPE_CONTROL field Also while switching to GPGPU pipeline, make sure to flush the untyped dataport cache. HDC pipeline flush bit must be set if we are flushing untyped dataport L1 data cache. v2: Add utrace support (Lionel) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16905>	2022-08-05 10:44:22 +03:00
Sagar Ghuge	8aead60434	iris: Specify Untyped L1 cache policy for stateless accesses Set write back L1 cache policy in STATE_BASE_ADDRESS instruction for A64 messages. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Suggested-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16905>	2022-08-05 10:43:50 +03:00
Nanley Chery	6875e07538	iris: Dedent enum iris_depth_reg_mode Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17859>	2022-08-03 15:31:10 +00:00
Nanley Chery	a75cd15b94	iris: Make the D16 reg mode single-sampled Wa_14010455700 is dependent on the format and sample count, but our code to track whether or not it had been applied was only dependent on the format. As a result, we failed to enable the workaround when an app used a D16 2xMSAA buffer, then a D16 1xMSAA buffer right afterwards. Make the workaround tracking code sample-dependent to fix this. Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17859>	2022-08-03 15:31:10 +00:00
Mykhailo Skorokhodov	6498328210	iris: Move Wa_1806527549 and enable by default Move Wa_1806527549 into `iris_init_render_context` and set HIZ_CHICKEN (7018h) bit = 1 by default for TGL. Cc: mesa-stable Signed-off-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17778>	2022-08-02 16:33:10 +03:00
Nanley Chery	bec82bb436	iris: Use fill_surface_states for compressed resources In iris_create_surface, use the fill_surface_states helper function instead of an open-coded solution for compressed resources. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17598>	2022-07-22 16:33:37 +00:00
Nanley Chery	6c65e990b6	iris: Don't leak compressed resources in iris_create_surface Before this patch, we were leaking compressed resources in iris_create_surface. Specifically, when we failed to create an uncompressed ISL surface and view for a compressed resource, we didn't unreference the resource pointer we referenced into the pipe_surface. Fix this by delaying the pipe_surface initialization code to after attempting to create the uncompressed surface and view. Cc: 22.1 <mesa-stable> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17598>	2022-07-22 16:33:36 +00:00
Nanley Chery	bca601ffe9	iris: Don't leak surface states for compressed resources Before this patch, we were leaking surface states in iris_create_surface. Specifically, when we failed to create an uncompressed ISL surface and view for a compressed resource, we didn't free surface states we allocated for it. Fix this by attempting to create the uncompressed surface and view before we allocate the surface states. Cc: 22.1 <mesa-stable> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17598>	2022-07-22 16:33:36 +00:00
Lionel Landwerlin	2d1f021e16	intel/fs: Set NonPerspectiveBarycentricEnable when the interpolator needs it. [anholt: changed to make all drivers do the right thing by moving the payload barycentric check into the compiler] Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17381>	2022-07-19 01:25:47 +00:00
Chuansheng Liu	39f8c61f32	iris,anv: correct the max thread number for DG2+ Correct the max thread number for DG2+ platforms according to below bspec. Ref: Bspec: 47202 Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Chuansheng Liu <chuansheng.liu@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17506>	2022-07-13 08:11:19 +00:00
Kenneth Graunke	0ce9d7b7c9	iris: Use PIPE_* defines rather than ones from main/config.h Gallium drivers shouldn't be including src/mesa/main headers, but we're picking up a rogue main/config.h via the compiler, so this code I ported over from i965 kept compiling. Use the PIPE_* defines instead so that we can stop including that. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17309>	2022-06-30 23:46:35 +00:00
Erik Faye-Lund	8376fb0f33	iris: do not do STATIC_ASSERT on variables Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16670>	2022-06-03 07:14:43 +00:00
Jason Ekstrand	dfedeccc13	intel: Only set VectorMaskEnable when needed For cases with lots of very small primitives, this may improve performance because we're not executing those dead channels all the time. Shader-db reports no instruction or cycle-count changes. However, by hacking up the driver to report when this optimization triggers, it appears to affect about 10% of shader-db. v2 (Kenneth Graunke): Always enable VMask prior to XeHP for now, because using VMask on those platforms allows us to perform the eliminate_find_live_channel() optimization. However, XeHP doesn't seem to have packed fragment shader dispatch, so we lose that optimization regardless, and there's no reason not to avoid vmask. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/1054>	2022-05-27 21:52:48 +00:00
Kenneth Graunke	27314718a3	intel: Drop Wa_1409226450 (stall before instruction cache invalidation) Production Tigerlake and DG1 hardware shouldn't need this workaround. It was only needed on the very first steppings which never went public. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16575>	2022-05-19 21:31:45 +00:00
Lionel Landwerlin	1c077ca9c0	u_trace/anv/iris: drop cs argument for recording traces Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16605>	2022-05-19 19:04:28 +00:00
Jason Ekstrand	62f0677223	iris: Set BindingTableEntryCount for compute shaders This may slightly increase perf somewhere because the hardware can now pre-cache binding tables. The real feature is that INTEL_DEBUG=bat now dumps out surface states for compute. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15759>	2022-05-11 23:47:08 +00:00
Karol Herbst	d98b82a103	iris/cs: take buffer offsets into account for CL Sadly we pass in an offset, which the driver can't ignore Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16348>	2022-05-10 03:37:44 +00:00
Rohan Garg	581035b3a9	iris: set a default EDSC flag anv sets the default EDSC flag, do the same for iris too Fixes: `5ae278da18` ("iris: use vtbl to avoid multiple symbols, fix state base address") Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15905>	2022-04-13 12:36:01 +00:00
Kenneth Graunke	a969ad1ddf	iris: Demote DC flush to HDC flush in cache tracker FLUSH_HDC is sufficient to flush things out to L3, so we'd rather use that where possible. It's also emulated via DATA_CACHE_FLUSH on platforms where it isn't supported, so we can use it unconditionally. We still use DATA_CACHE_FLUSH for invalidating the data cache, and to flush the DC-tagged cachelines in L3 to be globally-observable. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	1c8b4940eb	iris: Emit flushes for push constant source buffers Push constant loading is not coherent with L3 according to the document that describes the hardware change for the vertex buffer L3 Bypass Disable field. If we've updated a push constant buffer with say, a blorp_buffer_copy, we may need to flush both the render cache and the tile cache. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	bbd5714a7e	iris: Use cache-tracker for draw count flushing We should be using the cache tracker for this. We can consider this access IRIS_DOMAIN_OTHER_READ now that it's the catch-all non-L3-coherent read-only access domain. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	43e3747eea	iris: Extend the cache tracker to handle L3 flushes and invalidates Most clients are L3-coherent these days. However, there are some notable exceptions, such as push constants, stream output, and command streamer memory reads and writes. With the advent of the tile cache, flushing the render or depth caches alone are no longer sufficient for memory to become globally-observable. For those, we need to flush the tile cache as well. However, we'd like to avoid that for L3-coherent clients, as it shouldn't be necessary, and is expensive. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	8cd7e94eca	iris: Add a separate PIPE_CONTROL_L3_READ_ONLY_CACHE_INVALIDATE bit This will let us use it without performing a VF cache invalidation, should we want to do that. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	536eee31d0	iris: Fix UBO cache tracking for the !indirect_ubos_use_sampler case On Tigerlake, we use the data cache for reading indirect UBOs instead of the sampler. But we still use the constant cache for direct UBO access, so unfortunately we may access it through two different domains. To work around this, we add a new domain for pull constants (UBOs), which will be either constant+texture or constant+data. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	d39bd7ba70	iris: Split out an IRIS_DOMAIN_SAMPLER_READ domain from OTHER_READ The bulk of IRIS_DOMAIN_OTHER_READ domain usage was the 3D sampler, but there were also a few oddball cases like command streamer reads, blitter access, and so on. The sampler is definitely L3 coherent, but some off the more esoteric reads may not be, so I'd like to separate them, so that OTHER_READ can become a non-L3-coherent kitchen-sink domain. The sampler cases only need TEXTURE_CACHE_INVALIDATE, and can skip the CONSTANT_CACHE_INVALIDATE we had on IRIS_DOMAIN_OTHER_READ. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	8e0ff0275d	iris: Use IRIS_DOMAIN_DEPTH_WRITE for read only depth/stencil. We were using IRIS_DOMAIN_OTHER_READ for read-only depth/stencil access in an attempt to avoid unnecessary flushing; IRIS_DOMAIN_DEPTH_WRITE could indicate read-write access. However, IRIS_DOMAIN_OTHER_READ is clearly the wrong domain. Depth and stencil data is read via the depth cache, while IRIS_DOMAIN_OTHER_READ currently corresponds to the sampler cache and constant cache together (although this will change in future patches). It's unclear whether this hack was useful. For now, just drop it and use the correct depth cache domain, even if it's marked as read-write. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Jason Ekstrand	cde1be0b84	iris: Handle range tracking for global bindings The moment something is bound globally, the whole thing is valid. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15777>	2022-04-06 20:16:55 +00:00
Jason Ekstrand	187923c2eb	iris: Account for BO offsets in iris_set_global_binding() Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15777>	2022-04-06 20:16:55 +00:00
Francisco Jerez	6cc09699cd	iris: Remove remaining history flushes. This removes a couple of remaining history flushes which were open-coded instead of using the iris_flush_and_dirty_for_history() helper. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15738>	2022-04-04 10:32:31 -07:00
Mike Blumenkrantz	e219351457	iris: assert that samplerview base_array_layer is zero for hw < skl this codepath is broken for older hardware Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15610>	2022-03-29 22:12:30 +00:00
Anuj Phogat	5cc4075f95	anv, iris: Add Wa_16011411144 for DG2 v2: Use CS_STALL instead of FLUSH_ENABLE in Iris (Lionel) Add missing CS_STALL after SO_BUFFER change in Anv (Lionel) Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> (v1) Reviewed-by: Francisco Jerez <currojerez@riseup.net> Cc: 22.0 <mesa-stable> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14947>	2022-03-17 14:18:02 +00:00
Jason Ekstrand	12d815bcac	intel/guardband: Take min/max instead of total size Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 13:13:45 -05:00
Kenneth Graunke	8b9045e7a4	intel: Use 3DSTATE_BINDING_TABLE_POOL_ALLOC exclusively on Gfx11+ On Icelake and later, we can use a new 3DSTATE_BINDING_TABLE_POOL_ALLOC command to update the location of the binder (buffer containing binding table entries), rather than having to move Surface State Base Address via a STATE_BASE_ADDRESS command. This has less stalling and also means our surface addresses can remain relative to a fixed 4GB address range, meaning we don't have to re-stream them any time the binder changes. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14507>	2022-03-09 09:18:59 +00:00
Kenneth Graunke	e3a0e97300	intel: Limit Wa_1607854226 to Gfx12.0 only This workaround is needed on all Gfx12.0 parts, but doesn't appear to be necessary on XeHP. The other drivers do not appear to be applying this workaround on those parts. As further evidence, we accidentally added the 3DSTATE_BINDING_TABLE_POOL_ALLOC commands after switching back to GPGPU mode, which would be an incorrect way to implement the workaround, and things seem to be working. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14507>	2022-03-09 09:18:59 +00:00
Kenneth Graunke	ab47cad4fb	iris: Rename surface_base_address to binder_address in a few places On Gfx11+, we're going to stop changing Surface State Base Address and instead start changing the Binding Table Pool address instead. So, rename a few things to track the last binder address, which is what we're actually changing, regardless of how we program it. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14507>	2022-03-09 09:18:59 +00:00
Kenneth Graunke	db34c71513	iris: Use more efficient binding table pointer formats on Icelake+. Skylake and older use a 15:5 binding table pointer format, which means our binder can be at most 64kB in size. Each binding table within the binder must be aligned to 32B. XeHP uses a new 20:5 binding table format, which allows us to increase the binder size to 1MB while retaining the nice 32B alignment. Larger binders mean fewer stalls as we update the base address for the binder. Icelake and Tigerlake can either use the 15:5 format or an 18:8 format. 18:8 mode requires the base of each binding table to be aligned to 256B instead of 32B, but it gives us a maximum binder size of 512kB. We can store 64 binding table entries in a 256B chunk (256B / 4B = 64), but only 8 entries in a 32B chunk (32B / 4B = 8). Assuming that most binding tables have fewer than 64 entries, this means that with the 18:8 format, we're likely to be able to fit 2048 (512KB / 256B) tables into a a buffer before needing to allocate a new one and stall. Technically, the old format could also store 2048 binding tables per buffer as well (64KB / 32B = 2048). However, tables that needed more than 8 entries would need multiple 32B chunks. A single table would take multiple aligned chunks, while with the larger 256B format, it could fit in a single one. This cuts binder resets by 6.3% on a Shadow of Mordor benchmark trace. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14507>	2022-03-09 09:18:59 +00:00
Kenneth Graunke	e6b7e74308	iris: Set MI_FLUSH_DW::PostSyncOperation correctly The MI_FLUSH_DW post-sync operation uses the same encoding as the PIPE_CONTROL one so we can use the same helper. Write PS Depth Count is not supported, of course, as the blitter has no depth pipeline. This means that we can write the timestamp register from the blitter. Fixes: `604d97671b` ("iris: Add support for flushing the blitter (hackily)") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15157>	2022-02-24 21:42:16 +00:00
Paulo Zanoni	d10fd5b7c9	iris: fix register spilling on compute shaders on XeHP XeHP scratch space is handled differently. Commit `ae18e1e707` implemented support for it, but handled it differently between render and compute shaders: it calculates scratch_addr differently and doesn't pin the buffer on compute. Make it work on compute shaders by calling pin_scratch_space() from iris_compute_walker(), which fixes both the address and the pinning. This commit can be verified by the two-year-old-but-still-unreviewed Piglit MR 234. You can also verify this by running a very simple compute shader with INTEL_DEBUG=spill_fs. References: https://gitlab.freedesktop.org/mesa/piglit/-/merge_requests/234 Fixes: `ae18e1e707` ("iris: Add support for scratch on XeHP") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15070>	2022-02-22 22:16:57 +00:00

1 2 3 4 5 ...

833 commits