fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-22 22:10:10 +01:00

Author	SHA1	Message	Date
Anuj Phogat	9c421d6b47	iris/icl: Add WA_2204188704 to disable pixel shader panic dispatch Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-03-28 19:59:59 +00:00
Anuj Phogat	e0f4359ec1	iris/icl: Set Enabled Texel Offset Precision Fix bit h/w specification requires this bit to be always set. See Mesa commit `5eb173304b`. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-03-28 19:59:59 +00:00
Danylo Piliaiev	c8abe03f3b	i965,iris,anv: Make alpha to coverage work with sample mask From "Alpha Coverage" section of SKL PRM Volume 7: "If Pixel Shader outputs oMask, AlphaToCoverage is disabled in hardware, regardless of the state setting for this feature." From OpenGL spec 4.6, "15.2 Shader Execution": "The built-in integer array gl_SampleMask can be used to change the sample coverage for a fragment from within the shader." From OpenGL spec 4.6, "17.3.1 Alpha To Coverage": "If SAMPLE_ALPHA_TO_COVERAGE is enabled, a temporary coverage value is generated where each bit is determined by the alpha value at the corresponding sample location. The temporary coverage value is then ANDed with the fragment coverage value to generate a new fragment coverage value." Similar wording could be found in Vulkan spec 1.1.100 "25.6. Multisample Coverage" Thus we need to compute alpha to coverage dithering manually in shader and replace sample mask store with the bitwise-AND of sample mask and alpha to coverage dithering. The following formula is used to compute final sample mask: m = int(16.0 * clamp(src0_alpha, 0.0, 1.0)) dither_mask = 0x1111 * ((0xfea80 >> (m & ~3)) & 0xf) \| 0x0808 * (m & 2) \| 0x0100 * (m & 1) sample_mask = sample_mask & dither_mask Credits to Francisco Jerez <currojerez@riseup.net> for creating it. It gives a number of ones proportional to the alpha for 2, 4, 8 or 16 least significant bits of the result. GEN6 hardware does not have issue with simultaneous usage of sample mask and alpha to coverage however due to the wrong sending order of oMask and src0_alpha it is still affected by it. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109743 Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2019-03-25 13:54:55 -07:00
Chris Wilson	db99d02fce	iris: Push heavy memchecker code to DEBUG Invoking VALGRIND_CHECK_MEM_IS_DEFINED pulls in enough code to convince gcc to not inline __gen_uint and results in a lot of packing code ending up out-of-line with lots of stack copying. To ameliorate this, only insert the check inside the packer if DEBUG is defined and instead perform the validation checking before submitting the batch to the kernel. This should give accurate results if --trace-origins=yes is used, and failing that we can recompile in full debug mode to check on insertion. Improve drawoverhead baseline by 25% with a default build with valgrind-dev installed (with effectively no loss of vg coverage). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-03-22 10:38:03 -07:00
Kenneth Graunke	66c100a8d6	iris: Skip resolves and flushes altogether if unnecessary Improves drawoverhead baseline scores by 1.17x.	2019-03-21 20:28:17 -07:00
Rafael Antognolli	93123417dd	iris: Track fast clear color. v2: Update tracked clear color when we update the surface state. v3: Update all aux surface states when updating the clear color. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-03-20 16:46:26 -07:00
Rafael Antognolli	131b42f0aa	iris: Implement fast clear color. If all the restrictions are satisfied, do a fast clear instead of regular clear. v2: - add perf_debug() when we can't fast clear (Ken) - improve comment: s/miptree/resource/ (Ken) - use swizzle_color_value from blorp (Ken) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-03-20 16:46:25 -07:00
Rafael Antognolli	a8b5ea8ef0	iris: Add function to update clear color in surface state. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-03-20 16:46:25 -07:00
Rafael Antognolli	34d00b4410	iris: Use the clear depth when emitting 3DSTATE_CLEAR_PARAMS. Take the clear depth into account when IRIS_DIRTY_DEPTH_BUFFER is marked as dirty. Also update the blorp surface clear color. v2: Use a single if (zres && zres->aux.bo) (Ken). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-03-20 16:46:25 -07:00
Tapani Pälli	3e534489ec	iris: mark switch case fallthrough CID: 1444103 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-03-20 08:21:50 +02:00
Rafael Antognolli	9c63ec26ea	iris: Enable HiZ for multisampled depth surfaces. Fix this check so that we can get a HiZ aux buffer for multisampled surfaces as well. Also make sure we don't try to emit a sampler view surface state for multisampled depth sufaces with HiZ enabled, as the sampler can't HiZ for multisampled buffers and isl would assert. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-03-18 22:21:30 -07:00
Kenneth Graunke	d5974aeeae	iris: Slightly better bounds on buffer sizes	2019-03-18 01:39:43 -07:00
Kenneth Graunke	d75f84cb65	iris: Fix write enable in pinning of depth/stencil resources We may bind new Z/S buffers (which come via the framebuffer CSO, triggering IRIS_DIRTY_DEPTH_BUFFER), but with writes disabled. The next draw may enable Z or S writes (which come via the ZSA CSO, triggering IRIS_DIRTY_WM_DEPTH_STENCIL), which requires us to update our pin to have the write flag. So, update pinning if either dirty flag changes. To clarify, pass cso_zsa to the pinning function rather than pulling the random values out of ice->state, which unfortunately have to exist for the resolve code since iris_depth_stencil_alpha_state only exists in iris_state.c.	2019-03-11 15:04:08 -07:00
Kenneth Graunke	863e810a19	iris: Refactor depth/stencil buffer pinning into a helper. This avoids the code duplication that caused me to put things in the wrong place in the previous commit. One used to have extra flushes, but we moved those out so now these are identical and can be easily shared.	2019-03-11 15:04:08 -07:00
Kenneth Graunke	9302414f8b	iris: Move depth/stencil flushes so they actually do something Commit `d6dd57d43c` (iris: Add missing depth cache flushes) added the depth/stencil flushes to the wrong place. I meant to add them to the iris_upload_dirty_render_state code that emits the packets, but I accidentally added them to the nearly identical looking code in iris_restore_render_saved_bos. This meant we missed the actual flushing at draw time, but instead did pointless flushing on the first draw in a batch where things are already flushed anyway. This commit moves them to iris_resolve.c, next to the depth prepares, similar to what we do for color buffers. i965 does them elsewhere, but I'm not sure why - this seems like the most consistent place.	2019-03-11 15:04:08 -07:00
Kenneth Graunke	04ff2e3fbb	iris: Fix TES gl_PatchVerticesIn handling. 1. If we switch the TCS for one with a different number of output vertices, then the TES's gl_PatchVerticesIn value will change. We need to re-upload in this case. For now, re-emit constants whenever the TCS/TES are swapped out. 2. If there is no TCS, then we can't grab gl_PatchVerticesIn from the TCS info. Since it's a passthrough, we can just use the primitive's patch count (like the TCS gl_PatchVerticesIn does). Fixes KHR-GL45.tessellation_shader.single.max_patch_vertices and KHR-GL45.tessellation_shader.tessellation_control_to_tessellation_evaluation.gl_PatchVerticesIn. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-03-11 14:07:16 -07:00
Kenneth Graunke	2f51cb5e67	iris: Rework default tessellation level uploads Now that we've added a system value uploading mechanism, we may as well reuse the same system for default tessellation levels. This simplifies the state upload code a bit. Also fixes: KHR-GL45.tessellation_shader.tessellation_control_to_tessellation_evaluation.gl_tessLevel Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-03-11 14:07:12 -07:00
Kenneth Graunke	f36794d1f0	iris: Fix backface stencil write condition A bit too much search and replace here.	2019-03-10 14:52:53 -07:00
Sagar Ghuge	bca28deb46	iris: Track last VS URB entry size Return immediately if last VS URB entry size is good enough for BLORP operation v2: Fix comments (Caio) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Suggested-by: Kenneth Graunke<kenneth@whitecape.org> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-03-08 10:01:39 -08:00
Sagar Ghuge	d0a8fba69a	iris: Refactor code to share 3DSTATE_URB_* packet v2: 1) Set IRIS_DIRTY_URB bit (Caio) 2) Get rid of unnecessary function (Caio) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Suggested-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-03-08 10:01:39 -08:00
Kenneth Graunke	809a81ec3a	iris: Properly support alpha and luminance-alpha formats For texturing, we map alpha formats to the corresponding red format, as many alpha formats are outright missing, and red is more efficient when sampling anyway. When rendering to A8_UNORM, we use that format directly, so the image gets the shader output's .a/.w channel, rather than the .r/.x channel. All other A* formats are non-renderable, so we can't do much and just mark them as unsupported for rendering. Fortunately, GL only requires rendering to A8_UNORM, so that works out. According to Andre Heider and Timur Kristóf, this fixes font rendering in Witcher 1 (via nine). Andre also reported that it fixes Unigine Heaven (presumably via nine). v2: Use the same swizzle for both sampler views and "render targets". BLORP expects the read swizzle, and will take the inverse when setting up the destination swizzle (and actually applying it in the shaders). We ignore the format swizzle when setting up normal rendering SURFACE_STATEs, which is necessary because it would be an illegal shader channel select combination. Thanks to Jason Ekstrand for pointing out that BLORP took an inverse swizzle. Tested-by: Timur Kristóf <timur.kristof@gmail.com> Tested-by: Andre Heider <a.heider@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-03-07 11:39:27 -08:00
Kenneth Graunke	fbc51c4c95	iris: Defer uploading sampler state tables until draw time Gallium might call us multiple times to bind subsets of the samplers, at which point we'd recreate the table a bunch of times. It doesn't really buy us anything to do it here - even if we defer to draw time, the dirty tracking ensures we'll only do it on the first draw after a bind_sampler_states() call. We now use the number of samplers specified by the shader instead of the binding count. If this number changes, we flag sampler state as dirty so we re-upload a table with the right number of entries. This also fixes a bug where ice->state.need_border_colors was never unset, so once something needed border colors, the pool would always be pinned in all future batches. v2: Explicitly flag sampler states as dirty, rather than assuming that bind_sampler_states() will be called if the program texture count changes. While this may be true for st/mesa, it isn't the case for Gallium HUD. Tested-by: Timur Kristóf <timur.kristof@gmail.com> Tested-by: Andre Heider <a.heider@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-03-07 11:39:27 -08:00
Kenneth Graunke	9caabd6c5f	iris: Plumb through ISL_SWIZZLE_IDENTITY in buffer surface emitters Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-03-07 11:39:27 -08:00
Kenneth Graunke	4787bc944a	isl: Add a swizzle parameter to isl_buffer_fill_state() This is necessary for legacy texture buffer object formats, where we'll need to use a swizzle to fake e.g. luminance. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-03-07 11:39:27 -08:00
Kenneth Graunke	744b8e1c12	iris: Fix MOCS for blits and clears I915_MOCS_CACHED is the wrong value. Expose mocs() and use that.	2019-03-06 18:04:53 -08:00
Jose Maria Casanova Crespo	ffa9082c40	iris: setup EdgeFlag Vertex Element when needed. If Vertex Shader uses EdgeFlag the hardware request that it is setup as the last VERTEX_ELEMENT_STATE. If SGVS are add at draw time we need to also reconfigure the last 3DSTATE_VF_INSTANCING so its VertexElementIndex points to the new Vertex Element that contains the EdgeFlag. So if draw parameters or edgeflag are not used the CSO generated at iris_create_vertex_element is sent directly in the batches. But if edge flag is used we adjust last VERTEX_ELEMENT_STATE and last 3DSTATE_VF_INSTANCING using their alternative edge flag version we generate at iris_create_vertex_element and store at the CSO. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-03-06 22:19:08 +00:00
Jose Maria Casanova Crespo	4122665dd9	iris: Enable ARB_shader_draw_parameters support Additional VERTEX_ELEMENT_STATE are used to store basevertex and baseinstance and drawid updating the DWordLength of the 3DSTATE_VERTEX_ELEMENTS command. This passes all piglit tests for spec.draw_parameters. tests and VK-GL-CTS KHR-GL45.shader_draw_parameters_tests.* tests. Now we only mark a dirty_update when parameters are changed or when we have an indirect draw. We enable PIPE_CAP_DRAW_PARAMETERS on Iris. There is no edge flag support in the Vertex Elements setup. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-02-26 13:28:38 -08:00
Jordan Justen	bd0ad651e0	iris: Always use in-tree i915_drm.h Ref: `f1374805a8` "drm-uapi: use local files, not system libdrm" Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-02-24 21:06:40 -08:00
Rafael Antognolli	4f191feb0c	iris: Pin HiZ buffers when rendering.	2019-02-21 10:26:12 -08:00
Kenneth Graunke	2cddc953cd	iris: some initial HiZ bits	2019-02-21 10:26:12 -08:00
Kenneth Graunke	56f1fe3eac	iris: pin the buffers	2019-02-21 10:26:12 -08:00
Kenneth Graunke	d8f3bc1c4c	iris: actually use the multiple surf states for aux modes	2019-02-21 10:26:12 -08:00
Kenneth Graunke	77a1070d36	iris: Initial import of resolve code	2019-02-21 10:26:12 -08:00
Kenneth Graunke	3efd5299af	iris: Fill out SURFACE_STATE entries for each possible aux usage	2019-02-21 10:26:12 -08:00
Jordan Justen	d0996d5fab	iris: Emit default L3 config for the render pipeline Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2019-02-21 10:26:12 -08:00
Kenneth Graunke	51ddc40084	iris: Always emit at least one BLEND_STATE	2019-02-21 10:26:12 -08:00
Kenneth Graunke	d6dd57d43c	iris: Add missing depth cache flushes	2019-02-21 10:26:12 -08:00
Kenneth Graunke	bf23e79629	iris: Set HasWriteableRT correctly A bit of irritating state cross dependency here, but nothing too hard	2019-02-21 10:26:12 -08:00
Kenneth Graunke	d612cd1bf8	iris: Set 3DSTATE_WM::ForceThreadDispatchEnable The Vulkan driver only sets this if color writes are disabled, which is more conservative - but would require us to inspect blend state. (If color writes are enabled, we don't need to force anything, because the internal signal is already correct. But it shouldn't hurt to do so.)	2019-02-21 10:26:12 -08:00
Kenneth Graunke	27d751cdd8	iris: Drop XXX about alpha testing I was misreading i965 - the 3DSTATE_WM::PixelShaderKillsPixel bit from Gen < 8 needed all of this, but the 3DSTATE_PS_EXTRA bit only needs prog_data->uses_kill.	2019-02-21 10:26:12 -08:00
Kenneth Graunke	15341778ba	iris: rework num textures to util_lastbit	2019-02-21 10:26:12 -08:00
Kenneth Graunke	732c3a90a4	iris: Fix bug in bound vertex buffer tracking res might be NULL, at which point this is an unbind.	2019-02-21 10:26:11 -08:00
Kenneth Graunke	4bfd12bbf7	iris: minor tidying	2019-02-21 10:26:11 -08:00
Kenneth Graunke	b1bacbf038	iris: Unreference some more things on state module teardown	2019-02-21 10:26:11 -08:00
Caio Marcelo de Oliveira Filho	4fd1f70e62	iris: always include an extra constbuf0 if using UBOs In st_nir_lower_uniforms_to_ubo() all UBO access in the shader have its index incremented to open room for uniforms in constbuf0. So if we use UBOs, we always need to include the extra binding entry in the table. To avoid doing this checks both when compiling the shader and when assigning binding tables, store the num_cbufs in iris_compiled_shader. Fixes a bunch of tests from Piglit and CTS that use UBOs but don't use uniforms or system values. Note that some tests fitting this criteria were passing because the UBOs were moved to be push constants (avoiding the problem). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-02-21 10:26:11 -08:00
Kenneth Graunke	61798e3c88	iris: CS stall on VF cache invalidate workarounds See commit `31e4c9ce40` in i965.	2019-02-21 10:26:11 -08:00
Kenneth Graunke	0f677b0d87	iris: Don't enable smooth points when point sprites are enabled dEQP-GLES3.functional.rasterization.fbo.rbo_multisample_*.primitives.points	2019-02-21 10:26:11 -08:00
Kenneth Graunke	3b336a1513	iris: Allow sample mask of 0 I think this was an attempt to work around various sample mask bugs I had early on. It's not correct. A sample mask of 0 is legal and means to disable all samples. Fixes dEQP-GLES31.functional.texture.multisample..sample_mask*	2019-02-21 10:26:11 -08:00
Kenneth Graunke	7f318bf2ac	iris: Delete genx->bound_vertex_buffers This is actually stored in ice->state, as it isn't gen-specific	2019-02-21 10:26:11 -08:00
Kenneth Graunke	7ed1383c0a	iris: Fix surface states for Gen8 lowered-to-untype images We have to use SURFTYPE_BUFFER and ISL_FORMAT_RAW for these.	2019-02-21 10:26:11 -08:00

1 2 3 4 5 ...

400 commits