fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-21 13:18:09 +02:00

Author	SHA1	Message	Date
Sagar Ghuge	c401186762	intel: Track stencil aux usage on Gen12+ Enable stencil compression enable and control surface enable bit if stencil buffer lossless compression is enabled. v2: Remove unnecessary GEN_GEN check (Nanley Chery) v3: (Nanley Chery) - Change commit subject tag from intel/isl to intel - Keep assignment order correct Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2019-10-29 14:46:15 -07:00
Sagar Ghuge	53d472df24	intel/blorp: Add helper function for stencil buffer resolve On Gen12+, Stencil buffer's lossless compression should be resolved with WM_HZ_OP packet. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2019-10-29 14:46:15 -07:00
Sagar Ghuge	ce208be2d8	intel/blorp: Assign correct view while clearing depth stencil We never saw any failures regarding this typo but it's good to assign correct stencil view while constructing blorp_params. Fixes: `0cabf93b80` "intel/blorp: Add an entrypoint for clearing depth and stencil" Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2019-10-29 14:46:15 -07:00
Sagar Ghuge	4287e0a4e4	genxml/gen12: Add Stencil Buffer Resolve Enable bit Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2019-10-29 14:46:15 -07:00
Jason Ekstrand	52aa7f3e05	anv: Reduce the minimum number of relocations The original value of 256 was under the assumption that you're a batch buffer which is likely going to have a large number of relocations. However, pipeline objects on Gen7 will have at most 6 relocations (one per shader stage and one for the workaround BO) so this is a lot of per-pipeline wasted space. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-10-29 20:27:52 +00:00
Jason Ekstrand	a3153162a9	anv: Delay allocation of relocation lists The old relocation list code always allocated 256 relocations and a hash set up-front without knowing whether or not we really need them. In particular, in the softpin case, this is two fairly large allocations that we don't need to be making. Also, for pipeline objects on haswell where we don't have softpin, we don't need relocations unless scratch is used so this is extra data per-pipeline. Instead, we should do it on-demand. This shaves 3.5% off of a cpu-limited example running with the Dawn WebGPU implementation. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-10-29 20:27:52 +00:00
Plamena Manolova	4fe2317601	anv: Implement new way for setting streamout buffers. For gen12 we set the streamout buffers using 4 separate commands instead of 3DSTATE_SO_BUFFER. Signed-off-by: Plamena Manolova <plamena.manolova@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-10-29 19:21:20 +00:00
Plamena Manolova	665b81e29a	genxml: Add 3DSTATE_SO_BUFFER_INDEX_* instructions For gen12 we set the streamout buffers using 4 separate commands instead of 3DSTATE_SO_BUFFER. Signed-off-by: Plamena Manolova <plamena.manolova@intel.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2019-10-29 19:19:58 +00:00
Plamena Manolova	f9ad73cdfd	anv: Set depthBounds to true in anv_GetPhysicalDeviceFeatures. Add depth bounds testing to the list of supported physical device features. Signed-off-by: Plamena Manolova <plamena.manolova@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-10-29 16:05:33 +00:00
Plamena Manolova	e6c8750278	genxml: Change 3DSTATE_DEPTH_BOUNDS bias. The bias for the 3DSTATE_DEPTH_BOUNDS instruction should be 2 not 1. Signed-off-by: Plamena Manolova <plamena.manolova@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-10-29 16:05:33 +00:00
Lionel Landwerlin	1a2246a5e0	intel/perf: update ICL configurations A few equations/programming changes for ICL. v2: Fix a couple of issues in naming and floating/integer operations (Ken) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-29 13:00:26 +02:00
Caio Marcelo de Oliveira Filho	e2155158e9	anv: Fix output of INTEL_DEBUG=bat for chained batches The anv_batch_bo contents are linked one to another, and when printing we have to start with the first of those. Since in `u_vector` new elements are added to the head, to get the first element we need the vector's tail. Fixes: `32ffd90002` ("anv: add support for INTEL_DEBUG=bat") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-10-28 19:34:54 -07:00
Eric Engestrom	cd04b63c00	loader: default to iris for all future PCI IDs The existing "fallback" code didn't actually do anything, so this removes it, and instead we just always fallback to `iris` for future PCI IDs. Suggested-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 23:21:39 +00:00
Eric Engestrom	ea8116908c	anv: add a couple printflike() annotations Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-10-28 23:17:16 +00:00
Sagar Ghuge	366fcbf2d8	intel/isl: Support lossless compression with multisamples GEN12 adds the ability to losslessly compress each sample plane in a multisampled buffer that uses MCS compression. v2: Remove unnecessary assertion (Nanley Chery) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2019-10-28 14:02:01 -07:00
Sagar Ghuge	e80bca6895	intel/blorp: Use isl_aux_usage_has_mcs instead of comparing Depending on MCS_CSS or MCS we can emit blorp blit shaders. As we support MCS_CSS and MCS, it makes sense to use isl_aux_usage_has_mcs function. v2: Fix commit message (Nanley Chery) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2019-10-28 14:02:01 -07:00
Sagar Ghuge	2f0fbe06e6	intel/isl: Don't reconfigure aux surfaces for MCS If aux for MCS is already configured, don't configure again. v2: Fix missing period in commit message (Nanley Chery) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2019-10-28 14:02:01 -07:00
Nanley Chery	8e7644e48f	intel/blorp: Satisfy clear color rules for HIZ_CCS Store the converted depth value into two dwords. Avoids regressing the piglit test "fbo-depth-array depth-clear", when HIZ_CCS sampling is enabled in a later commit. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:06 -07:00
Nanley Chery	0aa308f420	intel: Fix and use HIZ_CCS write through mode Write through to the CCS if the surface is used as a texture and can be sampled by the HW with CCS. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:06 -07:00
Nanley Chery	5425fcf2cb	intel/blorp: Satisfy HIZ_CCS fast-clear alignments Prevent the piglit test, amd_vertex_shader_layer-layered-depth-texture-render, from regressing in in a future commit. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:06 -07:00
Nanley Chery	6451008e8b	intel: Refactor blorp_can_hiz_clear_depth() Prepare this function to be used in iris and to handle new Gen12 behavior. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:06 -07:00
Nanley Chery	cc99d0adc0	isl: Add isl_surf_supports_hiz_ccs_wt() Add a helper to determine if an ISL surface supports the write-through mode of HIZ_CCS. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:06 -07:00
Nanley Chery	04e5f7e8a9	intel/blorp: Treat HIZ_CCS like HiZ Allow it in depth buffer instructions but disable it for blits. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:06 -07:00
Nanley Chery	cc415f911f	intel/blorp: Assert against HiZ in surface states Avoid unexpected behavior if the caller happens to pass in a HiZ aux usage. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:05 -07:00
Nanley Chery	c50f8b2fc9	intel: Support HIZ_CCS in isl_surf_get_ccs_surf Add an extra aux parameter which will be filled out with CCS if the first two isl_surf parameters fit the requirements for HiZ_CCS. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:05 -07:00
Nanley Chery	e2e67b3f11	isl: Reduce assertions during aux surf creation Return false more often to reduce the burden on the caller. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:05 -07:00
Nanley Chery	6670e07a6e	intel: Enable CCS_E for R24_UNORM_X8_TYPELESS on TGL+ While this format isn't listed in BSpec: 53911, other documentation and empirical evidence suggest that it's fine to remap it to R32_FLOAT. I've filed a bug for the BSpec page. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:05 -07:00
Nanley Chery	f93bc14618	intel: Use 3DSTATE_DEPTH_BUFFER::ControlSurfaceEnable Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:05 -07:00
Jason Ekstrand	ab994ecae6	intel/isl: Support HIZ_CCS in emit_depth_stencil_hiz v2. Remove undocumented CCS_E-only mode for depth. (Nanley) Co-authored-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:05 -07:00
Nanley Chery	6312328a61	intel: Use RENDER_SURFACE_STATE::DepthStencilResource Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:05 -07:00
Jordan Justen	5d34a9975f	intel: Update alignment restrictions for HiZ surfaces. v2 (Nanley): * Maintain a chronological ordering for HiZ alignments. Suggested by Ken. Co-authored-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:05 -07:00
Nanley Chery	d5fb9cccdc	intel/blorp: Disable depth testing for slow depth clears We'll start doing slow depth clears more often on HIZ_CCS buffers in a future commit. Reduce the performance impact by making them use less bandwidth. From the Depth Test section of the BSpec: This function is enabled by the Depth Test Enable state variable. If enabled, the pixel's ("source") depth value is first computed. After computation the pixel's depth value is clamped to the range defined by Minimum Depth and Maximum Depth in the selected CC_VIEWPORT state. Then the current ("destination") depth buffer value for this pixel is read. and from the Depth Buffer Updates section of the BSpec: If depth testing is disabled or the depth test passed, the incoming pixel's depth value is written to the Depth Buffer. Taken together, it's clear that depth testing isn't necessary to perform a depth buffer clear. Mark Janes and I analyzed this patch with frameretrace and a depthrange piglit test. I disabled HiZ to ensure we'd get slow depth clears. We've observed the bandwidth consumption by the depth buffer access to be cut ~50% on BDW and SKL during depth clears. On a more graphically intensive workload, the Shadowmapping Sascha benchmark, I took the average of 3 runs on a BDW with a display resolution of about 1920x1200 (minus some desktop environment decorations). I measured a 22.61% FPS improvement when HiZ is disabled. v2. The BSpec doesn't mandate this behavior, update comment accordingly. (Ken) Fixes: `bc4bb5a7e3` ("intel/blorp: Emit more complete DEPTH_STENCIL state") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:05 -07:00
Nanley Chery	e655eed531	intel: Enable CCS_E for some formats on Gen12 In ISL: Update the format table to add CCS_E support for some 8BPP formats, some 16BPP formats, and R10G10B10A2_UNORM_SRGB. In the helper for determining CCS_E support, we return false for some 16BPP formats because they aren't properly handled in blorp_copy(). In BLORP: Allow the new and non-problematic formats for CCS_E-enabled copies. v2. Update other fields for A1B5G5R5_UNORM and A4B4G4R4_UNORM in table. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> (v1)	2019-10-28 10:47:05 -07:00
Nanley Chery	126c9562d9	isl: Redefine the CCS layout for Gen12 The CCS could be described in a number of ways, but this format was chosen to minimize churn in the drivers. We may decide on an different direction in the future. v2. Increase alignment for display surfaces. (Nanley) Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> (v1) Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:05 -07:00
Nanley Chery	1e91280242	isl: Add and use isl_tiling_flag_to_enum() Use a helper that will automatically handle Gen12's CCS tiling when creating a CCS isl_surf. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-10-28 10:47:05 -07:00
Nanley Chery	22be1447bb	isl/drm: Map HiZ and CCS tilings to Y In the function which translates ISL tilings to i915 tilings, map ISL's HiZ and CCS tilings to Y instead of NONE (linear). The HW docs describe HiZ and pre-Gen12 CCS surfaces as being Y-tiled in memory. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-10-28 10:47:05 -07:00
Jason Ekstrand	901bed5122	intel/isl: Update surf_fill_state for gen12 v2 (Nanley): * Avoid driver churn for now. * Include some media compression changes. Co-authored-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-10-28 10:47:05 -07:00
Jason Ekstrand	caf4cc548e	intel/isl/fill_state: Separate aux_mode handling from aux_surf v2. Avoid driver churn for now. (Nanley) Co-authored-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-10-28 10:47:05 -07:00
Jason Ekstrand	a1e0b21061	intel/isl: Add new aux modes available on gen12 v2. Add media compression. (Nanley) Co-authored-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-10-28 10:47:05 -07:00
Nanley Chery	8af1853331	anv/private: Modify aux slice helpers for Gen12 CCS The isl_surf structs for Gen12's CCS won't describe how many slices in the main surface can be compressed. All slices will be compressable if CCS is enabled, so lookup the main surface's logical dimension. v2. Add a space before a `?`. (Jordan) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-10-28 10:47:05 -07:00
Nanley Chery	ba52cd7ab2	intel/blorp: Don't assert aux slices match main slices This isn't accurate enough for HiZ which can have a discontiguous range of supported aux slices. This also won't work with the plan to represent Gen12 CCS as a single slice surface. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-10-28 10:47:05 -07:00
Jason Ekstrand	4021a3925c	intel/blorp: Use surf instead of aux_surf for image dimensions Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-10-28 10:47:05 -07:00
Nanley Chery	d90bffaef8	intel/blorp: Halve the Gen12 fast-clear/resolve rectangle Update their dimensions according to the Bspec. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-10-28 10:47:05 -07:00
Rafael Antognolli	43b48ee752	intel/blorp/gen12: Set FWCC when storing the clear color. From "Render Target Fast Clear" description for Gen12: "SW must store clear color using MI_STORE_DATA_IMM with ForceWriteCompletionCheck bit set." From Instruction_MI_STORE_DATA_IMM, bitfield 10 (when set to 1): "Following the last write from this command, Command Streamer will wait for all previous writes are completed and in global observable domain before moving to next command." We use 4 SDIs to store the clear color (one per channel). From the description, it looks to me that setting that flag only on the last SDI should be enough. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-10-28 10:47:05 -07:00
Nanley Chery	07e16221d9	isl: Round up some pitches to 512B for Gen12's CCS Gen12's CCS requires that the main surface have a pitch aligned to 512B. v2. Provide a BSpec citation. (Ken) Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:05 -07:00
Nanley Chery	300d77c2fa	anv/cmd_buffer: Don't assume CCS_E includes CCS_D There's no longer a clear-only compression mode of CCS on Gen12+. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:05 -07:00
Nanley Chery	4f0b5f9732	anv/image: Disable CCS_D on Gen12+ Clear-only compression no longer exists on TGL. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-10-28 10:47:04 -07:00
Nanley Chery	a94cb6503f	isl: Disable CCS_D on Gen12+ Clear-only compression no longer exists on TGL. v2. Add BSpec reference. (Sagar) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-10-28 10:47:04 -07:00
Nanley Chery	0eaf293b47	anv/formats: Disable I915_FORMAT_MOD_Y_TILED_CCS on TGL+ The format of the CCS has changed. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-10-28 10:47:04 -07:00
Nanley Chery	d0fcc2dd50	anv: Properly allocate aux-tracking space for CCS_E add_aux_state_tracking_buffer() actually checks the aux usage when determining how many dwords to allocate for state tracking. Move the function call to the point after the CCS_E aux usage is assigned. Fixes: `de3be61801` ("anv/cmd_buffer: Rework aux tracking") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-28 10:47:04 -07:00

1 2 3 4 5 ...

4798 commits