fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-28 01:18:15 +02:00

Author	SHA1	Message	Date
Tapani Pälli	728ebcdec2	iris/android: fix build and link with libmesa_intel_perf Fixes: `0fd4359733` "iris/perf: implement routines to return counter info" Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-22 10:01:14 +03:00
Jason Ekstrand	951cf94521	nir: Add explicit signs to image min/max intrinsics This better matches all the other atomic intrinsics such as those for SSBOs and shared variables where the sign is part of the intrinsic opcode. Both generators (GLSL and SPIR-V) know the sign from the type of the image variable or handle. In SPIR-V, signed min/max are separate opcodes from unsigned. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-21 17:19:55 +00:00
Sagar Ghuge	fe0e9db797	iris: Enable non coherent framebuffer fetch on broadwell v2: Use GEN_GEN in iris_state (Kenneth Graunke) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-20 00:50:58 -07:00
Sagar Ghuge	57ce422e20	iris: Free resource if failed to allocate surface state Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-20 00:50:55 -07:00
Sagar Ghuge	02244bc515	iris: Pass isl_surf to fill_surface_state Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Suggested-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-20 00:50:45 -07:00
Sagar Ghuge	638a157e02	iris: Add infrastructure to support non coherent framebuffer fetch Create separate SURFACE_STATE for render target read in order to support non coherent framebuffer fetch on broadwell. Also we need to resolve framebuffer in order to support CCS_D. v2: Add outputs_read check (Kenneth Graunke) v3: 1) Import Curro's comment from get_isl_surf 2) Rename get_isl_surf method 3) Clean up allocation in case of failure Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-20 00:50:44 -07:00
Sagar Ghuge	61c0637afb	iris: Add helper functions to get tile offset All helper functions are ported from i965 driver. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-20 00:50:43 -07:00
Sagar Ghuge	7e816991cc	iris: Add helper function to get isl dim layout v2: Add missing space (Caio) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-20 00:50:41 -07:00
Sagar Ghuge	58471e20d2	iris: Add render target read entry in binding table This will be used in next patches for supporting non coherent framebuffer fetch on Broadwell. v2: Fix comment (Kenneth Graunke) v3: 1) Fix a few nits (Caio) 2) Add comment (Caio) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-20 00:50:31 -07:00
Jason Ekstrand	16edd02bfa	iris: Only request an input mask if the shader needs it Fixes: `aebca3961b` "iris: Fix handling of SIMD32 fragment shaders" Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-16 19:59:42 -05:00
Jordan Justen	0f5be81edd	iris: Expose aux buffer as 2nd plane w/modifiers Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-13 15:20:47 -07:00
Jordan Justen	246eebba4a	iris: Export and import surfaces with modifiers that have aux data The DRI interface for modifiers with aux data treats the aux data as a separate plane of the main surface. When the dri layer requests the plane associated with the aux data, we save the required information into the dri aux plane image. Later when the image is used, the dri plane image will be available in the pipe_resource structure's `next` field. Therefore in iris, we reconstruct the aux setup from this separate dri plane image when the image is used. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-13 15:20:47 -07:00
Kenneth Graunke	99c8eb997d	iris: Do proper format checks for Y+CCS modifier support We need to ensure that the DRI image format supports CCS. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-08-13 15:20:47 -07:00
Jordan Justen	51f941c20c	iris: Create single bo for surfaces with modifiers and aux data Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-13 15:20:47 -07:00
Jordan Justen	2c7b577e13	iris: Split iris_resource_alloc_aux to enable aux modifiers Reworks: * If the aux-state is not ISL_AUX_STATE_AUX_INVALID, then use memset even when memset_value is zero. The hiz buffer initial aux-state will be set to invalid, and therefore we can skip the memset. But, for CCS it will be set to ISL_AUX_STATE_PASS_THROUGH, and therefore the aux data must be cleared to 0 with the memset. Previously we would use BO_ALLOC_ZEROED with the CCS aux data, so this memset wasn't required. Now, the CCS aux data may be part of the main surface. We prefer to not use BO_ALLOC_ZEROED excessively, so the memset is needed for the CCS case. (Nanley) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-13 15:20:46 -07:00
Jordan Justen	aad36dfd16	iris: Add aux offset into hiz_address This is not currently required because the hiz buffer is in a separate buffer, and therefore the offset is 0. If we combine the aux buffer with the main surface buffer, then the hiz offset may become non-zero. Suggested-by: Nanley Chery <nanley.g.chery@intel.com> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-13 15:20:39 -07:00
Jordan Justen	fc12fd05f5	iris: Implement pipe_screen::resource_get_param Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-13 01:12:30 -07:00
Rafael Antognolli	a1a499e7fe	iris/gen11: Emit SLICE_HASH_TABLE when pipes are unbalanced. If the pixel pipes have a different number of subslices, emit a slice hashing table that will ensure proper workload distribution. v2: Don't need to set the mask - it's mbo (Ken). v3: Don't keep a reference to the resource used for emitting the table (Ken).	2019-08-12 16:19:08 -07:00
Jason Ekstrand	134607760a	intel/compiler: Fill a compiler statistics struct This commit is all annoying plumbing work which just adds support for a new brw_compile_stats struct. This struct provides a binary driver readable form of the same statistics we dump out to stderr when we INTEL_DEBUG is set with a shader stage. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-08-12 22:56:07 +00:00
Francisco Jerez	026773397b	iris/gen9: Optimize slice and subslice load balancing behavior. See "i965/gen9: Optimize slice and subslice load balancing behavior." for the rationale. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-12 13:17:58 -07:00
Tapani Pälli	d4b574f26a	iris: reorder arguments as expected by the function CID: 1452262 Fixes: `b4c54894bb` "iris: Handle vertex shader with window space position" Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com>	2019-08-12 13:08:26 +03:00
Tapani Pälli	590ba15d6e	iris/android: move iris_query.c to 'per gen' LIBIRIS_SRC_FILES Fixes Iris build on Android. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-08-12 10:06:36 +03:00
Kenneth Graunke	0f3768bc5d	iris: Free query on error path CID: 1452276	2019-08-11 14:04:31 -07:00
Kenneth Graunke	661be3fef9	iris: Add missing 'break' We don't want to fall through to unreachable(). CID: 1452277	2019-08-11 14:04:31 -07:00
Kenneth Graunke	f1dba99639	iris: minor restyling	2019-08-10 00:16:45 -07:00
Mark Janes	9c597514d4	iris/query: enable amd performance monitors Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-09 19:28:34 -07:00
Mark Janes	469af7fdc9	iris/perf: get monitor results Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-09 19:28:32 -07:00
Mark Janes	1cb4fc184f	iris/perf: add begin/end hooks Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-09 19:28:24 -07:00
Mark Janes	8c4c346665	iris/perf: add delete query Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-09 19:28:17 -07:00
Mark Janes	aca42759ff	iris/perf: implement iris_create_monitor_object This is the first call that provides the iris context to the monitor implementation. On the first call, use the iris context to initialize the monitor context. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-09 19:28:14 -07:00
Mark Janes	0fd4359733	iris/perf: implement routines to return counter info With this commit, Iris will report that AMD_performance_monitor is supported, and will allow the caller to query the available metrics. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-09 19:28:03 -07:00
Greg V	c0dc5c1859	meson: define ETIME to ETIMEDOUT if not present Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-08-08 21:44:33 +01:00
Rhys Perry	c52c54a746	anv,i965,iris: deduplicate setting of total_shared v5: add patch Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-08 12:10:39 -05:00
Mark Janes	2446f5cfd8	intel/perf: move perf-related constants to common location The perf subsystem needs several macro definitions that were duplicated in Iris and i965 headers. Place these macros within perf, if the perf implementation contains the only references to the values. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-07 21:33:55 -07:00
Bas Nieuwenhuizen	5a26f528cb	meson,i965: Link with android deps when building for android. The DBG marco in brw_blorp.c ends up calling an android log function: error: undefined reference to '__android_log_print' v2: On suggestion from Lionel, hang the Android dependency onto a new libintel_common dependency. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-08-07 15:34:46 +02:00
Danylo Piliaiev	b4c54894bb	iris: Handle vertex shader with window space position Iris advertises support for PIPE_CAP_TGSI_VS_WINDOW_SPACE_POSITION so let's actually implement it. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110657 Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-06 20:25:35 +00:00
Kenneth Graunke	382f92a814	iris: Increase BATCH_SZ to 64kB This seems to improve performance by roughly ~1% across the board. Thanks to Rafael Antognolli and Dan Walsh for their help tuning.	2019-08-06 09:09:26 -07:00
Kenneth Graunke	64b73b770b	iris: Fix bad external BO hash table and zombie list interactions A while ago, we started deferring GEM object closure and VMA release until buffers were idle. This had some unforeseen interactions with external buffers. We keep imported buffers in hash tables, so if we have repeated imports of the same GEM object, we map those to the same iris_bo structure. This is critical for several reasons. Unfortunately, we broke this assumption. When freeing a non-idle external buffer, we would drop it from the hash tables, then move it to the zombie list. If someone reimported the same GEM object, we would not find it in the hash tables, and go ahead and make a second iris_bo for that GEM object. But the old iris_bo would still be in the zombie list, and so we would eventually call GEM_CLOSE on it - closing a BO that should have still been live. To work around this, we defer removing a BO from the hash tables until it's actually fully closed. This has the strange effect that an external BO may be on the zombie list, and yet be resurrected before it can be properly cleaned up. In this case, we remove it from the list so it won't be freed. Fixes severe instability in Weston, which was hitting EINVALs and ENOENTs from execbuf2, due to batches referring to a GEM object that had been closed, or at least had its VMA torched. Fixes: `457a55716e` ("iris: Defer closing and freeing VMA until buffers are idle.")	2019-08-05 08:53:41 -07:00
Kenneth Graunke	48e5a99d86	iris/bufmgr: Move iris_bo_reference into hash_find_bo, rename it Everybody importing an external buffer was looking it up in the hash table, then referencing it. We can just do that in the helper instead, which also gives us a convenient spot to stash extra code shortly.	2019-08-05 08:53:07 -07:00
Jason Ekstrand	aebca3961b	iris: Fix handling of SIMD32 fragment shaders The brw_wm_prog_data_dispatch_grf_start_reg and _prog_offset helpers read the _NPixelDispatchEnable fields from 3DSTATE_PS to figure out which bits to pull out of the prog data and stuff where. Therefore, they need to be called with the final set of _NPixelDispatchEnable bits after we've done the workaround for SIMD32 and 16x MSAA. Otherwise, if you end up with a somewhat odd combination of enables, the GRF start reg and KSP data ends up in the wrong slots. In particular, running SIMD32-only is broken but several other combinations are as well. Fixes: `5445c176e2` "iris: Disable SIMD32 when using a 16x MSAA..." Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-03 22:24:40 +00:00
Timothy Arceri	06ec14d692	iris: bump compat profile support to 4.6 All of the current piglit compat profile tests pass. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-02 18:56:53 +10:00
Kenneth Graunke	18c2e09dc7	gallium: Implement GL_EXT_shader_samples_identical via a new capability This exposes the textureSamplesIdenticalEXT function in GLSL. We enable it for iris and radeonsi, because their compilers already have support for this. Tested on Intel Kabylake and AMD Vega 64. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-01 23:38:54 -07:00
Mark Janes	49465f1330	iris/screen: use initialization routine for gen_device_info Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-08-01 16:39:48 -07:00
Mark Janes	7852fe5415	intel/common: provide common ioctl routine i965 links against libdrm for drmIoctl, but anv and iris both re-implement this routine to avoid the dependency. intel/dev also needs an ioctl wrapper, so lets share the same implementation everywhere. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-08-01 16:38:40 -07:00
Timothy Arceri	2afedfaf9a	iris: add support for gl_ClipVertex in tess eval shaders Required for OpenGL compat support. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-01 16:12:37 -07:00
Timothy Arceri	00b5bf2d72	iris: add support for gl_ClipVertex in geometry shaders This will enable us to support the OpenGL compat profile. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-01 16:12:27 -07:00
Kenneth Graunke	b61f17d362	iris: Skip emitting 3DSTATE_INDEX_BUFFER if possible We were emitting 3DSTATE_INDEX_BUFFER on every indexed draw, even if back-to-back draws referred to the same index buffer. This improves drawoverhead scores in the DrawElements cases by about 10%, by giving us even more minimal batches.	2019-07-31 15:14:10 -07:00
Kenneth Graunke	3a22a8bf49	iris: Skip repeated depth buffer disables. Often times, the depth buffer is entirely disabled, but color render targets change. For example, GenerateMipmaps will change the color render target for each miplevel, but there is no depth buffer. In the Civilization VI benchmark, this drops the median number of 3DSTATE_DEPTH_BUFFER etc. packets emitted per frame from 472 to 34.	2019-07-30 19:47:41 -07:00
Sagar Ghuge	587a497529	iris: Enable EXT_texture_shadow_lod Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-07-30 10:42:20 -07:00
Kenneth Graunke	44e713eddb	iris: Fix SO offset to be 32-bit in DrawTransformFeedback handling We accidentally started copying a full 64-bit value rather than copying a 32-bit offset and zeroing the top 32-bits. This caused us to compute bogus vertex counts which could lead to GPU hangs in some cases. Thanks to Clayton Craft for catching the regressions! Fixes: `0e24d10ff5` ("iris: Use gen_mi_builder to handle CS ALU operations.")	2019-07-29 16:38:19 -07:00

1 2 3 4 5 ...

1128 commits