fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-16 18:28:05 +02:00

Author	SHA1	Message	Date
Kenneth Graunke	140f53e646	Revert "nir: replace lower_ffma and fuse_ffma with has_ffma" This reverts commit `939ddf3f67`. Intel has a separate pass for fusing FFMAs selectively. We split these flags in commit `1b72c31e1f` and the reasoning still stands. The patch being reverted was just a cleanup, so there should be no issue with reverting it. Acked-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6849>	2020-09-24 13:11:50 -07:00
Jonathan Marek	4c71cda9ab	vulkan/wsi/display: add option for display fence to signal syncobj To avoid having a separate "wsi_fence" path in the driver, make it so wsi fences can signal a syncobj. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Acked-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6707>	2020-09-24 13:20:00 +00:00
Marek Olšák	939ddf3f67	nir: replace lower_ffma and fuse_ffma with has_ffma Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6756>	2020-09-24 12:29:11 +00:00
Marek Olšák	771aad3027	nir: split lower_ffma into lower_ffma16/32/64 AMD wants different behavior for each bit size Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6756>	2020-09-24 12:29:11 +00:00
Jason Ekstrand	9750164c09	nir: Rename get_buffer_size to get_ssbo_size This makes it explicit that this intrinsic is only for SSBOs. For the v3dv driver, we'll be adding a get_ubo_size intrinsic and we want to be able to distinguish between the two. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6812>	2020-09-22 13:34:12 +00:00
Lionel Landwerlin	10540c2f6f	anv: implement VK_KHR_copy_commands2 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6801>	2020-09-22 11:37:12 +00:00
Lionel Landwerlin	cc3bf00cc2	intel/compiler: fixup Gen12 workaround for array sizes We didn't handle the case of NULL images/textures for which we should return 0. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `397ff2976b` ("intel: Implement Gen12 workaround for array textures of size 1") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3522 Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6729>	2020-09-21 21:20:09 +00:00
Jason Ekstrand	f63ffc18e7	intel/fs/swsb: SCHEDULING_FENCE only emits SYNC_NOP It's not really unordered in the sense that it can still stall on ordered things and we don't need a SYNC_NOP for that because it is a SYNC_NOP. However, it also doesn't count when computing instruction distances. Fixes: `18e72ee210` "intel/fs: Add FS_OPCODE_SCHEDULING_FENCE" Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6781>	2020-09-20 14:43:40 +00:00
Jason Ekstrand	a163cba1e3	anv: Fix the target_bo assertion in anv_reloc_list_add Fixes: `b75a92e201` "anv: assert that the target bo is valid..." Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3528 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6759>	2020-09-17 07:25:05 -05:00
Mike Blumenkrantz	b75a92e201	anv: assert that the target bo is valid when adding a reloc list this catches some undefined behavior like e.g., using a stale descriptorset that references deleted bos, which I would absolutely never do Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6747>	2020-09-16 19:58:16 +00:00
Mike Blumenkrantz	355f241d41	anv: improve error message when failing to open device path Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6747>	2020-09-16 19:58:16 +00:00
Gert Wollny	80cde3ad55	intel/compiler: Set lower_uniform_to_ubo compiler flag Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6316>	2020-09-16 10:07:42 +00:00
Lionel Landwerlin	320d80f2ce	intel/genxml: make sure test assert are compiled in Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Anuj Phogat anuj.phogat@gmail.com Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6678>	2020-09-15 06:14:34 +00:00
Jordan Justen	20a4235c4c	anv, iris: Set MediaSamplerDOPClockGateEnable for gen12+ This has been shown to help performance on TGL and DG1. This could be applied to gen9+, but we still need to show if it helps with those platforms. Rework: * Make change in src/intel/vulkan/genX_cmd_buffer.c too. (Ken) * Keep mask as 3 for gen < 12 Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6684>	2020-09-11 17:40:03 -07:00
Nanley Chery	e3101c96bb	anv/image: Disable multi-layer CCS_E on TGL+ Anv doesn't do multi-layer fast-clear tracking, but TGL may add fast-clears to multiple layers. Disable CCS_E for image arrays on TGL+ until anv gets more clear color tracking abilities. With this change, anv+TGL now passes: * dEQP-VK.multiview.readback_implicit_clear.15_15_15_15 * dEQP-VK.multiview.readback_implicit_clear.8_1_1_8 * dEQP-VK.multiview.readback_implicit_clear.1_2_4_8_16_32 * dEQP-VK.multiview.renderpass2.readback_implicit_clear.15_15_15_15 * dEQP-VK.multiview.renderpass2.readback_implicit_clear.8_1_1_8 * dEQP-VK.multiview.renderpass2.readback_implicit_clear.1_2_4_8_16_32 v2. Mention HSD 14010672564. (Sagar) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6528>	2020-09-11 23:35:28 +00:00
Nanley Chery	5cd6235958	blorp: Drop trailing whitespace in blorp_clear.c Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6650>	2020-09-10 22:09:28 +00:00
Nanley Chery	1eff389bd3	blorp: Fix alignment test for HIZ_CCS_WT fast-clears Remove the extra logical ORs in the ternary operation. Fixes: `5425fcf2cb` ("intel/blorp: Satisfy HIZ_CCS fast-clear alignments") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6650>	2020-09-10 22:09:28 +00:00
Marcin Ślusarz	69e65b9cf0	intel/tools: fix possible randomly increased verbosity of error2aub Found by Coverity. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6667>	2020-09-10 12:16:58 +00:00
Marcin Ślusarz	18eb853ac8	intel/compiler: quiet Coverity warnings Coverity complains about possible out-of-bounds write & read, because it thinks that "loc + i" can be bigger than sizes of the 2 used arrays. It's not obvious from the code it cannot happen, so add asserts here. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6667>	2020-09-10 12:16:58 +00:00
Marcin Ślusarz	5b6fd2a314	intel/tools: handle ftell errors Found by Coverity, as "argument cannot be negative", referring to fread's 2nd argument. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6667>	2020-09-10 12:16:58 +00:00
Marcin Ślusarz	46a82aa3a6	intel/tools: fix possible memory leak in the error path Found by Coverity. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6667>	2020-09-10 12:16:58 +00:00
Marcin Ślusarz	5ea0b6a9c6	intel/compiler: initialize remaining fields of various classes These variables seem to be initialized before being used, so this patch is not fixing any bug, but leaving them unitialized may become a bug after some refactoring. These classes were affected: fs_reg_alloc, fs_visitor, fs_generator, instruction_scheduler. Found by Coverity. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6667>	2020-09-10 12:16:58 +00:00
Marcin Ślusarz	40b964dc8f	intel/compiler: remove unused fs_validator::param_size Found by Coverity as unitialized variable. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6667>	2020-09-10 12:16:58 +00:00
Marcin Ślusarz	d3cd24999f	anv: fix minor gen_ioctl(I915_PERF_IOCTL_CONFIG) error handling issue Found by Coverity. Fixes: `2001a80d4a` ("anv: Implement VK_KHR_performance_query") Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6667>	2020-09-10 12:16:58 +00:00
Nanley Chery	80a8bb590a	intel/common: Drop unused gen_aux_map_add_image Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6486>	2020-09-09 20:02:03 +00:00
Nanley Chery	1ddb1a6cf9	intel/common: Add get_aux_map_format_bits() Add a function suitable for planar YUV surfaces. For these surfaces, drivers remap each plane to an RGB-formatted surface. Enable drivers to pass the plane index and the original YUV format to get the right aux-map format bits. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6486>	2020-09-09 20:02:03 +00:00
Nanley Chery	9efd71466d	intel/isl: Add YUV format info for the aux-map * Define ISL equivalents for the P010, P012, and P016 formats. * Add aux-map encodings for the YUV formats iris will soon support. v2. Replace &&'s with \|\|'s in isl_format_is_planar() (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6486>	2020-09-09 20:02:03 +00:00
Nanley Chery	1ea3a54366	intel/isl: Support ISL_AUX_USAGE_MC in surface states Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6486>	2020-09-09 20:02:03 +00:00
Nanley Chery	e15543b71a	intel/isl: Describe I915_FORMAT_MOD_Y_TILED_GEN12_MC_CCS Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6486>	2020-09-09 20:02:03 +00:00
Jason Ekstrand	3bd7c3c9db	intel/nir: Call validate_ssa_dominance at both ends of the NIR compile This invokes it before we go into the optimization/lowering pass and then right before we go out of SSA. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5288>	2020-09-08 19:44:01 +00:00
Eric Anholt	f3b33a5a35	nir: Add a range_base+range to nir_intrinsic_load_ubo(). For UBO accesses to be the same performance as classic GL default uniform block uniforms, we need to be able to push them through the same path. On freedreno, we haven't been uploading UBOs as push constants when they're used for indirect array access, because we don't know what range of the UBO is needed for an access. I believe we won't be able to calculate the range in general in spirv given casts that can happen, so we define a [0, ~0] range to be "We don't know anything". We use that at the moment for all UBO loads except for nir_lower_uniforms_to_ubo, where we now avoid losing the range information that default uniform block loads come with. In a departure from other NIR intrinsics with a "base", I didn't make the base an be something you have to add to the src[1] offset. This keeps us from needing to modify all drivers (particularly since the base+offset thing can mean needing to do addition in the backend), makes backend tracking of ranges easy, and makes the range calculations in load_store_vectorizer reasonable. However, this could definitely cause some confusion for people used to the normal NIR base. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6359>	2020-09-08 18:20:51 +00:00
Marcin Ślusarz	64b0b7c274	intel/compiler: fix typo in a comment Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6602>	2020-09-04 17:38:25 +00:00
Marcin Ślusarz	95ce619680	intel/compiler: print dispatch width when shader fails to compile Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6602>	2020-09-04 17:38:25 +00:00
Marcin Ślusarz	e5f735a986	intel/compiler: move extern C functions out of namespace brw brw_compile_gs and brw_compile_tcs are extern C functions, but are defined inside of brw namespace, which somehow works but confuses Eclipse CDT's code analysis. Move these functions out of brw namespace and fix references to objects from brw namespace. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6602>	2020-09-04 17:38:25 +00:00
Marcin Ślusarz	d4c6e3f196	intel/compiler: use the same name for nir shaders in brw_compile_* functions Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6602>	2020-09-04 17:38:25 +00:00
Marcin Ślusarz	0dda209406	intel/compiler: match brw_compile_* declarations with their definitions Current state confuses Eclipse CDT's code analysis. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6602>	2020-09-04 17:38:25 +00:00
Marek Olšák	ac55b1a9a6	nir: get ffma support from NIR options for nir_lower_flrp This also fixes the inverted last parameter of nir_lower_flrp in most drivers. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6599>	2020-09-04 17:06:22 +00:00
Marcin Ślusarz	663c4d5377	intel/fs: add hint how to get more info when shader validation fails Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6559>	2020-09-04 12:09:22 +00:00
Marcin Ślusarz	ab07926057	intel: add INTEL_DEBUG=shaders Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6559>	2020-09-04 12:09:22 +00:00
Jason Ekstrand	21fbffc542	anv: Set alignments on UBO/SSBO root derefs This doesn't really do anything for us today. One day, I suppose we could use it to do something with wide loads with non-uniform offsets. The big reason to do this is to get better testing to make sure that NIR doesn't blow up on the deref paths. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6472>	2020-09-03 18:02:51 +00:00
Jason Ekstrand	a7a0315d7f	intel/nir: Stop using nir_lower_vars_to_scratch Instead, we do a limited indirect deref lowering and then use nir_lower_vars_to_explicit_types and nir_lower_explicit_io to lower it as if it were SSBO or global memory access. Among other things, this should enable pointer arithmetic on local variables. Fun! The only shader-db change from this change on ICL was a few tiny cycle count changes in 7 Aztec Ruins compute shaders. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5909>	2020-09-03 14:26:49 +00:00
Jason Ekstrand	38a83a3048	nir/lower_indirect_derefs: Add a threshold Instead of always lowering everything, we add a threshold such that if the total indirected array size (AoA size) is above that threshold, it won't lower. It's assumed that the driver will sort things out somehow by, for instance, lowering to scratch. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5909>	2020-09-03 14:26:49 +00:00
Jason Ekstrand	c897cd0278	intel/compiler: Handle all indirect lowering choices in brw_nir.c Since everything flows through NIR and we're doing all of our indirect deref lowering there now, there's no reason to keep making those decisions in brw_compiler and stuffing them in the GLSL compiler structs. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5909>	2020-09-03 14:26:49 +00:00
Jason Ekstrand	fe18a0fd45	intel/nir: Lower load_num_work_groups to 32-bit if needed For OpenCL-style kernels, this builtin is 64-bit. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6570>	2020-09-02 20:38:22 +00:00
Jason Ekstrand	5799da47c7	intel/fs: Use a single untyped surface read for load_num_work_groups There's no good reason to split this into three. Sure, CS indirects are only guaranteed by the spec to be DWORD aligned, but that's all untyped surface reads require anyway. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6570>	2020-09-02 20:38:22 +00:00
Jason Ekstrand	8e8701b43a	intel/fs: Don't copy-propagate stride=0 sources into ddx/ddy This can come up if, for instance, the shader does a derivative of a uniform or flat input. Ideally, NIR would use divergence analysis to get rid of the derivative in this case but it doesn't right now. This fixes a crash in F1 2017. Cc: mesa-stable@lists.freedesktop.org Reported-by: Marcin Ślusarz <marcin.slusarz@intel.com> Tested-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6564>	2020-09-02 20:31:32 +00:00
Jason Ekstrand	5ee3242837	anv: Patch constant data pointers into shaders with using softpin When we have softpin, we know the address of the shader constant data at shader upload time because it's sitting at the end of the shader. This commit changes ANV to use patch constants to embed the address in the shader patch the right address in at upload time. This allows us to avoid having to set up a UBO binding on-the-fly for shader constants. This commit uses an A64 message but it's quite possible that we could also use an A32 message and make the dataport do the 64-bit add for us. However, load_global is what we have right now so it was easier to just use that. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6244>	2020-09-02 19:48:44 +00:00
Jason Ekstrand	a5acf3d8c3	anv: Properly cache brw_stage_prog_data::relocs Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6244>	2020-09-02 19:48:44 +00:00
Jason Ekstrand	91becd84ae	intel/fs: Add support for a new load_reloc_const intrinsic Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6244>	2020-09-02 19:48:44 +00:00
Jason Ekstrand	8d8a3815ef	intel/eu: Add a mechanism for emitting relocatable constant MOVs Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6244>	2020-09-02 19:48:44 +00:00

1 2 3 4 5 ...

5850 commits