fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-31 13:30:42 +01:00

Author	SHA1	Message	Date
Iago Toral Quiroga	7ad692d8e2	anv: do not subtract the base layer to compute depth in 3DSTATE_DEPTH_BUFFER According to the PRM description of the Depth field: "This field specifies the total number of levels for a volume texture or the number of array elements allowed to be accessed starting at the Minimum Array Element for arrayed surfaces" However, ISL defines array_len as the length of the range [base_array_layer, base_array_layer + array_len], so it already represents a value relative to the base array layer like the hardware expects. v2: Depth is defined as a U11-1 field, so subtract 1 from the actual value (Jason) This fixes a number of new CTS tests that would crash otherwise: dEQP-VK.pipeline.render_to_image.* Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-02 09:04:03 +01:00
Iago Toral Quiroga	64bf78270d	isl: document the meaning of the array_len field in isl_view Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-02 09:03:42 +01:00
Jason Ekstrand	d5b355ce5f	i965: Move intel_debug.h to intel/common/gen_debug.h This is shared between the Vulkan and GL drivers as it's a requirement of the back-end compiler. However, it doesn't really belong in the compiler. We rename the file to match the prefix of the other stuff in common and because libdrm defines an intel_debug.h and this avoids a pile of possible name conflicts. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2017-03-01 16:14:03 -08:00
Jason Ekstrand	8048c1953c	i965: Reduce cross-pollination between the DRI driver and compiler Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-03-01 16:14:03 -08:00
Jason Ekstrand	e647c4fbd9	util/build-id: Return a pointer rather than copying the data We're about to use the build-id as the starting point for another SHA1 hash in the Intel Vulkan driver, and returning a pointer is far more convenient. Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-03-01 15:31:44 -08:00
Jason Ekstrand	e3d33a23e6	anv: Properly handle destroying NULL devices and instances Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: "17.0 13.0" <mesa-dev@lists.freedesktop.org>	2017-03-01 15:31:44 -08:00
Mauro Rossi	3f2cb699cf	android: vulkan: add support for libmesa_vulkan_util The following changes are implemented: Add src/vulkan/Android.mk to build libmesa_vulkan_util Android.mk: add src/vulkan to SUBDIR to build new module intel/vulkan: fix libmesa_vulkan_util,vk_enum_to_str.h dependencies Add -o OUTPUT_PATH option in src/vulkan/util/gen_enum_to_str.py script Use -o OUTPUT_PATH option in automake generation rules for vk_enum_to_str.{c,h} Fixes: `e9dcb17` "vulkan/util: Add generator for enum_to_str functions" Fixes: `8e03250` "vulkan: Combine wsi and util makefiles" Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> [Emil Velikov] - Move parser within main() - Use --outdir instead of -o Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2017-02-28 01:24:41 +01:00
Emil Velikov	3935690d58	automake: anv: add missing include $(top_srcdir)/src/vulkan/util Otherwise we'll fail to find the header and `make distcheck` will bail. Fixes: `e9dcb17962` ("vulkan/util: Add generator for enum_to_str functions") Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2017-02-28 14:08:17 +00:00
Jason Ekstrand	76c8327e6e	anv: Bump advertised version to 1.0.42 We've been following the spec changes. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-02-27 09:44:46 -08:00
Dave Airlie	f695735ed6	vulkan/wsi/radv: add initial prime support (v1.1) This is a complete rewrite of my previous rfc patches. This adds the ability to present to a different GPU that rendering using a driver side operation that can copy from the tiled to linear shared image. This does prime support completely in the swapchain present code, and each queue has a precreated command buffer for each image and for the each queue family. This means presenting should work on graphics and compute queues and transfer in the future. v1.1: initialise needs_linear_copy in swapchain. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Mike Lothian <mike@fireburn.co.uk> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-27 05:42:16 +10:00
Emil Velikov	93369aa928	blorp: automake: add TODO to the tarball Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Andreas Boll <andreas.boll.dev@gmail.com>	2017-02-24 17:37:00 +00:00
Emil Velikov	ab6fa871ef	anv: automake: add TODO to the tarball Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Andreas Boll <andreas.boll.dev@gmail.com>	2017-02-24 17:36:59 +00:00
Jason Ekstrand	261092f7d4	anv: Enable MSAA compression This just enables basic MSAA compression (no fast clears) for all multisampled surfaces. This improves the framerate of the Sascha "multisampling" demo by 76% on my Sky Lake laptop. Running Talos on medium settings with 8x MSAA, this improves the framerate in the benchmark by 80%. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-02-23 12:10:42 -08:00
Jason Ekstrand	42b10b175d	anv/blorp/clear_subpass: Only set surface clear color for fast clears Not all clear colors are valid. In particular, on Broadwell and earlier, only 0/1 colors are allowed in surface state. No CTS tests are affected outright by this because, apparently, the CTS coverage for different clear colors is pretty terrible. However, when multisample compression is enabled, we do hit it with CTS tests and this commit prevents regressions when enabling MCS on Broadwell and earlier. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-02-23 12:10:42 -08:00
Pohjolainen, Topi	042cc201f2	intel/isl: Apply render target alignment constraints for MCS v2: Instead of having the same block in isl_gen7,8,9.c add it once into isl.c::isl_choose_image_alignment_el() instead. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-02-23 12:10:42 -08:00
Lionel Landwerlin	34e29b2ebd	intel/isl: add MCS width constraint 16 samples v3 (Jason Ekstrand): Add a comment explaining why Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-02-23 12:10:42 -08:00
Jason Ekstrand	3885375195	intel/isl: Return surface creation success from aux helpers The isl_surf_init call that each of these helpers make can, in theory, fail. We should propagate that up to the caller rather than just silently ignoring it. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-02-23 12:10:42 -08:00
Samuel Iglesias Gonsálvez	a9c488f285	isl/state: fix assert on raw buffer surface state minimum size From IVB PRM, SURFACE_STATE::Height: "For typed buffer and structured buffer surfaces, the number of entries in the buffer ranges from 1 to 2^27 . For raw buffer surfaces, the number of entries in the buffer is the number of bytes which can range from 1 to 2^30." The minimum value is 1, according to the spec. The spec quote was already added into the code by `028f6d8317`. Fixes crashing tests under: dEQP-VK.robustness.buffer_access.* Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-02-23 11:46:47 +01:00
Jason Ekstrand	1bd0e9ca33	anv/Makefile: Gather all the genX files into one place While we're here, we also fix the alphabetization of the list of genx_* files. Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-02-22 15:07:18 -08:00
Dylan Baker	8e03250fcf	vulkan: Combine wsi and util makefiles Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-02-22 13:12:02 -08:00
Dylan Baker	e9dcb17962	vulkan/util: Add generator for enum_to_str functions This adds a python generator to produce enum_to_str functions for Vulkan from the vk.xml API description. It supports extensions as well as core API features, and the generator works with both python2 and python3. Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com> Acked-by: Matt Turner <mattst88@gmail.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2017-02-22 13:12:02 -08:00
Jason Ekstrand	f31ed6d0cd	anv: Take a device parameter in anv_state_flush This allows the helper to check for llc instead of having to do it manually at all the call sites. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-02-21 12:26:35 -08:00
Jason Ekstrand	f408971deb	anv: Pull all clflushing into a clflush_range helper All this cache line address calculation stuff is tricky. Let's not duplicate it more places than we have to. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-02-21 12:26:35 -08:00
Jason Ekstrand	16b187c8bb	anv: Remove the unused state_pool_emit macro Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-02-21 12:26:35 -08:00
Jason Ekstrand	f9d7d27d6d	anv: Rename clflush_range and state_clflush It's a bit shorter and easier to work with. Also, we're about to add a helper called clflush which does the clflush but without any memory fencing. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-02-21 12:26:35 -08:00
Jason Ekstrand	075ed20614	intel/blorp: Explicitly flush all allocated state Found by inspection. However, I expect it fixes real bugs when using blorp from Vulkan on little-core platforms. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-02-21 12:26:35 -08:00
Jason Ekstrand	b6b03329af	anv: Put everything about queries in genX_query.c Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-02-21 12:26:35 -08:00
Jason Ekstrand	965fad0e8b	anv/Makefile: alphabetize Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-02-21 12:26:35 -08:00
Jason Ekstrand	40087bcb51	anv/query: Perform CmdResetQueryPool on the GPU This fixes a some rendering corruption in The Talos Principle Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-02-21 12:26:35 -08:00
Jason Ekstrand	dc9abd0e6b	genxml: Make MI_STORE_DATA_IMM more consistent Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-02-21 12:26:35 -08:00
Jason Ekstrand	3788cd3239	anv/query: clflush the bo map on non-LLC platforms Found by inspection Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-02-21 12:26:35 -08:00
Jason Ekstrand	8582ab2d6e	anv: Add an invalidate_range helper This is similar to clflush_range except that it puts the mfence on the other side to ensure caches are flushed prior to reading. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-02-21 12:26:35 -08:00
Emil Velikov	9807e9dea6	anv: remove unused anv_dispatch_table dtable Fixes: `4c9dec80ed` ("anv: Get rid of the ANV_CALL macro") Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2017-02-21 18:31:04 +00:00
Emil Velikov	e776e0385c	anv: remove unneeded extern C notation Analogous to previous commit - never used in any C++ code. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2017-02-21 18:28:18 +00:00
Dave Airlie	0a44a680ff	vulkan/wsi/x11: add support to detect if we can support rendering (v3) This adds support to radv_GetPhysicalDeviceXlibPresentationSupportKHR and radv_GetPhysicalDeviceXcbPresentationSupportKHR to check if the local device file descriptor is compatible with the descriptor retrieved from the X server via DRI3. This will stop radv binding to an X server until we have prime support in place. Hopefully apps use this API before trying to render things. v2: drop unneeded function, don't leak memory. (jekstrand) v3: also check in surface_get_support callback. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-20 12:53:52 +10:00
Jason Ekstrand	5f02c2a054	anv/TODO: Check off Storage Image Without Format The code for this landed a few days ago.	2017-02-17 14:18:34 -08:00
Matt Turner	656e30b686	anv: Use build-id for pipeline cache UUID. The --build-id=... ld flag has been present since binutils-2.18, released 28 Aug 2007. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-02-15 13:59:51 -08:00
Lionel Landwerlin	0fcb92c17d	anv: wsi: report presentation error per image request vkQueuePresentKHR() takes VkPresentInfoKHR pointer and includes a pResults fields which must holds the results of all the images requested to be presented. Currently we're not filling this field. Also as a side effect we probably want to go through all the images rather than stopping on the first error. This commit also makes the QueuePresentKHR() implementation return the first error encountered. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-02-15 11:43:05 +00:00
Jason Ekstrand	bfbb362601	anv: Use vk_foreach_struct for handling extension structs Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-02-14 16:15:39 -08:00
Jason Ekstrand	f434a60a53	anv: Implement the Skylake stencil PMA optimization Unfortunately, this doesn't substantially improve the performance of any known apps. With Dota 2 on my Sky Lake gt4, it seems help by somewhere between 0% and 1% but there's enough noise that it's hard to get a clear picture. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2017-02-14 14:18:55 -08:00
Jason Ekstrand	d665c51eea	genxml: Add the CACHE_MODE_0 register on gen9 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-02-14 14:18:55 -08:00
Jason Ekstrand	028e1137e6	anv/pipeline: Be smarter about depth/stencil state It's a bit hard to measure because it almost gets lost in the noise, but this seemed to help Dota 2 by a percent or two on my Broadwell GT3e desktop. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2017-02-14 14:18:55 -08:00
Jason Ekstrand	215fed7318	anv/pipeline: Make a copy of VkPipelineDepthStencilStateCreateinfo Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2017-02-14 14:18:55 -08:00
Jason Ekstrand	e8d52dab48	anv: Add support for the PMA fix on Broadwell This helps Dota 2 on Broadwell by 8-9%. I also hacked up the driver and used the Sascha "shadowmapping" demo to get some results. Setting uses_kill to true dropped the framerate on the demo by 25-30%. Enabling the PMA fix brought it back up to around 90% of the original framerate. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2017-02-14 14:18:55 -08:00
Jason Ekstrand	62bba4ba2d	genxml: Add the CACHE_MODE_1 register on gen8 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-02-14 14:18:55 -08:00
Jason Ekstrand	6ce8592836	anv: Disable stencil writes when both write masks are zero Vulkan doesn't have a stencilWriteEnable bit like it does for depth. Instead, you have a stencil mask. Since the stencil mask is handled as dynamic state, we have to handle it later during command buffer construction. This, combined with a later commit, seems to help Dota2 on my Broadwell GT3e desktop by a couple percent because it allows the hardware to move the depth and stencil writes to early in more cases. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2017-02-14 14:18:55 -08:00
Jason Ekstrand	114c281e70	anv/entrypoints: Only generate entrypoints for supported features This changes the way anv_entrypoints_gen.py works from generating a table containing every single entrypoint in the XML to just the ones that we actually need. There's no reason for us to burn entrypoint table space on a bunch of NV extensions we never plan to implement. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-02-14 14:18:55 -08:00
Connor Abbott	6319bfc2a6	anv: fix Get*MemoryRequirements for !LLC Even though we supported both coherent and non-coherent memory types, we effectively forced apps to use the coherent types by accident. Found by inspection, only compile tested. Signed-off-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-02-14 13:05:44 -08:00
Alex Smith	924a8cbb40	anv: Add support for shaderStorageImageWriteWithoutFormat This allows shaders to write to storage images declared with unknown format if they are decorated with NonReadable ("writeonly" in GLSL). Previously an image view would always use a lowered format for its surface state, however when a shader declares a write-only image, we should use the real format. Since we don't know at view creation time whether it will be used with only write-only images in shaders, create two surface states using both the original format and the lowered format. When emitting the binding table, choose between the states based on whether the image is declared write-only in the shader. Tested on both Sascha Willems' computeshader sample (with the original shaders and ones modified to declare images writeonly and omit their format qualifiers) and on our own shaders for which we need support for this. Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-02-14 08:16:52 -08:00
Jason Ekstrand	2c30918581	anv/apply_pipeline_layout: Set image.write_only to false This makes our driver robust to changes in spirv_to_nir which would set this flag on the variable. Right now, our driver relies on spirv_to_nir not setting var->data.image.write_only for correctness. Any patch which implements the shaderStorageImageWriteWithoutFormat will need to effectively revert this commit. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-02-14 08:16:45 -08:00

... 277 278 279 280 281 ...

15202 commits