fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-23 10:48:08 +02:00

Author	SHA1	Message	Date
Jason Ekstrand	f195d40eca	anv/device: Add a helper for querying whether a BO is busy This is a bit more efficient than using GEM_WAIT with a timeout of 0. Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>	2017-04-05 21:17:11 -07:00
Emil Velikov	a6840efc09	anv: provide required gem stubs for the tests Introduce stubs to anv_gem_stub.c that match the anv_gem.c ones. Otherwise we may get link-time errors, when building the tests. v2: Introduce all the missing stubs at once. Cc: Jason Ekstrand <jason@jlekstrand.net> Cc: Vinson Lee <vlee@freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100574 Fixes: `c964f0e485` ("anv: Query the kernel for reset status") Fixes: `651ec926fc` ("anv: Add support for 48-bit addresses") Fixes: `060a6434ec` ("anv: Advertise larger heap sizes") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> --- I've intentionally kept the order the same identical to the anv_gem.c. This way we can easily grep & diff in the future ;-)	2017-04-05 17:54:38 +01:00
Jason Ekstrand	060a6434ec	anv: Advertise larger heap sizes Instead of just advertising the aperture size, we do something more intelligent. On systems with a full 48-bit PPGTT, we can address 100% of the available system RAM from the GPU. In order to keep clients from burning 100% of your available RAM for graphics resources, we have a nice little heuristic (which has received exactly zero tuning) to keep things under a reasonable level of control. Reviewed-by: Kristian H. Kristensen <krh@bitplanet.net>	2017-04-04 18:33:52 -07:00
Jason Ekstrand	651ec926fc	anv: Add support for 48-bit addresses This commit adds support for using the full 48-bit address space on Broadwell and newer hardware. Thanks to certain limitations, not all objects can be placed above the 32-bit boundary. In particular, general and state base address need to live within 32 bits. (See also Wa32bitGeneralStateOffset and Wa32bitInstructionBaseOffset.) In order to handle this, we add a supports_48bit_address field to anv_bo and only set EXEC_OBJECT_SUPPORTS_48B_ADDRESS if that bit is set. We set the bit for all client-allocated memory objects but leave it false for driver-allocated objects. While this is more conservative than needed, all driver allocations should easily fit in the first 32 bits of address space and keeps things simple because we don't have to think about whether or not any given one of our allocation data structures will be used in a 48-bit-unsafe way. Reviewed-by: Kristian H. Kristensen <krh@bitplanet.net>	2017-04-04 18:33:52 -07:00
Jason Ekstrand	439da38d18	anv: Replace anv_bo::is_winsys_bo with a uint32_t flags Reviewed-by: Kristian H. Kristensen <krh@bitplanet.net>	2017-04-04 18:33:52 -07:00
Jason Ekstrand	5d1ba2cb04	anv/blorp: Align vertex buffers to 64B This fixes issues seen when adding support for full 48-bit addresses. The 48-bit addresses themselves have nothing to do with it other than that it caused the kernel to place buffers slightly differently so they interacted differently with the caches. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-04-04 18:33:52 -07:00
Jason Ekstrand	c964f0e485	anv: Query the kernel for reset status When a client causes a GPU hang (or experiences issues due to a hang in another client) we want to let it know as soon as possible. In particular, if it submits work with a fence and calls vkWaitForFences or vkQueueQaitIdle and it returns VK_SUCCESS, then the client should be able to trust the results of that rendering. In order to provide this guarantee, we have to ask the kernel for context status in a few key locations. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-04-04 18:33:52 -07:00
Jason Ekstrand	82573d0f75	anv: Check for device loss at the end of WaitForFences It's possible that the device could have been lost while we were waiting. We should let the user know if this has happened. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-04-04 18:33:51 -07:00
Jason Ekstrand	c6f69eea6a	anv/pipeline: Properly handle unset gl_Layer and gl_ViewportIndex When the shader does not set one of these values, they are supposed to get a default value of 0. We have hardware bits in 3DSTATE_CLIP for this but haven't been setting them. This fixes the intermittent failure of dEQP-VK.geometry.layered.3d.render_to_default_layer. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-04-04 18:33:51 -07:00
Jason Ekstrand	0817110969	anv: Implement VK_KHR_incremental_present Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Daniel Stone <daniels@collabora.com>	2017-04-03 13:51:08 -07:00
Jason Ekstrand	f82b6c6272	vulkan/wsi: Plumb present regions through the common code Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Acked-by: Dave Airlie <airlied@redhat.com>	2017-04-03 13:51:08 -07:00
Tapani Pälli	3535b87a1a	anv: change BLOCK_POOL_MEMFD_SIZE to 1GB This allows us to run 32bit Vulkan apps on Android, ftruncate call would fail on 2GB (max size being 2GB - 1). Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-31 08:43:28 +03:00
Craig Stout	1da7a11de8	anv/cmd_buffer: fix host memory leak push_constants must be free'd. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100452 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Cc: "17.0 13.0" <mesa-stable@lists.freedesktop.org>	2017-03-29 14:32:32 -07:00
Jason Ekstrand	9aba81b160	anv/batch_chain: Handle another OOM in cmd_buffer_execbuf Found by inspection while rebasing other patches. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-03-29 09:39:49 -07:00
Jason Ekstrand	f3673db3d6	anv/cmd_buffer: Refactor flush_pipeline_select_* While having the _3d and _gpgpu versions is nice, there's no reason why we need to have duplicated logic for tracking the current pipeline. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2017-03-28 14:57:09 -07:00
Jason Ekstrand	6baae9625d	anv: Flush caches prior to PIPELINE_SELECT on all gens The programming note that says we need to do this still exists in the SkyLake PRM and, from looking at the bspec, seems like it may apply to all hardware generations SNB+. Unfortunately, this isn't particularly clear cut since there is also language in the bspec that says you can skip the flushing and stall to get better throughput. Experimentation with the "Car Chase" benchmark in GL seems to indicate that some form of flushing is still needed. This commit makes us do the full set of flushes regardless of hardware generation. We can always reduce the flushing later. Reported-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Cc: "17.0 13.0" <mesa-stable@lists.freedesktop.org>	2017-03-28 14:57:08 -07:00
Jason Ekstrand	0fe3dcce4c	anv/cmd_buffer: Fix bad indentation A bunch of code was indented in such a way that it looked like it went with the if statement above but it definitely didn't. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Cc: "17.0 13.0" <mesa-stable@lists.freedesktop.org>	2017-03-28 14:57:06 -07:00
Jason Ekstrand	01a65dc43b	anv/cmd_buffer: Apply flush operations prior to executing secondaries This fixes rendering issues in the Vulkan port of skia on some hardware. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-03-28 14:56:55 -07:00
Jason Ekstrand	9319ef96fd	anv/blorp: Use anv_get_layerCount everywhere Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-03-28 14:41:48 -07:00
Jason Ekstrand	1b8fa8dd79	anv: Make anv_get_layerCount a macro Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-03-28 14:41:47 -07:00
Chad Versace	6cbc13d94c	intel: Fix requests for exact surface row pitch (v2) All callers of isl_surf_init() that set 'min_row_pitch' wanted to request an exact row pitch, as evidenced by nearby asserts, but isl lacked API for doing so. Now that isl has an API for that, update the code to use it. v2: Assert that isl_surf_init() succeeds because the callers assume it. [for jekstrand] Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> (v1) Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> (v1) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (v2)	2017-03-28 09:44:44 -07:00
Xu Randy	004468de14	anv/blorp: Fix a crash in CmdClearColorImage We should use anv_get_layerCount() to access layerCount of VkImageSub- resourceRange in anv_CmdClearColorImage and anv_CmdClearDepthStencil- Image, which handles the VK_REMAINING_ARRAY_LAYERS (~0) case. Test: Sample multithreadcmdbuf from LunarG can run without crash Signed-off-by: Xu Randy <randy.xu@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-03-27 07:43:17 -07:00
Samuel Iglesias Gonsálvez	c4c02471f4	anv: enable sampling from fast-cleared images on SKL A resolve is not needed on Skylake in this case. We were forcing a resolve because we set the input_aux_usage to ISL_AUX_USAGE_NONE. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2017-03-27 06:32:24 +02:00
Iago Toral Quiroga	129fd58131	anv/query: handle out of host memory without crashing in compute_query_result() We don't need to make the caller (CmdCopyQueryPoolResults) aware of the problem since compute_query_result() only emits state. The caller is also expected to hit OOM in this scenario right after calling this function, but it is already handling it safely. Fixes: dEQP-VK.api.out_of_host_memory.cmd_copy_query_pool_results Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-03-24 09:39:44 +01:00
Iago Toral Quiroga	ddb2bb3ed4	anv/pipeline: make FragCoord include sample positions when sample shading We need to know if sample shading has been requested during shader compilation since that affects the way fragment coordinates are computed. Notice that the semantics of fragment coordinates only depend on whether sample shading has been requested, not on whether more than one sample will actually be produced (that is, minSampleShading and rasterizationSamples do not affect this behavior). Because this setting affects the code we generate for the shader, we also need to include it in the WM prog key. Notice we don't need to alter the OpenGL code because it doesn't ever use this behavior, so they key's value is always false (the default). Fixes: dEQP-VK.glsl.builtin_var.fragcoord_msaa.* Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-24 08:11:53 +01:00
Iago Toral Quiroga	023ea3772d	nir/lower_wpos_center: support adding sample position to fragment coordinate According to section 14.6 of the Vulkan specification: "When sample shading is enabled, the x and y components of FragCoord reflect the location of the sample corresponding to the shader invocation." So add a boolean parameter to the lowering pass to select this behavior when we need it. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-24 08:11:53 +01:00
Iago Toral Quiroga	4da1832c00	anv: return VK_ERROR_DEVICE_LOST immeditely when device is known to be lost If we know the device has been lost we should return this error code for any command that can report it before we attempt to do anything with the device. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-24 08:11:53 +01:00
Iago Toral Quiroga	50c8d2c1f7	anv/device: keep track of 'device lost' state The Vulkan specs say: "A logical device may become lost because of hardware errors, execution timeouts, power management events and/or platform-specific events. This may cause pending and future command execution to fail and cause hardware resources to be corrupted. When this happens, certain commands will return VK_ERROR_DEVICE_LOST (see Error Codes for a list of such commands). After any such event, the logical device is considered lost. It is not possible to reset the logical device to a non-lost state, however the lost state is specific to a logical device (VkDevice), and the corresponding physical device (VkPhysicalDevice) may be otherwise unaffected. In some cases, the physical device may also be lost, and attempting to create a new logical device will fail, returning VK_ERROR_DEVICE_LOST." This means that we need to track if a logical device has been lost so we can have the commands referenced by the spec return VK_ERROR_DEVICE_LOST immediately. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-24 08:11:53 +01:00
Iago Toral Quiroga	70194c9f1a	anv/device: return VK_ERROR_DEVICE_LOST for errors during queue submissions So that we don't have to do things like rolling back address relocations in case that we ran into OOM after computing them, etc Also, make sure that if the queue submission comes with a fence, we set it up correctly so it behaves according to the spec after returning VK_ERROR_DEVICE_LOST. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-24 08:11:53 +01:00
Dylan Baker	4ee675d537	anv: Remove dead prototype from entrypoints Spotted by Emil. v2: - Add this patch Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com>	2017-03-22 16:22:00 -07:00
Dylan Baker	860beb99a6	anv: use cElementTree in anv_entrypoints_gen.py It's written in C rather than pure python and is strictly faster, the only reason not to use it that it's classes cannot be subclassed. Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com>	2017-03-22 16:22:00 -07:00
Dylan Baker	9050138af7	anv: don't use Element.get in anv_entrypoints_gen.py This has the potential to mask errors, since Element.get works like dict.get, returning None if the element isn't found. I think the reason that Element.get was used is that vulkan has one extension that isn't really an extension, and thus is missing the 'protect' field. This patch changes the behavior slightly by replacing get with explicit lookup in the Element.attrib dictionary, and using xpath to only iterate over extensions with a "protect" attribute. Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com>	2017-03-22 16:22:00 -07:00
Dylan Baker	4d4697f868	anv: use dict.get in anv_entrypoints_gen.py Instead of using an if and a check, use dict.get, which does the same thing, but more succinctly. Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com>	2017-03-22 16:22:00 -07:00
Dylan Baker	96a5f2a5ac	anv: anv_entrypoints_gen.py: use reduce function. Reduce is it's own reward. Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com>	2017-03-22 16:22:00 -07:00
Dylan Baker	dd3830d11b	anv: anv-entrypoints_gen.py: rename hash to cal_hash. hash is reserved name in python, it's the interface to access an object's hash protocol. Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com>	2017-03-22 16:22:00 -07:00
Dylan Baker	8211e3e60d	anv: Generate anv_entrypoints header and code in one command This produces the header and the code in one command, saving the need to call the same script twice, which parses the same XML file. Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com>	2017-03-22 16:22:00 -07:00
Dylan Baker	383032c700	anv: anv_entrypoints_gen.py: directly write files instead of piping This changes the output to be written as a file rather than being piped. This had one critical advantage, it encapsulates the encoding. This prevents bugs where a symbol (generally unicode like © [copyright]) is printed and the system being built on doesn't have a unicode locale. v2: - Update Android.mk v3: - Don't generate both files at once - Fix Android.mk - drop --outdir, since the filename is passed in as an argument Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com>	2017-03-22 16:22:00 -07:00
Dylan Baker	a2a2bad2e2	anv: convert C generation to template in anv_entrypoints_gen.py This produces a file that is identical except for whitespace, there is a table that has 8 columns in the original and is easy to do with prints, but is ugly using mako, so it doesn't have columns; the data is not inherently tabular. Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com>	2017-03-22 16:22:00 -07:00
Dylan Baker	0d8e22c5e4	anv: convert header generation in anv_entrypoints_gen.py to mako This produces an identical file except for whitespace. Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com>	2017-03-22 16:22:00 -07:00
Dylan Baker	ba1085c694	anv: Update "do not edit" comments with proper filename This does two things, first it updates both the .h and the .c file to have the same do not edit string. Second, it uses __file__ to ensure that even if the file is moved or renamed that the name will be correct. One thing to note is the use of '{{' and '}}' in the C template. This is to instruct python to print a literal '{' and '}' respectively, rather than treating the contents as a formatter specifier. v3: - add this patch Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com>	2017-03-22 16:22:00 -07:00
Dylan Baker	ed9339bf26	anv: split main into two functions in anv_entrypoints_gen.py This is groundwork for the next patches, it will allows porting the header and the code to mako separately, and will also allow both to be run simultaneously. Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com>	2017-03-22 16:22:00 -07:00
Dylan Baker	1610b3dede	anv: don't pass xmlfile via stdin anv_entrypoints_gen.py It's slow, and has the potential for encoding issues. v2: - pass xml file location via argument - update Android.mk Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com>	2017-03-22 16:22:00 -07:00
Dylan Baker	8017da8dd2	anv: make constants capitals in anv_entrypoints_gen.py Again, it's standard python style. Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com>	2017-03-22 16:22:00 -07:00
Dylan Baker	08a6d3b4ba	anv: Use python style in anv_entrypoints_gen.py These are all fairly small cleanups/tweaks that don't really deserve their own patch. - Prefer comprehensions to map() and filter(), since they're faster - replace unused variables with _ - Use 4 spaces of indent - drop semicolons from the end of lines - Don't use parens around if conditions - don't put spaces around brackets - don't import modules as caps (ET -> et) - Use docstrings instead of comments v2: - Replace comprehensions with multiplication Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com>	2017-03-22 16:22:00 -07:00
Dylan Baker	abd72f2e35	anv: anv_entrypoints_gen.py: use a main function This is just good practice. Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com>	2017-03-22 16:22:00 -07:00
Jason Ekstrand	7ab03ba725	anv/device: Move push descriptor query handling The query is a properties query so it needs to be handled in GetPhysicalDeviceProperties2, not GetPhysicalDeviceFeatures2. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-03-22 09:44:54 -07:00
Jason Ekstrand	c942faf8f3	anv/image: Return early when unbinding an image Found by inspection. Reviewed-by: Chad Versace <chadversary@chromium.org> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Cc: "17.0 13.0" <mesa-stable@lists.freedesktop.org>	2017-03-22 09:44:54 -07:00
Kenneth Graunke	0c3fbf8028	i965: Drop AUB_TRACE_* stuff. This was used for aubdumping (deleted a while ago) and INTEL_DEBUG=bat decoding (deleted recently). While we're changing parameters, delete the wrapper macro and make the actual function brw_state_batch instead of __brw_state_batch. This subsumes a patch by Emil Velikov to drop this from BLORP. Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-03-21 13:49:18 -07:00
Xu,Randy	57595cb073	anv/genX: Solve the vkCreateGraphicsPipelines crash The crash is due to NULL pColorBlendState, which is legal if the pipeline has rasterization disabled or if the subpass of the render pass the pipeline is created against does not use any color attachments. Test: Sample subpasses from LunarG can run without crash Signed-off-by: Xu,Randy <randy.xu@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Cc: "17.0 13.0" <mesa-stable@lists.freedesktop.org>	2017-03-20 08:31:18 +02:00
Jason Ekstrand	1d5f4f46da	genxml: Make MI_STORE_DATA_IMM have a single 64-bit data field This is way more convenient than having two separate dword fields. Reviewed-By: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-03-17 15:31:19 -07:00

... 72 73 74 75 76 ...

4700 commits