fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-27 10:08:13 +02:00

Author	SHA1	Message	Date
Rafael Antognolli	c032cae9ff	genxml: Rename "Function Enable" to "Enable". Rename that field name on genxml for: - 3DSTATE_GS - gen6+ - 3DSTATE_DS - gen7+ - 3DSTATE_HS - gen7+ Signed-off-by: Rafael Antognolli <rafael.antognolli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-05-03 16:41:07 -07:00
Rafael Antognolli	5b4223dc8e	genxml: Clip guardbands are float, not int. This makes genxml create the right struct types, and generate the right batch commands. Signed-off-by: Rafael Antognolli <rafael.antognolli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-05-03 16:41:07 -07:00
Rafael Antognolli	4266c372d9	genxml: 3DSTATE_VS rename Function Enable to Enable. Signed-off-by: Rafael Antognolli <rafael.antognolli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-05-03 16:41:07 -07:00
Kenneth Graunke	da299b7df3	genxml: Make "Reorder Mode" fields consistent. Both GS and SOL have these fields. Some were ReorderEnable = true, some were ReorderMode = REORDER_TRAILING, and some were just TRAILING. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2017-05-03 16:41:07 -07:00
Rafael Antognolli	872ffb2221	genxml: Add alias for MOCS. Use an alias, so we can set the same value as the #define's. v3: - Call it "SO Buffer MOCS" to follow the most common naming scheme. - Add alias for gen7 and gen75 too (Ken). Signed-off-by: Rafael Antognolli <rafael.antognolli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-05-03 16:41:02 -07:00
Rafael Antognolli	b5e652fc83	genxml: Add missing field values to 3DSTATE_SBE. Fill out "Attribute Active Component Format" possible values. Signed-off-by: Rafael Antognolli <rafael.antognolli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-05-03 16:41:02 -07:00
Rafael Antognolli	273a10b3f1	genxml: Update xml for 3DSTATE_SF. - Normalize "Anti-Aliasing Enable" - Add "Multisample Rasterization Mode" constants - Rename "Use Point Width on Vertex" to "Vertex" - Rename "Use Point Width from State" to "State" Signed-off-by: Rafael Antognolli <rafael.antognolli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-05-03 16:41:02 -07:00
Rafael Antognolli	3f155ab290	genxml: Rename clip enable property. There are two variants: - Clip Enable - CLIP Enable (on gen6) Rename everything to Clip Enable. Signed-off-by: Rafael Antognolli <rafael.antognolli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-05-03 16:41:02 -07:00
Louis-Francis Ratté-Boulianne	e0aa2bd9cb	genxml: Fill out Gen4, Gen45 and Gen5 XML Add some more details to Gen4 and Gen45 and add what is needed in Gen5 XML. This commit overwrite the previous work done on Gen4 and Gen45 as it contains more instructions and fixes some mistakes. However, comments (dword boundaries) are lost in the process. v3: - Set the type of some fields, instead of prefix. Also fix the SAMPLER_BORDER_COLOR_STATE fields of gen5.xml. Signed-off-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2017-05-03 16:40:52 -07:00
Jason Ekstrand	4201cc2dd3	anv: Implement VK_KHX_external_semaphore_fd This implementation allocates a 4k BO for each semaphore that can be exported using OPAQUE_FD and uses the kernel's already-existing synchronization mechanism on BOs. Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-05-03 15:09:46 -07:00
Jason Ekstrand	ef2e427d78	anv: Pull the guts of cmd_buffer_execbuf into a helper Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-05-03 15:09:46 -07:00
Jason Ekstrand	975c0f339f	anv: Implement VK_KHX_external_semaphore Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-05-03 15:09:46 -07:00
Jason Ekstrand	298e054d0c	anv: Implement VK_KHX_external_semaphore_capabilities This just stubs things out. Real external semaphore support will come with VK_KHX_external_semaphore_fd. Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-05-03 15:09:46 -07:00
Jason Ekstrand	65aa89e75f	anv: Add a real semaphore struct It's just a dummy for now, but we'll flesh it out as needed for external semaphores. Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-05-03 15:09:46 -07:00
Jason Ekstrand	f8d7c23e1f	anv: Trivially implement multiDrawIndirect Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2017-05-03 11:25:46 -07:00
Jason Ekstrand	272b7e7d25	anv: Enable VK_KHX_multiview and SPV_KHR_multiview Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2017-05-03 11:25:46 -07:00
Jason Ekstrand	3dbd7737d4	anv/cmd_buffer: Emit instanced draws for multiple views Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2017-05-03 11:25:46 -07:00
Jason Ekstrand	32abb0e13c	anv/cmd_buffer: Pull indirect draw parameter loading into a helper Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2017-05-03 11:25:46 -07:00
Jason Ekstrand	0db7070330	anv/pipeline: Add shader lowering for multiview v2 (Jason Ekstrand): - Take a view_mask rather than a whole subpass - Build the view mask into the VS shader key Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2017-05-03 11:25:46 -07:00
Jason Ekstrand	ca5bdfdfc6	anv/pipeline: Add a subpass field to anv_pipeline This simplifies the code a variety of places. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2017-05-03 11:25:46 -07:00
Jason Ekstrand	c4549e05aa	anv/pipeline: Call nir_gather_info later We want to insert more lowering code that may insert system values and we need to gather info after that lowering. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2017-05-03 11:25:46 -07:00
Jason Ekstrand	dcb6a68bb4	anv: Move shader hashing to anv_pipeline Shader hashing is very closely related to shader compilation. Putting them right next to each other in anv_pipeline makes it easier to verify that we're actually hashing everything we need to be hashing. The only real change (other than the order of hashing) is that we now hash in the shader stage. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2017-05-03 11:25:46 -07:00
Jason Ekstrand	d6b8106eea	anv/pass: Store the per-subpass view mask Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2017-05-03 11:25:46 -07:00
Jason Ekstrand	e997f548de	anv: Add the KHX_multiview boilerplate Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2017-05-03 11:25:46 -07:00
Jason Ekstrand	0bed97006f	anv/nir: Delete the apply_dynamic_offsets prototype That pass hasn't existed since `dd4db84640` but the prototype stuck around for no reason. Reviewed-by: Elie Tournier <elie.tournier@collabora.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2017-05-03 11:25:46 -07:00
Samuel Iglesias Gonsálvez	f57e234fdd	i965/vec4: don't modify regioning parameters to the sources of DF align1 instructions The regioning parameters are now properly set by convert_to_hw_regs() and we don't need to fix them in the generator. That latter fix previously done in the generator was strictly speaking wrong for any non-identity regions. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Cc: "17.1" <mesa-stable@lists.freedesktop.org> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2017-05-03 15:32:39 +02:00
Samuel Iglesias Gonsálvez	aaeb1c99be	i965/vec4: fix register width for DF VGRF and UNIFORM On gen7, the swizzles used in DF align16 instructions works for element size of 32 bits, so we can address only 2 consecutive DFs. As we assumed that in the rest of the code and prepare the instructions for this (scalarize_df()), we need to set it to two again. However, for DF align1 instructions, a width of 2 is wrong as we are not reading the data we want. For example, an uniform would have a region of <0, 2, 1> so it would repeat the first 2 DFs, when we wanted to access to the first 4. This patch sets the default one to 4 and then modifies the width of align16 instruction's DF sources when we translate the logical swizzle to the physical one. v2: - Remove conditional (Curro). Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Cc: "17.1" <mesa-stable@lists.freedesktop.org> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2017-05-03 15:32:39 +02:00
Samuel Iglesias Gonsálvez	7f728bce81	i965/vec4: fix vertical stride to avoid breaking region parameter rule From IVB PRM, vol4, part3, "General Restrictions on Regioning Parameters": "If ExecSize = Width and HorzStride ≠ 0, VertStride must be set to Width * HorzStride." In next patch, we are going to modify the region parameter for uniforms and vgrf. For uniforms that are the source of DF align1 instructions, they will have <0, 4, 1> regioning and the execsize for those instructions will be 4, so they will break the regioning rule. This will be the same for VGRF sources where we use the vstride == 0 exploit. As we know we are not going to cross the GRF boundary with that execsize and parameters (not even with the exploit), we just fix the vstride here. v2: - Move is_align1_df() (Curro) - Refactor exec_size == width calculation (Curro) Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Cc: "17.1" <mesa-stable@lists.freedesktop.org> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2017-05-03 15:32:39 +02:00
Jason Ekstrand	6ef1bd4fa5	anv/tests: Create a dummy instance as well as device This fixes crashes caused by `35e626bd0e` which made us start referencing the instance in the allocators. With this commit, the tests now happily pass again. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100877 Tested-by: Vinson Lee <vlee@freedesktop.org>	2017-05-01 17:06:40 -07:00
Chad Versace	85ca563b58	anv: Drop 'x11' prefix from non-X11 WSI funcs Drop it from x11_anv_wsi_image_create and x11_anv_wsi_image_free. The functions are used by Wayland WSI too. Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2017-04-28 08:54:45 -07:00
Jason Ekstrand	ebd1bd6998	anv: Alphabetize KHR extensions Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2017-04-28 07:41:03 -07:00
Jason Ekstrand	032861693e	anv: Move queues, events, and semaphores to their own file Things are about to get more complicated, especially as far as semaphores are concerned. Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-04-27 20:08:46 -07:00
Jason Ekstrand	9bd1f03487	anv: Implement VK_KHX_external_memory_fd This commit just exposes the memory handle type. There's interesting we need to do here for images. So long as the user doesn't set any crazy environment variables such as INTEL_DEBUG=nohiz, all of the compression formats etc. should "just work" at least for opaque handle types. v2 (chadv): - Rebase. - Fix vkGetPhysicalDeviceImageFormatProperties2KHR when handleType == 0. - Move handleType-independency comments out of handleType-switch, in vkGetPhysicalDeviceExternalBufferPropertiesKHX. Reduces diff in future dma_buf patches. Co-authored-with: Chad Versace <chadversary@chromium.org> Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-04-27 20:08:46 -07:00
Jason Ekstrand	818b857914	anv: Use the BO cache for DeviceMemory allocations Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-04-27 20:08:46 -07:00
Jason Ekstrand	494d6f65a7	anv/allocator: Add a BO cache This cache allows us to easily ensure that we have a unique anv_bo for each gem handle. We'll need this in order to support multiple-import of memory objects and semaphores. v2 (Jason Ekstrand): - Reject BO imports if the size doesn't match the prime fd size as reported by lseek(). Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-04-27 20:08:46 -07:00
Jason Ekstrand	5d25ac6a4b	anv: Implement VK_KHX_external_memory This is the trivial implementation that just exposes the extension string but exposes zero external handle types. Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-04-27 20:08:46 -07:00
Chad Versace	354ca7a1d4	anv: Implement VK_KHX_external_memory_capabilities This is a complete but trivial implementation. It's trivial becasue We support no external memory capabilities yet. Most of the real work in this commit is in reworking the UUIDs advertised by the driver. v2 (chadv): - Fix chain traversal in vkGetPhysicalDeviceImageFormatProperties2KHR. Extract VkPhysicalDeviceExternalImageFormatInfoKHX from the chain of input structs, not the chain of output structs. - In vkGetPhysicalDeviceImageFormatProperties2KHR, iterate over the input chain and the output chain separately. Reduces diff in future dma_buf patches. Co-authored-with: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Chad Versace <chadversary@chromium.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-04-27 20:08:46 -07:00
Jason Ekstrand	d4d9258b61	anv/physical_device: Rename uuid to pipeline_cache_uuid We're about to have more UUIDs for different things so this one really needs to be properly labeled. Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-04-27 20:08:46 -07:00
Jason Ekstrand	02767cb4ff	anv: Refactor device_get_cache_uuid into physical_device_init_uuids Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-04-27 20:08:46 -07:00
Jason Ekstrand	35e626bd0e	anv: Set EXEC_OBJECT_ASYNC when available Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-04-27 20:08:46 -07:00
Jason Ekstrand	bd3a9813b9	anv/cmd_buffer: Use the device allocator for QueueSubmit The command is really operating on a Queue not a command buffer and the nearest object to that with an allocator is VkDevice. Reviewed-by: Chad Versace <chadversary@chromium.org> Cc: "17.0 17.1" <mesa-dev@lists.freedesktop.org>	2017-04-27 20:08:46 -07:00
Jason Ekstrand	c43b4bc85e	anv: Don't place scratch buffers above the 32-bit boundary This fixes rendering corruptions in DOOM. Hopefully, it will also make Jenkins a bit more stable as we've been seeing some random failures and GPU hangs ever since turning on 48bit. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100620 Fixes: `651ec926fc` "anv: Add support for 48-bit addresses" Tested-by: Grazvydas Ignotas <notasas@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "17.1" <mesa-stable@lists.freedesktop.org>	2017-04-27 02:04:57 -07:00
Rafael Antognolli	6a40ccec4b	genxml: Fix gen_pack_header.py crash when field type is invalid. Just return earlier in that case. Also set prefix to an empty string, so we don't get to use it undefined. Signed-off-by: Rafael Antognolli <rafael.antognolli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-04-24 15:14:12 -07:00
Rafael Antognolli	9670124e31	genxml: Make BLEND_STATE command support variable length array. We need to emit BLEND_STATE, which size is 1 + 2 * nr_draw_buffers dwords (on gen8+), but the BLEND_STATE struct length is always 17. By marking it size 1, which is actually the size of the struct minus the BLEND_STATE_ENTRY's, we can emit a BLEND_STATE of variable number of entries. For gen6 and gen7 we set length to 0, since it only contains BLEND_STATE_ENTRY's, and no other data. With this change, we also change the code for blorp and anv to emit only the needed BLEND_STATE_ENTRY's, instead of always emitting 16 dwords on gen6-7 and 17 dwords on gen8+. v2: - Use designated initializers on blorp and remove 0 from initialization (Jason) - Default entries to disabled on Vulkan (Jason) - Rebase code. Signed-off-by: Rafael Antognolli <rafael.antognolli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-04-24 15:14:10 -07:00
Rafael Antognolli	4ace73b1f6	genxml: Fix python crash when no dwords are found. If the 'dwords' dict is empty, max(dwords.keys()) throws an exception. This case could happen when we have an instruction that is only an array of other structs, with variable length. v2: - Add another clause for empty dwords and make it work with python 3 (Dylan) - Set the length to 0 if dwords is empty, and do not declare dw Signed-off-by: Rafael Antognolli <rafael.antognolli@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-04-24 15:14:08 -07:00
Rafael Antognolli	19720405d5	genxml: Remove unused parameter. 'start' parameter from Group.emit_pack_function() is useless. Signed-off-by: Rafael Antognolli <rafael.antognolli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-04-24 15:14:05 -07:00
Rafael Antognolli	1ea41163eb	intel/aubinator: Correctly read variable length structs. Before this commit, when a group with count="0" is found, only one field is added to the struct representing the instruction. This causes only one entry to be printed by aubinator, for variable length groups. With this commit we "detect" that there's a variable length group (count="0") and store the offset of the last entry added to the struct when reading the xml. When finally reading the aubdump file, we check the size of the group and whether we have variable number of elements, and in that case, reuse the last field to add the remaining elements. Signed-off-by: Rafael Antognolli <rafael.antognolli@intel.com> Tested-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2017-04-24 15:13:51 -07:00
Nanley Chery	50134cede1	isl/format: Update the R16G16B16X16_FLOAT entry The section of the PRM mentioned in the code comment above this table says that this format supports the render target write message. Internal documentation says that this format also supports alpha blending. As a side effect, this allows CCS_D buffers to be created for images with this format. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2017-04-24 13:30:50 -07:00
Nanley Chery	b1066f7365	anv/pass: Delete anv_pass::subpass_attachments This field has no users. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2017-04-24 13:30:50 -07:00
Francisco Jerez	58324389be	intel/fs: Take into account amount of data read in spilling cost heuristic. Until now the spilling cost calculation was neglecting the amount of data read from the register during the spilling cost calculation. This caused it to make suboptimal decisions in some cases leading to higher memory bandwidth usage than necessary. Improves Unigine Heaven performance by ~4% on BDW, reversing an unintended FPS regression from my previous commit `147e71242c` with n=12 and statistical significance 5%. In addition SynMark2 OglCSDof performance is improved by an additional ~5% on SKL, and a Kerbal Space Program apitrace around the Moho planet I can provide on request improves by ~20%. Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Plamena Manolova <plamena.manolova@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-04-24 11:01:40 -07:00

... 270 271 272 273 274 ...

15202 commits