fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-06-10 18:38:27 +02:00

Author	SHA1	Message	Date
Francisco Jerez	79d08ed3d2	anv: Fix uniform and storage buffer offset alignment limits. This fixes a regression in a bunch of image store vulkan CTS tests from commit `ad38ba1134`, which started using OWORD block read messages to implement UBO loads. The reason for the failure is that we were giving bogus buffer alignment limits to the application (1B), so the CTS would happily come back with descriptor sets pointing at not even word-aligned uniform buffer addresses. Surprisingly the sampler messages used to fetch pull constants before that commit were able to cope with the non-texel aligned addresses, but the dataport messages used to fetch pull constants after that commit and the ones used to access storage buffers (before and after the same commit) aren't as permissive with unaligned addresses. Cc: <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=99097 Reported-by: Mark Janes <mark.a.janes@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-12-16 14:12:54 -08:00
Kenneth Graunke	e0c1ec3b09	genxml: Make Gen8 3DSTATE_DS SIMD8 enable work like Gen9+. This will let us avoid ifdefs. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2016-12-14 14:59:06 -08:00
Kenneth Graunke	000b563a1b	genxml: Rename "DS Function Enable" to "Function Enable". This makes Gen7/7.5 match Gen8-9. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2016-12-14 14:59:06 -08:00
Chad Versace	72ffe8318d	anv: Reject VkMemoryAllocateInfo::allocationSize == 0 The Vulkan 1.0.33 spec says "allocationSize must be greater than 0". Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2016-12-14 12:04:58 -08:00
Grazvydas Ignotas	b58d1eecc6	intel/aubinator: fix 32bit shift overflow warning Doesn't look like this can work on 32bit, just rids of annoying warning. Signed-off-by: Grazvydas Ignotas <notasas@gmail.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2016-12-11 20:04:15 +01:00
Grazvydas Ignotas	3a1b15c392	anv: fix release build unused variable warnings Signed-off-by: Grazvydas Ignotas <notasas@gmail.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2016-12-11 20:03:14 +01:00
Edward O'Callaghan	efe9d1cde3	anv: Clean up some unused variables Following on from the spirit of commit `011e5570f`. Signed-off-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-12-10 11:59:59 +11:00
Jordan Justen	d6526d7247	intel/blorp_blit: Add split_blorp_blit_debug switch Enabling this debug switch causes surface shrinking to happen by default, and lowers the surface size limit which causes blorp blits to be split. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-12-07 09:00:49 -08:00
Jordan Justen	da381ae647	intel/blorp_blit: Enable splitting large blorp blits Detect when the surface sizes are too large for a blorp blit. When it is too large, the blorp blit will be split into a smaller operation and attempted again. For gen7, this fixes the cts test: ES3-CTS.gtf.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_multisampled_to_singlesampled_blit It will also enable us to increase our renderable size from 8k x 8k to 16k x 16k. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-12-07 09:00:49 -08:00
Jordan Justen	efea8e7244	intel/blorp_blit: Move RGB=>R conversion to follow blit splitting In blorp_copy, when RGB surfaces are copied, we convert the destination surface to a Red only surface, but 3 times as wide. This introduces an implicit restriction of "mod 3" for the destination width. It is easier to handle the blorp split buffer offsetting with the original RGB surface, and do the RGB=>R after this. Suggested-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-12-07 09:00:49 -08:00
Jordan Justen	edf3113aed	intel/blorp_blit: Adjust blorp surface parameters for split blits If try_blorp_blit() previously returned that a blit was too large, shrink_surface_params() will be used to update the surface parameters for the smaller blit so the blit operation can proceed. v2: * Use double instead of float. (Jason) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-12-07 09:00:49 -08:00
Jordan Justen	12e0a6e259	intel/blorp_blit: Split blorp blits if they are too large We rename do_blorp_blit() to try_blorp_blit(), and add a return error if the surface size for the blit is too large. Now, do_blorp_blit() is rewritten to try to split the blit into smaller operations if try_blorp_blit() fails. Note: In this commit, try_blorp_blit() will always attempt to blit and never return an error, which matches the previous behavior. We will enable the size checking and splitting in a future commit. The motivation for this splitting is that in some cases when we flatten an image, it's dimensions grow, and this can then exceed the programmable hardware limits. An example is w-tiled+MSAA blits. v2: * Use double instead of float. (Jason) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-12-07 09:00:49 -08:00
Jordan Justen	b74d4f6ca0	intel/blorp_blit: Create structure for src & dst coordinates This will be useful for splitting blits into smaller sizes. We also make the coordinates of type double rather than float. Since we will be splitting and scaling the coordinates, we might require extra precision in the calculations. v2: * Use double instead of float. (Jason) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-12-07 09:00:49 -08:00
Nanley Chery	72db1570b4	anv/TODO: Document sampling from HiZ Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2016-12-06 14:51:30 -08:00
Jason Ekstrand	eb7b51d62a	genxml/gen9: Change the default of MI_SEMAPHORE_WAIT::RegisterPoleMode We would really like it to be false as that's what you get on hardware that doesn't have RegisterPoleMode (Sky Lake for example). While we're at it, we change it to a boolean. This fixes dEQP-VK.synchronization.smoke.events on Broxton. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "13.0" <mesa-stable@lists.freedesktop.org>	2016-12-06 11:35:13 -08:00
Jason Ekstrand	c5d664f9dc	anv/pipeline: Call nir_lower_constant_initializers Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-12-05 15:40:09 -08:00
Jason Ekstrand	0291bf4db2	Revert "i965: use nir_lower_indirect_derefs() for GLSL" This reverts commit `9404439a75`. I didn't intend to push it and it breaks clip and cull distance.	2016-12-05 15:21:20 -08:00
Timothy Arceri	9404439a75	i965: use nir_lower_indirect_derefs() for GLSL This moves the nir_lower_indirect_derefs() call into brw_preprocess_nir() so thats is called by both OpenGL and Vulkan and removes that call to the old GLSL IR pass lower_variable_index_to_cond_assign() We want to do this pass in nir to be able to move loop unrolling to nir. There is a increase of 1-3 instructions in a small number of shaders, and 2 Kerbal Space program shaders that increase by 32 instructions. Shader-db results BDW: total instructions in shared programs: 8705873 -> 8706194 (0.00%) instructions in affected programs: 32515 -> 32836 (0.99%) helped: 3 HURT: 79 total cycles in shared programs: 74618120 -> 74583476 (-0.05%) cycles in affected programs: 528104 -> 493460 (-6.56%) helped: 47 HURT: 37 LOST: 2 GAINED: 0	2016-12-05 14:00:35 -08:00
Ilia Mirkin	fda1d0187d	anv: expose support for VK_KHR_sampler_mirror_clamp_to_edge This is already supported in genX_state.c, expose the extension string. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2016-11-30 20:49:04 -05:00
Jason Ekstrand	27433b26b1	anv/cmd_buffer: Actually use the stencil dimension In an attempt to fix 3DSTATE_DEPTH_BUFFER for stencil-only cases, I accidentally kept setting the SurfaceType to 2D in the stencil-only case thanks to a copy+paste error. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2016-11-30 17:42:42 -08:00
Ville Syrjälä	676c0cf287	anv: Prefer in-tree headers to out-of-tree headers Set the include paths to consider in-tree headers before out-of-tree headers. Avoids the build failing due to stale headers being present in $prefix. Previosuly 'make -ki install' or something similar was required to update the out-of-tree headers to allow the build to succeed. Also avoids having to rebuild the entire thing after every 'make install'. Cc: Rob Clark <robdclark@gmail.com> Cc: Jason Ekstrand <jason.ekstrand@intel.com> Signed-off-by: Ville Syrjälä <ville.syrjala@linux.intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Chad Versace <chadversary@chromium.org>	2016-11-30 20:01:00 +02:00
Kristian H. Kristensen	d3d7cab812	aubinator: Add support for enum types Signed-off-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-29 22:02:49 -08:00
Kristian H. Kristensen	7fc659d8d5	intel/genxml: Fix ksp for INTERFACE_DESCRIPTOR_DATA This one was split across two dwords as "Kernel Start Pointer" and "Kernel Start Pointer High", which looks like it works when the driver only accesses "Kernel Start Pointer". This breaks, of course, with BO offsets > 4G. Signed-off-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-29 22:02:49 -08:00
Kristian H. Kristensen	99e573b4e0	intel/genxml: Use enum 3D_Logic_Op_Function where applicable Signed-off-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-29 22:02:49 -08:00
Kristian H. Kristensen	374d19ac00	intel/genxml: Use blend function and factor enums where applicable Signed-off-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-29 22:02:49 -08:00
Kristian H. Kristensen	09fe8ad010	intel/genxml: Use enum 3D_Vertex_Component_Control where applicable Signed-off-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-29 22:02:49 -08:00
Kristian H. Kristensen	54e71e5851	intel/genxml: Use enum 3D_Stencil_Operation where applicable Signed-off-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-29 22:02:49 -08:00
Kristian H. Kristensen	193c1b72e0	intel/genxml: Use enum SURFACE_FORMAT where applicable Signed-off-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-29 22:02:49 -08:00
Kristian H. Kristensen	0799022bf9	intel/genxml: Use enum 3D_Prim_Topo_Type where applicable Signed-off-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-29 22:02:49 -08:00
Kristian H. Kristensen	993babc014	intel/genxml: Use 3D_Compare_Function for gen8+ test functions When the state fields where shuffled around for gen8, the compare function enums were downgraded to just uints. Change them to enum 3D_Compare_Function. Signed-off-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-29 22:02:49 -08:00
Kristian H. Kristensen	fc2225b1af	intel/genxml: Emit genxml enums as C enums The previous commits got rid of any clashes between #defines and enum values and we can now emit the genxml enums as debugger friendly C enums. Signed-off-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-29 22:02:49 -08:00
Kristian H. Kristensen	8fc74b879e	intel/genxml: Remove duplicate COMPAREFUNCTION values These values were defined both as an enum and as inline values. Remove the inline values and reference the 3D_Compare_Function enum instead. Signed-off-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-29 22:02:49 -08:00
Kristian H. Kristensen	5814fc1bb7	intel/genxml: Allow referencing enums in type attributes This lets us reference enums in the type attribute of a field. Signed-off-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-29 22:02:49 -08:00
Kristian H. Kristensen	3b6b6f6463	anv: Emit cherryview SF state without including gen9_pack.h Cleaner this way and we avoid including gen9_pack.h when we compile with gen8_pack.h. We also avoid the if (cherryview) condition for non-gen8 gens that don't need it. Signed-off-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-29 22:02:49 -08:00
Kristian H. Kristensen	908febcf21	anv: Don't include two different pack headers The batch chain logic only needs the pre-gen8 size of MI_BATCH_BUFFER_START, which seems like something we can make a special case for. The other two gen7 references, MI_BATCH_BUFFER_END and MI_NOOP, are the same on all gens. Signed-off-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-29 22:02:49 -08:00
Kristian H. Kristensen	be9c2ab23b	intel/genxml: Move enums above structs We'll need to define them before we can reference them in structs and instructions. Enums have no dependencies, so move them first in the file. Signed-off-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-29 22:02:49 -08:00
Kristian H. Kristensen	ce26486115	genxml: Add values for Barycentric Interpolation Mode Signed-off-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-29 22:02:49 -08:00
Ilia Mirkin	ed0b3cbd09	anv: remove per-sample shading from TODO This was done some time ago. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2016-11-30 00:17:56 -05:00
Ilia Mirkin	be92b3f49d	anv: clean up VkPhysicalDeviceFeatures list Remove duplicate .alphaToOne, add missing .shaderResourceMinLod, and reorder a few entries to match their vulkan.h order. All the sparse features are still left out entirely. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2016-11-30 00:17:56 -05:00
Ilia Mirkin	7a8def8c18	anv: bump the texture gather offset limits This matches what NVIDIA and AMD hardware expose, as well as what Intel hardware supports. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-29 07:44:01 -08:00
Jason Ekstrand	f469235a6e	anv/cmd_buffer: Remove the 1-D case from the HiZ QPitch calculation The 1-D special case doesn't actually apply to depth or HiZ. I discovered this while converting BLORP over to genxml and ISL. The reason is that the 1-D special case only applies to the new Sky Lake 1-D layout which is only used for LINEAR 1-D images. For tiled 1-D images, such as depth buffers, the old gen4 2-D layout is used and the QPitch should be in rows. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Cc: "13.0" <mesa-stable@lists.freedesktop.org>	2016-11-28 20:17:29 -08:00
Jason Ekstrand	d4ef87c1bb	anv/cmd_buffer: Set the correct surface type for depth/stencil Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2016-11-28 20:17:16 -08:00
Ilia Mirkin	e6847f24f0	anv: enable drawIndirectFirstInstance This was already piped through in the CmdDraw(Indexed)Indirect handling. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-28 19:32:14 -08:00
Ilia Mirkin	d2280a007a	anv: expose depthBiasClamp, it is already set The gen7/8_cmd_buffer logic already sets the clamp, and it's piped through via the dynamic state. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-11-28 19:32:14 -08:00
Ilia Mirkin	e2c669a56b	anv: bump maxFramebufferLayers to 2048 This matches maxImageArrayLayers, as well as the same setting in the GL frontend. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-11-28 19:32:14 -08:00
Ilia Mirkin	76b97d544e	anv: enable storage image extended formats These are all regularly available in desktop GL, so the backend fully supports them. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-28 19:32:14 -08:00
Ilia Mirkin	a34f89c5e6	anv: expose imageCubeArray functionality This appears to be fully supported already. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-28 19:32:13 -08:00
Dave Airlie	eaf0768b8f	radv: set maxFragmentDualSrcAttachments to 1 Reported-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "13.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-11-29 13:27:26 +10:00
Jason Ekstrand	6bc8bef1a1	intel/aubinator: Pull useful information from the AUB header This commit does two things. One is to pull useful and/or interesting information from the AUB file header and display it as a header above your decoded batches. Second, it is now capable of pulling the PCI ID from the AUB file comment left by intel_aubdump. This removes the need to use the --gen flag all the time. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-11-28 16:45:09 -08:00
Jason Ekstrand	da5ebeffdf	intel/aubinator: Wait to setup decoders until we parse the aub header This requires that a few more state bits become global. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-11-28 16:45:09 -08:00

1 2 3 4 5 ...

1132 commits