fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-01 14:38:06 +02:00

Author	SHA1	Message	Date
Jason Ekstrand	f7736ccf53	anv: Add valid_bufer_usage to the memory type metadata Instead of returning valid types as just a number, we now walk the list and check the buffer's usage against the usage flags we store in the new anv_memory_type structure. Currently, valid_buffer_usage == ~0. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Cc: "17.1" <mesa-stable@lists.freedesktop.org>	2017-05-23 16:46:34 -07:00
Jason Ekstrand	92325a7efc	anv: Determine the type of mapping based on type metadata Before, we were just comparing the type index to 0. Now we actually look the type up in the table and check its properties to determine what kind of mapping we want to do. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Cc: "17.1" <mesa-stable@lists.freedesktop.org>	2017-05-23 16:46:32 -07:00
Jason Ekstrand	c1f4343807	anv: Set up memory types and heaps during physical device init Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Cc: "17.1" <mesa-stable@lists.freedesktop.org>	2017-05-23 16:46:30 -07:00
Jason Ekstrand	eceaf7e234	anv: Predicate 48bit support on gen >= 8 This doesn't matter right now since it only affects whether or not we set the kernel bit but, if we ever do anything else based on it, we'll want it to be correct per-gen. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Cc: "17.1" <mesa-stable@lists.freedesktop.org>	2017-05-23 16:46:27 -07:00
Jason Ekstrand	4eecd534f0	anv/image: Get rid of the memset(aux, 0, sizeof(aux)) hack Up until now, we've been memsetting the auxiliary surface to 0 at BindImageMemory time to ensure that it is properly initialized. However, this isn't correct because apps are allowed to freely alias memory between different images and buffers so long as they properly track whether or not a particular image is valid and, if it isn't, transition from UNINITIALIZED to something else before using it. We now implement those transitions so we can drop the hack. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Cc: "17.1" <mesa-stable@lists.freedesktop.org>	2017-05-23 16:46:22 -07:00
Jason Ekstrand	cc45c4bb80	anv: Handle transitioning depth from UNDEFINED to other layouts Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Cc: "17.1" <mesa-stable@lists.freedesktop.org>	2017-05-23 16:46:20 -07:00
Jason Ekstrand	75edecf502	anv: Handle color layout transitions from the UNINITIALIZED layout This causes dEQP-VK.api.copy_and_blit.resolve_image.partial.* to start failing due to test bugs. See CL 1031 for a test fix. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Cc: "17.1" <mesa-stable@lists.freedesktop.org>	2017-05-23 16:46:03 -07:00
Axel Davy	7e04ae74d4	st/nine: Fix a regression and syntax cleanup A few cleanups and in particular initializing properly the new pipe_draw_info fields. This should fix the regression caused by `330d0607ed` Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=101088 Signed-off-by: Axel Davy <axel.davy@ens.fr> Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-05-24 00:40:43 +02:00
Ian Romanick	7009955281	mesa: Remove GL_APPLE_vertex_array_object stubs Mark the functions 'exec="skip"' in the XML instead. libGL will still have the functions, but the driver won't try to use them. I verified that this commit works with piglit's 'object-namespace-pollution glClear vertex-array' on x64 with a driver built from mesa-12.0.3 tag. In fairness, this test also works with a libGL built from `7927d03`. I believe it continues to work because on non-Windows platforms we generate some extra, dummy dispatch functions that can be used when a driver requests a function unknown to libGL. This was done to provide some "forward" compatibility with drivers that need more functions. This doesn't work on Windows because the Windows calling convention is for the callee to clean up the stack. That's the theory anyway. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-05-23 15:02:29 -07:00
Marek Olšák	0781b58b3a	gallium/radeon: pipe AMDGPU_INFO_NUM_VRAM_CPU_PAGE_FAULTS into gallium HUD Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-05-23 23:29:16 +02:00
Rob Clark	1db28fbbea	freedreno/ir3: switch to NIR by default Now that we lower vars to regs, we no longer regress for anything that does complex dereferences. (With tgsi, derefers are already lowered before tgsi_to_nir, but not with glsl_to_nir.) In fact it actually fixes a few things to bypass tgsi. So make NIR the default (finally!) Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-05-23 12:26:35 -04:00
Rob Clark	caa64b24ce	freedreno/ir3: lower arrays to regs Instead of using load/store_var intrinsics, which can have complex derefs in the case of multi-dimensional arrays, lower these to regs and handle the direct/indirect loads in get_src() and stores in put_dst(). This should let us switch to using nir by default. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-05-23 12:26:35 -04:00
Rob Clark	232fc99544	freedreno/ir3: add put_dst() Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-05-23 12:26:35 -04:00
Rob Clark	2bbd425adb	freedreno/ir3: code-motion Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-05-23 12:26:35 -04:00
Rob Clark	90dade300f	freedreno/ir3: fix cmdline compiler standalone_compiler_cleanup() frees the glsl types, among other things, so it needs to come after nir->ir3. But since we exit after dumping the disassembly, it is easier to just not call it at all. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-05-23 12:26:35 -04:00
Rob Clark	1059dc9165	freedreno/ir3: add missing nir_opt_copy_prop_vars() pass Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-05-23 12:26:35 -04:00
Rob Clark	c712a637b9	freedreno/ir3: need different compiler options for a5xx vertex_id_zero_based differs.. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-05-23 12:26:35 -04:00
Rob Clark	4531e67c47	freedreno/a5xx: remove copapasta from a4xx Won't ever hit this w/ a420 gpu, so this is dead code. Need to get astc working to know whether to rip this out entirely or not. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-05-23 12:26:35 -04:00
Rob Clark	0c2e0f15b8	freedreno: only support SSBOs with nir tgsi_to_nir does not support them. Note that compute shaders already force nir. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-05-23 12:26:35 -04:00
Rob Clark	444b4b40f9	freedreno/a5xx: add some missing texture formats Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-05-23 12:26:35 -04:00
Rob Clark	6ccbbd8d05	freedreno/a5xx: provoking vertex Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-05-23 12:26:35 -04:00
Rob Clark	d7f296de26	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-05-23 12:26:35 -04:00
Rob Clark	6f65a1a211	nir/lower-atomics-to-ssbo: remove atomic_uint arrays too Maybe there is a better way to do this. But by the time we get to assigning uniform locs, we want the atomic_uint's to all be gone, otherwise we assert in st_glsl_attrib_type_size(). Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-05-23 12:26:34 -04:00
Rob Clark	5f6c034f82	nir/lower-atomics-to-ssbo: fix num_components Fixes some piglits like arb_shader_atomic_counters-active-counters Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-05-23 12:26:34 -04:00
Timothy Arceri	a363fa0c99	radeon: pass flags that can change shaders to disk_cache_create() I wasn't sure if I should filter the flags so that we only use flags that actually change the shader output. To avoid manual updates we just pass in everything for now. Reviewed-by: Eduardo Lima Mitev <elima@igalia.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-05-23 09:09:43 +10:00
Timothy Arceri	0bbcfbfc0b	util/disk_cache: add new driver_flags param to cache keys This will be used for things such as adding driver specific environment variables to the key. Allowing us to set environment vars that change the shader and not have the driver ignore them if it finds existing shaders in the cache. Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2017-05-23 09:09:43 +10:00
Jose Fonseca	d970f773f4	u_format_test: Ignore S3TC errors. This prevents spurious failures when libtxc-dxtn-s2tc is installed. Note: lp_test_format doesn't need any change since we were already ignoring S3TC failures there. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Rhys Kidd <rhyskidd@gmail.com>	2017-05-22 21:00:06 +01:00
Nanley Chery	d132bb36ce	docs: Document ASTC extension support for SKL and BXT v2: Remove the '+' after bxt Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2017-05-22 11:13:53 -07:00
Nanley Chery	d6150bd764	i965: Enable ASTC HDR for Broxton This platform passes the following GLES3 tests: ES3-CTS.functional.texture.compressed.astc.endpoint_value_hdr_cem_* Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2017-05-22 11:13:53 -07:00
Nanley Chery	52a6fd9871	intel/isl: Add ASTC HDR to format lists and helpers Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2017-05-22 11:13:53 -07:00
Bas Nieuwenhuizen	b2c5e69942	radv: Add compute HTILE fast clear. Not really what the fast depth clear does, no matter whether you use EXPCLEAR or not. Seems the fast clear using the DB HW always touches the main buffer. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-05-22 20:07:21 +02:00
Bas Nieuwenhuizen	df91abfe5a	radv: Use correct clear words for HTILE. Did some RE'ing what several HTILE words give when read from a descriptor with HTILE compression enabled. Seems to align with -pro usage for D16 too. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-05-22 20:07:21 +02:00
Bas Nieuwenhuizen	0b26f0ee4f	radv: Add queue masks for htile usage determination. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-05-22 20:07:21 +02:00
Bas Nieuwenhuizen	0628580eff	radv: Specify semantics of HTILE layout helpers. And correct implementation to specify only what we support. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-05-22 20:07:21 +02:00
Bas Nieuwenhuizen	62e182acd0	radv: Don't use a separate can_expclear. We never use EXPCLEAR clears. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-05-22 20:07:21 +02:00
Ian Romanick	7174e3f22b	mesa: GL_ARB_shader_subroutine is not optional in core profile text data bss dec hex filename 7038459 235248 37280 7310987 6f8e8b 32-bit i965_dri.so before 7038227 235248 37280 7310755 6f8da3 32-bit i965_dri.so after 6681438 303400 50608 7035446 6b5a36 64-bit i965_dri.so before 6681254 303400 50608 7035262 6b597e 64-bit i965_dri.so after Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2017-05-22 10:51:26 -07:00
Benedikt Schemmer	b026f45bdd	drirc: Add allow_glsl_builtin_variable_redeclaration for Dead Island Riptide Definitive Edition Signed-off-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-05-22 19:32:07 +02:00
Marek Olšák	8c069a6a06	gallium/radeon: add a query for monitoring Gallium thread load Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-05-22 19:23:39 +02:00
Marek Olšák	2beb31bd7c	radeonsi/gfx9: compile shaders with +xnack so that LLVM doesn't allocate SGPRs where XNACK is. Cc: 17.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-05-22 19:23:39 +02:00
Rhys Kidd	499f45163a	vc4: Remove dead code in vc4_dump_surface_msaa() Coverity caught the use of dead code copy-paste for found_colors[] and num_found_colors. CID: 1341850 Signed-off-by: Rhys Kidd <rhyskidd@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2017-05-22 09:50:22 -07:00
Lionel Landwerlin	30dc56bb5b	egl/wayland: verify event queue was allocated We're already verified that 'window' wasn't NULL, I'm guessing this allocation error is about the newly created queue. CID: 1409754 Fixes: `03dd9a88b0` ("egl/wayland: Use per-surface event queues") Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Daniel Stone <daniels@collabora.com>	2017-05-22 15:44:38 +01:00
Timothy Arceri	4eb0411ed7	mesa: add APPLE_vertex_array_object stubs APPLE_vertex_array_object support was removed in `7927d0378f`. However it turns out we can't remove the functions because this can cause issues when libglapi is used together with DRI drivers built prior to said commit Fixes: `7927d0378f` ("mesa: drop APPLE_vertex_array_object support") Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-05-22 14:56:51 +10:00
Timothy Arceri	3ceae88642	glsl: set mask via initialisation list rather than in constructor body Potentially more efficient as it may avoid the struct being initialised twice. Also add var to the initialisation list while we are here. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-05-22 14:21:55 +10:00
Vladislav Egorov	cf164d9e97	ralloc: Use strnlen() inside of strncat() If the str is long or isn't null-terminated, strlen() could take a lot of time or even crash. I don't know why was it used in the first place, maybe for platforms without strnlen(), but strnlen() is already used inside of ralloc_strndup(), so this change should not additionally break anything. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-05-22 12:34:28 +10:00
Vladislav Egorov	4a47247523	glcpp: Skip unnecessary line continuations removal Overwhelming majority of shaders don't use line continuations. In my shader-db only shaders from the Talos Principle and Serious Sam used them, less than 1% out of all shaders. Optimize for this case, don't do any copying if no line continuation was found. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-05-22 12:34:28 +10:00
Vladislav Egorov	b8e792ee25	glcpp: Avoid unnecessary strcmp() strcmp() is slow. Initiate comparison with "__LINE__" or "__FILE__" only if the identifier starts with '_', which is rare. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-05-22 12:34:28 +10:00
Thomas Helland	1575a8146a	main: Move hashLockMutex/hashUnlockMutex to header and inline Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-05-22 09:19:24 +10:00
Thomas Helland	f203a9f7d1	main: Use _mesa_HashLock/UnlockMutex consistently This is shorter and easier on the eyes. At the same time this also ensures that we are always asserting that the table pointer is not NULL. Currently that was not done for all situations. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-05-22 09:17:37 +10:00
Thomas Helland	90dfcc6b32	util: Change the pointer hashing function Use our knowledge that pointers are at least 4 byte aligned to remove the useless digits. Then shift by 6, 10, and 14 bits and add this to the original pointer, effectively folding in the entropy of the higher bits of the pointer into a 4-bit section. Stopping at 14 means we can add the entropy from 18 bits, or at least a 600Kbyte section of memory. Assuming that ralloc allocates from a linearly allocated heap less than this we can make a very efficient pointer hashing function for our usecase. Even if we are not on an architecture that is 4 byte aligned, there is still a high big chance that the thing we are allocating is at least 8 bytes in size, so even then we will have entropy into the third bit. The 4 bit increment on the shifts is chosen rather arbitrarily; if we had chosen a 3 bit increment we would need to add another xor to cover a decently sized memorypool. Increasing it to 5 bits would spread our entropy more, possibly hurting us with more collisions on hash tables of size less than 32. With a hash table of size 16 there are a max of 11 entries, and we can assume that with such a small table collisions are not that painfull. This allows us to hash the whole 32 or 64 bit pointer at once, instead of running FNV1a, looping through each byte and doing increments, decrements, muls, and xors on every byte. This cuts _mesa_hash_data from 1.5 % on profiles, to making _mesa_hash_pointer show up with a 0.09% share. Collisions on insertion actually seems to be ever so slightly lower with this hash function, as found by printing a loop counter and sorting the data. perf stat shows a 1.5% reduction in instruction count, and a 5% reduction in stalled cycles. Shader-db runtime goes from 225 to 220 seconds. No instruction-count changes in shader-db, but there are some minor changes in cycle-count that is likely caused by nir walking a set in some of its passes, and this causing a different ordering. That might eventually lead to a difference in register allocation. However, the effect is a net positive; total cycles in shared programs: 24739550 -> 24738482 (-0.00%) cycles in affected programs: 374468 -> 373400 (-0.29%) helped: 178 HURT: 49 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2017-05-22 09:17:37 +10:00
Philipp Zabel	1586768e74	vulkan/wsi/wayland: Fix proxy wrappers for swapchain recreation Before the swapchain event queue is destroyed, all proxy objects that reference it must be dropped. Otherwise we risk a use-after-free if a frame callback event or buffer release events are received afterwards. This happens when an application destroys and recreates a swapchain in FIFO mode between two frames without using the VkSwapchainCreateInfoKHR::oldSwapchain mechanism to keep the old swapchain until after the next redraw. Fixes: `5034c61558` ("vulkan/wsi/wayland: Use proxy wrappers for swapchain") Signed-off-by: Philipp Zabel <philipp.zabel@gmail.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Cc: mesa-stable@lists.freedesktop.org	2017-05-20 17:00:08 +01:00

1 2 3 4 5 ...

92248 commits