fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-16 02:50:19 +01:00

Author	SHA1	Message	Date
Gurchetan Singh	bf0ca99ec7	virgl: access caps in a less verbose way in virgl_is_format_supported Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-08-08 16:21:57 -07:00
Alyssa Rosenzweig	5a898e2a65	pan/midgard: Disassemble load/store barrel shift Arm assembly intensifies. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-08 15:49:12 -07:00
Eric Engestrom	525a917c6c	util/anon_file: const string param Fixes: `c0376a1234` ("util: add anon_file.h for all memfd/temp file usage") Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Tested-by: Eric Anholt <eric@anholt.net> Tested-by: Andreas Baierl <ichgeh@imkreisrum.de>	2019-08-08 22:02:54 +01:00
Eric Engestrom	8a028b0df2	util/anon_file: drop unused #include Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Tested-by: Eric Anholt <eric@anholt.net> Tested-by: Andreas Baierl <ichgeh@imkreisrum.de>	2019-08-08 22:02:54 +01:00
Eric Engestrom	60af7f5a81	util/anon_file: add missing #include Fixes: `c0376a1234` ("util: add anon_file.h for all memfd/temp file usage") Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Tested-by: Eric Anholt <eric@anholt.net> Tested-by: Andreas Baierl <ichgeh@imkreisrum.de>	2019-08-08 22:02:54 +01:00
Greg V	ac1561088d	intel/perf: use MAJOR_IN_SYSMACROS/MAJOR_IN_MKDEV Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Fixes: `134e750e16` ("i965: extract performance query metrics")	2019-08-08 21:44:33 +01:00
Greg V	0233372581	util: fix cpuset support on FreeBSD Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-08-08 21:44:33 +01:00
Greg V	c00ee00031	i965/tiled_memcpy: avoid creating bswap32 if it exists as a macro (e.g. on FreeBSD) Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-08-08 21:44:33 +01:00
Greg V	7b520dc74f	anv: add MAP_POPULATE fallback define for portability FreeBSD does not have MAP_POPULATE Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-08-08 21:44:33 +01:00
Greg V	2be3f16600	anv: remove unused Linux-specific include Fixes: `4201cc2dd3` ("anv: Implement VK_KHX_external_semaphore_fd") Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-08-08 21:44:33 +01:00
Greg V	c0dc5c1859	meson: define ETIME to ETIMEDOUT if not present Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-08-08 21:44:33 +01:00
Roman Stratiienko	28061e0ab0	lima: Fix Android.mk 1. Update LOCAL_SRC_FILES according to commit `54434fe670` ("lima/gpir: Rework the scheduler"). 2. Add libpanfrost_shared.a dependency. 3. Generate lima_nir_algebraic.c with Android.mk Fixes Android build error introduced by commit `5adfc8602c` ("lima/ppir: move sin/cos input scaling into NIR") Signed-off-by: Roman Stratiienko <roman.stratiienko@globallogic.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Acked-by: Qiang Yu <yuq825@gmail.com>	2019-08-08 17:47:22 +00:00
Roman Stratiienko	26a01a6797	Add libpanfrost_shared to Android build 1. Add missing directory to ./Android.mk 2. Fix ./src/panfrost/Android.shared.mk Signed-off-by: Roman Stratiienko <roman.stratiienko@globallogic.com> Reviewed-by: Icenowy Zheng <icenowy@aosc.io> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Acked-by: Qiang Yu <yuq825@gmail.com>	2019-08-08 17:47:22 +00:00
Rhys Perry	c52c54a746	anv,i965,iris: deduplicate setting of total_shared v5: add patch Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-08 12:10:39 -05:00
Rhys Perry	024a46a407	anv: use derefs for shared memory access vkpipeline-db for my Skylake GPU: total instructions in shared programs: 8847602 -> 8847896 (<.01%) instructions in affected programs: 10165 -> 10459 (2.89%) helped: 8 HURT: 2 total cycles in shared programs: 1606273555 -> 1606251634 (<.01%) cycles in affected programs: 2201803 -> 2179882 (-1.00%) helped: 7 HURT: 3 The shaders with more instructions is due to a loop over a shared array in Three Kingdoms being unrolled (and creating a lot of nested ifs). Not sure if that's good or bad. One of the shaders with worse cycles is only worse by 0.04% and the other two are the shaders with loops unrolled. v2: add patch v4: don't set spirv_options.shared_addr_format v4: move comment concerning the shared address format used and NULL v4: add vkpipeline-db results v5: rename to nir_lower_vars_to_explicit_types v5: move setting of total_shared to outside brw_compile_cs v6: set shared_addr_format v6: formatting changes Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> (v5) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-08 12:10:39 -05:00
Rhys Perry	fd73ed1bd7	nir: add nir_lower_to_explicit() v2: use glsl_type_size_align_func v2: move get_explicit_type() to glsl_types.cpp/nir_types.cpp v2: use align() instead of util_align_npot() v2: pack arrays a bit tighter v2: rename mem_* to field_* v2: don't attempt to handle when struct offsets are already set v2: use column_type() instead of recreating it v2: use a branch instead of \|= in nir_lower_to_explicit_impl() v2: assign locations to variables and update shared_size and num_shared v2: allow the pass to be used with nir_var_{shader_temp,function_temp} v4: rebase v5: add TODO v5: small formatting changes v5: remove incorrect assert in get_explicit_type() v5: rename to nir_lower_vars_to_explicit_types v5: correctly update progress when only variables are updated v5: rename get_explicit_type() to get_explicit_shared_type() v5: add comment explaining how get_explicit_shared_type() is different v5: update cast strides v6: update progress when lowering nir_var_function_temp variables v6: formatting changes v6: add more detailed documentation comment for get_explicit_shared_type v6: rename get_explicit_shared_type to get_explicit_type_for_size_align v7: fix comment in nir_lower_vars_to_explicit_types_impl() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> (v5) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-08 12:10:39 -05:00
Rhys Perry	8bd2e138f5	nir/lower_explicit_io: add nir_var_mem_shared support v2: require nir_address_format_32bit_offset instead v3: don't call nir_intrinsic_set_access() for shared atomics Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-08 12:10:39 -05:00
Erik Faye-Lund	1e21bb4123	mesa: avoid warning on Windows On Windows, p_atomic_inc_return returns an unsigned long long rather than the type the pointer refers to, so let's make sure we cast the result to the right type. Otherwise, we'll trigger a warning about the wrong format-string for the type. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: Eric Engestrom <eric@engestrom.ch>	2019-08-08 18:20:29 +02:00
Erik Faye-Lund	e0a740c633	mesa/main: cast away constness This avoids a warning about implicitly casting away the constness of the pointer. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: Eric Engestrom <eric@engestrom.ch>	2019-08-08 18:20:29 +02:00
Erik Faye-Lund	75097114d9	spirv: fixup signature This avoids a warning on some compiler, complaining about implicitly casting the function-pointer. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Fixes: `d482a8f` "spirv: Update the OpenCL.std.h header" Acked-by: Eric Engestrom <eric@engestrom.ch>	2019-08-08 18:20:29 +02:00
Lucas Stach	68c24b09c2	etnaviv: remember data offset into BO Imported resources might not start at offset 0 into the buffer object. Make sure to remember the offset that is provided with the handle on import. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2019-08-08 16:11:34 +02:00
Danylo Piliaiev	b8842bc312	i965: Emit a dummy MEDIA_VFE_STATE before switching from GPGPU to 3D There is an object-level preemption workaround which requires this. However, even without object-level preemption, we seem to have issues with geometry flickering when 3D and compute are combined in the same batch and this appears to fix it. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110395 Suggested-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: mesa-stable@lists.freedesktop.org	2019-08-08 13:39:15 +00:00
Bas Nieuwenhuizen	23a9d20997	radv: Avoid VEGA/RAVEN scissor bug in binning. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Dave Airlie <airlied@redhat.com>	2019-08-08 14:08:21 +02:00
Bas Nieuwenhuizen	4a3f987afd	radv: Avoid binning RAVEN hangs. Mirroring radeonsi. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Dave Airlie <airlied@redhat.com>	2019-08-08 14:08:21 +02:00
Bas Nieuwenhuizen	66ecc3eac8	radv: Fix off by one for S_028C48_MAX_ALLOC_COUNT. Reviewed-by: Dave Airlie <airlied@redhat.com>	2019-08-08 14:08:21 +02:00
Jan Zielinski	207026d29e	swr/rasterizer: modernize thread TLB Reviewed-by: Alok Hota <alok.hota@intel.com>	2019-08-08 12:33:21 +02:00
Jan Zielinski	387599a661	swr/rasterizer: Refactor events collection mechanism Several improvements and cleanups in events and statstics mechanisms Reviewed-by: Alok Hota <alok.hota@intel.com>	2019-08-08 11:15:07 +02:00
Jan Zielinski	ff75c35846	swr/rasterizer: improvements in simdlib 1. fix build issues with MSVC 2019 compiler The MSVC 2019 compiler seems to have an issue with optimized code-gen when using the _mm256_and_si256() intrinsic. Only disable use of integer vpand on buggy versions MSVC 2019. Otherwise allow use of integer vpand intrinsic. 2. Remove unused vec/matrix functionality Reviewed-by: Alok Hota <alok.hota@intel.com>	2019-08-08 10:53:47 +02:00
Jan Zielinski	b55a93fdd4	swr/rasterizer: Events are now grouped and enabled by knobs All events are now grouped as follows: -Framework (i.e. ThreadStart) [always ON] -Api (i.e. SwrSync) [always ON] -Pipeline [default ON] -Shader [default ON] -SWTag [default OFF] -Memory [default OFF] Reviewed-by: Alok Hota <alok.hota@intel.com>	2019-08-08 10:33:25 +02:00
Jan Zielinski	982d99490f	swr/rasterizer: do not mark tiles dirty until actually rendered Reviewed-by: Alok Hota <alok.hota@intel.com>	2019-08-08 10:16:20 +02:00
Jan Zielinski	4f04f260d9	swr/rasterizer: enable size accumulation in mem stats Small refactoring is also performed Reviewed-by: Alok Hota <alok.hota@intel.com>	2019-08-08 10:16:20 +02:00
Jan Zielinski	365ad367f1	swr/rasterizer: enable using AOS vertex data format Reviewed-by: Alok Hota <alok.hota@intel.com>	2019-08-08 10:16:20 +02:00
Iago Toral Quiroga	fb9f7872e7	v3d: handle wait requirement when retrieving query results correctly Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-08 08:36:52 +02:00
Iago Toral Quiroga	0f2d1dfe65	v3d: use the GPU to record primitives written to transform feedback We can use the PRIMITIVE_COUNTS_FEEDBACK packet to write various primitive counts to a buffer, including the number of primives written to transform feedback buffers, which will handle buffer overflow correctly. There are a couple of caveats with this: Primitive counters are reset when we emit a 'Tile Binning Mode Configuration' packet, which can happen in the middle of a primitives query, so we need to read the buffer when we submit a job and accumulate the counts in the context so we don't lose them. We also need to do the same when we switch primitive type during transform feedback so we can compute the correct number of recorded vertices from the number of primitives. This is necessary so we can provide an accurate vertex count for draw from transform feedback. v2: - When computing the number of vertices for a primitive, pass in the base primitive, since that is what the hardware will count. - No need to update primitive counts when switching primitive types if the base primitives are the same. - Log perf warning when mapping the primitive counts BO for readback (Eric). - Only emit the primitive counts packet once at job end (Eric). - Use u_upload mechanism for the primitive counts buffer (Eric). - Use the XML to generate indices into the primitive counters buffer (Eric). Fixes piglit tests: spec/ext_transform_feedback/overflow-edge-cases spec/ext_transform_feedback/query-primitives_written-bufferrange spec/ext_transform_feedback/query-primitives_written-bufferrange-discard spec/ext_transform_feedback/change-size base-shrink spec/ext_transform_feedback/change-size base-grow spec/ext_transform_feedback/change-size offset-shrink spec/ext_transform_feedback/change-size offset-grow spec/ext_transform_feedback/change-size range-shrink spec/ext_transform_feedback/change-size range-grow spec/ext_transform_feedback/intervening-read prims-written Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-08 08:36:52 +02:00
Iago Toral Quiroga	cf8986bce0	gallium/util: add a helper to compute vertex count from primitive count v2: - Only compute vertex counts for base primitives. - Add a unit test (Eric) Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-08 08:36:52 +02:00
Iago Toral Quiroga	9eb8699e0f	v3d: be more explicit about the query types supported Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-08 08:36:52 +02:00
Iago Toral Quiroga	9b316ab57a	v3d: generate packet unpack functions These were not being compiled because of the lack of __gen_unpack_address. v2: - Shift raw address correctly (Eric). Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-08 08:36:52 +02:00
Iago Toral Quiroga	5ffb8b1716	v3d: add header guards in v3d_packet_helpers.h Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-08 08:36:52 +02:00
Tomeu Vizoso	e7eac8a1e8	panfrost: Print errors from kernel Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-08 07:42:52 +02:00
Tomeu Vizoso	7c8434889d	panfrost: Mark buffers as PANFROST_BO_HEAP What we call GROWABLE in Mesa corresponds to the HEAP BO flag in the kernel. These buffers cannot be memory mapped in the CPU side at the moment, so make sure they are also marked INVISIBLE. This allows us to allocate a big heap upfront (16MB) without actually reserving space unless it's needed. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-08 07:42:52 +02:00
Tomeu Vizoso	19afd41e65	panfrost: Mark BOs as NOEXEC Unless a BO has the EXECUTABLE flag, mark it as NOEXEC. v2: - Rework version detection (Alyssa). Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-08 07:42:52 +02:00
Tomeu Vizoso	9398932c2d	panfrost: Take into account flags when looking up in the BO cache This will be useful right now so we avoid retrieving a non-executable buffer when a executable one is needed. As we support more flags, this logic will need to be extended to consider the different trade-offs to be made when matching BO specifications to BOs in the cache. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-08 07:42:52 +02:00
Tomeu Vizoso	950b5fc596	panfrost: Allocate shaders in their own BOs Instead of all shaders being stored in a single BO, have each shader in its own. This removes the need for a 16MB allocation per context, and allows us to place transient blend shaders in BOs marked as executable (before they were allocated in the transient pool, which shouldn't be executable). v2: - Store compiled blend shaders in a malloc'ed buffer, to avoid reading from GPU-accessible memory when patching (Alyssa). - Free struct panfrost_blend_shader (Alyssa). - Give the job a reference to regular shaders when emitting (Alyssa). v3: - Split out the allocation flags change (Rob). Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-08 07:42:52 +02:00
Tomeu Vizoso	5804d75b9c	util/hash_table: Fix hashing in clears on 32-bit Some hash functions (eg. key_u64_hash) will attempt to dereference the key, causing an invalid access when passed DELETED_KEY_VALUE (0x1) or FREED_KEY_VALUE (0x0). When in 32-bit arch a 64-bit key value doesn't fit into a pointer, so hash_table_u64 internally use a pointer to a struct containing the 64-bit key value. Fix _mesa_hash_table_u64_clear() to handle the 32-bit case by creating a temporary hash_key_u64 to pass to the hash function. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Suggested-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Cc: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: Nicolai Hähnle <nicolai.haehnle@amd.com>	2019-08-08 07:42:52 +02:00
Tapani Pälli	aba57b11ee	anv: support GetSwapchainGrallocUsage2ANDROID for Android New function supports gralloc1 usage flags that get set separately for producer and consumer. As we still need to support old method too, let's share common code and use android_convertGralloc0To1Usage helper. Bump the VK_ANDROID_native_buffer version to indicate support for the new call. Changes were tested on Android Celadon P with Basemark GPU and various Sascha Willems Vulkan demos. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-08 05:08:01 +00:00
Mark Janes	51c3ab618b	st/mesa: eliminate unnecessary redirection Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-07 21:33:56 -07:00
Mark Janes	61c54a8878	intel/perf: fix debug typo Misspelling was seen with INTEL_DEBUG=perfmon. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-07 21:33:56 -07:00
Mark Janes	2df1ab4d48	intel/perf: make gen_perf_query_object private Encapsulate the details of this structure within the perf implemenation. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-07 21:33:56 -07:00
Mark Janes	deea3798b6	intel/perf: make perf context private Encapsulate the details of this data structure. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-07 21:33:56 -07:00
Mark Janes	1f4f421ce0	intel/perf: print debug information INTEL_DEBUG=perfmon will iterate over the perf queries, printing information about the state of each query. Some of this information will be private to intel/perf, and needs to a dump routine that can be called from i965. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-07 21:33:56 -07:00

... 24 25 26 27 28 ...

115447 commits