fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 07:18:17 +02:00

Author	SHA1	Message	Date
George Kyriazis	8c83d2d371	swr: Support simd16 vertex shaders Supporting simd16 vertex shaders involves packing the output of the fetch shader appropriately, especially the vertexID buffers that have to be formatted in one simd16 register, needed by the VS. As part of this support, we needed to remove the 2nd JitManager, since it was not accounting for vector width correctly. USE_SIMD16_SHADERS is also split into two defines. The additional one (USE_SIMD16_VS) controls the width of the vertex shader (VS), while the original one (USE_SIMD16_SHADERS) controls overall front end width. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:52:42 -06:00
George Kyriazis	1874d95a8e	swr/rast: changed jit debug magic number Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:52:41 -06:00
George Kyriazis	c719f62621	swr/rast: Added ICLAMP builder function Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:52:41 -06:00
George Kyriazis	f192502001	swr/rast: Jit debug work Properly validate DLL matches OBJ for jitted function Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:52:41 -06:00
George Kyriazis	3c405e32b0	swr/rast: silence generated file warnings Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:52:40 -06:00
George Kyriazis	fe107e3c17	swr/rast: jit shader lib debug work Create shader_lib during build, link with shaders at DLL generation time Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:52:40 -06:00
George Kyriazis	0cd9ad98a3	swr/rast: AVX-512 changes to enable 16-wide VS Add a new define (USE_SIMD16_VS), to denote calling a 16-wide vertex shader. This is needed because the mesa driver can do 16-wide shaders, but rasty cannot yet, so we need to distinguish. Create a new VertexID entry (VertexID16) for the USE_SIMD16_VS case, since we need to format the vertex id in a way that is digestible by the 16-wide VS Disabled for now. To be enabled in a future checkin when driver work is complete. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:52:40 -06:00
George Kyriazis	3140e714d2	swr/rast: x86 autogenerated macro work Add name argument to x86 autogenerated macros. Add useful variable names for DCL_inputVec implementation. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:52:39 -06:00
George Kyriazis	4cd6e2ebfd	swr/rast: Shorten some filenames in shader and fetch dump files Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:52:39 -06:00
George Kyriazis	3936044d07	swr/rast: work supporting optimizations in Debug builds. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:52:38 -06:00
George Kyriazis	c4a42f5add	swr/rast: Add debugging type support for function types. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:52:38 -06:00
George Kyriazis	e9e7f3ce0a	swr/rast: Shader debugging work - Move debug .ll files to JIT_CACHE_DIR - Don't link against jitter SRGBLut table, add global data to shader that needs it. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:52:34 -06:00
George Kyriazis	34bbcb5052	swr/rast: Debug Symbols work Added support for Fetch / Sample / LD functions Added DLL link to JitCache implementation Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:52:30 -06:00
George Kyriazis	01ab218bbc	swr/rast: Initial work for debugging support. Adds ability to step into jitted llvm IR in Visual Studio. - Updated llvm type generation script to also generate corresponding debug types. - New module pass inserts debug metadata into the IR for each function Disabled by default. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:52:22 -06:00
George Kyriazis	4660e13152	swr/rast: Add private state parameter in fetcher Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:48:41 -06:00
George Kyriazis	079ae3c48d	swr/rast: Added missing define for Linux/gcc + ZeroMemory() macro definition for non win32-compilation in common/os.h Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:48:41 -06:00
George Kyriazis	70f8eac603	swr/rast: Fix one more invalid object format for windows. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:48:41 -06:00
Bas Nieuwenhuizen	61a790409e	radv: Always re-emit the sample position offset user SGPR. The user SGPR location can change between pipelines, so we need to emit it again to the pottentially changed SGPR index. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-01-19 23:35:12 +01:00
Bas Nieuwenhuizen	dbf1e918cd	radv: emit pa_sc_mode_cntl_0 with multisample state. We don't have the meta kludge with 0 viewports anymore, so we can always enable them. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-01-19 23:35:12 +01:00
Kenneth Graunke	c7dcee58b5	i965: Avoid problems from referencing orphaned BOs after growing. Growing the batch/state buffer is a lot more dangerous than I thought. A number of places emit multiple state buffer sections, and then write data to the returned pointer, or save a pointer to brw->batch.state.bo and then use it in relocations. If each call can grow, this can result in stale map references or stale BO pointers. Furthermore, fences refer to the old batch BO, and that reference needs to continue working. To avoid these woes, we avoid ever swapping the brw->batch.*.bo pointer, instead exchanging the brw_bo structures in place. That way, stale BO references are fine - the GEM handle changes, but the brw_bo pointer doesn't. We also defer the memcpy until a quiescent point, so callers can write to the returned pointer - which may be in either BO - and we'll sort it out and combine the two properly in the end. v2/v3: - Handle stale pointers in the shadow copy case, where realloc may or may not move our shadow copy to a new address. - Track the partial map explicitly, to avoid problems with buffer reuse where multiple map modes exist (caught by Chris Wilson). v4: - Don't use realloc in the CPU shadow case, it isn't safe. Fixes: `2dfc119f22` "i965: Grow the batch/state buffers if we need space and can't flush." Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> [v3] Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>	2018-01-19 11:30:10 -08:00
Kenneth Graunke	8a5bc304ff	i965: Rename 'aux' to 'prog_data' in program cache. 'aux' is a very generic name, suggesting it can be a bunch of things. However, it's always the brw_*_prog_data structure. So, call it that. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2018-01-19 11:29:47 -08:00
Chuck Atkins	a4be2bcee2	swr: allow a single swr architecture to be builtin Part 2 of 2 (part 1 is autoconf changes, part 2 is C++ changes) When only a single SWR architecture is being used, this allows that architecture to be builtin rather than as a separate libswrARCH.so that gets loaded via dlopen. Since there are now several different code paths for each detected CPU architecture, the log output is also adjusted to convey where the backend is getting loaded from. This allows SWR to be used for static mesa builds which are still important for large HPC environments where shared libraries can impose unacceptable application startup times as hundreds of thousands of copies of the libs are loaded from a shared parallel filesystem. Based on an initial implementation by Tim Rowley. v2: Refactor repetitive preprocessor checks to reduce code duplication v3: Formatting changes per Bruce C. Also delay screen creation until end to avoid leaks when failure conditions are hit. Signed-off-by: Chuck Atkins <chuck.atkins@kitware.com> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com> CC: Tim Rowley <timothy.o.rowley@intel.com> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 13:16:00 -06:00
Chuck Atkins	2ed8b6f827	swr: (autoconf) allow a single swr architecture to be builtin Part 1 of 2 (part 1 is autoconf changes, part 2 is C++ changes) When only a single SWR architecture is being used, this allows that architecture to be builtin rather than as a separate libswrARCH.so that gets loaded via dlopen. Since there are now several different code paths for each detected CPU architecture, the log output is also adjusted to convey where the backend is getting loaded from. This allows SWR to be used for static mesa builds which are still important for large HPC environments where shared libraries can impose unacceptable application startup times as hundreds of thousands of copies of the libs are loaded from a shared parallel filesystem. Based on an initial implementation by Tim Rowley. v2: Fix comment placement pointed out by Bruce C. Signed-off-by: Chuck Atkins <chuck.atkins@kitware.com> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com> CC: Tim Rowley <timothy.o.rowley@intel.com> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 13:15:54 -06:00
Greg V	8ff8c82630	swr: fix clang 5 null cast warning Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:15:56 +00:00
Gert Wollny	ea89843b3d	mesa/program: Fix -Wunused-param warning v2: Don't annotate, but remove the unused ctx parameter Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-01-19 15:45:57 +00:00
Gert Wollny	81d8a0f4a4	mesa/program/prog_execute.c: Silence -Wunused-param v2: Don't annotate, but remove the unused ctx parameter Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-01-19 15:45:57 +00:00
Gert Wollny	fef4b16523	mesa: Make numSamples an unsigned int As a followup to the previous patch propagate the change of numSamples from int to unsigned to gl_config::samples and consequently fix some -Wsign-compare warnings. Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-01-19 15:45:57 +00:00
Gert Wollny	d0e37599ab	gallium: Make (num_)samples an unsigned int According to the ARB_multisample num_samples is a non-negative integer. Consequently define it as such, fail in glx/choose_visual if a negative number is given. v2: split patch into gallium and mesa part Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-01-19 15:45:57 +00:00
Andres Gomez	7a2c87177a	docs: correct a typo in releasing instructions Cc: Emil Velikov <emil.velikov@collabora.com> Cc: Juan A. Suarez Romero <jasuarez@igalia.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-01-19 15:25:53 +02:00
Andres Gomez	7760566ab7	docs: move untar line in basic testing instructions for coherence For scons, windows/mingw dealing with LLVM_CONFIG is done before untarring. This is also more convenient for copy and paste. Cc: Emil Velikov <emil.velikov@collabora.com> Cc: Juan A. Suarez Romero <jasuarez@igalia.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-01-19 15:25:39 +02:00
Andres Gomez	bd8537fa71	docs: add a notice whenever a release is the final in a series Cc: Emil Velikov <emil.velikov@collabora.com> Cc: Juan A. Suarez Romero <jasuarez@igalia.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-01-19 15:25:27 +02:00
Andres Gomez	b910eec489	docs: add final release note for 17.2.8 Cc: Emil Velikov <emil.velikov@collabora.com> Cc: Juan A. Suarez Romero <jasuarez@igalia.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-01-19 15:25:20 +02:00
Andres Gomez	4b50cfef44	docs: add final release note for 17.1.10 Cc: Emil Velikov <emil.velikov@collabora.com> Cc: Juan A. Suarez Romero <jasuarez@igalia.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-01-19 15:20:57 +02:00
Grazvydas Ignotas	e6abc613e2	st/vdpau: release held lock in error path Signed-off-by: Grazvydas Ignotas <notasas@gmail.com> Reviewed-by: Christian König <christian.koenig@amd.com> Cc: mesa-stable@lists.freedesktop.org	2018-01-19 13:30:22 +02:00
Juan A. Suarez Romero	302ff82434	docs: update calendar, add news and link release notes to 17.3.3 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2018-01-19 10:46:18 +01:00
Juan A. Suarez Romero	059db12097	docs: add sha256 checksums for 17.3.3 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `bc1503b13f`)	2018-01-19 10:46:18 +01:00
Juan A. Suarez Romero	3205a45fc3	docs: add release notes for 17.3.3 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit `80f5f279b3`)	2018-01-19 10:46:18 +01:00
Samuel Iglesias Gonsálvez	7109a1fe13	anv: avoid segmentation fault due to vk_error() vk_error() is a macro that calls __vk_errorf() with instance == NULL. Then, __vk_errorf() passes a pointer to instance->debug_report_callbacks to vk_debug_error(), which segfaults as this pointer is invalid but not NULL. Fixes: `e5b1bd6ab8` "vulkan: move anv VK_EXT_debug_report implementation to common code." Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2018-01-19 09:39:05 +01:00
Bas Nieuwenhuizen	32170d87e3	ac/nir: Fix vector extraction if source vector has >4 elements. v2: Add forgotten argument and start offset. Fixes: `91074bb11b` "radv/ac: Implement Float64 SSBO stores." Tested-by: Timothy Arceri <tarceri@itsqueeze.com> Acked-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-01-19 02:00:28 +01:00
Bas Nieuwenhuizen	f4211e6f93	ac/nir: Use correct 32-bit component writemask for 64-bit SSBO stores. Fixes: `91074bb11b` "radv/ac: Implement Float64 SSBO stores." Tested-by: Timothy Arceri <tarceri@itsqueeze.com> Acked-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-01-19 02:00:14 +01:00
Bas Nieuwenhuizen	4a9fd90e1e	ac/nir: Fix TCS output LDS offsets. When a channel was not set we also did not increase the LDS address, while that obviously should happen. The output loading code was inadvertently fixed which resulted in a mismatch causing the SaschaWillems tessellation demo to result in corrupt rendering. Fixes: `7898eb9a60` "ac: rework load_tcs_{inputs,outputs}" Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-01-19 01:54:59 +01:00
Bas Nieuwenhuizen	bd5c942cef	radv: Use correct bindings for inputRate in key generation. The bindings also have an index field. Fixes: `49d035122e` "radv: Add single pipeline cache key." Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=104677 Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-01-19 01:54:59 +01:00
Bas Nieuwenhuizen	b1444c9ccb	radv: Implement VK_ANDROID_native_buffer. Passes dEQP-VK.api.smoke.* dEQP-VK.wsi.android.* with android-cts-7.1_r12 . Unlike the initial anv implementation this does use syncobjs instead of waiting on the CPU. This is missing meson build coverage for now. One possible todo is that linux 4.15 now has a sycall that allows us to export amdgpu fence to a sync_file, which allows us not to force all fences and semaphores to use syncobjs. However, I had trouble with my kernel crashing regularly with NULL pointers, and I'm not sure how beneficial it is in the first place given that intel uses syncobjs for all fences if available. Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-01-19 01:43:55 +01:00
Bas Nieuwenhuizen	a3e241ed07	radv: Add create image flag to not use DCC/CMASK. If we import an image, we might not have space in the buffer for CMASK, even though it is compatible. Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-01-19 01:43:55 +01:00
Bas Nieuwenhuizen	e344cd8178	radv: Generate VK_ANDROID_native_buffer. Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-01-19 01:43:55 +01:00
Bas Nieuwenhuizen	0f89f9b8eb	radv: Replace an assert with unreachable. Otherwise we get uninitialized variable warnings for es_vgpr_comp_cnt. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-01-19 00:38:45 +01:00
Bas Nieuwenhuizen	e417ab212b	radv: Remove DCC check on CS resolve dst image. Gives a warning when the assert is disabled, and not even necessarily true. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-01-19 00:38:45 +01:00
George Kyriazis	f76ca91ae0	gallivm: support avx512 (16x32) in interleave2_half lp_build_interleave2_half was not doing the right thing for avx512-style 16-wide loads. This path is hit in the swr driver with a 16-wide vertex shader. It is called from lp_build_transpose_aos, when doing texel fetches and the fetched data needs to be transposed to one component per output register. Special-case the post-load swizzle operations for avx512 16x32 (16-wide 32-bit values) so that we move the xyzw components correctly to the outputs. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-01-18 17:07:06 -06:00
Brian Paul	9e6efdd177	vbo: fix VBO optimization regression The optimization in change `8e4efdc895` ("vbo: optimize some display list drawing") missed the loopback case. This is used when the glBegin/End primitive doesn't have a uniform set of vertex attributes. The new Piglit gl-1.0-dlist-materials test hits this. So check the aligned_vertex_buffer_offset(list) value and adjust the buffer offset accordingly. We also need to remove the 'start == 0' assertion in the loopback code since it no longer applies. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-01-18 15:07:17 -07:00
Dylan Baker	26bde1e354	meson: ensure that xmlpool_options.h is generated for targets that need it Currently a couple of gallium targets race with xmlpool_options.h being generated, don't do that. Signed-off-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2018-01-18 13:31:47 -08:00

1 2 3 4 5 ...

99305 commits