fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-04-13 04:40:43 +02:00

Author	SHA1	Message	Date
Adam Jackson	bc1bc6f512	glx: Lower GLX opcode lookup into SendMakeCurrentRequest Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Adam Jackson <ajax@redhat.com>	2017-11-13 10:39:33 -05:00
Jason Ekstrand	93200ea26d	aubinator: Don't skip the first field in each subgroup The previous iteration algorithm would advance the field pointer right after we advance the group. This meant that you would end up with skipping the first field of the group. In the common case, where the only field is a struct (e.g. 3DSTATE_VERTEX_BUFFERS), it would get skipped entirely. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-11-13 07:37:23 -08:00
Jason Ekstrand	74a9e51696	intel/genxml: Delete empty groups They serve no purpose other than to just fill empty space in the packet so each dword has something. Just disallowing empty groups is a bit easier on some of the tools. This does not change the generated packing headers in any way. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-11-13 07:37:23 -08:00
Jason Ekstrand	54a6f7eaca	anv: Don't crash on invalid heap sizes when the PCI ID is overriden	2017-11-13 07:37:23 -08:00
Alex Smith	4122d00846	nir/spirv: tg4 requires a sampler Gather operations in both GLSL and SPIR-V require a sampler. Fixes gathers returning garbage when using separate texture/samplers (on AMD, was using an invalid sampler descriptor). Signed-off-by: Alex Smith <asmith@feralinteractive.com> Cc: "17.2 17.3" <mesa-stable@lists.freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-11-13 13:38:18 +00:00
Alex Smith	e9eb3c4753	spirv: Use correct type for sampled images We should use the result type of the OpSampledImage opcode, rather than the type of the underlying image/samplers. This resolves an issue when using separate images and shadow samplers with glslang. Example: layout (...) uniform samplerShadow s0; layout (...) uniform texture2D res0; ... float result = textureLod(sampler2DShadow(res0, s0), uv, 0); For this, for the combined OpSampledImage, the type of the base image was being used (which does not have the Depth flag set, whereas the result type does), therefore it was not being recognised as a shadow sampler. This led to the wrong LLVM intrinsics being emitted by RADV. Signed-off-by: Alex Smith <asmith@feralinteractive.com> Cc: "17.2 17.3" <mesa-stable@lists.freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-11-13 13:37:50 +00:00
Alejandro Piñeiro	157c9a1341	spirv: add DO NOT EDIT warning on generated spirv_info.c Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-11-13 13:28:44 +01:00
Thomas Hellstrom	54a58b2856	loader/dri3: Improve dri3 thread-safety It turned out that with recent changes that call into dri3 from glFinish(), it appears like different thread end up waiting for X events simultaneously, causing deadlocks since they steal events from eachoter and update the dri3 counters behind eachothers backs. This patch intends to improve on that. It allows at most one thread at a time to wait on events for a single drawable. If another thread intends to do the same, it's put to sleep until the first thread finishes waiting, and then it rechecks counters and optionally retries the waiting. Threads that poll for X events never pulls X events off the event queue if there are other threads waiting for events on that drawable. Counters in the dri3 drawable structure are protected by a mutex. Finally, the mutex we introduce is never held while waiting for the X server to avoid unnecessary stalls. This does not make dri3 drawables completely thread-safe but at least it's a first step. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102358 Fixes: `d5ba75f888` "st/dri2 Plumb the flush_swapbuffer functionality through to dri3" Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Acked-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-11-13 12:43:39 +01:00
Juan A. Suarez Romero	2b72ab58e5	etnaviv: automake,meson: include common_3d.xml.h in the sources lists v2: include the file also in the meson.build (Eric Engestrom). Fixes: `f1e1c60ff6` ("etnaviv: Update from rnndb") Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-11-13 12:26:59 +01:00
Tapani Pälli	41f7de477c	egl: EXT_pixel_format_float plumbing Patch adds support and capability to match with new surface attribute, component type. Currently no configs with floating point type are exposed. With this change, following dEQP test starts to pass: dEQP-EGL.functional.choose_config.color_component_type_ext.dont_care dEQP-EGL.functional.choose_config.color_component_type_ext.fixed dEQP-EGL.functional.choose_config.color_component_type_ext.float Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com> Reviewed-by: Adam Jackson <ajax@redhat.com>	2017-11-13 12:40:26 +02:00
Samuel Pitoiset	934b77f2fe	radv: add unlikely() around radv_save_descriptors() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-13 11:05:40 +01:00
Samuel Pitoiset	305745457c	radv: optimize calling radv_cmd_buffer_trace_emit() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-13 11:05:38 +01:00
Samuel Pitoiset	957d42271b	radv: optimize calling radv_save_pipeline() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-13 11:05:36 +01:00
Samuel Pitoiset	ebab5c8ff4	radv: use vk_zalloc instead of vk_alloc+memset Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-13 11:05:35 +01:00
Samuel Pitoiset	0f68208f1d	radv: remove unnecessary memset() in radv_AllocateCommandBuffers() This should not be needed, if the allocation fails an error is returned and the host should handle it. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-13 11:05:32 +01:00
Samuel Pitoiset	66da4c75bc	radv: remove useless initializations in radv_create_cmd_buffer() There is a memset() above. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-13 11:05:30 +01:00
Samuel Pitoiset	3d95fde661	radv: remove useless memset() in radv_CreateFence() All radv_fence fields are initialized here. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-13 11:05:28 +01:00
Samuel Pitoiset	cd64a4f705	radv: use vk_error() everywhere an error is returned For consistency and it might help for debugging purposes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-13 11:05:26 +01:00
Samuel Pitoiset	4e16c6a41e	radv: make radv_emit_framebuffer_state() static Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-13 11:04:25 +01:00
Samuel Pitoiset	be01197d8d	radv: do not emit the framebuffer when restoring a pass Instead just dirty RADV_CMD_DIRTY_FRAMEBUFFER and it will be re-emitted if necessary before the next draw. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-13 11:04:22 +01:00
Samuel Pitoiset	f87c58dde3	radv: prefetch VBO descriptors at the right place Just after the vertex shader. This seems to give a minor boost for, at least, Serious Sam Fusion 2017 and Dawn of War 3. I don't see any real impacts with The Talos Principle. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-13 11:03:16 +01:00
Samuel Pitoiset	9444a34f4a	radv: add radv_emit_prefetch_TC_L2_async() helper Will be used for VBO descriptors prefetching. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-13 11:03:13 +01:00
Samuel Pitoiset	36c2e46328	radv: rename radv_emit_shaders_prefetch() to radv_emit_prefetch() For consistency because this function will also prefetch VBO descriptors. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-13 11:03:11 +01:00
Iago Toral Quiroga	456e10944f	glsl/linker: use without_array() to retrieve type This is what we do in the condition too, so it makes sense. v2: Only compute without_array() once (Ilia). Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2017-11-13 09:22:26 +01:00
Dave Airlie	bec716e844	radv: emit esgs ring size in one place. This register is the same on all gpus so far, so emit it in one place and also for the pre-gfx9 gpus set the value in the pipeline creation. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-11-13 07:17:09 +00:00
Dave Airlie	031e591923	radv: move calculating vs out info regs into pipeline. This moves some calculations of register values into the pipeline construction, it saves looking at outinfo in the cmd buffer emit. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-11-13 07:16:53 +00:00
Rob Clark	4a9aad96aa	freedreno/a5xx: fix SSBO emit for non-zero offset Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:29:00 -05:00
Rob Clark	5f25ab4fee	freedreno/a5xx: remove obsolete comment Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:29:00 -05:00
Rob Clark	8fcee858d5	freedreno/ir3: don't create split/fo if only writing .x In case an instruction only writes one register, and it is .x, we can skip the extra level of fanout indirection. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00
Rob Clark	e7b2719f69	freedreno/a5xx: indirect grids Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00
Rob Clark	471aa1b6d0	freedreno/a5xx: add global size compute cap Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00
Rob Clark	62981bbe65	freedreno/ir3: turn on std430 packing Seems to fix dEQP compute related tests.. and matches what i965 does, so perhaps there is some assumption that std430 packing is on by default somewhere in NIR?	2017-11-12 12:28:59 -05:00
Rob Clark	bedbe7f90c	freedreno/a5xx: image support	2017-11-12 12:28:59 -05:00
Rob Clark	819a613ae3	freedreno/ir3: moar better scheduler Add a new pass that inserts additional dependencies, rather than simply relying on SSA srcs added in the nir->ir3 frontend. This makes it easier to deal with barriers, but the additional false deps also lets us deal properly with ensuring a write depends on all previous reads. Since conversion to barrier instructions is lossy (ie. just knowing the instruction doesn't tell us enough about what other instructions the barrier applies to), use barrier_class/barrier_conflict fields in the ir3_instruction to retain this information. This could probably be relaxed somewhat by considering which array/ buffer/image variable is being referenced. Ie. a write to buffer A can overtake a read from buffer B, if B is not coherent. (right?) Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00
Rob Clark	15ea8d128a	freedreno/ir3: move macros I want to add a growable array to ir3_instruction, so we can append false dependencies for purposes of scheduling barriers, atomics, and dealing with write after read hazards. Just code motion preparing for next patch. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00
Rob Clark	9edfc369c0	freedreno/ir3: image support Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00
Rob Clark	eaae81058c	freedreno/ir3: shared variable support Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00
Rob Clark	dd75abc6f3	freedreno/ir3: some SSBO cleanups/fixes Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00
Rob Clark	2f8bdf2e2b	freedreno/ir3: split out INSTR4F instructions Atomic instructions take a different # of src args depending on .g or .l variant, split these out into different helpers with INSTR*F() helper macro that lets you specify instruction flag. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00
Rob Clark	0038deb256	freedreno/ir3: cat6 encoding fixes Instruction encoding/decoding fixes needed for images, shared variables, etc. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00
Rob Clark	4e9a6c6868	freedreno/ir3: add barriers Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00
Rob Clark	4c711f4d18	freedreno/ir3: invert is_same_type_mov() logic Some instructions (like barriers) have no dst, which causes problems with dereferencing a NULL dst. Flip the logic around to reject opc's that can't be a type of move first, to filter out those instructions. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00
Rob Clark	6da5130074	freedreno/ir3: add cat7 instructions Needed for memory and execution barriers. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00
Rob Clark	33f5f63b8f	freedreno/ir3: add SSBO get_buffer_size() support Somehow I overlooked this when adding initial SSBO support. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00
Rob Clark	b267a08404	freedreno/ir3: extract helper for common consts User consts and driver consts such as UBO addresses and immediates are handled the same for all shader stages, so split out a shared helper for these, to make it easier to add more. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00
Rob Clark	13fe1feb62	freedreno: add image view state tracking It is unfortunate that image state isn't a real CSO, since (at least for a4xx/a5xx) it is a combination of sampler and "SSBO" image state, and it would be useful to pre-compute the state block "register" values rather than doing it at emit time. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00
Rob Clark	12c1c3ab23	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00
Rob Clark	0006b860ce	mesa/st/nir: assign driver_location for images Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00
Rob Clark	ecbe1e976f	st/program: fix compute shader nir references In case the IR is NIR, the driver takes reference to the nir_shader. Also, because there are no variants, we need to clone the shader, instead of sharing the reference with gl_program, which would result in a double free in _mesa_delete_program(). Signed-off-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-11-12 12:28:59 -05:00
Rob Clark	5009dc55f2	freedreno/ir3: rename ir3_compile -> ir3_context Having both an ir3_compile (which was really context for compiling a single shader variant) and ir3_compiler (which is the compiler object that compiles all variants, ie. basically holds the RA regset) is a bit confusing. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-12 12:28:59 -05:00

1 2 3 4 5 ...

89862 commits