fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-01 05:20:09 +01:00

Author	SHA1	Message	Date
Kenneth Graunke	4c4d9e4f03	glsl: Fix program interface queries relating to interface blocks. This fixes 555 dEQP tests (using the nougat-cts-dev branch), Piglit's arb_program_interface_query/arb_program_interface_query-resource-query, and GL45-CTS.program_interface_query.separate-programs-{tess-control, tess-eval,geometry}. Only one dEQP program interface failure remains. I would have liked to split this up into several distinct changes, but I wasn't sure how to do that given thet tangled nature of these issues. So, the issues: * We need to treat interface blocks declared as an array of instances as a single block - removing the outer array. The resource list entry's name should not include the array length. Properties such as GL_ARRAY_SIZE should refer to the variable inside the block, not the interface block's array properties. * We need to do this prefixing even for structure variables. * We need to do this for built-ins (such as gl_PerVertex.gl_Position). * After interface array unwrapping, any variable which is an array should have [0] appended. It doesn't matter if it's a TCS/TES/GS input or TCS output - that looked like an attempt to unwrap for per-vertex variables, but that didn't consider per-patch variables, and as far as I can tell there's nothing to justify this. Several Mesa developers have suggested that Issue 16 contradicts the main specification, but I believe that it doesn't - the main spec just isn't terribly clear. The main ARB_program_interface query spec says: "* For an active interface block not declared as an array of block instances, a single entry will be generated, using the block name from the shader source. * For an active interface block declared as an array of instances, separate entries will be generated for each active instance. The name of the instance is formed by concatenating the block name, the "[" character, an integer identifying the instance number, and the "]" character." Issue 16 says that built-ins should be named "gl_PerVertex.gl_Position", but several people suggested the second bullet above means that it should be named "gl_PerVertex[array length].gl_Position". There are two important things to note. Those bullet points say "an active interface block", while the others say "variable" or "active shader storage block member". They also don't mention applying the rules recursively (unlike the other bullets). Both suggest that these rules apply to blocks themselves, not members of blocks. In fact, for GL_UNIFORM_BLOCK queries, we do have "block[0]", "block[1]", ... resource list entries - so those rules are real, and actually used. So if they don't apply to block members, then how should members be named? Unfortunately, I don't see any rules outside of issue 16 - where the rationale is very unclear. I hope to clarify the spec in the future. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-12-19 15:43:09 -08:00
Kenneth Graunke	ad6d1d70ad	glsl: Drop bogus is_vertex_input from add_shader_variable(). stage_mask is a bitmask of shader stages, so the proper comparison would be (1 << MESA_SHADER_VERTEX), not MESA_SHADER_VERTEX itself. But we only care for structure types, and VS inputs cannot be structs. So we can just drop this entirely. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2016-12-19 15:40:47 -08:00
Kenneth Graunke	37d63b50b1	mesa/get: Convert stencil values to TYPE_UINT. These are listed as Z+ in the GL spec, and often have values of 0xFFFFFFFF. For glGetFloat, we should return 4294967295.0 rather than -1.0. Similarly, for glGetInteger64v, we should return 0xFFFFFFFF, not the sign extended 0xFFFFFFFFFFFFFFFF. Fixes 6 dEQP tests matching the pattern dEQP-GLES3.functional.state_query.integers.stencilvaluemask*getfloat when run in a single process (with state reset code happening between tests, which makes dEQP set the stencil value mask to 0xFFFFFFFF). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-12-19 11:33:40 -08:00
Kenneth Graunke	9f93afb9a5	mesa/get: Add TYPE_UINT for casting through a GLuint. The "State Tables" section of the OpenGL specification lists many values as belonging to Z+ (non-negative integers), not Z (all integers). For ordinary glGetInteger queries, this doesn't matter. However, when accessing Z+ values via glGetFloat or glGetInteger64, we need to treat the source value as an unsigned value. Otherwise, we'll produce a negative number when bit 31 is set. This commit merely adds the plumbing. It doesn't convert any values. v2: Gotta catch 'em all (add missing cases caught by Ilia) Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-12-19 11:33:40 -08:00
Kenneth Graunke	78a391ed83	mesa/get: Make GetFloat/GetDouble of TYPE_INT_N not normalize things. GetFloat of integer valued things is supposed to perform a simple int -> float conversion. INT_TO_FLOAT is not that. Instead, it converts [-2147483648, 2147483647] to a normalized [-1.0, 1.0] float. This is only used for COMPRESSED_TEXTURE_FORMATS, which nobody in their right mind would try and access via glGetFloat(), but we may as well fix it. Found by inspection. v2: Gotta catch 'em all (fix another case of this caught by Ilia) Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-12-19 11:33:40 -08:00
Michel Dänzer	52098fada7	Revert "cso: don't release sampler states that are bound" This reverts commit `6dc96de303`. No longer necessary with the previous change. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-12-19 17:51:38 +09:00
Michel Dänzer	95eb5e4eed	cso: Make sanitize_hash safe for samplers Remove currently bound sampler states from the hash table before pruning entries from the hash table, so they cannot accidentally be deleted by the pruning. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-12-19 17:51:34 +09:00
Michel Dänzer	745e2eaaec	cso: Store hash key in struct cso_sampler Preparation for following changes, no functional change intended. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-12-19 17:51:31 +09:00
Michel Dänzer	9e14238647	cso: Optimize cso_save/restore_fragment_samplers Only copy/memset the pointers that actually need to be. v2: * Cast info->nr_samplers to int for calculating delta (Nicolai) Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-12-19 17:50:21 +09:00
Michel Dänzer	5e70f80c99	cso: Store pointers to struct cso_sampler in struct sampler_info Preparation for following changes, no functional change intended. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-12-19 17:50:17 +09:00
Michel Dänzer	3d661a12be	cso: Don't restore nr_samplers in cso_restore_fragment_samplers If info->nr_samplers > ctx->nr_fragment_samplers_saved, the assignment would prevent cso_single_sampler_done from unbinding the no longer used samplers from the driver, which could result in use-after-free. This is probably unlikely to happen in practice though. Cc: "12.0 13.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-12-19 17:50:08 +09:00
Liu Zhiquan	e2610bf165	EGL/android: Enhance pbuffer implementation Some dri drivers will pass multiple bits in buffer_mask parameter to droid_image_get_buffer(), more than the actual supported buffer type combination. For such case, will go through all the bits, and will not return error when unsupported buffer is requested, only return error when the allocation for supported buffer failed. v2: coding style and log changes v3: coding style changes and update patch format Signed-off-by: Liu Zhiquan <zhiquan.liu@intel.com> Signed-off-by: Long, Zhifang <zhifang.long@intel.com> Reviewed-by: Tomasz Figa <tfiga@chromium.org>	2016-12-19 08:26:32 +02:00
Bas Nieuwenhuizen	1d529cba02	radv: Use correct workgroup size limits. Not sure where the 16k comes from, but pretty sure 2k is the max. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-12-18 22:18:14 +01:00
Dave Airlie	6229994ab7	radv: expose the compute queue v2: Don't expose the SDMA queue and use the CIK check also in the second if. (Bas) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-12-18 20:52:55 +01:00
Bas Nieuwenhuizen	442735d35d	radv: Only emit PFP ME syncs for DMA on the GFX queue. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-12-18 20:52:51 +01:00
Bas Nieuwenhuizen	f2523ebf52	radv: Create an empty CS per ring type. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-12-18 20:52:47 +01:00
Bas Nieuwenhuizen	accc5fc026	radv: Don't enable CMASK on compute queues. We can't fast clear on compute queues. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-12-18 20:52:41 +01:00
Bas Nieuwenhuizen	bfee9866ea	radv: Use RELEASE_MEM packet for MEC timestamp query. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-12-18 20:52:37 +01:00
Bas Nieuwenhuizen	9b0efc98ba	radv: Implement indirect dispatch for the MEC. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-12-18 20:52:33 +01:00
Bas Nieuwenhuizen	3a559029e2	radv: update vkCmdUpdateBuffer for the MEC. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-12-18 20:52:29 +01:00
Bas Nieuwenhuizen	b3499557a2	radv: Implement cache flushing for the MEC. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-12-18 20:52:26 +01:00
Dave Airlie	72aaa83f4b	radv: add semaphore support Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-12-18 20:52:26 +01:00
Dave Airlie	d270b5fac3	radv: pass queue index into winsys submission This is so we can submit on separate queues if needed Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-12-18 20:52:26 +01:00
Dave Airlie	d0e6fb0574	radv: init compute queue and avoid initing transfer queues Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-12-18 20:52:26 +01:00
Bas Nieuwenhuizen	71dabe1c16	radv/winsys: Make WaitIdle queue aware. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-12-18 20:52:20 +01:00
Dave Airlie	d028bd7b55	radv/meta: update header info Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-12-18 20:52:20 +01:00
Dave Airlie	4bd666a319	radv: hook compute clears into clear image api. These aren't used yet but we will want to use them when we implement a separate compute queue. Signed-off-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-12-18 20:52:20 +01:00
Dave Airlie	f11ea8779d	radv: clear image implementation for compute queue Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-12-18 20:52:20 +01:00
Dave Airlie	9839ce282b	radv/meta: split clear image out into a separate layer clear function This will make it easier to add support for clears on compute queues. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-12-18 20:52:20 +01:00
Dave Airlie	ef5f59c9a9	radv: implement image->image copies using compute shader This is required for having a separate compute queue, we probably can't use this on GFX queue due to DCC. v2: Set coord_components = 2 for itoi texture fetch. (Bas) Signed-off-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-12-18 20:52:20 +01:00
Dave Airlie	983af3a6d1	radv: add a compute shader implementation for buffer to image This implements the reverse of the current buffer->image path and can be used when we need to do image transfer on compute queues This just adds the code turned off as we don't support separate computes queues yet, and we don't want to use this path on the GFX queues for DCC reasons. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-12-18 20:52:20 +01:00
Bas Nieuwenhuizen	35cf08ef64	radv: Use correct pitch for views with different block size. Needed when accessing a comrpessed texture as R32G32B32A32 from a shader. This was not encountered previously, as we used the CB for the reinterpretation, which does not use this pitch. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-12-18 20:52:15 +01:00
Dave Airlie	94a7434bbc	radv: Store queue family in command buffers. v2: Added helper (Bas) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-12-18 20:52:15 +01:00
Dave Airlie	c20701f4be	radv: start fixing up queue allocate for multiple queues v2: Fix error handling and zero init the device (Bas) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-12-18 20:52:15 +01:00
Dave Airlie	59c9a131f4	radv/winsys: start adding support for DMA/compute queue Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-12-18 20:52:15 +01:00
Bas Nieuwenhuizen	86cb418bd4	radv/winsys: Expose number of compute/dma rings. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-12-18 20:52:08 +01:00
Rob Clark	2c0dfd48f0	freedreno/a5xx: border color support Not 100% sure it works if you have border color in VS.. but it might be right. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:49:45 -05:00
Rob Clark	939486d3d3	freedreno/a5xx: use MRT0 to import linear zs A bit of a hack, but we need to do this until we can do tiled zs in sysmem (and associated tile/until blits for transfer_map). Fixes xonotic and glmark2 "refract", when reorder wasn't enabled. (reorder would paper over the issue by avoiding the extra round- trip to system memory and back to gmem. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:48:10 -05:00
Rob Clark	bea8602e5b	freedreno: fdN_gmem_restore_format() is not gen specific Refactor out into a common helper, since this is the same across generations when we need equiv z/s gmem restore format. Next patch needs this in a5xx, rather than creating yet another helper push this into core. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:48:03 -05:00
Rob Clark	6f93c75a47	freedreno/a5xx: cargo-cult end-batch sequence more faithfully Fixes some issues at least with GMEM bypass mode, where we'd sometimes end up with some FS quads not hitting memory. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:47:54 -05:00
Rob Clark	d35022f24d	freedreno/a5xx: misc fix Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:47:47 -05:00
Rob Clark	651f2655a8	freedreno/a5xx: fix (at least some) vtx formats Swap/component-order doesn't seem to be quite what that is. At least blob was always setting it to XYZW ('11') but we weren't. Causing problems w/ formats like sint16.. Hard-coding this instead at least seems to get glamor working. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:47:38 -05:00
Rob Clark	2540226f66	freedreno/a5xx: more formats Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:47:31 -05:00
Rob Clark	c768461c1f	freedreno/a5xx: fixup caps Might not be 100% accurate, mostly just copy from a4xx to get started. We are defn lying about occlusion query at this point (not implemented yet) but need it to expose anything higher than gl1.4 (glamor needs gl2.1) Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:47:18 -05:00
Rob Clark	abcf8f5b58	freedreno/a5xx: fix random faults on first sysmem draw Not sure what this event is, but blob writes it.. and it seems to solve random write faults at mystery address that would sometimes happen on first BYPASS draw. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:47:08 -05:00
Rob Clark	54537fa1dc	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:47:00 -05:00
Rob Clark	5e632b3a83	freedreno/a5xx: fix stride/size for mem->gmem blits <brownpaperbag>these should be the in-GMEM dimensions</brownpaperbag> Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:46:48 -05:00
Dave Airlie	0f2e9a8986	radv/winsys: consolidate request->fence code Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-12-17 16:30:16 +01:00
Dave Airlie	7ad1c24e2a	radv: handle fence allocation failing Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-12-17 16:29:57 +01:00
Bas Nieuwenhuizen	b2b4f7248b	radv: Don't bail out on pipeline create failure. The spec says we have to try to create all, and only set failed pipelines to VK_NULL_HANDLE. If one of them fails, we have to return an error, but as far as I can see, the spec does not care which of the suberrors. Fixes dEQP-VK.api.object_management.alloc_callback_fail_multiple.compute_pipeline dEQP-VK.api.object_management.alloc_callback_fail_multiple.graphics_pipeline Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-12-17 11:41:53 +01:00

... 95 96 97 98 99 ...

92185 commits