fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-27 22:50:31 +01:00

Author	SHA1	Message	Date
Tapani Pälli	cf96bce0ca	glsl: mark explicit uniforms as explicit in other stages too If shader declares uniform explicit location in one stage but implicit in another, explicit location should be used. Patch marks implicit uniforms as explicit if they were explicit in previous stage. This makes sure that we don't treat them implicit later when assigning locations. Fixes following CTS test: ES31-CTS.explicit_uniform_location.uniform-loc-implicit-in-some-stages3 v2: move check to cross_validate_globals (Timothy) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2016-01-15 07:12:42 +02:00
Jason Ekstrand	6be517b20e	i965/fs: Always set hannel 2 of texture headers in some stages	2016-01-14 20:42:47 -08:00
Jason Ekstrand	e1d13cd058	i965/fs/generator: Take an actual shader stage rather than a string	2016-01-14 20:27:56 -08:00
Francisco Jerez	0556b87de4	i965/gen7.5+: Disable resource streamer during GPGPU workloads. The RS and hardware binding tables are only supported on the 3D pipeline and can lead to corruption if left enabled during a GPGPU workload. Disable it when switching to the GPGPU (or media) pipeline and re-enable it when switching back to the 3D pipeline. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2016-01-14 19:26:24 -08:00
Francisco Jerez	c8df0e7bf3	i965/gen7: Emit stall and dummy primitive draw after switching to the 3D pipeline. This hardware bug can supposedly lead to a hang on IVB and VLV. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-01-14 19:26:23 -08:00
Francisco Jerez	635be1402c	i965/gen4-5: Emit MI_FLUSH as required prior to switching pipelines. AFAIK brw_emit_select_pipeline() is only called once during context init on Gen4-5, at which point the pipeline is likely to be already idle so it may just happen to work by luck regardless of the MI_FLUSH. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-01-14 19:26:23 -08:00
Francisco Jerez	18c76551ee	i965/gen6-7: Implement stall and flushes required prior to switching pipelines. Switching the current pipeline while it's not completely idle or the read and write caches aren't flushed can lead to corruption. Fixes misrendering of at least the following Khronos CTS test: ES31-CTS.shader_image_load_store.basic-allTargets-store-fs The stall and flushes are no longer required on Gen8+. v2: Emit PIPE_CONTROL with non-zero post-sync op before the write cache flush on SNB due to hardware bug. (Ken) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=93323 Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-01-14 19:26:23 -08:00
Francisco Jerez	044acb9256	i965/gen8+: Invalidate color calc state when switching to the GPGPU pipeline. This hardware bug can cause a hang on context restore while the current pipeline is set to GPGPU (BDWGFX HSD 1909593). In addition to clearing the valid bit, mark the CC state as dirty to make sure that the CC indirect state pointer is re-emitted when we switch back to the 3D pipeline. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-01-14 19:26:23 -08:00
Francisco Jerez	22ac1f6922	i965: Add state bit to trigger re-emission of color calculator state. This will be used on Gen8+ to make sure that the color calculator state pointers are re-emitted when switching back to the 3D pipeline after some GPGPU workload due to a hardware workaround. There are other state bits already defined that could be used to achieve the same effect but they all cause a ton of unrelated state to be re-emitted (e.g. BRW_NEW_STATE_BASE_ADDRESS), so just define a new one, state bits are cheap. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-01-14 19:26:23 -08:00
Jason Ekstrand	47af950df5	anv/apply_pipeline_layout: Stomp texture array size to 1	2016-01-14 18:58:25 -08:00
Jason Ekstrand	6483d3f8fe	nir/spirv: Fix texture return types We were just hard-coding everything to a vec4. This meant we weren't handling shadow samplers at all and integer things were getting the wrong return type.	2016-01-14 18:48:57 -08:00
Ilia Mirkin	fffb559129	nv50/ir: rebase indirect temp arrays to 0, so that we use less lmem space Reduces local memory usage in a lot of Metro 2033 Redux and a few KSP shaders: total local used in shared programs : 54116 -> 30372 (-43.88%) Probably modest advantage to execution, but it's an imporant prerequisite to dropping some of the TGSI optimizations done by the state tracker. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-14 20:14:01 -05:00
Ilia Mirkin	e231f59b6d	nv50/ir: only use FILE_LOCAL_MEMORY for temp arrays that use indirection Previously we were treating any indirect temp array usage to mean that everything should end up in lmem. The MemoryOpt pass would clean a lot of that up later, but in the meanwhile we would lose a lot of opportunity for optimization. This helps a lot of Metro 2033 Redux and a handful of KSP shaders: total instructions in shared programs : 6288373 -> 6261517 (-0.43%) total gprs used in shared programs : 944051 -> 945131 (0.11%) total local used in shared programs : 54116 -> 54116 (0.00%) A typical case is for register usage to double and for instructions to halve. A future commit can also optimize local memory usage size to be reduced with better packing. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-14 20:13:59 -05:00
Ilia Mirkin	37b67db6ae	nvc0/ir: be careful about propagating very large offsets into const load Indirect constbuf indexing works by using very large offsets. However if an indirect constbuf index load is const-propagated, it becomes a very large const offset. Take that into account when legalizing the SSA by moving the high parts of that offset into the file index. Also disallow very large (or small) indices on most other instructions. This fixes regressions in ubo_array_indexing/*-two-arrays piglit tests. Fixes: `abd326e81b` (nv50/ir: propagate indirect loads into instructions) Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-14 18:20:27 -05:00
Kristian Høgsberg Kristensen	2eb52198ff	vk: Fix struct field indentation	2016-01-14 15:18:40 -08:00
Chad Versace	5dea9d0039	anv: Document anv_cmd_state::current_pipeline It's the value of PIPELINE_SELECT.PipelineSelection.	2016-01-14 13:18:40 -08:00
Chad Versace	ed33ccde63	anv: Make vkBeginCommandBuffer reset the command buffer If its the command buffer's first call to vkBeginCommandBuffer, we must initialize the command buffer's state. Otherwise, we must reset its state. In both cases, let's use anv_ResetCommandBuffer. From the Vulkan 1.0 spec: If a command buffer is in the executable state and the command buffer was allocated from a command pool with the VK_COMMAND_POOL_CREATE_RESET_COMMAND_BUFFER_BIT flag set, then vkBeginCommandBuffer implicitly resets the command buffer, behaving as if vkResetCommandBuffer had been called with VK_COMMAND_BUFFER_RESET_RELEASE_RESOURCES_BIT not set. It then puts the command buffer in the recording state.	2016-01-14 13:14:40 -08:00
Chad Versace	ea20389320	anv: Add FIXME for vkResetCommandPool vkResetCommandPool currently destroys its command buffers. The Vulkan 1.0 spec requires that it only reset them: Resetting a command pool recycles all of the resources from all of the command buffers allocated from the command pool back to the command pool. All command buffers that have been allocated from the command pool are put in the initial state.	2016-01-14 13:14:40 -08:00
Chad Versace	20fd816b6b	anv: Remove duplicate func prototype anv_private.h declared anv_cmd_buffer_begin_subpass twice.	2016-01-14 13:14:40 -08:00
Chad Versace	0415dfcfe7	anv/meta: Add FINISHME for clearing multi-layer framebuffers	2016-01-14 13:14:40 -08:00
Jason Ekstrand	32f8bcb84f	i965/vec4: Use UW type for multiply into accumulator on GEN8+ BDW adds the following restriction: "When multiplying DW x DW, the dst cannot be accumulator."	2016-01-14 12:04:25 -08:00
Jason Ekstrand	45349acad0	Merge remote-tracking branch 'mesa-public/master' into vulkan This fixes the bitfieldextract and bitfieldinsert CTS tests	2016-01-14 11:36:27 -08:00
Ilia Mirkin	7a521ddf36	nvc0: allow fragment shader inputs to use indirect indexing Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-14 14:28:04 -05:00
Ilia Mirkin	e94ef885bb	st/mesa: use surface format to generate mipmaps when available This fixes the recently posted mipmap + texture views piglit test. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "11.0 11.1" <mesa-stable@lists.freedesktop.org> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2016-01-14 14:28:04 -05:00
Marek Olšák	dc96a18d24	radeonsi: don't miss changes to SPI_TMPRING_SIZE I'm not sure about the consequences of this bug, but it's definitely dangerous. This applies to SI, CIK, VI. Cc: 11.0 11.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-01-14 19:55:41 +01:00
Charmaine Lee	6303231a1d	svga: add DXGenMips command support For those formats that support hw mipmap generation, use the DXGenMips command. Otherwise fallback to the mipmap generation utility. Tested with piglit, OpenGL apps (Heaven, Turbine, Cinebench) v2: make sure the texture surface was created with the render target bind flag set relocation flag to SVGA_RELOC_WRITE for the texture surface Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-01-14 10:44:25 -07:00
Charmaine Lee	78e628ae43	svga: add num-generate-mipmap HUD query The actual increment of the num-generate-mipmap counter will be done in a subsequent patch when hw generate mipmap is supported. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-01-14 10:39:53 -07:00
Charmaine Lee	3038e8984d	gallium/st: add pipe_context::generate_mipmap() This patch adds a new interface to support hardware mipmap generation. PIPE_CAP_GENERATE_MIPMAP is added to allow a driver to specify if this new interface is supported; if not supported, the state tracker will fallback to mipmap generation by rendering/texturing. v2: add PIPE_CAP_GENERATE_MIPMAP to the disabled section for all drivers v3: add format to the generate_mipmap interface to allow mipmap generation using a format other than the resource format v4: fix return type of trace_context_generate_mipmap() Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-01-14 10:39:53 -07:00
Brian Paul	b1e11f4d71	st/mesa: declare struct pipe_screen in st_cb_bufferobjects.h To silence a compiler warning. Trivial.	2016-01-14 10:38:18 -07:00
Matt Turner	b82e26a6a4	nir: Lower bitfield_extract. The OpenGL specifications for bitfieldExtract() says: The result will be undefined if <offset> or <bits> is negative, or if the sum of <offset> and <bits> is greater than the number of bits used to store the operand. Therefore passing bits=32, offset=0 is legal and defined in GLSL. But the earlier SM5 ubfe/ibfe opcodes are specified to accept a bitfield width ranging from 0-31. As such, Intel and AMD instructions read only the low 5 bits of the width operand, making them not able to implement the GLSL-specified behavior directly. This commit adds ubfe/ibfe operations from SM5 and a lowering pass for bitfield_extract to to handle the trivial case of <bits> = 32 as bitfieldExtract: bits > 31 ? value : bfe(value, offset, bits) Fixes: ES31-CTS.shader_bitfield_operation.bitfieldExtract.uvec3_0 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=92595 Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Tested-by: Marta Lofstedt <marta.lofstedt@intel.com>	2016-01-14 09:28:01 -08:00
Matt Turner	15640ee77a	nir: Handle <bits>=32 case in bitfield_insert lowering. The OpenGL specifications for bitfieldInsert() says: The result will be undefined if <offset> or <bits> is negative, or if the sum of <offset> and <bits> is greater than the number of bits used to store the operand. Therefore passing bits=32, offset=0 is legal and defined in GLSL. But the earlier SM5 bfi opcode is specified to accept a bitfield width ranging from 0-31. As such, Intel and AMD instructions read only the low 5 bits of the width operand, making them not able to implement the GLSL-specified behavior directly. This commit fixes the lowering of bitfield_insert to handle the trivial case of <bits> = 32 as bitfieldInsert: bits > 31 ? insert : bfi(bfm(bits, offset), insert, base) Fixes: ES31-CTS.shader_bitfield_operation.bitfieldInsert.uint_2 ES31-CTS.shader_bitfield_operation.bitfieldInsert.uvec4_3 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=92595 Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Tested-by: Marta Lofstedt <marta.lofstedt@intel.com>	2016-01-14 09:27:52 -08:00
Jason Ekstrand	f46f4e4886	nir/spirv: Add initial support for Vertex/Instance index	2016-01-14 09:12:32 -08:00
Jason Ekstrand	3d0fac7aca	vulkan.h: Pull in 1.0.1 header	2016-01-14 08:37:54 -08:00
Jason Ekstrand	24a6fcba77	vulkan-1.0.0: Bump the version to 1.0.0	2016-01-14 08:26:37 -08:00
Jason Ekstrand	c310fb032d	vulkan-1.0.0: Rework memory barriers	2016-01-14 08:09:39 -08:00
Brian Paul	6470435190	st/mesa: add check for color logicop in blit_copy_pixels() We check that a bunch of raster operations are disabled in blit_copy_pixels(). We also need to check that color logicop is disabled. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-01-14 09:08:21 -07:00
Jason Ekstrand	b14a78cfb8	vulkan-1.0.0: No-op WSI changes	2016-01-14 08:02:44 -08:00
Jason Ekstrand	6d3322d0e5	vulkan-1.0.0: Make extents unsigned	2016-01-14 08:00:18 -08:00
Jason Ekstrand	b57c72d964	vulkan-1.0.0: Rework blits to use four offsets	2016-01-14 07:59:37 -08:00
Jason Ekstrand	f6cae99294	vulkan-1.0.0: Split out command buffer inheritance info	2016-01-14 07:45:15 -08:00
Jason Ekstrand	f99f847412	vulkan-1.0.0: Re-order some structs in the header	2016-01-14 07:43:05 -08:00
Jason Ekstrand	aab9517f3d	vulkan-1.0.0: Misc. field and argument renames	2016-01-14 07:41:45 -08:00
Jason Ekstrand	d877095e66	vulkan-1.0.0: Get rid of MIPMAP_MODE_BASE	2016-01-14 07:32:16 -08:00
Jason Ekstrand	7b81637762	vulkan-1.0.0: Convert pPreserveAttachments to a uint32_t	2016-01-14 07:30:46 -08:00
Jason Ekstrand	802f00219a	anv/device: Update features and limits	2016-01-14 07:30:46 -08:00
Jason Ekstrand	08735ba91c	anv/cmd_buffer: Fix setting of viewport/scissor count	2016-01-14 07:30:46 -08:00
Jason Ekstrand	ed4fe3e9ba	anv/state: Respect SamplerCreateInfo.anisotropyEnable	2016-01-14 07:30:46 -08:00
Jason Ekstrand	8a81d136f8	anv/image: Fill out VkSubresourceLayout.arrayPitch	2016-01-14 07:30:46 -08:00
BogDan Vatra	102c74277f	WIP: Partially upgrade to vulkan v0.221.0 TODO, make use of: - VkPhysicalDeviceFeatures.drawIndirectFirstInstance, - VkPhysicalDeviceFeatures.inheritedQueries - VkPhysicalDeviceLimits.timestampComputeAndGraphics - VkSubmitInfo.pWaitDstStageMask - VkSubresourceLayout.arrayPitch - VkSamplerCreateInfo.anisotropyEnable	2016-01-14 07:30:46 -08:00
Nicolai Hähnle	e976860638	gallium/radeon: do not reallocate user memory buffers The whole point of AMD_pinned_memory is that applications don't have to map buffers via OpenGL - but they're still allowed to, so make sure we don't break the link between buffer object and user memory unless explicitly instructed to. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-01-14 09:41:24 -05:00

... 99 100 101 102 103 ...

82384 commits