fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-27 03:38:12 +02:00

Author	SHA1	Message	Date
Kristian Høgsberg Kristensen	7b348ab8a0	anv: Fix rebase error	2016-03-05 14:33:50 -08:00
Kristian Høgsberg Kristensen	34326f46df	anv: Turn pipeline cache on by default Move the environment variable check to cache creation time so we block both lookups and uploads if it's turned off.	2016-03-05 13:54:24 -08:00
Kristian Høgsberg Kristensen	f2b37132cb	anv: Check if shader if present before uploading to cache Between the initial check the returns NO_KERNEL and compiling the shader, other threads may have added the shader to the cache. Before uploading the kernel, check again (under the mutex) that the compiled shader still isn't present.	2016-03-05 13:54:24 -08:00
Kristian Høgsberg Kristensen	30bbe28b7e	anv: Always use point size from the shader There is no API for setting the point size and the shader is always required to set it. Section 24.4: "If the value written to PointSize is less than or equal to zero, or if no value was written to PointSize, results are undefined." As such, we can just always program PointWidthSource to Vertex. This simplifies anv_pipeline a bit and avoids trouble when we enable the pipeline cache and don't have writes_point_size in the prog_data.	2016-03-05 13:54:24 -08:00
Kristian Høgsberg Kristensen	6139fe9a77	anv: Also cache the struct anv_pipeline_binding maps This is state the we generate when compiling the shaders and we need it for mapping resources from descriptor sets to binding table indices.	2016-03-05 13:50:07 -08:00
Kristian Høgsberg Kristensen	584f39c65e	anv: Don't re-upload shaders when merging Using anv_pipeline_cache_upload_kernel() will re-upload the kernel and prog_data when we merge caches. Since the kernel and prog_data is already in the program_stream, use anv_pipeline_cache_add_entry() instead to only add the entry to the hash table.	2016-03-05 13:50:07 -08:00
Kristian Høgsberg Kristensen	626559ed37	anv: Add anv_pipeline_cache_add_entry() This function will grow the cache to make room and then add the entry.	2016-03-05 13:50:07 -08:00
Kristian Høgsberg Kristensen	07441c344c	anv: Rename anv_pipeline_cache_add_entry() to 'set' This function is a helper that unconditionally sets a hash table entry and expects the cache to have enough room. Calling it 'add_entry' suggests it will grow the cache as needed.	2016-03-05 13:50:07 -08:00
Kristian Høgsberg Kristensen	87967a2c85	anv: Simplify pipeline cache control flow a bit No functional change, but the control flow around searching the cache and falling back to compiling is a bit simpler.	2016-03-05 13:50:07 -08:00
Kristian Høgsberg Kristensen	2b29342fae	anv: Store prog data in pipeline cache stream We have to keep it there for the cache to work, so let's not have an extra copy in struct anv_pipeline too.	2016-03-05 13:50:07 -08:00
Kristian Høgsberg Kristensen	37c5e70253	anv: Rename 'table' to 'hash_table' in anv_pipeline_cache A little less ambiguous.	2016-03-05 13:50:07 -08:00
Kristian Høgsberg Kristensen	c028ffea70	anv: Serialize as much pipeline cache as we can We can serialize as much as the application asks for and just stop once we run out of memory. This lets applications use a fixed amount of space for caching and still get some benefit.	2016-03-05 13:50:07 -08:00
Kristian Høgsberg Kristensen	cd812f086e	anv: Use 1.0 pipeline cache header The final version of the pipeline cache header adds a few more fields.	2016-03-05 13:50:07 -08:00
Kristian Høgsberg Kristensen	26ed943eb9	anv: Fix shader key hashing This was copied from inline code to a helper and wasn't updated to hash a pointer instead.	2016-03-05 13:50:07 -08:00
Kristian Høgsberg Kristensen	3baf8af947	anv: Remove excess whitespace	2016-03-05 13:50:07 -08:00
Kristian Høgsberg Kristensen	ab36eae5e7	anv: Remove left-over bits of sparse-descriptor code	2016-03-05 13:50:07 -08:00
Jason Ekstrand	1afdfc3e6e	anv/pipeline: Implement the depth compare EQUAL workaround on gen8+	2016-03-05 09:59:28 -08:00
Jason Ekstrand	7c1660aa14	anv: Don't allow D16_UNORM to be combined with stencil Among other things, this can cause the depth or stencil test to spurriously fail when the fragment shader uses discard.	2016-03-05 09:59:28 -08:00
Jason Ekstrand	9a90176d48	anv/pipeline: Calculate the correct max_source_attr for 3DSTATE_SBE	2016-03-05 09:59:28 -08:00
Brian Paul	a4678311be	st/mesa: 78-column wrapping in st_extensions.c Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2016-03-05 09:21:05 -07:00
Brian Paul	9e6a6bd575	gallium/util: add new comments, assertions in u_debug_refcnt.c Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2016-03-05 09:20:34 -07:00
Brian Paul	b6a607b221	gallium/util: update comments and URL in u_debug_refcnt.c Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2016-03-05 09:20:28 -07:00
Brian Paul	cbca6964e2	gallium/util: make stream variable static in u_debug_refcnt.c Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2016-03-05 09:20:23 -07:00
Brian Paul	fb0abedce7	gallium/util: re-indent u_debug_refcnt.[ch] Wrap comments to 78 columns, etc. Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2016-03-05 09:20:14 -07:00
Brian Paul	a7ba29f6d8	gallium/tests: silence warning in compute.c compute.c: In function ‘launch_grid’: compute.c:435:20: warning: assignment discards ‘const’ qualifier from pointer target type [enabled by default] info.input = input; ^ Maybe the pipe_grid_info::input field should be const void *? Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2016-03-05 09:15:44 -07:00
Timothy Arceri	31943e6ba5	glsl: replace remaining tabs in link_varyings.cpp Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2016-03-05 20:50:10 +11:00
Timothy Arceri	e2415e8467	glsl: replace remaining tabs in link_uniforms.cpp Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2016-03-05 20:50:05 +11:00
Jordan Justen	81f30e2f50	anv/hsw: Move query code to genX file for Haswell This fixes many CTS cases, but will require an update to the kernel command parser register whitelist. (The CS GPRs and TIMESTAMP registers need to be whitelisted.) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2016-03-05 01:08:07 -08:00
Timothy Arceri	3322cb7b8d	docs: mark align layout qualifier as DONE Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2016-03-05 19:39:13 +11:00
Timothy Arceri	037f68d81e	glsl: apply align layout qualifier rules to block offsets From Section 4.4.5 (Uniform and Shader Storage Block Layout Qualifiers) of the OpenGL 4.50 spec: "The align qualifier makes the start of each block member have a minimum byte alignment. It does not affect the internal layout within each member, which will still follow the std140 or std430 rules. The specified alignment must be a power of 2, or a compile-time error results. The actual alignment of a member will be the greater of the specified align alignment and the standard (e.g., std140) base alignment for the member's type. The actual offset of a member is computed as follows: If offset was declared, start with that offset, otherwise start with the next available offset. If the resulting offset is not a multiple of the actual alignment, increase it to the first offset that is a multiple of the actual alignment. This results in the actual offset the member will have. When align is applied to an array, it affects only the start of the array, not the array's internal stride. Both an offset and an align qualifier can be specified on a declaration. The align qualifier, when used on a block, has the same effect as qualifying each member with the same align value as declared on the block, and gets the same compile-time results and errors as if this had been done. As described in general earlier, an individual member can specify its own align, which overrides the block-level align, but just for that member. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2016-03-05 19:39:07 +11:00
Timothy Arceri	5a27fefffe	glsl: parse align layout qualifier Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2016-03-05 19:39:01 +11:00
Timothy Arceri	22b0082b9d	docs: mark explicit byte offsets as DONE Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>	2016-03-05 19:38:55 +11:00
Timothy Arceri	802262c0af	glsl: use explicit offset when lowering buffer access Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>	2016-03-05 19:38:49 +11:00
Timothy Arceri	96527c3cf2	glsl: copy explicit offset to uniform storage Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>	2016-03-05 19:38:44 +11:00
Timothy Arceri	e12a49ac12	glsl: update comment on offset field The old comment was for the location not the offset, we now use the field for block members so mention that also. Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>	2016-03-05 19:38:39 +11:00
Timothy Arceri	9f24f42c49	glsl: add offset to glsl interface type In this patch we also copy the offset value from the ast and implement offset linking rules by adding it to the record_compare() function. From Section 4.4.5 (Uniform and Shader Storage Block Layout Qualifiers) of the GLSL 4.50 spec: "Two blocks linked together in the same program with the same block name must have the exact same set of members qualified with offset and their integral-constant-expression values must be the same, or a link-time error results." Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>	2016-03-05 19:38:34 +11:00
Timothy Arceri	8abed7f185	glsl: apply compile-time rules for the offset layout qualifier This implements the rules for the offset qualifier on block members. From Section 4.4.5 (Uniform and Shader Storage Block Layout Qualifiers) of the GLSL 4.50 spec: "The offset qualifier can only be used on block members of blocks declared with std140 or std430 layouts." ... "It is a compile-time error to specify an offset that is smaller than the offset of the previous member in the block or that lies within the previous member of the block." ... "The specified offset must be a multiple of the base alignment of the type of the block member it qualifies, or a compile-time error results." Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>	2016-03-05 19:38:30 +11:00
Timothy Arceri	6f45484ac7	glsl: enable offset layout qualifier for ARB_enhanced_layouts Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>	2016-03-05 19:38:26 +11:00
Timothy Arceri	1824ff1c2a	glsl: reject invalid input layout qualifiers Global in validation is already handled, this will do the validation for variables, blocks and block members. This fixes some CTS tests for the new enhanced layouts transform feedback qualifiers. V2: add some more valid input flags Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2016-03-05 19:07:09 +11:00
Timothy Arceri	bd53cc7b45	glsl: only apply default stream to output blocks This is needed to allow invalid qualifier checks on inputs. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2016-03-05 19:07:04 +11:00
Timothy Arceri	78d3098c05	glsl: rework parsing of blocks Previously interface blocks were giving the global default flags of uniform blocks. This meant we could not check for invalid qualifiers on interface blocks because they always contained invalid flags. This changes parsing so that interface blocks now get an empty set of layouts. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2016-03-05 19:07:00 +11:00
Timothy Arceri	d244986bf2	glsl: don't apply uniform/buffer layouts to interface blocks If the following patch we will stop setting these layouts by default on interface blocks, so we need to do this to avoid hitting the assert. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2016-03-05 19:06:56 +11:00
Nanley Chery	4e75f9b219	anv: Implement VK_REMAINING_{MIP_LEVELS,ARRAY_LAYERS} v2: Subtract the baseMipLevel and baseArrayLayer (Jason) Signed-off-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2016-03-04 21:25:23 -08:00
Kenneth Graunke	4ba7ad6cc1	i965: Only magnify depth for 3D textures, not array textures. When BaseLevel > 0, we magnify the dimensions to fill out the size of miplevels [0..BaseLevel). In particular, this was magnifying depth, thinking that the depth doubles at each level. This is perfectly reasonable for 3D textures, but dead wrong for array textures. Changing the depth != 1 condition to a target == GL_TEXTURE_3D check should make this only happen in the appropriate cases. Fixes about 32 dEQP tests: - dEQP-GLES31.functional.texture.gather.*.level_{1,2} Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Cc: mesa-stable@lists.freedesktop.org	2016-03-04 21:25:08 -08:00
Jason Ekstrand	c1436e80ef	anv/meta_clear: Set the right number of dynamic states	2016-03-04 19:18:20 -08:00
Juan A. Suarez Romero	2f76a9924e	i965/vec4: add opportunistic behaviour to opt_vector_float() opt_vector_float() transforms several scalar MOV operations to a single vectorial MOV. This is done when those MOV covers all the components of the destination register. So something like: mov vgrf3.0.xy:D, 0D mov vgrf3.0.w:D, 1065353216D mov vgrf3.0.z:D, 0D is transformed in: mov vgrf3.0:F, [0F, 0F, 0F, 1F] But there are cases where not all the components are written. For example, in: mov vgrf2.0.x:D, 1073741824D mov vgrf3.0.xy:D, 0D mov vgrf3.0.w:D, 1065353216D mov vgrf4.0.xy:D, 1065353216D mov vgrf4.0.w:D, 0D mov vgrf6.0:UD, u4.xyzw:UD Nor vgrf3 nor vgrf4 .z components are written, so the optimization is not applied. But it could be applied anyway with the components covered, using a writemask to select the ones written. So we could transform it in: mov vgrf2.0.x:D, 1073741824D mov vgrf3.0.xyw:F, [0F, 0F, 0F, 1F] mov vgrf4.0.xyw:F, [1F, 1F, 0F, 0F] mov vgrf6.0:UD, u4.xyzw:UD This commit does precisely that: opportunistically apply opt_vector_float() when possible. total instructions in shared programs: 7124660 -> 7114784 (-0.14%) instructions in affected programs: 443078 -> 433202 (-2.23%) helped: 4998 HURT: 0 total cycles in shared programs: 64757760 -> 64728016 (-0.05%) cycles in affected programs: 1401686 -> 1371942 (-2.12%) helped: 3243 HURT: 38 v2: change vectorize_mov() signature (Matt). v3: take in account predicates (Juan). v4 [mattst88]: Update shader-db numbers. Fix some whitespace issues. Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2016-03-04 19:16:52 -08:00
Jason Ekstrand	cc57efc67a	anv/pipeline: Fix depthBiasEnable on gen7 The first time I tried to fix this, I set the wrong fields.	2016-03-04 17:56:12 -08:00
Jason Ekstrand	653261285e	anv/cmd_buffer: Reset the state streams when resetting the command buffer	2016-03-04 17:54:29 -08:00
Jason Ekstrand	f700d16a89	anv/cmd_buffer: Include Haswell in set_subpass	2016-03-04 17:54:29 -08:00
George Kyriazis	feb71117ae	st/xlib: Don't destroy screen on XCloseDisplay() screen may still be used by other resources that are not yet freed. To correctly fix this there will be a need to account for resources differently, but this quick fix is not any worse than the original code that leaked screens anyway. Reviewed-by: Brian Paul <brianp@vmware.com>	2016-03-04 18:14:46 -07:00

... 144 145 146 147 148 ...

86267 commits