fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-08 15:38:09 +02:00

Author	SHA1	Message	Date
Kristian Høgsberg Kristensen	1286bd3160	vk: Delete vk.c test case We now have crucible up and running and all vk sub-cases have been moved over. Delete this crufty old hack of a test case.	2015-06-02 22:57:42 -07:00
Kristian Høgsberg Kristensen	2f6aa424e9	vk: Update generated headers with support for 64 bit fields	2015-06-02 22:57:42 -07:00
Kristian Høgsberg Kristensen	5744d1763c	vk: Set cb_state to NULL at cmd buffer create time Dynamic color/blend state can be NULL in case we're not rendering to color targets (only output to depth and/or stencil). Initialize cmd_buffer->cb_state to NULL so we can reliably detect whether it's been set or not.	2015-06-02 22:57:42 -07:00
Kristian Høgsberg Kristensen	c8f078537e	vk: Implement vertexOffset parameter of vkCmdDrawIndexed() As exposed by the func.draw_indexed test, we were ignoring the argument and hardcoding 0.	2015-06-02 22:57:42 -07:00
Timothy Arceri	86a74e9b6b	nir: use src for ssa helper Reviewed-by: Thomas Helland <thomashelland90@gmail.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-06-03 06:50:39 +10:00
Timothy Arceri	5f7b8fa481	nir: remove extra semicolon Reviewed-by: Thomas Helland <thomashelland90@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-06-03 06:50:33 +10:00
Matt Turner	5da809d70f	prog_to_nir: Remove OPCODE_MOV special case. OPCODE_MOV is in the op_trans[] array. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-06-02 12:22:42 -07:00
Matt Turner	576f7241b6	prog_to_nir: Remove from op_trans[] opcodes handled in the switch. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-06-02 12:22:42 -07:00
Jason Ekstrand	e702197e3f	vk/formats: Add a name to the metadata and better logging	2015-06-02 11:30:39 -07:00
Jason Ekstrand	fbafc946c6	vk/formats: Rework the formats table	2015-06-02 11:30:39 -07:00
Eduardo Lima Mitev	5b226a1242	nir: prevent use-after-free condition in should_lower_phi() lower_phis_to_scalar() pass recurses the instruction dependence graph to determine if all the sources of a given instruction are scalarizable. To prevent cycles, it temporary marks the phi instruction before recursing in, then updates the entry with the resulting value. However, it does not consider that the entry value may have changed after a recursion pass, hence causing a use-after-free situation and a crash. This patch fixes this by reloading the entry corresponding to the 'phi' after recursing and before updating its value. The crash can be reproduced ~20% of times with the dEQP test: dEQP-GLES3.functional.shaders.loops.while_constant_iterations.nested_sequence_fragment Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-06-02 20:21:49 +02:00
Kenneth Graunke	762395736b	i965: Add Gen8+ VS dispatch_mode assertion. Suggested by Ben Widawsky. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ben Widawsky <ben@bwidawsk.net>	2015-06-01 22:08:54 -07:00
Kristian Høgsberg Kristensen	f98c89ef31	vk: Move query related functionality to new file query.c	2015-06-01 21:52:45 -07:00
Kenneth Graunke	a2655e0dd4	i965: Drop LOAD_PAYLOAD workaround in fs_visitor::emit_urb_writes(). Now that Jason's LOAD_PAYLOAD improvements have landed, we don't need this. Passing 1 for the number of header registers already takes care of setting force_writemask_all on the header copy. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2015-06-01 12:45:41 -07:00
Kenneth Graunke	386bf336c4	i965: Use proper pitch for scalar GS pull constants and UBOs. See the corresponding code in brw_vs_surface_state.c. v2: const more things (requested by Topi Pohjolainen) Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ben Widawsky <ben@bwidawsk.net>	2015-06-01 12:45:40 -07:00
Kenneth Graunke	0f8ec779dd	i965: Create a shader_dispatch_mode enum to replace VS/GS fields. We used to store the GS dispatch mode in brw_gs_prog_data while separately storing the VS dispatch mode in brw_vue_prog_data::simd8. This patch introduces an enum to represent all possible dispatch modes, and stores it in brw_vue_prog_data::dispatch_mode, unifying the two. Based on a suggestion by Matt Turner. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ben Widawsky <ben@bwidawsk.net>	2015-06-01 12:45:40 -07:00
Kenneth Graunke	9945573d65	i965: Drop "Vector Mask Enable" bit from 3DSTATE_GS on Gen8+. The documentation makes it pretty clear that we shouldn't use this: "Under normal conditions SW shall specify DMask, as the GS stage will provide a Dispatch Mask appropriate to SIMD4x2 or SIMD8 thread execution (as a function of dispatch mode). E.g., for SIMD4x2 execution, the GS stage will generate a Dispatch Mask that is equal to what the EU would use as the Vector Mask. For SIMD8 execution there is no known usage model for use of Vector Mask (as there is for PS shaders)." I also managed to find descriptions of DMask and VMask, in the "State Register" (sr0.2/3) field descriptions: "Dispatch Mask (DMask). This 32-bit field specifies which channels are active at Dispatch time." "Vector Mask (VMask). This 32-bit field contains, for each 4-bit group, the OR of the corresponding 4-bit group in the dispatch mask." SIMD4x2 shaders process one or two vec4 values, with each 4-bit group corresponding to xyzw channel enables (either all on, or all off). Thus, DMask = VMask in SIMD4x2 mode. But in SIMD8 mode, 4-bit groups are meaningless, so it just messes up your values. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ben Widawsky <ben@bwidawsk.net>	2015-06-01 12:45:40 -07:00
Jason Ekstrand	08748e3a0c	i965: Use NIR by default for vertex shaders on GEN8+ GLSL IR vs. NIR shader-db results for SIMD8 vertex shaders on Broadwell: total instructions in shared programs: 2742062 -> 2681339 (-2.21%) instructions in affected programs: 1514770 -> 1454047 (-4.01%) helped: 5813 HURT: 1120 The gained programs are ARB vertext programs that were previously going through the vec4 backend. Now that we have prog_to_nir, ARB vertex programs can go through the scalar backend so they show up as "gained" in the shader-db results. Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Acked-by: Matt Turner <mattst88@gmail.com>	2015-06-01 12:25:58 -07:00
Jason Ekstrand	d4cbf6a728	vk/compiler: Add an index_count to the bind map and check for OOB	2015-06-01 12:25:58 -07:00
Jason Ekstrand	510b5c3bed	vk/HACK: Plumb real descriptor set/index into textures	2015-06-01 12:25:58 -07:00
Jason Ekstrand	aded32bf04	NIR: Add a helper for doing sampler lowering for vulkan	2015-06-01 12:25:58 -07:00
Brian Paul	f97166e550	docs: update GL_ARB_copy_image, GL_ARB_clear_texture gallium status VMware is working on these. Signed-off-by: Brian Paul <brianp@vmware.com>	2015-06-01 07:47:25 -06:00
Brian Paul	51d08d55f4	gallium/util: silence silence unused var warnings for non-debug build Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-06-01 07:42:05 -06:00
Brian Paul	54070a9d1d	egl/dri2: silence uninitialized variable warnings And update assertions to be more informative. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-06-01 07:42:04 -06:00
Brian Paul	87813c504a	gallivm: silence unused var warnings for non-debug build Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-06-01 07:42:03 -06:00
Brian Paul	71afc13eda	pipebuffer: silence unused var warnings for non-debug build Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-06-01 07:42:02 -06:00
Brian Paul	8759185871	st/mesa: silence unused var warnings for non-debug build Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-06-01 07:42:02 -06:00
Brian Paul	ae5d6db924	draw: silence unused var warnings for non-debug build Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-06-01 07:42:01 -06:00
Jose Fonseca	512117ce0e	gallivm: Remove stub disassemblerSymbolLookupCB. It's incompletete -- it wasn't filling ReferenceType so it was causing garbagge on the disassembly. Furthermore it seems impossible to get the jump information through this interface. The solution for function size problem is to effectively book-keep the machine code start and end address while JIT'ing.	2015-06-01 10:43:28 +01:00
Kristian Høgsberg Kristensen	5caa408579	vk: Indent tables to align '=' at column 48	2015-05-31 22:36:26 -07:00
Kristian Høgsberg Kristensen	76bb658518	vk: Add support for anisotropic bits	2015-05-31 22:15:34 -07:00
Kristian Høgsberg Kristensen	dc56e4f7b8	vk: Implement support for sampler border colors This supports the three Vulkan border color types for float color formats. The support for integer formats is a little trickier, as we don't know the format of the texture at this time.	2015-05-31 17:20:48 -07:00
Jason Ekstrand	e497ac2c62	vk/device: Only flush the texture cache when setting state base address After further examination, it appears that the other flushes and stalls weren't actually needed.	2015-05-30 18:04:50 -07:00
Neil Roberts	7f62fdae16	i965: Don't add base_binding_table_index if it's zero When calculating the binding table index for non-constant sampler array indexing it needs to add the base binding table index which is a constant within the generated code. Often this base is zero so we can avoid a redundant instruction in that case. It looks like nothing in shader-db is doing non-constant sampler array indexing so this patch doesn't make any difference but it might be worth having anyway. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz> Acked-by: Ben Widawsky <ben@bwidawsk.net>	2015-05-31 00:48:57 +01:00
Neil Roberts	6c846dc57b	i965: Don't use a temporary when generating an indirect sample Previously when generating the send instruction for a sample instruction with an indirect sampler it would use the destination register as a temporary store. This breaks when used in combination with the opt_sampler_eot optimisation because that forces the destination to be null. This patch fixes that by avoiding the temp register altogether. The reason the temporary register was needed was because it was trying to ensure the binding table index doesn't overflow a byte by and'ing it with 0xff. The result is then or'd with samper_index<<8. This patch instead just and's the whole thing by 0xfff. This will ensure that a bogus sampler index won't overflow into the rest of the message descriptor but unlike the previous code it won't ensure that the binding table index doesn't overflow into the sampler index. It doesn't seem like that should matter very much though because if the shader is generating a bogus sampler index then it's going to just get garbage out either way. Instead of doing sampler_index<<8\|(sampler_index+base_table_index) the new code avoids one operation by doing sampler_index*0x101+base_table_index which should be equivalent. However if we wanted to avoid the multiply for some reason we could do this by adding an extra or instruction still without needing the temporary register. This fixes a number of Piglit tests on Skylake that were using indirect samplers such as: spec@arb_gpu_shader5@execution@sampler_array_indexing@fs-simple Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz> Acked-by: Ben Widawsky <ben@bwidawsk.net> Tested-by: Anuj Phogat <anuj.phogat@gmail.com>	2015-05-31 00:48:57 +01:00
Jason Ekstrand	2251305e1a	vk/cmd_buffer: Track descriptor set dirtying per-stage	2015-05-30 10:07:29 -07:00
Jason Ekstrand	33cccbbb73	vk/device: Emit PIPE_CONTROL flushes surrounding new STATE_BASE_ADDRESS According to the bspec, you're supposed to emit a PIPE_CONTROL with a CS stall and a render target flush prior to chainging STATE_BASE_ADDRESS. A little experimentation, however, shows that this is not enough. It also appears as if you have to flush the texture cache after chainging base address or things won't propagate properly.	2015-05-30 08:08:07 -07:00
Eric Anholt	ec1c72d38e	vc4: Don't bother with safe list traversal in CSE. We don't remove or move instructions.	2015-05-29 22:09:53 -07:00
Eric Anholt	78c773bb36	vc4: Convert from simple_list.h to list.h list.h is a nicer and more familiar set of list functions/macros.	2015-05-29 22:09:53 -07:00
Jason Ekstrand	b2b9fc9fad	vk/allocator: Don't call VALGRIND_MALLOCLIKE_BLOCK on fresh gem_mmap's	2015-05-29 21:15:47 -07:00
Jason Ekstrand	03ffa9ca31	vk: Don't crash on partial descriptor sets	2015-05-29 20:43:10 -07:00
Eric Anholt	21a22a61c0	vc4: Make sure we allocate idle BOs from the cache. We were returning the most recently freed BO, without checking if it was idle yet. This meant that we generally stalled immediately on the previous frame when generating a new one. Instead, allocate new BOs when the oldest BO is still busy, so that the cache scales with how much is needed to keep some frames outstanding, as originally intended. Note that if you don't have some throttling happening, this means that you can accidentally run the system out of memory. The kernel is now applying some throttling on all execs, to hopefully avoid this.	2015-05-29 18:15:00 -07:00
Eric Anholt	c821ccf0e3	vc4: Fix return value handling for BO waits. If the wait ever returned -ETIME, we'd abort because the errno was stored in errno and not drmIoctl()'s return value.	2015-05-29 18:15:00 -07:00
Jason Ekstrand	4ffbab5ae0	vk/device: Allow for starting a new surface state buffer This commit allows for us to create a whole new surface state buffer when the old one runs out of room. We simply re-emit the state base address for the new state, re-emit binding tables, and keep going.	2015-05-29 17:49:41 -07:00
Jason Ekstrand	c4bd5f87a0	vk/device: Do lazy surface state emission for binding tables Before, we were emitting surface states up-front when binding tables were updated. Now, we wait to emit the surface states until we emit the binding table. This makes meta simpler and should make it easier to deal with swapping out the surface state buffer.	2015-05-29 16:51:11 -07:00
Timothy Arceri	fcc79af9e2	mesa: remove unused function declaration Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-05-30 07:24:02 +10:00
Brian Paul	82305f7b00	dri_util: make version var unsigned to silence warnings _mesa_override_gl_version_contextless() takes an unsigned version parameter. Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-05-29 13:36:39 -06:00
Ben Widawsky	b307921c3f	i965: Disable compaction for EOT send messages AFAICT, there is no real way to make sure a send message with EOT is properly ignored from compact, nor can I see a way to actually encode EOT while compacting. Before the single send optimization we'd always bail because we hit the is_immediate && !is_compactable_immediate case. However, with single send, is_immediate is not true, and so we end up trying to compact the un-compactible. Without this, any compacting single send instruction will hang because the EOT isn't there. I am not sure how I didn't hit this when I originally enabled the optimization. I didn't check if some surrounding code changed. I know Neil and Matt were both looking into this. I did a quick search and didn't see any patches out there to handle this. Please ignore if this has already been sent by someone. (Direct me to it and I will review it). Reported-by: Neil Roberts <neil@linux.intel.com> Reported-by: Mark Janes <mark.a.janes@intel.com> Tested-by: Mark Janes <mark.a.janes@intel.com> Signed-off-by: Ben Widawsky <ben@bwidawsk.net> Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-05-29 11:55:10 -07:00
Kristian Høgsberg Kristensen	4aecec0bd6	vk: Store dynamic slot index with struct anv_descriptor_slot We need to make sure we use the right index into dynamic offset array. Dynamic descriptors can be present or not in different stages and to get the right offset, we need to compute the index at vkCreateDescriptorSetLayout time.	2015-05-29 11:32:53 -07:00
Roland Scheidegger	c0d2b83f0b	gallivm: make sampling more robust when the sampler setup is bogus Pure integer formats cannot be sampled with linear tex / mip filters. In GL such a setup would make the texture incomplete. We shouldn't rely on the state tracker though to filter that out, just return all zeros instead of dying in the lerp. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-05-29 19:33:19 +02:00

... 12 13 14 15 16 ...

71265 commits