fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-27 05:30:24 +01:00

Author	SHA1	Message	Date
Ilia Mirkin	71a489633b	gm107/ir: add ATOM and CCTL support Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-20 19:37:34 -05:00
Ilia Mirkin	57b0025814	gm107/ir: set LD/ST address width bit Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-20 19:37:34 -05:00
Ilia Mirkin	2e533ab74b	gk110/ir: fix double-wide vm address	2016-01-20 19:37:34 -05:00
Ilia Mirkin	8c2dfe05c5	gk110/ir: add OP_CCTL handling	2016-01-20 19:37:33 -05:00
Ilia Mirkin	7d9a97d6be	gk110/ir: add atomic op emission, fix gmem loads Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-20 19:37:33 -05:00
Chad Versace	5ce5a7d021	anv/image: Stop including gen8_pack.h in common file	2016-01-20 15:42:17 -08:00
Chad Versace	8ab527de03	isl: Add a README Most of the file-level comment in isl.h is moved to the README.	2016-01-20 15:24:40 -08:00
Roland Scheidegger	dc8b9bd0aa	llvmpipe: warn about illegal use of objects in different contexts Doing that is clearly a bug. We can't quite assert as st/mesa may hit this, but increase at least visibility of it a bit. (For the non-refcounted objects it would be illegal too, but we can't detect that unless we'd store the context ourselves. Plus, those don't tend to cause random crashes at context or object destruction time... So just sampler views, surfaces and so targets for now.) Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-01-21 00:09:55 +01:00
Roland Scheidegger	e925ec8811	llvmpipe,i915: add back NEW_RASTERIZER dependency when computing vertex info I removed this mistakenly in `2dbc20e456`. I actually thought it should not be necessary and a piglit run didn't show any differences, but this shouldn't have been in there. draw_prepare_shader_outputs() is in fact dependent on NEW_RASTERIZER. The new polygon-mode-facing test indeed shows why this is necessary, there's lots of invalid reads and writes with valgrind (also crashes without valgrind), because the pre-pipeline vertex size doesn't match the post-pipeline vertex size (note this won't help much with stages which don't have the prepare hook which can grow the vertex size, in particular the wide point stage, but this isn't used by llvmpipe). The test still won't pass, of course, but it is only usage of uninitialized values now, which is much less dangerous... (Albeit I'm pretty sure for i915 it really is not needed anymore as it doesn't care about the extra outputs and doesn't call draw_prepare_shader_outputs().) Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-01-21 00:09:55 +01:00
Ilia Mirkin	dc3ac418bf	nv50/ir: don't flip SHL(ADD) into ADD(SHL) if ADD sources have modifiers Fixes: `31fde8fa` (nv50/ir: flip shl(add, imm) into add(shl, imm)) Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-20 18:03:36 -05:00
Kristian Høgsberg Kristensen	7b7a7c2bfc	vk: Make maxSamplerAllocationCount more reasonable We can't allocate 4 billion samplers. Let's go with 64k.	2016-01-20 14:36:52 -08:00
Ilia Mirkin	3a63576168	gk110/ir: fix load from shared memory It was accidentally using the store opcode. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-20 17:16:09 -05:00
Ilia Mirkin	9f23007a7a	gk110/ir: add partial BAR support This is enough for the plain TGSI BARRIER implementation. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-20 17:16:09 -05:00
Kristian Høgsberg Kristensen	8ef002dd7a	vk/tests: Add stub for anv_gem_get_bit6_swizzle()	2016-01-20 13:47:40 -08:00
Kristian Høgsberg Kristensen	420e8664cb	vk/tests: Add isl include path	2016-01-20 13:47:40 -08:00
Kenneth Graunke	b76e4458f9	nir/spirv/glsl450: Use fabs not iabs in ldexp. This was just wrong.	2016-01-20 12:18:02 -08:00
Tapani Pälli	f1152c3455	Revert "glsl: move uniform calculation to link_uniforms" This reverts commit `4475d8f916`.	2016-01-20 22:04:46 +02:00
Kristian Høgsberg Kristensen	947ebd9c71	isl: Add ish.h to libsil_la_SOURCES	2016-01-20 12:03:46 -08:00
Jason Ekstrand	21b2d87408	nir/spirv/glsl450: Implement FrexpStruct	2016-01-20 11:36:41 -08:00
Jason Ekstrand	c7896d1868	spirv/nir/glsl450: Use vtn_create_ssa_value to create SSA values	2016-01-20 11:36:26 -08:00
Jason Ekstrand	e45748bade	anv/device: Default to scalar GS on BDW+	2016-01-20 11:16:44 -08:00
Jason Ekstrand	34f9a5f301	nir/spirv: Pull texture dimensionality out of the image when available	2016-01-20 11:11:30 -08:00
Jason Ekstrand	59ef7c6507	anv/meta: fix UpdateBuffer in the case where we do multiple updates	2016-01-20 07:56:48 -08:00
Jason Ekstrand	a0516cfbac	anv/meta: Fix a finishme	2016-01-20 07:33:41 -08:00
Tapani Pälli	4475d8f916	glsl: move uniform calculation to link_uniforms Patch moves uniform calculation to happen during link_uniforms, this is possible with help of UniformRemapTable that has all the reserved locations. Location assignment for implicit locations is changed so that we utilize also the 'holes' that explicit uniform location assignment might have left in UniformRemapTable, this makes it possible to fit more uniforms as previously we were lazy here and wasting space. Fixes following CTS tests: ES31-CTS.explicit_uniform_location.uniform-loc-mix-with-implicit-max ES31-CTS.explicit_uniform_location.uniform-loc-mix-with-implicit-max-array v2: code cleanups, increment NumUniformRemapTable correctly, fix find_empty_block to work properly and add some more comments. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Marta Lofstedt <marta.lofstedt@intel.com>	2016-01-20 07:24:39 +02:00
Timothy Arceri	0a6a05c8ea	glsl: add missing explicit_image_format flag to has_layout() Fixes piglit regression after fixes to duplicate layout rules. Previously catching multiple layouts was relying on the code meant to catch duplicates within a single layout(...), this change triggers the rules for multiple layouts. Cc: Mark Janes <mark.a.janes@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2016-01-20 15:45:56 +11:00
Jason Ekstrand	c7203aa621	nir/spirv: Move OpPhi handling to vtn_cfg.c Phi handling is somewhat intrinsically tied to the CFG. Moving it here makes it a bit easier to handle that. In particular, we can now do SSA repair after we've done the phi node second-pass. This fixes 6 CTS tests.	2016-01-19 19:00:00 -08:00
Jason Ekstrand	891564adb9	nir/spirv: Handle OpLine and OpNoLine in foreach_instruction This way we don't have to explicitly handle them everywhere.	2016-01-19 19:00:00 -08:00
Kenneth Graunke	e79f8a4926	nir: Lower ldexp to arithmetic. This is a port of Matt's GLSL IR lowering pass to NIR. It's required because we translate SPIR-V directly to NIR, bypassing GLSL IR. I haven't introduced a lower_ldexp flag, as I believe all current NIR consumers would set the flag. i965 wants this, vc4 doesn't implement this feature, and st_glsl_to_tgsi currently lowers ldexp unconditionally anyway. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2016-01-19 18:10:30 -08:00
Kenneth Graunke	b3cc10f3b2	nir: Let nir_opt_algebraic rules contain unsigned constants > INT_MAX. struct.pack('i', val) interprets `val` as a signed integer, and dies if `val` > INT_MAX. For larger constants, we need to use 'I' which interprets it as an unsigned value. This patch makes us use 'I' for all values >= 0, and 'i' for negative values. This should work in all cases. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2016-01-19 18:10:30 -08:00
Jason Ekstrand	eb2a119da2	anv/meta: Implement UpdateBuffer	2016-01-19 16:53:35 -08:00
Jason Ekstrand	0ae1bd321e	anv/meta: Implement CmdFillBuffer	2016-01-19 16:53:35 -08:00
Jason Ekstrand	46eef31311	anv/meta_clear: Call emit_clear directly in ClearImage Using the load op means that we end up with recursive meta. We shouldn't be doing that.	2016-01-19 16:53:35 -08:00
Jason Ekstrand	6325a75011	anv/meta_clear: Do save/restore in actual entry points	2016-01-19 16:53:35 -08:00
Jason Ekstrand	56dbf13045	anv: Add support for VK_WHOLE_SIZE several places	2016-01-19 16:53:35 -08:00
Kenneth Graunke	549be68258	nir/spirv/glsl450: Implement Frexp.	2016-01-19 16:46:03 -08:00
Roland Scheidegger	b21973acaa	llvmpipe: turn depth clears into full depth/stencil clears for d24x8 formats If we have a d24x8 format, there is no stencil. Therefore, we can always clear these bits too, which means this will be some kind of memset rather than read-modify-write. This is good for some 7% increase or so in gears with huge window size - seems to have a bigger effect if things aren't in caches. Of course, any real app won't spend nearly as much time comparatively in clearing depth buffer in the first place, so the speedup will be much lower. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-01-20 01:45:56 +01:00
Kenneth Graunke	68c9ca1a94	nir/spirv/glsl450: Blindly implement Atan2. This is untested and probably broken. We already passed the atan2 CTS tests before implementing this opcode. Presumably, glslang or something was giving us a plain Atan opcode instead of Atan2. I don't know why.	2016-01-19 16:14:05 -08:00
Kenneth Graunke	2ab3efa0ad	nir/spirv/glsl450: Implement Atan.	2016-01-19 16:14:05 -08:00
Kenneth Graunke	bc9d9bc2e3	nir/spirv/glsl450: Implement Asin and Acos.	2016-01-19 16:14:05 -08:00
Francisco Jerez	f8ac314cc2	i965: Implement compute sampler state atom. Fixes a number of GLES31 CTS failures and hangs on various hardware: ES31-CTS.texture_gather.plain-gather-depth-2d ES31-CTS.texture_gather.plain-gather-depth-2darray ES31-CTS.texture_gather.plain-gather-depth-cube ES31-CTS.texture_gather.offset-gather-depth-2d ES31-CTS.texture_gather.offset-gather-depth-2darray ES31-CTS.layout_binding.sampler2D_layout_binding_texture_ComputeShader ES31-CTS.layout_binding.sampler2DArray_layout_binding_texture_ComputeShader ES31-CTS.explicit_uniform_location.uniform-loc-types-samplers ES31-CTS.compute_shader.resources-texture Some of them were actually passing by luck on some generations even though we weren't uploading sampler state tables explicitly for the compute stage, most likely because they relied on the cached sampler state left from previous rendering to be close enough. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=92589 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=93312 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=93325 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=93407 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=93725 Reported-by: Marta Lofstedt <marta.lofstedt@intel.com> Reviewed-by: Marta Lofstedt <marta.lofstedt@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-01-19 16:11:04 -08:00
Francisco Jerez	9e4c8acd78	i965: Trigger CS state reemission when new sampler state is uploaded. This reuses the NEW_SAMPLER_STATE_TABLE state bit (currently only used on pre-Gen7 hardware) to signal that the sampler state tables have changed in order to make sure that the GPGPU interface descriptor is updated. Reviewed-by: Marta Lofstedt <marta.lofstedt@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-01-19 16:11:04 -08:00
Kenneth Graunke	4fc018576b	glsl: Don't abbreviate tessellation shader stage names. I have a patch that writes shaders as .shader_test files, and it uses this function to create the headers (i.e. [vertex shader]). [tess ctrl shader] isn't a valid shader_runner header - it's spelled out as [tessellation control shader]. There's no real reason to abbreviate it, so spell it out. v2: Rebase on Rob's patches to move the code. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-01-19 14:57:42 -08:00
Timothy Arceri	11fc7ad62e	mesa: remove link validation that should be done elsewhere Even if re-linking fails rendering shouldn't fail as the previous succesfully linked program will still be available. It also shouldn't be possible to have an unlinked program as part of the current rendering state. This fixes a subtest in: ES31-CTS.sepshaderobjs.StateInteraction This change should improve performance on CPU limited benchmarks as noted in commit `d6c6b186cf`. >From Section 7.3 (Program Objects) of the OpenGL 4.5 spec: "If a program object that is active for any shader stage is re-linked unsuccessfully, the link status will be set to FALSE, but any existing executables and associated state will remain part of the current rendering state until a subsequent call to UseProgram, UseProgramStages, or BindProgramPipeline removes them from use. If such a program is attached to any program pipeline object, the existing executables and associated state will remain part of the program pipeline object until a subsequent call to UseProgramStages removes them from use. An unsuccessfully linked program may not be made part of the current rendering state by UseProgram or added to program pipeline objects by UseProgramStages until it is successfully re-linked." "void UseProgram(uint program); ... An INVALID_OPERATION error is generated if program has not been linked, or was last linked unsuccessfully. The current rendering state is not modified." V2: apply the rule to both core and compat. Cc: Tapani Pälli <tapani.palli@intel.com> Cc: Brian Paul <brianp@vmware.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-01-20 09:35:04 +11:00
Timothy Arceri	6a660a5f5d	glsl: allow multiple layout qualifiers for a single declaration From the ARB_shading_language_420pack spec: "More than one layout qualifier may appear in a single declaration. If the same layout-qualifier-name occurs in multiple layout qualifiers for the same declaration, the last one overrides the former ones." The parser was already failing correctly when the extension is not available but testing for duplicates within a single layout qualifier was still causing this to fail when available as both cases share the same function for merging. Here we add a parameter to differentiate between the two uses and apply it to the duplicate test. Acked-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2016-01-20 08:06:50 +11:00
Timothy Arceri	564009986f	glsl: update parser to allow duplicate default layout qualifiers In order to only create a single node for each default declaration we add a new boolean parameter to the in/out merge function to only create one once we reach the rightmost layout qualifier. From the ARB_shading_language_420pack spec: "More than one layout qualifier may appear in a single declaration. If the same layout-qualifier-name occurs in multiple layout qualifiers for the same declaration, the last one overrides the former ones." Acked-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2016-01-20 08:06:45 +11:00
Timothy Arceri	a0a93470e3	glsl: move default layout qualifier rules out of the parser Acked-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2016-01-20 08:06:40 +11:00
Timothy Arceri	fd612e4547	glsl: split layout_defaults into specific types This will allow merging of duplicate layout qualifiers as allowed by ARB_shading_language_420pack Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2016-01-20 08:06:35 +11:00
Timothy Arceri	c8b8c578d1	glsl: allow duplicate layout-qualifier-names This is added by ARB_enhanced_layouts although it doesn't fit into any of the six main changes so we enable this independently. From the ARB_enhanced_layouts spec: "More than one layout qualifier may appear in a single declaration. Additionally, the same layout-qualifier-name can occur multiple times within a layout qualifier or across multiple layout qualifiers in the same declaration" Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2016-01-20 08:06:29 +11:00
Matt Turner	866a6bf9f7	i965/vec4: Spaces around operators.	2016-01-19 12:12:38 -08:00

... 96 97 98 99 100 ...

82384 commits