fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 00:38:48 +02:00

Author	SHA1	Message	Date
Tapani Pälli	39cdf1621e	glsl: type check between switch init-expression and case Patch adds a type check between switch init-expression and case label and performs a implicit signed->unsigned type conversion when possible. v2: add GLSL spec reference, do implicit conversion if possible (Matt) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=79724 Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-06-17 08:13:28 +03:00
Tobias Klausmann	5357c14da4	nv50/ir: Remove NV50_SEMANTIC_VIEWPORTINDEX Use TGSI_SEMANTIC_VIEWPORT_INDEX for the last consumer. Signed-off-by: Tobias Klausmann <tobias.johannes.klausmann@mni.thm.de> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2014-06-16 23:08:32 -04:00
Tobias Klausmann	cd01e1667a	docs: update GL3.txt, relnotes: mark GL_ARB_viewport_array as done for nvc0 Signed-off-by: Tobias Klausmann <tobias.johannes.klausmann@mni.thm.de> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2014-06-16 23:08:32 -04:00
Tobias Klausmann	a2cb3a4a4f	nvc0: implement multiple viewports/scissors, enable ARB_viewport_array Signed-off-by: Tobias Klausmann <tobias.johannes.klausmann@mni.thm.de> [imirkin: mark things dirty on ctx switch, 3d blit] Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2014-06-16 23:08:03 -04:00
Ilia Mirkin	af05270ccf	nv50: make sure to mark first scissor dirty after blit Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.2" <mesa-stable@lists.freedesktop.org>	2014-06-16 23:08:03 -04:00
Kenneth Graunke	49659ad90c	i965: Use 8x4 aligned rectangles for HiZ operations on Broadwell. Like on Haswell, we need to use 8x4 aligned rectangle primitives for hierarchical depth buffer resolves and depth clears. See the comments in brw_blorp.cpp's brw_hiz_op_params() constructor. (The Broadwell documentation confirms that this is still necessary.) This patch makes the Broadwell code follow the same behavior as Chad and Jordan's Gen7 BLORP code. Based on a patch by Topi Pohjolainen. This fixes es3conform's framebuffer_blit_functionality_scissor_blit test, with no Piglit regressions. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Cc: "10.2" <mesa-stable@lists.freedesktop.org>	2014-06-16 17:23:21 -07:00
Kenneth Graunke	fa35b272a0	i965: Make INTEL_DEBUG=mip print out whether HiZ is enabled. We only enable HiZ for miplevels which are aligned on 8x4 blocks. When debugging HiZ failures, it's useful to know whether a particular miplevel is using HiZ or not. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-06-16 17:22:29 -07:00
Jordan Justen	380dd3be02	glsl/cs: Fix local_size_y and local_size_z flags.q.local_size has 3 bits. One each for x, y and z. Fixes piglit's: * spec/ARB_compute_shader/linker/mismatched_local_work_sizes * spec/ARB_compute_shader/compiler/default_local_size.comp * spec/ARB_compute_shader/compiler/work_group_size_too_large * spec/ARB_compute_shader/compiler/gl_WorkGroupSize_matches_layout.comp This was regressed in `738c9c3c`. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2014-06-16 09:54:52 -07:00
Jordan Justen	539cd92476	main/extensions: Only parse MESA_EXTENSION_OVERRIDE once Previously, we would parse MESA_EXTENSION_OVERRIDE each time a context was created. Now we will save the results of that parsing and use it during context initialization. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2014-06-16 09:54:52 -07:00
Jordan Justen	ac3e2efeff	main/extensions: Build list of extensions that can't be disabled This will allow us to utilize the early MESA_EXTENSION_OVERRIDE parsing at the later extension string initialization step. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2014-06-16 09:54:52 -07:00
Jordan Justen	863f57ee1b	main/extensions: Create extra extensions override string This will allow us to utilize the early MESA_EXTENSION_OVERRIDE parsing at the later extension string initialization step. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2014-06-16 09:54:52 -07:00
Jordan Justen	10e03b4401	i965/cs: Use override structure rather than separate env var In `25268b93`, we added a new environment variable (INTEL_COMPUTE_SHADER) to allow some constant values to be upgraded for the ARB_compute_shader extension. Now, we can look to see if the extension was enabled via the MESA_EXTENSION_OVERRIDE environment variable. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2014-06-16 09:54:52 -07:00
Jordan Justen	f5ca8c1972	main/extensions: Add early extension override structures During the early one_time_init phase of context creation, we initialize two global gl_extensions structures. We read the MESA_EXTENSION_OVERRIDE environment variable, and store positive and negative overrides in two structures: * struct gl_extensions _mesa_extension_override_enables * struct gl_extensions _mesa_extension_override_disables These are filled before the driver initializes extensions and constants, therefore the driver can make adjustments based on the desired overrides. This can be useful during development of a new extension where the extension is only partially ready. The driver can't actually advertise support for the extension, but if it sees that the override is set for the extension, then it can expose more supported parts of the extension, such as upgrading context constants. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2014-06-16 09:54:52 -07:00
Jordan Justen	8be64fb570	main/extensions: Create a context-less set_extensions function We will add new gl_extensions structures that capture the environment variable extension overrides and are available early in context creation. This will allow a driver to take actions during its initialization based on the extension overrides. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2014-06-16 09:54:52 -07:00
Jordan Justen	f2280eeba5	main/extensions: Don't advertise unknown extensions overrides with (-) Previously setting: MESA_EXTENSION_OVERRIDE=-GL_MESA_ham_sandwich Would cause Mesa to advertise support for the GL_MESA_ham_sandwich extension, even though the override specifically asked for it to be disabled. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2014-06-16 09:54:52 -07:00
Marek Olšák	41060a6095	radeonsi: fixup sizes of shader resource and sampler arrays This was wrong for a very long time. I wonder if the array size has any effect on anything. Reviewed-by: Christian König <christian.koenig@amd.com>	2014-06-16 16:55:57 +02:00
José Fonseca	7889469663	scons: Link libGL.so against xcb-dri2. Fixing undefined xcb_dri2_* symbols. Trivial.	2014-06-16 11:24:21 +01:00
Michel Dänzer	d6fd8a9771	r600g/radeonsi: Remove default case from PIPE_COMPUTE_CAP_* switch This way, the compiler warns about unhandled caps. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-06-16 15:56:29 +09:00
Tapani Pälli	5cb8fdb397	docs: update ARB_explicit_uniform_location status + modify release notes for 10.3 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Petri Latvala <petri.latvala@intel.com>	2014-06-16 06:49:59 +03:00
Tapani Pälli	f3750a2c86	Enable GL_ARB_explicit_uniform_location in the drivers. v2: enable also for i915 (Ian) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Petri Latvala <petri.latvala@intel.com>	2014-06-16 06:49:59 +03:00
Tapani Pälli	e8fb8b1bb3	glsl: parser changes for GL_ARB_explicit_uniform_location Patch adds a preprocessor define for the extension and stores explicit location data for uniforms during AST->HIR conversion. It also sets layout token to be available when having the extension in place. v2: change parser check to require GLSL 330 or enabling GL_ARB_explicit_attrib_location (Ian) v3: fix the check and comment in AST->HIR (Petri) Signed-off-by: Tapani Pälli <tapani.palli@intel.com>	2014-06-16 06:49:59 +03:00
Tapani Pälli	8381f0f0c3	glsl: add enable bit for ARB_explicit_uniform_location Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-06-16 06:49:59 +03:00
Tapani Pälli	73f7c8636d	mesa: support inactive uniforms in glUniform* functions Support inactive uniforms that have explicit location set in glUniform* functions. v2: remove unnecessary extension check, use new define (Ian) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-06-16 06:49:59 +03:00
Tapani Pälli	dd2a6519b9	glsl/linker: assign explicit uniform locations Patch refactors the existing uniform processing so explicit locations are taken in to account during variable processing. These locations are temporarily stored in gl_uniform_storage before actual locations are set. UNMAPPED_UNIFORM_LOC marks unset location so that we can use 0 as a valid explicit location. When locations are set, UniformRemapTable is first populated with uniforms that have explicit location set (inactive and active ones), rest are put after explicit location slots. v2: introduce define for locations that have not been set yet (Ian) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-06-16 06:49:59 +03:00
Tapani Pälli	eca9d16048	glsl/linker: initialize explicit uniform locations Patch initializes the UniformRemapTable for explicit locations. This needs to happen before optimizations to make sure all inactive uniforms get their explicit locations correctly. v2: fix initialization bug, introduce define for inactive uniforms (Ian) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-06-16 06:49:59 +03:00
Tapani Pälli	dadc3d04f0	glsl: add glsl_type::uniform_locations() helper function This function calculates the number of unique values from glGetUniformLocation for the elements of the type. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-06-16 06:49:59 +03:00
Tapani Pälli	bfe42ddd99	mesa: add new enum MAX_UNIFORM_LOCATIONS Patch adds new implementation dependent value required by the GL_ARB_explicit_uniform_location extension. Default value for user assignable locations is calculated as sum of MaxUniformComponents for each stage. v2: fix descriptor in get_hash_params.py (Petri) v3: simpler formula for calculating initial value (Ian) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-06-16 06:49:59 +03:00
Tapani Pälli	d1a64aad16	mesa: add enable bit for ARB_explicit_uniform_location Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-06-16 06:49:59 +03:00
Tapani Pälli	bd5f1202fb	glapi: add GL_ARB_explicit_uniform_location Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-06-16 06:49:59 +03:00
Kenneth Graunke	5d8e246ac8	i965/vec4: Use the sampler for pull constant loads on Broadwell. We've used the LD sampler message for pull constant loads on earlier hardware for some time, and also were already using it for the FS on Broadwell. This patch makes us use it for Broadwell VS/GS as well. I believe that when I wrote this code in 2012, we still used the data port in some cases, and I somehow neglected to convert it while rebasing. Improves performance in GLBenchmark 2.7 Egypt by 416.978% +/- 2.25821% (n = 17). Many other applications should benefit similarly: this speeds up uniform array access in the VS, which is commonly used for skinning shaders, among other things. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Tested-by: Ben Widawsky <ben@bwidawsk.net> Cc: "10.2" <mesa-stable@lists.freedesktop.org>	2014-06-15 16:51:05 -07:00
Kenneth Graunke	847abaccc0	i965: Add missing newlines to a few perf_debug messages. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Cc: "10.2" <mesa-stable@lists.freedesktop.org>	2014-06-15 16:51:05 -07:00
Kenneth Graunke	d053a05ef3	i965: Drop Broadwell perf_debugs about missing MOCS that aren't missing. I actually added MOCS support for these things, but forgot to delete the corresponding perf_debug() warnings. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Cc: "10.2" <mesa-stable@lists.freedesktop.org>	2014-06-15 16:51:05 -07:00
Kenneth Graunke	7f256c1c70	i965: Add missing MOCS setup for 3DSTATE_INDEX_BUFFER on Broadwell. Somehow I missed this when adding all of the other MOCS values. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Cc: "10.2" <mesa-stable@lists.freedesktop.org>	2014-06-15 16:51:05 -07:00
Kenneth Graunke	d0575d98fc	i965/vec4: Fix dead code elimination for VGRFs of size > 1. When faced with code such as: mov vgrf31.0:UD, 960D mov vgrf31.1:UD, vgrf30.xxxx:UD The dead code eliminator didn't consider reg_offsets, so it decided that the second instruction was writing was writing to the same register as the first one, and eliminated the first one. But they're actually different registers. This fixes INTEL_DEBUG=shader_time for vertex shaders. In the above code, vgrf31.0 represents the offset into the shader_time buffer where the data should be written, and vgrf31.1 represents the actual time data. With a completely undefined offset, results were...unexpected. I think this is probably one of the few cases (maybe only case) where we generate multiple MOVs to a large VGRF. Normally, we just use them as texturing results; the other SEND-from-GRF uses a size 1 VGRF. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=79029 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Cc: mesa-stable@lists.freedesktop.org	2014-06-15 16:51:05 -07:00
Kenneth Graunke	d6a7a2606e	i965: Add SHADER_OPCODE_SHADER_TIME_ADD to dump_instructions() decode. "shader_time_add" is a lot more informative than "op152". Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-06-15 16:51:04 -07:00
Vinson Lee	4133c7126c	glsl: Fix clang mismatched-tags warnings with glsl_type. Fix clang mismatched-tags warnings introduced with commit `4f5445a45d`. ./glsl_symbol_table.h:37:1: warning: class 'glsl_type' was previously declared as a struct [-Wmismatched-tags] class glsl_type; ^ ./glsl_types.h:86:8: note: previous use is here struct glsl_type { ^ ./glsl_symbol_table.h:37:1: note: did you mean struct here? class glsl_type; ^~~~~ Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-06-15 13:34:18 -07:00
Vinson Lee	32c5544860	mesa/drivers: Fix clang constant-logical-operand warnings. This patch fixes several clang constant-logical-operand warnings such as the following. ../../../../../src/mesa/tnl_dd/t_dd_tritmp.h:130:32: warning: use of logical '\|\|' with constant operand [-Wconstant-logical-operand] if (DO_TWOSIDE \|\| DO_OFFSET \|\| DO_UNFILLED \|\| DO_TWOSTENCIL) ^ ~~~~~~~~~~~ ../../../../../src/mesa/tnl_dd/t_dd_tritmp.h:130:32: note: use '\|' for a bitwise operation if (DO_TWOSIDE \|\| DO_OFFSET \|\| DO_UNFILLED \|\| DO_TWOSTENCIL) ^~ \| Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-06-14 23:21:43 -07:00
Chris Forbes	4191cc4861	glsl: Correct more typos Signed-off-by: Chris Forbes <chrisf@ijw.co.nz>	2014-06-15 12:55:16 +12:00
Tom Stellard	ac26a562ed	radeon/compute: Always report at least 1 compute unit Some apps will abort if they detect 0 compute units. This fixes crashes in some OpenCV tests.	2014-06-13 21:32:34 -04:00
Jason Ekstrand	ffe609cc69	meta_blit: properly compute texture width for the CopyTexSubImage fallback Cc: "10.2" <mesa-stable@lists.freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-06-13 13:09:21 -07:00
Rob Clark	06e9536e5f	freedreno/a3xx: vtx formats Add support for more vertex buffer formats. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-06-13 15:20:34 -04:00
Rob Clark	ba6a490bbc	freedreno: update generated headers Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-06-13 15:20:34 -04:00
Rob Clark	3394900dd3	freedreno: try for more squarish tile dimensions Worth about ~0.5fps in xonotic, for example. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-06-13 15:20:34 -04:00
Rob Clark	6aeeb706d2	freedreno: fix for null textures Some apps seem to give us a null sampler/view for texture slots which come before the last used texture slot. In particular 0ad triggers this. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-06-13 15:20:34 -04:00
Roland Scheidegger	2ea8e2fccf	llvmpipe: increase number of queries which can be binned simultaneously to 64 Gallium (but not OpenGL) does allow nesting of queries, but there's no limit specified (d3d10 has no limit neither). Nevertheless, for practical purposes we need some limit in llvmpipe, otherwise we'd need more complex handling of queries as we need to keep track of all binned queries (this only affects queries which gather data past setup). A limit of 16 is too small though, while 64 would suffice. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2014-06-13 20:08:39 +02:00
Bruno Jiménez	03aab2af16	radeon/compute: Implement PIPE_COMPUTE_CAP_MAX_COMPUTE_UNITS v2: Add RADEON_INFO_ACTIVE_CU_COUNT as a define, as suggested by Tom Stellard Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2014-06-13 10:59:30 -04:00
Neil Roberts	b8d15ca5e8	Remove _mesa_is_type_integer and _mesa_is_enum_format_or_type_integer The comment for _mesa_is_type_integer is confusing because it says that it returns whether the type is an “integer (non-normalized)” format. I don't think it makes sense to say whether a type is normalized or not because it depends on what format it is used with. For example, GL_RGBA+GL_UNSIGNED_BYTE is normalized but GL_RGBA_INTEGER+GL_UNSIGNED_BYTE isn't. If the normalized comment is just a mistake then it still doesn't make much sense because it is missing the packed-pixel types such as GL_UNSIGNED_INT_5_6_5. If those were added then it effectively just returns type != GL_FLOAT. That function was only used in _mesa_is_enum_format_or_type_integer. This function effectively checks whether the format is non-normalized or the type is an integer. I can't think of any situation where that check would make sense. As far as I can tell neither of these functions have ever been used anywhere so we should just remove them to avoid confusion. These functions were added in `9ad8f431b2`. Reviewed-by: Brian Paul <brianp@vmware.com>	2014-06-13 15:54:46 +01:00
Bruno Jiménez	2a0dffa0c9	clover: query driver for the max number of compute units Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2014-06-12 19:09:32 -04:00
Bruno Jiménez	8f4d37889c	gallium: Add PIPE_COMPUTE_CAP_MAX_COMPUTE_UNITS Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2014-06-12 19:08:06 -04:00
Bruno Jiménez	4f70d83089	r600g/compute: solve a bug introduced by `2e01b8b440` That commit made possible that the items could be one just after the other when their size was a multiple of ITEM_ALIGNMENT. But compute_memory_prealloc_chunk still looked to leave a gap between items. Resulting in that we got an infinite loop when trying to add an item which would left no space between itself and the next item. Fixes piglit test: cl-custom-r600-create-release-buffer-bug And the test for alignment I have just sent: http://lists.freedesktop.org/archives/piglit/2014-June/011135.html Sorry about this. Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2014-06-12 15:52:08 -04:00

1 2 3 4 5 ...

63453 commits