fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-26 03:48:12 +02:00

Author	SHA1	Message	Date
Chia-I Wu	2811dba1d0	ilo: simplify setting of shader samplers and views Remove the special path that unbinds all samplers/views not in the range. Just make another call to unbind them.	2013-08-07 18:10:32 +08:00
Chia-I Wu	186dab5b8f	ilo: correctly check for stencil ref change I intended to do a memcmp(), not a memcpy()...	2013-08-07 18:00:46 +08:00
Zack Rusin	12522041d6	draw: fix slot detection Nowadays -1 for slots means that the semantic is not present, so we need to store it in a signed variables, otherwise <0 comparisons are pointless. Fixes http://bugzilla.eng.vmware.com/show_bug.cgi?id=67811 (at least with softpipe, edgeflags don't work wit llvmpipe) Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2013-08-06 20:23:57 -04:00
Laurent Carlier	2572e3b4a1	gallivm: Fix build - Remove TargetOptions.RealignStack for llvm>=3.4 Since llvm -3.4svn r187618, TargetOptions doesn't provide RealignStack, so only enable it with llvm<3.4 This option must now be specified using function attributes, see LLVM commit r187618 Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2013-08-06 15:31:48 -07:00
Christoph Bumiller	2daf974cfe	nvc0: don't access array out of bounds on unexpected sample count	2013-08-06 22:29:33 +02:00
Emil Velikov	07c8f7a6f8	nv50: handle pure integer vertex attributes And as a side effect fix a crash in the following piglit test: general/attribs GL3 Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Cc: "9.2 and 9.1" mesa-stable@lists.freedesktop.org	2013-08-06 22:25:26 +02:00
Samuel Pitoiset	31caddb8d9	nvc0: implement MP performance counters for nvc0:nvd9	2013-08-06 22:24:30 +02:00
Samuel Pitoiset	9dcd7888e6	nvc0: implement compute support for nvc0 Tested on nvc0, nvc1, nvcf and nvd9.	2013-08-06 22:22:49 +02:00
Samuel Pitoiset	981b589101	nvc0: add more MP counters for nve4	2013-08-06 22:22:34 +02:00
Michel Dänzer	46b6f79fea	radeonsi: Number of SGPRs retrieved from LLVM already includes VCC Fixes spurious 'Assertion `num_sgprs <= 104' failed.' with shaders using all 104 SGPRs. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Christian König <christian.koenig@amd.com>	2013-08-06 12:50:01 +02:00
Vinson Lee	b57c1e4b86	llvmpipe: Do not need to free anything if there is no geometry shader. If gs is null, then freeing state->shader.tokens would result in a null dereference. Fixes "Dereference after null check" defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-08-05 21:54:20 -07:00
Vinson Lee	60b567ee59	nvc0: Initialize ptr for unexpected sample_count on release builds. Fixes "Uninitialized pointer read" defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-08-05 21:53:39 -07:00
Vinson Lee	8e850f2feb	draw: Change slot from unsigned to int. unfilled_stage::face_slot is of type int. Fixes "Unsigned compared against 0" defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-08-05 17:40:19 -07:00
Vinson Lee	8294d969e1	postprocess: Check ppq is null before calling pp_free_bos. pp_free_bos dereferences ppq without a null check. Fixes "Dereference before null check" defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-08-05 17:27:38 -07:00
Zack Rusin	a9cb914f49	draw: add back separate input assembler the issue is that stream output is run before the pipeline, which means that unless we decompose the primitives before the so then things crash. we could convert the entire stream output code into a pipeline stage but it will take a bit, so for now fix the crashes by simply re-adding the old input assembler which is run before the SO. Signed-off-by: Zack Rusin <zackr@vmware.com>	2013-08-03 02:57:40 -04:00
Zack Rusin	c9c211fae1	draw: implement proper primitive assembler as a pipeline stage we used to have a face primitive assembler that we ran after if the gs was missing but we had adjacency primitives in the pipeline, lets convert it to a pipeline stage, which allows us to use it to inject outputs (primitive id) into the vertices. it's also a lot cleaner because the decomposition is already handled for us. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-08-03 00:38:58 -04:00
Zack Rusin	8a94d15fba	draw: fix front face injection Inject front face only if the fragment shader uses it and propagate through all channels because otherwise we'll need to figure out the exact swizzle that the fs expects and it's just simpler to make sure all the components within the front face register are correctly set. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-08-03 00:36:39 -04:00
Brian Paul	4c9f12d69c	tgsi: remove unneeded File == TGSI_FILE_INPUT test We're already in an "if (File == TGSI_FILE_INPUT)" block at that point.	2013-08-05 10:25:08 -06:00
Brian Paul	3e4b5c6c9c	tgsi: clean up tgsi_scan_shader() function Replace "fulldecl->Semantic.Name/Index" with semName/semIndex. Simplify if/else logic for TGSI_FILE_OUTPUT code. Remove old comment. Fix indentation. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-05 10:11:33 -06:00
Zack Rusin	95829e2029	llvmpipe: fix frontface behavior again Lets make sure the frontface is 1 for front and -1 for back. Discussed with Roland and Jose. Signed-off-by: Zack Rusin <zackr@vmware.com>	2013-08-02 22:21:29 -04:00
Vinson Lee	0794f638ee	r600g/sb: Dump correct value for CND. Fixes "Copy-paste error" reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-08-04 13:49:17 -07:00
Ilia Mirkin	8edb79f1ef	nv50: fix some h264 interlaced decoding on vp2 Some videos specify mb_adaptive_frame_field_flag instead of field_pic_flag. This implies that the pic height needs to be halved, and this field needs to be passed to the VP engine. Cc: "9.2" mesa-stable@lists.freedesktop.org Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2013-08-03 12:52:04 +02:00
Zack Rusin	bff0d87668	llvmpipe: don't interpolate front face or prim id The loop was iterating over all the fs inputs and setting them to perspective interpolation, then after the loop we were creating extra output slots with the correct interpolation. Instead of injecting bogus extra outputs, just set the interpolation on front face and prim id correctly when doing the initial scan of fs inputs. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-08-02 20:12:53 -04:00
Zack Rusin	8e77e5e543	draw: make sure clipping works with injected outputs clipping would drop the extra outputs because it always used the number of standard vertex shader outputs, without geometry shader or extra outputs. The commit makes sure that clipping with geometry shaders which have more outputs than the current vertex shader and with extra outputs correctly propagates the entire vertex. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-02 20:11:18 -04:00
Zack Rusin	d6b3a193d4	draw: inject frontface info into wireframe outputs Draw module can decompose primitives into wireframe models, which is a fancy word for 'lines', unfortunately that decomposition means that we weren't able to preserve the original front-face info which could be derived from the original primitives (lines don't have a 'face'). To fix it allow draw module to inject a fake face semantic into outputs from which the backends can figure out the original frontfacing info of the primitives. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-02 20:11:18 -04:00
Zack Rusin	05487ef88d	draw: stop crashing with extra shader outputs Draw sometimes injects extra shader outputs (aa points, lines or front face), unfortunately most of the pipeline and llvm code didn't handle them at all. It only worked if number of inputs happened to be bigger or equal to the number of shader outputs plus the extra injected outputs. In particular when running the pipeline which depends on the vertex_id in the vertex_header things were completely broken. The patch adjust the code to correctly use the total number of shader outputs (the standard ones plus the injected ones) to make it all stop crashing and work. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-08-02 20:11:18 -04:00
Zack Rusin	2e46a1dcb3	draw: use the vertex size Instead of using the magical 4 use the above computed vertex size. Doesn't change the behavior, just makes the code a bit cleaner. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-02 20:11:18 -04:00
Zack Rusin	da1a74f673	draw/llvm: add some extra debugging output when dumping shader outputs it's nice to have the integer values of the outputs, in particular because some values are integers. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-02 20:11:18 -04:00
Zack Rusin	36096af026	tgsi: detect prim id and front face usage in fs Adding code to detect the usage of prim id and front face semantics in fragment shaders. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-02 20:11:18 -04:00
Zack Rusin	2da1daaa4e	tgsi: add ucmp to the list of opcodes we forgot to add ucmp to the list of opcodes, so it was never generated for ureg. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-02 19:08:39 -04:00
Zack Rusin	2d15f4746b	llvmpipe: make the front-face behavior match the gallium spec The spec says that front-face is true if the value is >0 and false if it's <0. To make sure that we follow the spec, lets just subtract 0.5 from our value (llvmpipe did 1 for frontface and 0 otherwise), which will get us a positive num for frontface and negative for backface. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-02 15:50:16 -04:00
Christoph Bumiller	957a2014f9	r600g: honour semantic index in fragment color exports Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2013-08-02 13:32:49 +02:00
Roland Scheidegger	e7ed70a52e	gallivm: obey clarified shift behavior llvm shifts are undefined for shift counts exceeding (or matching) bit width, so need to apply a mask for the tgsi shift instructions. v2: only use mask for the tgsi shift instructions, not for the build shift helpers. None of the internal callers need this behavior, and while llvm can optimize away the masking for constants there are legitimate cases where it might not be able to do so even if we know that shift count must be smaller than type width (currently all such callers do not use the build shift helpers). Reviewed-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-02 03:49:57 +02:00
Roland Scheidegger	7a72bef47e	tgsi: obey clarified shift behavior c shifts are undefined for shift counts exceeding (or matching) bit width, so need to apply a mask (on x86 it actually would usually probably work as shifts do masking on int domain shifts - unless some auto-vectorizer would come along at last as simd domain does not mask the shift count). Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-02 03:49:57 +02:00
Roland Scheidegger	606132b4de	gallium: clarify shift behavior with shift count >= 32 Previously, nothing was said what happens with shift counts exceeding bit width of the values to shift. In theory 3 behaviors are possible: 1) undefined (classic c definition) 2) just shift out all bits (so result is zero, or -1 potentially for ashr) 3) mask the shift count to bit width - 1 API's either require 3) or are ok with 1). In particular, GLSL (as well as a couple uninteresting legacy GL extensions) is happy with undefined, whereas both OpenCL and d3d10 require 3). Consequently, most hw also implements 3). So, for simplicity we just specify that 3) is required rather than saying undefined and then needing state trackers to work around it. Also while here specify shift count as a vector, not scalar. As far as I can tell this was a doc bug, neither state trackers nor drivers used scalar shift count. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-02 03:49:57 +02:00
Samuel Pitoiset	ef6d5ee9f3	nvc0: properly align NVE4_COMPUTE_MP_TEMP_SIZE MP_TEMP_SIZE must be aligned to 0x8000, while TEMP_SIZE on NVE4_3D must be aligned to 0x20000, so perform both alignments to be sure we allocate enough space (actually the bo will most likely use 128 KiB pages and not aligning to that would be a waste anyway). Cc: "9.2" mesa-stable@lists.freedesktop.org	2013-07-31 21:40:38 +02:00
Roland Scheidegger	b1ed7202df	gallivm: use nearest rounding for float->unorm24 conversion Previously we were using truncation, which gives the correct result only for numbers in [0.5-1.0] range (because there's no mantissa bits to do any rounding there). This is frequently hit (and probably only used there) when converting fragment depth to depth format (d24s8 etc.) or otherwise dealing with depth format. v2: as spotted by Jose, get rid of extra type (src_type is already unsigned). Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-07-31 17:09:02 +02:00
Brian Paul	fdbd6a5033	gallium/util: reformat, comment util_get_offset() Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2013-07-31 06:53:48 -06:00
Brian Paul	30f1770cb1	gallium/util: comments, var renaming in u_inlines.h The variable 'usage' was being used for two different things. Sometimes for PIPE_USAGE_x and other times for PIPE_TRANSFER_x. This renames usage to access when we're talking about PIPE_TRANSFER_x flags. Plus, add a bunch of comments to remind us what's going on. Also, use unsigned for PIPE_TRANSFER_x bitmask to be consistent with other places. And add a missing const qualifier. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2013-07-31 06:53:48 -06:00
Brian Paul	365f38f3df	softpipe: use new softpipe_resource_data() accessor We should probably be using map()/unmap() when accessing resource data, but this is a little better. v2: assert that the resource is not a display target, per Jose. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2013-07-31 06:53:48 -06:00
Brian Paul	99c42d11a2	softpipe: don't ignore pipe_constant_buffer::buffer_offset This was never a problem since the Mesa state tracker always gives us a user-space constant buffer with buffer_offset=0. But if another state tracker ever gave us a "HW" constant buffer with non-zero buffer_offset we'd mis-render. Also, use the correct buffer size. And move an assertion to the top of the function. Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2013-07-31 06:53:48 -06:00
Brian Paul	089ef37eab	gallium/docs: clarify definition of PIPE_CAP_USER_CONSTANT_BUFFERS, etc The cap means _can_ accept user-space constant buffers; it doesn't mean _only_ accepts user-space constant buffers. v2: also update the PIPE_CAP_USER_VERTEX_BUFFERS and PIPE_CAP_USER_INDEX_BUFFERS descriptions as well. Per Jose. Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2013-07-31 06:53:48 -06:00
Marek Olšák	7568a89500	st/dri: add a new driconf option disable_shader_bit_encoding for Unigine Now Unigine Heaven 3.0 finally works with r600g. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-07-30 23:31:30 +02:00
Marek Olšák	0f6a7cb00c	mesa,glsl,st/dri: add a new driconf option force_glsl_version for Unigine See documentation in mtypes.h. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2013-07-30 23:31:28 +02:00
Marek Olšák	bc4f0b6bac	st/dri: remove driOptionCache from dri_context in favor of dri_screen There is no reason to have this duplicated. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-07-30 23:31:24 +02:00
Marek Olšák	dda936e057	st/dri: move enabling postprocessing to dri_screen The driconf options are global. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-07-30 23:31:24 +02:00
Marek Olšák	772070527f	st/dri: remove more unused driconf options vblank_mode is read by dri_util.c and falls under the "dri2" driver name, which is not connected to the actual Mesa/Gallium driver in any way. Reviewed-by: Brian Paul <brianp@vmware.com>	2013-07-30 23:31:24 +02:00
Marek Olšák	83dbe61ea4	st/dri: implement the driconf option force_s3tc_enable properly Reviewed-by: Brian Paul <brianp@vmware.com>	2013-07-30 23:31:24 +02:00
Marek Olšák	f27f3a4b15	driconf: remove the unused option allow_large_textures Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2013-07-30 23:31:23 +02:00
Marek Olšák	2acc27cc6d	st/dri: support the driconf option disable_blend_func_extended This is needed for Unigine. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-07-30 23:31:23 +02:00

1 2 3 4 5 ...

18948 commits