fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-23 00:20:09 +01:00

Author	SHA1	Message	Date
Chia-I Wu	02496cd2b6	ilo: fold gen6_translate_index_size into the caller There is only one caller so fold it.	2013-08-08 13:10:36 +08:00
Chia-I Wu	1c19d0bb81	ilo: fold gen6_translate_depth_format into the caller There is only one caller so fold it.	2013-08-08 13:02:17 +08:00
Courtney Goeltzenleuchter	c2c5366ff2	ilo: Call GPE emit functions directly. Eliminate pipeline and GPE function vectors and have the pipeline functions call the GPE emit functions directly.	2013-08-08 11:39:21 +08:00
Courtney Goeltzenleuchter	4bc9daf923	ilo: move emit functions so that they can be inlined.	2013-08-08 11:39:21 +08:00
Tom Stellard	d0c13fba17	r300g/compiler/tests: Pass the required LDFLAGS when building the test program CC: "9.2 <mesa-stable@lists.freedesktop.org>"	2013-08-07 17:28:19 -07:00
Tom Stellard	d691ba4d94	r300g/compiler/tests: Fix segfault CC: "9.2" <mesa-stable@lists.freedesktop.org>	2013-08-07 17:27:23 -07:00
Chia-I Wu	79b868fea1	ilo: speed up 3DSTATE_VERTEX_BUFFERS emission a bit Ignore vbuffer_mask which does not gain us anything.	2013-08-07 23:13:50 +08:00
Chia-I Wu	7ce3cbaacf	ilo: skip state emission when reducing sampler count When the number of sampler states bound is reduced, we are good to keep referencing the old SAMPLER_STATE array and skip emitting a new one.	2013-08-07 23:13:44 +08:00
Chia-I Wu	2811dba1d0	ilo: simplify setting of shader samplers and views Remove the special path that unbinds all samplers/views not in the range. Just make another call to unbind them.	2013-08-07 18:10:32 +08:00
Chia-I Wu	186dab5b8f	ilo: correctly check for stencil ref change I intended to do a memcmp(), not a memcpy()...	2013-08-07 18:00:46 +08:00
Zack Rusin	12522041d6	draw: fix slot detection Nowadays -1 for slots means that the semantic is not present, so we need to store it in a signed variables, otherwise <0 comparisons are pointless. Fixes http://bugzilla.eng.vmware.com/show_bug.cgi?id=67811 (at least with softpipe, edgeflags don't work wit llvmpipe) Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2013-08-06 20:23:57 -04:00
Christoph Bumiller	2daf974cfe	nvc0: don't access array out of bounds on unexpected sample count	2013-08-06 22:29:33 +02:00
Emil Velikov	07c8f7a6f8	nv50: handle pure integer vertex attributes And as a side effect fix a crash in the following piglit test: general/attribs GL3 Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Cc: "9.2 and 9.1" mesa-stable@lists.freedesktop.org	2013-08-06 22:25:26 +02:00
Samuel Pitoiset	31caddb8d9	nvc0: implement MP performance counters for nvc0:nvd9	2013-08-06 22:24:30 +02:00
Samuel Pitoiset	9dcd7888e6	nvc0: implement compute support for nvc0 Tested on nvc0, nvc1, nvcf and nvd9.	2013-08-06 22:22:49 +02:00
Samuel Pitoiset	981b589101	nvc0: add more MP counters for nve4	2013-08-06 22:22:34 +02:00
Michel Dänzer	46b6f79fea	radeonsi: Number of SGPRs retrieved from LLVM already includes VCC Fixes spurious 'Assertion `num_sgprs <= 104' failed.' with shaders using all 104 SGPRs. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Christian König <christian.koenig@amd.com>	2013-08-06 12:50:01 +02:00
Vinson Lee	b57c1e4b86	llvmpipe: Do not need to free anything if there is no geometry shader. If gs is null, then freeing state->shader.tokens would result in a null dereference. Fixes "Dereference after null check" defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-08-05 21:54:20 -07:00
Vinson Lee	60b567ee59	nvc0: Initialize ptr for unexpected sample_count on release builds. Fixes "Uninitialized pointer read" defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-08-05 21:53:39 -07:00
Zack Rusin	95829e2029	llvmpipe: fix frontface behavior again Lets make sure the frontface is 1 for front and -1 for back. Discussed with Roland and Jose. Signed-off-by: Zack Rusin <zackr@vmware.com>	2013-08-02 22:21:29 -04:00
Vinson Lee	0794f638ee	r600g/sb: Dump correct value for CND. Fixes "Copy-paste error" reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-08-04 13:49:17 -07:00
Ilia Mirkin	8edb79f1ef	nv50: fix some h264 interlaced decoding on vp2 Some videos specify mb_adaptive_frame_field_flag instead of field_pic_flag. This implies that the pic height needs to be halved, and this field needs to be passed to the VP engine. Cc: "9.2" mesa-stable@lists.freedesktop.org Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2013-08-03 12:52:04 +02:00
Zack Rusin	bff0d87668	llvmpipe: don't interpolate front face or prim id The loop was iterating over all the fs inputs and setting them to perspective interpolation, then after the loop we were creating extra output slots with the correct interpolation. Instead of injecting bogus extra outputs, just set the interpolation on front face and prim id correctly when doing the initial scan of fs inputs. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-08-02 20:12:53 -04:00
Zack Rusin	d6b3a193d4	draw: inject frontface info into wireframe outputs Draw module can decompose primitives into wireframe models, which is a fancy word for 'lines', unfortunately that decomposition means that we weren't able to preserve the original front-face info which could be derived from the original primitives (lines don't have a 'face'). To fix it allow draw module to inject a fake face semantic into outputs from which the backends can figure out the original frontfacing info of the primitives. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-02 20:11:18 -04:00
Zack Rusin	2d15f4746b	llvmpipe: make the front-face behavior match the gallium spec The spec says that front-face is true if the value is >0 and false if it's <0. To make sure that we follow the spec, lets just subtract 0.5 from our value (llvmpipe did 1 for frontface and 0 otherwise), which will get us a positive num for frontface and negative for backface. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-02 15:50:16 -04:00
Christoph Bumiller	957a2014f9	r600g: honour semantic index in fragment color exports Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2013-08-02 13:32:49 +02:00
Samuel Pitoiset	ef6d5ee9f3	nvc0: properly align NVE4_COMPUTE_MP_TEMP_SIZE MP_TEMP_SIZE must be aligned to 0x8000, while TEMP_SIZE on NVE4_3D must be aligned to 0x20000, so perform both alignments to be sure we allocate enough space (actually the bo will most likely use 128 KiB pages and not aligning to that would be a waste anyway). Cc: "9.2" mesa-stable@lists.freedesktop.org	2013-07-31 21:40:38 +02:00
Brian Paul	365f38f3df	softpipe: use new softpipe_resource_data() accessor We should probably be using map()/unmap() when accessing resource data, but this is a little better. v2: assert that the resource is not a display target, per Jose. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2013-07-31 06:53:48 -06:00
Brian Paul	99c42d11a2	softpipe: don't ignore pipe_constant_buffer::buffer_offset This was never a problem since the Mesa state tracker always gives us a user-space constant buffer with buffer_offset=0. But if another state tracker ever gave us a "HW" constant buffer with non-zero buffer_offset we'd mis-render. Also, use the correct buffer size. And move an assertion to the top of the function. Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2013-07-31 06:53:48 -06:00
Marek Olšák	4dfe1a0df5	Revert "r300g: Give CLIP_DISABLE another try" This reverts commit `e866bd1ade`. https://bugs.freedesktop.org/show_bug.cgi?id=57875 Cc: mesa-stable@lists.freedesktop.org	2013-07-30 22:36:20 +02:00
Jonathan Charest	4f8048bb5a	r600g/compute: Added missing address space checking of kernel parameters To have non-static buffers in local memory, it is necessary to pass them as arguments to the kernel. For r600, the correct lds size must be set to the SQ_LDS_ALLOC register. The correct size is the clover size plus the size reported by the compiler. Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2013-07-30 07:09:16 -07:00
Maarten Lankhorst	e847b5ae06	nvc0: force use of correct firmware file Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com>	2013-07-28 12:06:57 +02:00
Christoph Bumiller	5c37039797	nv50,nvc0: s/uint16/uint32 for constant buffer offset Looks like a thinko, "Hey, constant buffers can be at most 64 KiB in size, offset can't be larger." But it can, of course. I think piglit lacks a test for UBO and BindBufferRange that tests if it actually works.	2013-07-24 20:46:38 +02:00
Tom Stellard	4e90bc9a12	gallium: Add PIPE_CAP_ENDIANNESS Cc: mesa-stable@lists.freedesktop.org [ Francisco Jerez: Fix "PIPE_ENDIAN_SMALL" in the documentation, define PIPE_ENDIAN_NATIVE. ]	2013-07-22 22:43:17 +02:00
Zack Rusin	7bae56c5c2	llvmpipe: Ensure FTZ/DAZ flags are set on deferred draw flushes. Tested-by: José Fonseca <jfonseca@vmware.com>	2013-07-22 18:11:39 +01:00
José Fonseca	2a650611be	llvmpipe: Remove lp_rast_get_num_threads(). Never called. Trivial.	2013-07-22 18:08:39 +01:00
Zack Rusin	f59cb67376	llvmpipe/tests: update arith test to check for edge cases Test infs, zeros and nans with our arith functions to assure correct/defined behavior with those values. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2013-07-19 16:29:18 -04:00
Roland Scheidegger	4ef19f7fec	llvmpipe: clamp inputs for srgb render buffers Usually with fixed point renderbuffers clamping is done as part of conversion. However, since we blend in float format, we essentially skip all conversion steps pre-blend but since this is still a fixed point renderbuffer we must still clamp the inputs in this case. Makes no difference for piglit though. Obviously we could skip this if fragment color clamping is enabled, but a) this is deprecated in OpenGL (d3d never had it) and b) we don't support it natively so it gets baked into the shader. Also add some comment about logic ops being broken for srgb, luckily no test tries to do that as there's no easy fix... Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Zack Rusin <zackr@vmware.com>	2013-07-18 19:04:20 +02:00
Roland Scheidegger	e57b98bad3	llvmpipe: fix blending with SRC_ALPHA_SATURATE with some formats without alpha We were fixing up the blend factor to ZERO, however this only works correctly with fixed point render buffers where the input values are clamped to 0/1 (because src_alpha_saturate is min(As, 1-Ad) so can be negative with unclamped inputs). Haven't seen any failure anywhere due to that with fixed point SNORM buffers (which clamp inputs to -1/1) but it should apply there as well (snorm blending is rare, even opengl 4.3 doesn't require snorm rendertargets at all, d3d10 requires them but they are not blendable). Doesn't look like piglit hits this though (some internal testing hits the float case at least). (With legacy OpenGL we could theoretically still use the fixup to zero if the fragment color clamp is enabled, but we can't detect that easily since we don't support native clamping hence it gets baked into the shader.) Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Zack Rusin <zackr@vmware.com>	2013-07-18 19:03:35 +02:00
Marek Olšák	0d7f087483	r600g: use WAIT_3D_IDLE before using CP DMA I broke this with `7948ed1250` for r700 at least.	2013-07-18 14:27:34 +02:00
Jonathan Gray	0b405f364f	r300g: make use of gallium's os_get_process_name() Lets the code compile on non Linux systems. Signed-off-by: Jonathan Gray <jsg@jsg.id.au> Signed-off-by: Marek Olšák <maraeo@gmail.com>	2013-07-18 14:04:48 +02:00
Ilia Mirkin	fbdae1ca41	nv50: H.264/MPEG2 decoding support via VP2, available on NV84-NV96, NVA0 Adds H.264 and MPEG2 codec support via VP2, using firmware from the blob. Acceleration is supported at the bitstream level for H.264 and IDCT level for MPEG2. Known issues: - H.264 interlaced doesn't render properly - H.264 shows very occasional artifacts on a small fraction of videos - MPEG2 + VDPAU shows frequent but small artifacts, which aren't there when using XvMC on the same videos Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2013-07-18 07:52:32 +02:00
Vadim Girlin	07baf9cfd1	r600g/sb: improve alu packing on cayman Scheduler/register allocator in r600-sb was developed and optimized on evergreen (VLIW-5) hardware, so currently it's not optimal for VLIW-4 chips. This patch should improve performance on cayman gpus due to better alu packing, but also it tends to increase register usage, so overall positive effect on performance has to be proven by real benchmarks yet. Some results with bfgminer kernel on cayman: source bytecode: 60 gprs, 3905 alu groups, sbcl before the patch: 45 gprs, 4088 alu groups, sbcl with this patch: 55 gprs, 3474 alu groups. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-07-17 18:29:56 +04:00
Vadim Girlin	ba7fa4c4c9	r600g/sb: fix handling of new multislot instructions on cayman Ex-scalar instructions that became multislot on cayman do replicate result to all channels - handle them similar to DOT4. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-07-17 18:27:31 +04:00
Vadim Girlin	033eec4145	r600g/sb: fix debug dump code in scheduler Update the stale debug code for other changes related to debug output. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-07-17 18:27:31 +04:00
Vadim Girlin	44ebe7291c	r600g/sb: fix initial register allocation Mark values that are members of the 'same register' constraint as preallocated in ra_init pass, this will prevent incorrect reallocation in scheduler in some cases. Should fix https://bugs.freedesktop.org/show_bug.cgi?id=66713 Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-07-17 18:27:30 +04:00
Vadim Girlin	f0d881106a	r600g/sb: move chip & class name functions to sb_context Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-07-17 18:27:30 +04:00
Vadim Girlin	96efa4cdf4	r600g/sb: fix handling of PS in source bytecode on cayman Actually PS doesn't make sense for cayman and isn't even mentioned in cayman docs, but llvm backend currently uses it in bytecode and, assuming that hw seems to be mostly ok with it, this will allow sb to parse such source bytecode correctly. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-07-17 18:27:30 +04:00
Vinson Lee	81d3881367	r600g/sb: Initialize ra_checker member variables. Fixes "Uninitialized scalar field" defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org>	2013-07-17 18:27:30 +04:00
Roland Scheidegger	dc1cc928ed	llvmpipe: support sRGB framebuffers Just use the new conversion functions to do the work. The way it's plugged in into the blend code is quite hacktastic but follows all the same hacks as used by packed float format already. Only support 4x8bit srgb formats (rgba/rgbx plus swizzle), 24bit formats never worked anyway in the blend code and are thus disabled, and I don't think anyone is interested in L8/L8A8. Would need even more hacks otherwise. Unless I'm missing something, this is the last feature except MSAA needed for OpenGL 3.0, and for OpenGL 3.1 as well I believe. v2: prettify a bit, use separate function for packing. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-07-16 01:54:51 +02:00

... 16 17 18 19 20 ...

11465 commits