fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-08 15:38:09 +02:00

Author	SHA1	Message	Date
Paul Berry	bd8d257ef3	glsl/linker: fix varying packing for non-flat integer varyings. Commit `dfb57e7` (glsl: Fix error checking on "flat" keyword to match GLSL ES 3.00, GLSL 1.50) relaxed the rules for integral varyings: they only need to be declared as "flat" if they are a fragment shader inputs. This allowed for the possibility of a vertex shader output being a non-flat integer, provided that it was not matched to a fragment shader input. A non-contrived situation where this might arise is if a vertex shader generates some integral outputs which are consumed by tranform feedback, but not by the fragment shader. Unfortunately, lower_packed_varyings assumes that all integral varyings are flat, regardless of whether they are consumed by the fragment shader. As a result, attempting to create a non-flat integral vertex output of a size that required packing (i.e. a size other than ivec4 or uvec4) would cause an assertion failure in lower_packed_varyings. This patch prevents the assertion failure by forcing vertex shader outputs to be "flat" whenever they are not consumed by the fragment shader. This should have no effect on rendering since the "flat" keyword only affects the behaviour of fragment shader inputs. Fixes piglit test "spec/EXT_transform_feedback/nonflat-integral". NOTE: This is a candidate for the 9.1 release branch. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> (cherry picked from commit `7862bde8af`)	2013-05-10 16:41:29 -07:00
Chris Forbes	509054eb25	mesa: don't memcmp() off the end of a cache key. Reported-by: `per` in #intel-gfx The size of the cache key varies, so store the actual size as well as the key blob itself, rather than just assuming it's the same as the size passed in. NOTE: This is a candidate for stable branches. V2: Don't leave silly holes in structure; use unsigned instead of GLuint. V3: Fix missing case for `last` match. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Paul Berry <stereotype441@gmail.com> (cherry picked from commit `c4629ad3f9`)	2013-05-10 16:41:29 -07:00
Brian Paul	df4e6650e3	gallium/u_blitter: fix is_blit_generic_supported() stencil checking Don't check if there's sampler support for stencil if we're not going to actually blit/copy stencil values. Fixes the case where we mistakenly said we can't support a blit of depth values from S8Z24 to X8Z24. Also, rename the is_stencil variable to dst_has_stencil to improve readability. NOTE: This is a candidate for the stable branches. Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: José Fonseca <jfonseca@vmware.com> (cherry picked from commit `de99b6d117`)	2013-05-10 16:41:29 -07:00
Alexander Monakov	cc53944c26	Honor GLX_DONT_CARE in MATCH_MASK NOTE: This is a candidate for stable branches. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=47478 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=62999 Bugzilla: http://bugs.winehq.org/show_bug.cgi?id=26763 (cherry picked from commit `9cda356004`)	2013-05-10 16:41:29 -07:00
Kenneth Graunke	acc3561cca	i965: Fix stencil write enable flag in 3DSTATE_DEPTH_BUFFER on Gen7+. ctx->Stencil.WriteMask is a statically sized array of 3 elements. Checking it against 0 actually is a NULL check, and can never fail, which meant that we always said stencil writes were enabled. Use the new core Mesa derived state flag to fix this. NOTE: This is a candidate for stable branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Paul Berry <stereotype441@gmail.com> (cherry picked from commit `01bd29d681`)	2013-05-10 16:41:28 -07:00
Paul Berry	671e4e6b9e	i965: Reduce code duplication in handling of depth, stencil, and HiZ. This patch consolidates duplicate code in the brw_depthbuffer and gen7_depthbuffer state atoms. Previously, these state atoms contained 5 chunks of code for emitting the _3DSTATE_DEPTH_BUFFER packet (3 for Gen4-6 and 2 for Gen7). Also a lot of logic for determining the appropriate buffer setup was duplicated between the Gen4-6 and Gen7 functions. This refactor splits the code into three separate functions: brw_emit_depthbuffer(), which determines the appropriate buffer setup in a mostly generation-independent way, brw_emit_depth_stencil_hiz(), which emits the appropriate state packets for Gen4-6, and gen7_emit_depth_stencil_hiz(), which emits the appropriate state packets for Gen7. Tested using Piglit on Gen5-7 (no regressions). v2: Re-word some comments. Fix an assertion that incorrectly prohibited packed depth/stencil formats on Gen6 (these are allowed provided that HiZ is disabled). Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `41e4bccc75`)	2013-05-10 16:41:28 -07:00
Kenneth Graunke	ae79402dba	mesa: Add new ctx->Stencil._WriteEnabled derived state flag. i965 needs to know whether stencil writes are enabled in several places, and gets the test wrong sometimes. While we could create a function to compute this, it seems generally useful enough to warrant a new piece of derived state. Also, all the plumbing is already in place. NOTE: This is a candidate for stable branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Paul Berry <stereotype441@gmail.com> (cherry picked from commit `1e3235d36e`)	2013-05-10 16:41:28 -07:00
Marek Olšák	2708dc5e88	radeonsi: add more cases for copying unsupported formats to resource_copy_region Ported from r600g commit: `8891b2f9c9` Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> NOTE: This is a candidate for the 9.1 branch. (cherry picked from commit `ff01e0db0e`)	2013-05-10 16:41:28 -07:00
Paul Berry	c5a1eabaf2	glsl: Fix array indexing when constant folding built-in functions. Mesa constant-folds built-in functions by using a miniature GLSL interpreter (see ir_function_signature::constant_expression_evaluate_expression_list()). This interpreter had a bug in its handling of array indexing, which caused expressions like "m[i][j]" (where m is a matrix) to be handled incorrectly. Specifically, it incorrectly treated j as indexing into the whole matrix (rather than indexing just into the vector m[i]); as a result the offset computed for m[i] was lost and m[i][j] was treated as m[j][0]. Fixes piglit tests inverse-mat[234].{vert,frag}. NOTE: This is a candidate for the 9.1 and 9.0 branches. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=57436 (cherry picked from commit `7d4f1e6467`)	2013-05-10 16:41:28 -07:00
Michel Dänzer	7c6472410a	radeonsi: Handle arbitrary 2-byte formats in resource_copy_region Fixes mplayer -vo vdpau OSD. NOTE: This is a candidate for the 9.1 branch. Reported-by: Igor Vagulin <igor.vagulin@gmail.com> Reviewed-by: Christian König <christian.koenig@amd.com> Tested-by: Christian König <christian.koenig@amd.com> (cherry picked from commit `c6efb4870b`)	2013-05-10 16:41:28 -07:00
Maarten Lankhorst	09f5ee9918	nvc0: Fix fd leak in nvc0_create_decoder NOTE: This is a candidate for the 9.0 and 9.1 branches. Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com> (cherry picked from commit `6d20c646d6`)	2013-05-10 16:41:28 -07:00
Aras Pranckevicius	46ac963a23	GLSL: fix lower_jumps to report progress properly A fix for lower_jumps progress reporting, very much like similar in `c1e591eed`. NOTE: This is a candidate for stable branches. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> (cherry picked from commit `b2eee0869f`)	2013-05-10 16:41:28 -07:00
Eric Anholt	ee561e0927	i965/fs: Clean up the setup of gen4 simd16 message destinations. I think this makes it much more obvious what's going on here. NOTE: This is a candidate for the 9.1 branch. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `8edc7cbe64`)	2013-05-10 16:41:28 -07:00
Eric Anholt	724269bb32	i965/fs: Do CSE on gen7's varying-index pull constant loads. This is our first CSE on a regs_written() > 1 instruction, so it takes a bit of extra fixup. Reduces the number of loads on kwin's Lanczos shader from 12 to 2. v2: Fix compiler warning (false positive on possibly-uninitialized variable) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=61554 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (v1) NOTE: This is a candidate for the 9.1 branch. (cherry picked from commit `9f43b84928`)	2013-05-10 16:41:27 -07:00
Eric Anholt	f523c0fb21	i965/fs: Avoid inappropriate optimization with regs_written > 1. Right now we don't have anything with regs_written() > 1 and !inst->mlen, but that's about to change. NOTE: This is a candidate for the 9.1 branch. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `bc0e1591f6`)	2013-05-10 15:40:28 -07:00
Eric Anholt	52bf09d52c	i965: Make the constant surface interface take a normal byte size. This puts the rounding-up logic into the function itself instead of all the callers having to manage it. Also drop an "unused" comment in gen4, as the stride is used for texbos (and will be for uniforms soon). NOTE: This is a candidate for the 9.1 branch. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `2f41a60145`)	2013-05-10 13:43:11 -07:00
Eric Anholt	7f2a65d896	i965/fs: Move varying uniform offset compuation into the helper func. I'm going to want to change the math for gen7 using sampler LD instructions in a way that gets CSE to occur like we'd hope. NOTE: This is a candidate for the 9.1 branch. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `8c694dfe64`)	2013-05-10 13:43:11 -07:00
Eric Anholt	d61b1fdad6	i965/fs: Remove creation of a MOV instruction that's never used. We weren't inserting it into the list, so it did nothing. This line was replaced by the MOV/MUL block above. NOTE: This is a candidate for the 9.1 branch. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `59e858861c`)	2013-05-10 13:43:11 -07:00
Haixia Shi	627e2669ab	ACTIVE_UNIFORM_MAX_LENGTH should include 3 extra characters for arrays. If the active uniform is an array, then the length of the uniform name should include the three extra characters for the "[0]" suffix, which is required by the GL 4.2 spec to be appended to the uniform name in glGetActiveUniform(). This avoids the situation where the output buffer does not have enough space to hold the "[0]" suffix, resulting in an incomplete array specification like "foobar[0". NOTE: This is a candidate for the 9.1 branch. Change-Id: I41e87ba347a7169eec8c575596cc3416adbe0728 Signed-off-by: Haixia Shi <hshi@chromium.org> Reviewed-by: Stéphane Marchesin <marcheu@chromium.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> (cherry picked from commit `bc0cc2944f`)	2013-05-10 13:43:11 -07:00
Brian Paul	44d35d70e3	mesa: remove platform checks around __builtin_ffs, __builtin_ffsll Use the __builtin_ffs, __builtin_ffsll functions whenever we have GCC, not just for specific platforms. Fixes Solaris build. Note: This is a candidate for the stable branches. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=62868 Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com> (cherry picked from commit `95df2b2883`)	2013-05-10 13:43:11 -07:00
Ian Romanick	f5887e4d3f	mesa: Note that patch `0967c36` shouldn't actually get picked to the 9.1 branch The code didn't apply cleanly due to a number of refactors, so a different solution was needed. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2013-05-10 13:43:11 -07:00
Chris Forbes	34a4fc5989	i965/fs: Don't try to use bogus interpolation modes pre-Gen6. Interpolation modes other than perspective-barycentric-pixel-center (and their associated coefficients in the WM payload) only exist in Gen6 and later. Unfortunately, if a varying was declared as `centroid`, we would blindly read the nonexistant values, and so produce all manner of bad behavior -- texture swimming, snow, etc. Fixes rendering in Counter-Strike Source and Team Fortress 2 on Ironlake. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Paul Berry <stereotype441@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Tested-by: Jordan Justen <jordan.l.justen@intel.com> (cherry picked from commit `79f786f936`)	2013-05-08 15:39:25 -07:00
Ian Romanick	f81eea3f1f	docs: Add 9.1.2 release md5sums Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2013-04-30 15:25:57 -07:00
Ian Romanick	8c2981b8e0	docs: 9.1.2 release notes Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2013-04-30 15:18:53 -07:00
Ian Romanick	f9abbcacaa	mesa: Bump version to 9.1.2 Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2013-04-30 15:17:47 -07:00
Chris Forbes	251c87d884	i965/vs: Fix Gen4/5 VUE map inconsistency with gl_ClipVertex This is roughly a backport of Eric's commit `0967c362`. We avoided assigning a slot in the VUE map for gl_ClipVertex, but left the bit set in outputs_written, producing horrible confusion further down the pipe. Mostly fixes rendering in source games, and probably in Freespace 2 SCP. No Piglit regressions on Ironlake. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> V2: Mask out the bit, not its index. Strangely, the game still worked with that wrong, but rendering of pretty much anything else was completely trashed. Reviewed-by: Paul Berry <stereotype441@gmail.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Tested-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-04-30 07:16:02 +12:00
Adam Jackson	3cff41c7e4	linux: Don't emit a .note.ABI-tag section anymore (#26663 ) We don't support pre-2.6 kernels anyway - the install docs say 2.6.28 for DRI - and apparently this confuses ld.so's sorting when multiple libGLs are installed. Just remove it. Note: this is a candidate for the stable branches. Acked-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Adam Jackson <ajax@redhat.com> (cherry picked from commit `904b03824b`)	2013-04-27 17:22:51 +10:00
Alex Deucher	e78b553195	r600g: disable hyperz by default on 9.1 There are too many cases were we end up with lockups. Once we sort out the remaining issues on master, they can be backported and hyperz can be re-enabled on 9.1 Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com>	2013-04-22 12:16:51 -04:00
Tom Stellard	f0440493c2	r300g: Fix bug in OMOD optimization https://bugs.freedesktop.org/show_bug.cgi?id=60503 NOTE: This is a candidate for the stable branches. (cherry picked from commit `c6a86fb563`)	2013-04-12 09:35:00 -07:00
Carl Worth	4f44146226	i965: Avoid segfault in gen6_upload_state This fixes a bug introduced in commit `258453716f` and triggered whenever "rb" is NULL. Fixes at least one cause bug #59445: [SNB/IVB/HSW Bisected]Oglc draw-buffers2(advanced.blending.none) segfault https://bugs.freedesktop.org/show_bug.cgi?id=59445 (Though segfaults are still possible in that test case, but they have been present since before commit `258453716f` which is what's being fixed here.) Reviewed-by: Eric Anholt <eric@anholt.net> [jordan.l.justen@intel.com: fixes Anomaly Warzone Earth crash at title screen] Tested-by: Jordan Justen <jordan.l.justen@intel.com>	2013-04-10 19:48:56 -07:00
Ian Romanick	39bb794aba	mesa: Note that patch `dbf94d1` should't actually get picked to the 9.1 branch Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2013-04-08 15:01:07 -07:00
Ian Romanick	c18d48da41	glsl: Add missing bool case in glsl_type::get_scalar_type Since the case was missing bec4->get_scalar_type() would return bvec4, but vec4->get_scalar_type() would return float. NOTE: This is a candidate for stable branches. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> (cherry picked from commit `c770faea0a`)	2013-04-08 14:49:58 -07:00
Martin Andersson	830bc1cbe6	r600g: Use virtual address for PIPE_QUERY_SO* in r600_emit_query_end Virtual address is used for PIPE_QUERY_SO* queries in r600_emit_query_begin, but not in r600_emit_query_end. This will trigger a GPU fault when one of those queries is made and virtual address is enabled. Note: this is a candidate for the 9.1 branch Signed-off-by: Alex Deucher <alexander.deucher@amd.com> (cherry picked from commit `92855bcc95`)	2013-04-08 14:49:53 -07:00
Eric Anholt	c589071fb2	mesa: Disable validate_ir_tree() on release builds. Since half of ir_validate uses asserts() (the other using printf() then abort()), there's not much use to calling it in a release build. Cuts 6.3% of the startup time of TF2. NOTE: This is a candidate for the stable branches. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `712bac1f41`)	2013-04-08 14:49:48 -07:00
Marek Olšák	80092d8869	mesa: handle HALF_FLOAT like FLOAT in get_tex_rgba NOTE: This is a candidate for the stable branches. Reviewed-by: Brian Paul <brianp@vmware.com> Tested-by: Brian Paul <brianp@vmware.com> (cherry picked from commit `b2a4573c14`)	2013-04-08 14:49:44 -07:00
Matt Turner	c7720a24be	mesa: Implement TEXTURE_IMMUTABLE_LEVELS for ES 3.0. NOTE: This is a candidate for the 9.1 branch. Fixes piglit's texture-immutable-levels test. Reported-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com> (cherry picked from commit `12dc4be8a6`)	2013-04-05 19:01:10 -07:00
Adam Jackson	82ac970d37	glx: Build with VISIBILITY_CFLAGS in automake Note: This is a candidate for the stable branches. Signed-off-by: Adam Jackson <ajax@redhat.com> (cherry picked from commit `38aa8ec937`)	2013-04-05 19:01:09 -07:00
Michel Dänzer	e0af764882	radeonsi: Emit pixel shader state even when only the vertex shader changed Fixes random failures with piglit glsl-max-varyings. NOTE: This is a candidate for the 9.1 branch. Reviewed-by: Christian König <christian.koenig@amd.com> (cherry picked from commit `032e5548b3`)	2013-04-05 19:01:09 -07:00
Kenneth Graunke	0c5fa7ae0e	i965: Don't use texture swizzling to force alpha to 1.0 if unnecessary. Commit `33599433c7` began setting the texture swizzle mode to XYZ1 for RED, RG, and RGB textures in order to force alpha to 1.0 in case we actually stored the texture as RGBA. This had a unforseen performance implication: the shader precompile assumes that the texture swizzle mode will be XYZW for non-shadow sampler types. By setting it to XYZ1, this means every shader used with a RED, RG, or RGB texture has to be recompiled. This is a very common case. Unfortunately, there's no way to improve the precompile, since RGBA textures still need XYZW, and there's no way to know by looking at the shader source what texture formats might be used. However, we only need to smash alpha to 1.0 if the texture's memory format actually has alpha bits. If not, the sampler already returns 1.0 for us without any special swizzling. XRGB8888, for example, is a very common case where this occurs. This partially fixes a performance regression since commit `33599433c7`. More work is required to fully fix it in all cases. This at least helps Warsow. NOTE: This is a candidate for the 9.1 branch. Reviewed-by: Carl Worth <cworth@cworth.org> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `d86efc075e`)	2013-04-05 19:01:09 -07:00
Maarten Lankhorst	725c671d61	radeon/llvm: Do not link against libgallium when building statically. NOTE: This is a candidate for the 9.1 branch. Tested-by: Vincent Lejeune <vljn@ovi.com> Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com> (cherry picked from commit `7c3d8301af`)	2013-04-05 19:01:09 -07:00
Andreas Boll	4205bd4b9b	gallium/egl: fix out-of-tree build Taken from downstream: http://anonscm.debian.org/gitweb/?p=pkg-xorg/lib/mesa.git;a=blob;f=debian/patches/15-fix-oot-build.diff;h=7040999a22d3937d0578cfd85ee2c71d7dc614bb;hb=refs/heads/ubuntu%2B1 NOTE: This is a candidate for the 9.1 branch. Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> (cherry picked from commit `182895c4e6`)	2013-04-05 19:01:09 -07:00
Andreas Boll	6e8f8a959b	osmesa: fix out-of-tree build Taken from downstream: http://anonscm.debian.org/gitweb/?p=pkg-xorg/lib/mesa.git;a=blob;f=debian/patches/14-fix-osmesa-build.diff;h=00581d0e1833c5492d9050e1bf3d5e658cad782e;hb=refs/heads/ubuntu%2B1 v2: Move the added line immediately after -I$(top_srcdir)/src/mapi NOTE: This is a candidate for the 9.1 and 9.0 branches. Acked-by: Kenneth Graunke <kenneth@whitecape.org> (v1) Reviewed-by: Matt Turner <mattst88@gmail.com> (cherry picked from commit `92e6260c19`)	2013-04-05 19:01:09 -07:00
Andreas Boll	0c0e72f756	build: Enable x86 assembler on Hurd. Taken from downstream: http://anonscm.debian.org/gitweb/?p=pkg-xorg/lib/mesa.git;a=blob;f=debian/patches/10-hurd-configure-tweaks.diff;h=984e17df1b8afdf8e4b36bee96aa5ab6a5691021;hb=refs/heads/ubuntu%2B1 Thanks to Pino Toscano. v2: Don't bother with x86_64. AFAICT GNU/Hurd doesn't support it so far. NOTE: This is a candidate for stable branches. Acked-by: Kenneth Graunke <kenneth@whitecape.org> (v1) Acked-by: Matt Turner <mattst88@gmail.com> (cherry picked from commit `06fff296e9`)	2013-04-05 19:01:09 -07:00
Andreas Boll	60e5696de3	mesa: use ieee fp on s390 and m68k Taken from downstream: http://anonscm.debian.org/gitweb/?p=pkg-xorg/lib/mesa.git;a=blob;f=debian/patches/02_use-ieee-fp-on-s390-and-m68k.patch;h=d3d6c1d7fec3c72ecf320706167deb61c52636c3;hb=refs/heads/ubuntu%2B1 Fixes Debian bug #349437. Patch written by David Nusinow. NOTE: This is a candidate for stable branches. Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Matt Turner <mattst88@gmail.com> (cherry picked from commit `7962f28c43`)	2013-04-05 19:01:09 -07:00
Roland Scheidegger	7067d65e56	gallivm: fix return opcode handling in main function of a shader If we're in some conditional or loop we must not return, or the code after the condition is never executed. (v2): And, we also can't just continue as nothing happened, since the mask update code would later check if we actually have a mask, so we need to remember that there was a return in main where we didn't exit (to illustrate this, a ret in a if clause would cause a mask update which is still ok as we're in a conditional, but after the endif the mask update code would drop the mask hence bringing execution back to pixels which should have their execution mask set to zero by the ret). Thanks to Christoph Bumiller for figuring this out. This fixes https://bugs.freedesktop.org/show_bug.cgi?id=62357. Note: This is a candidate for the stable branches. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> (cherry picked from commit `5af7b45986`)	2013-04-05 19:01:09 -07:00
Andreas Boll	4999f0a84e	radeon/llvm: Link against libgallium.la to fix an undefined symbol Ported from downstream: http://anonscm.debian.org/gitweb/?p=pkg-xorg/lib/mesa.git;a=blob;f=debian/patches/119-libllvmradeon-link.patch;h=ee47f8a07dbf33c32f8b57faed923680ed6648fb;hb=refs/heads/ubuntu%2B1 Fixes a regression introduced with `f70c385351` NOTE: This is a candidate for the 9.1 branch. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=62434 Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com> (cherry picked from commit `36320bfa54`)	2013-04-05 19:01:08 -07:00
Maarten Lankhorst	70f7138754	gallium/build: Fix visibility CFLAGS in automake v2: Andreas Boll <andreas.boll.dev@gmail.com> - Fix formatting - use one CFLAG per line NOTE: This is a candidate for the 9.1 branch. Signed-off-by: Maarten Lankhorst <m.b.lankhorst@gmail.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=59238 Reviewed-by: Andreas Boll <andreas.boll.dev@gmail.com> (cherry picked from commit `f70c385351`)	2013-04-05 19:01:08 -07:00
Paul Berry	0756ab9c85	i965: Apply depthstencil alignment workaround when doing fast clears. Fast depth clears have the same depth/stencil alignment requirements as other drawing operations. Therefore, we need to call brw_workaround_depthstencil_alignment() from both the clear and drawing paths. Without this fix, we get image corruption if the following conditions hold: (a) the first ever drawing operation to a depth miplevel (or the first drawing operation after having used the texture for sampling) is a clear, (b) the depth miplevel has a size that is eligible for fast depth clears, and (c) the depth miplevel has an offset within the miptree that isn't 8x8 aligned. Fixes piglit "depthstencil-render-miplevels" tests with size 273. NOTE: This is a candidate for stable branches Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> (cherry picked from commit `c5d5827951`)	2013-04-05 19:01:08 -07:00
Kenneth Graunke	6e6dcd451e	i965: Make INTEL_DEBUG=shader_time use the RAW surface format. Untyped Atomic Operation messages are illegal for non-RAW formats. The IVB hardware proceeds happily (after all, who cares what the format of the surface is if you're doing untyped ops on it?), but later hardware apparently doesn't. The simulator for gen7 does complain, though. v2: Rebase against updates to previous patches. (by anholt) NOTE: This is a candidate for the 9.1 branch. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Paul Berry <stereotype441@gmail.com> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `91df4d746b`)	2013-04-05 19:01:08 -07:00
Kenneth Graunke	0d9f849ddf	i965: Specialize SURFACE_STATE creation for shader time. This is basically a copy and paste of gen7_create_constant_surface, but with the parameters filled in to offer a simpler interface. It will diverge shortly. I didn't bother adding it to the vtable for now since shader time is only exposed on Gen7+. v2: Replace tabs in the new code (by anholt) Add back dropped memset() and add a comment about HSW channel selects. NOTE: This is a candidate for the 9.1 branch. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Paul Berry <stereotype441@gmail.com> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `125b34cffb`)	2013-04-05 19:01:08 -07:00

... 2 3 4 5 6 ...

55318 commits