fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-05 00:58:05 +02:00

Author	SHA1	Message	Date
Ilia Mirkin	47c19a5819	nvc0: change logic for centering of eng2d blit when downsampling We want to center the sample. The old code may have been correct given the limited values of ms_x/y, but the new logic should be more intuitive. Note that ms_x can only be 1/2 and ms_y can only be 0/1. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2014-04-19 13:23:32 -04:00
Ilia Mirkin	6d5c3c8260	nv50: use 2d blit when src/dst have same number of samples The 2D engine should be usable in more cases, but this fixes MS blits between textures with the same MS settings. Otherwise a single sample is selected to be the target texel value. This allows other tests to work that render to a RB and then blit that to a texture for input into a shader that uses sampler2DMS to verify it. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2014-04-19 13:23:32 -04:00
Ilia Mirkin	2d2e60bdee	gallium/docs: fix PIPE_CAP_ENDIANNESS delimiter, remove trailing spaces Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2014-04-19 13:23:32 -04:00
Petri Latvala	b45f65e760	mesa: update glext.h to version 20140313 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-04-18 14:30:57 -07:00
Kenneth Graunke	a1273a07ed	i965/fs: Implement fs_inst::force_sechalf support on Broadwell. Back when I originally wrote this code, force_sechalf was only used for Gen4 code, so I didn't bother hooking it up. However, it's used more generally these days. In particular, we use it for computing gl_SamplePosition. Fixes Piglit's spec/ARB_sample_shading/builtin-gl-sample-position tests. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=77222 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-04-18 11:57:33 -07:00
Chris Forbes	92840aabf7	glsl: Allow explicit binding on atomics again As of `943b2d52bf`, layout(binding) on an atomic would fail the assertion here. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-18 10:35:05 -07:00
Alex Deucher	7489f3eeda	radeonsi: fix num banks selection on SI for dma setup (v2) The number of banks varies based on the tile mode index just like CIK. Bug: https://bugs.freedesktop.org/show_bug.cgi?id=77533 v2: fix ordering for nbanks calculation for consistency Signed-off-by: Alex Deucher <alexander.deucher@amd.com>	2014-04-18 13:24:12 -04:00
Matt Turner	f770123f58	i965/fs: Reduce restrictions on interference in register coalescing. We previously only allowed coalescing registers that interfere (i.e., whose live ranges overlap) if the destination register's live range was entirely inside the source's live range. This is unnecessary -- we only need to check for interfering writes in the intersection of their live ranges. total instructions in shared programs: 1639470 -> 1638453 (-0.06%) instructions in affected programs: 84751 -> 83734 (-1.20%) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-18 09:16:19 -07:00
Matt Turner	55de1c035c	i965/fs: Give up in interference check if we see a WHILE. Rather than any old control flow. Muchnick's algorithm just checks for interfering writes between the MOV and the end of the program. Handling this when you have backward branches is hard, so don't, but there's no reason to bail if you see forward branches. instructions in affected programs: 4270 -> 4248 (-0.52%) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-18 09:16:19 -07:00
Matt Turner	5ff1e446d4	i965/fs: Simplify interference scan in register coalescing. We were starting at the beginning of the instruction list, rather than with the MOV instruction itself. This allows us to coalesce after control flow. Excluding the shaders from an unreleased title, the shader-db results: total instructions in shared programs: 1603791 -> 1594215 (-0.60%) instructions in affected programs: 678772 -> 669196 (-1.41%) GAINED: 5 LOST: 0 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-18 09:16:19 -07:00
Matt Turner	04a4e43eb2	i965/fs: Unindent can_coalesce_vars(). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-18 09:16:19 -07:00
Matt Turner	a975b2f55c	i965/fs: Recognize nop-MOV instructions early. And avoid rewriting other instructions unnecessarily. Removes a few self-moves we weren't able to handle because they were components of a large VGRF. instructions in affected programs: 830 -> 826 (-0.48%) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-18 09:16:19 -07:00
Matt Turner	ef6127ff69	i965/fs: Only sweep NOPs if register coalescing made progress. Otherwise there's nothing to do. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-18 09:16:19 -07:00
Marek Olšák	352e06ddea	r600g,radeonsi: don't skip the context flush if a fence should be returned Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=77589	2014-04-18 13:33:57 +02:00
Brian Paul	744d2a225d	svga: fix comment for emit_adjusted_vertex_attribs()	2014-04-17 16:15:37 -06:00
Brian Paul	cb34575e19	svga: compute need_swvfetch in svga_create_vertex_elements_state() This saves us doing it at state validation time. Reviewed-by: Matthew McClure <mcclurem@vmware.com>	2014-04-17 11:31:15 -07:00
Brian Paul	851645a3e7	svga: add VS code to set attribute W component to 1 There's a few 3-component vertex attribute formats that have no equivalent SVGA3D_DECLTYPE_x format. Previously, we had to use the swtnl code to handle them. This patch lets us use hwtnl for more vertex attribute types by fetching 3-component attributes as 4-component attributes and explicitly setting the W component to 1. This lets us handle PIPE_FORMAT_R16G16B16_SNORM/UNORM and PIPE_FORMAT_R8G8B8_UNORM vertex attribs without using the swtnl path. Fixes piglit normal3b3s GL_SHORT test. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2014-04-17 11:29:33 -07:00
Brian Paul	615a356ee3	svga: implement support for signed byte vertex attributes There's no SVGA3D_DECLTYPE that directly corresponds to PIPE_FORMAT_R8G8B8_SNORM. Previously, we used the swtnl fallback path to handle this but that's slow and causes invariance issues. Now we fetch the attribute as SVGA3D_DECLTYPE_UBYTE4N and insert some extra VS instructions to remap the attributes from the range [0,1] to the range[-1,1]. Fixes Sauerbraten sw fallback. Fixes piglit normal3b3s-invariance test. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2014-04-17 11:29:33 -07:00
Brian Paul	52faafa174	svga: move translated vertex declaration types into svga_velems_state Now only translate the formats once in svga_create_vertex_elements_state(). And rename the array and use the proper SVGA3dDeclType type. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2014-04-17 11:29:32 -07:00
Brian Paul	0f5add1959	Revert "svga: add work-around for Sauerbraten Z fighting issue" This reverts commit `c875d6e57a`. Conflicts: src/gallium/drivers/svga/svga_context.c This work-around will no longer be needed after the next patch which properly supports signed-byte vertex attributes. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2014-04-17 11:29:32 -07:00
Brian Paul	7c7ab5434a	svga: use new inst_token_setp() helper function Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2014-04-17 11:29:32 -07:00
Brian Paul	8e131576ee	svga: use new inst_token_predicated() helper function Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2014-04-17 11:29:32 -07:00
Kenneth Graunke	71846a943f	i965: Retype pre-Gen6 varying pull load destination to UW. This sets up the proper execution mask for sends in SIMD16 mode. Fixes Piglit's glsl-fs-normalmatrix, glsl-fs-uniform-array-2, glsl-fs-uniform-array-6, and glsl-fs-uniform-array-7 on Ironlake, which regressed when I enabled SIMD16 pull parameter support in commit `b207e88b25`. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-04-17 10:54:00 -07:00
Anuj Phogat	ee10e893cb	mesa: Fix error condition for multisample proxy texture targets Fixes failures in Khronos OpenGL CTS test proxy_textures_invalid_samples Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-17 10:26:39 -07:00
Anuj Phogat	1d350b9e22	i965: Add glBlitFramebuffer to commands affected by conditional rendering Fixes failures in Khronos OpenGL CTS test conditional_render_test9 Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-17 10:26:39 -07:00
Anuj Phogat	8ed42ddd7d	swrast: Add glBlitFramebuffer to commands affected by conditional rendering Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2014-04-17 10:26:05 -07:00
Anuj Phogat	48fc2703e5	i965: Fix component mask and varying_to_slot mapping for gl_ViewportIndex gl_ViewportIndex doesn't get its own varying slot. It is stored in VARYING_SLOT_PSIZ.z. This patch fixes the issue for both gen7 and gen8 because gen7_upload_3dstate_so_decl_list() is shared between them. Fixes failures in OpenGL Khronos CTS test transform_feedback_builtins. Makes new piglit test glsl-1.50-transform-feedback-builtins pass for 'gl_ViewportIndex'. Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-17 10:08:28 -07:00
Anuj Phogat	7928b9c249	i965: Fix component mask and varying_to_slot mapping for gl_Layer gl_Layer doesn't get its own varying slot. It is stored in VARYING_SLOT_PSIZ.y. This patch fixes the issue for both gen7 and gen8 because gen7_upload_3dstate_so_decl_list() is shared between them. Fixes failures in OpenGL Khronos CTS test transform_feedback_builtins. Makes new piglit test glsl-1.50-transform-feedback-builtins pass for 'gl_Layer'. Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-17 10:08:28 -07:00
Anuj Phogat	969b461c2b	i965: Put an assertion to check valid varying_to_slot[varying] Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-17 10:08:28 -07:00
Darren Powell	bc86690f13	radeonsi: Added Diag Handler to receive LLVM Error messages Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2014-04-17 19:37:58 -04:00
Marek Olšák	9f9ab8ec0d	winsys/radeon: remove some unused code Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-04-17 13:54:19 +02:00
Marek Olšák	8b966bcaf2	winsys/radeon: remove is_handle_added array Use index -1 if a buffer is not added. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-04-17 13:54:19 +02:00
Marek Olšák	b0fca0a378	winsys/radeon: remove local variable reloc from radeon_get_reloc Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-04-17 13:54:18 +02:00
Marek Olšák	3384a41aa9	winsys/radeon: remove parameter reloc from radeon_get_reloc Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-04-17 13:54:18 +02:00
José Fonseca	75e487538d	util: Add __declspec(noreturn) to _debug_assert_fail(). Mostly for consistency; as MSVC's static source code analysis doesn't seem to rely on assertions, but instead on different kind of source annotations( http://msdn.microsoft.com/en-us/library/hh916383.aspx ). Reviewed-by: Brian Paul <brianp@vmware.com>	2014-04-17 09:56:49 +01:00
José Fonseca	a2b89c4ae1	auxiliary/os,auxiliary/util: Fix the `‘noreturn’ function does return` warning. Now that _debug_assert_fail() has the noreturn attribute, it is better that execution truly never returns. Not just for sake of silencing the warning, but because the code at the return IP address may be invalid or lead to inconsistent results. This removes support for the GALLIUM_ABORT_ON_ASSERT debugging environment variable, but between the usefulness of GALLIUM_ABORT_ON_ASSERT and better static code analysis I think better static code analysis wins. Reviewed-by: Brian Paul <brianp@vmware.com>	2014-04-17 09:56:48 +01:00
José Fonseca	97fa9cd220	scons: Enable building through Clang Static Analyzer. Same intent as commit `a45a50a482`, but this the C compiler is detected via C-preprocessor macros, similar to how autotools do it, as that seems to be the most reliable method. Reviewed-by: Brian Paul <brianp@vmware.com>	2014-04-17 09:56:48 +01:00
Maarten Lankhorst	74f19445cc	gallium glsl: Fix crash with piglit fs-deref-literal-array-of-structs.shader_test This allows the following shader code to work without a weird crash: struct Foo { int value[1]; }; int actual_value = Foo[2](Foo(int[1](100)), Foo(int[1](200)))[i].value[0]; Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com>	2014-04-17 10:34:10 +02:00
Maarten Lankhorst	49d26a277d	nouveau/vdec: small fixes to h264 handling nouveau_vp3_inter_sizes requires sliec_count as argument just as the other places that call it from h264 code do. Hopefully fixes something. Fix the status_vp code to allow status == 0 too, when processing hasn't started yet. set h264->second_field correctly.	2014-04-17 10:30:39 +02:00
Thomas Hellstrom	09cd376353	st/xa: Cache render target surface Otherwise it will trick the gallium driver into thinking that the render target has actually changed (due to different pipe_surface pointing to same underlying pipe_resource). This is really badness for tiling GPUs like adreno. This also appears to fix a rendering error with Motif on vmwgfx. Why that is is still under investigation. Based on an idea by Rob Clark. Cc: "10.0 10.1" <mesa-stable@lists.freedesktop.org> Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Rob Clark <robclark@freedesktop.org>	2014-04-17 09:56:28 +02:00
Rob Clark	a45ae814d1	st/xa: scissor to help tilers Keep track of the maximal bounds of all the operations and set scissor accordingly. For tiling GPU's this can be a big win by reducing the memory bandwidth spent moving pixels from system memory to tile buffer and back. You could imagine being more sophisticated and splitting up disjoint operations. But this simplistic approach is good enough for the common cases. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com>	2014-04-17 09:42:06 +02:00
Rob Clark	3c52013273	st/xa: remove unneeded args Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com>	2014-04-17 09:40:42 +02:00
Iago Toral Quiroga	cda5e0c25e	glsl: Small optimization for constant conditionals Once the relevant branch has been identified do not iterate over the instructions in the branch, do a linked list insertion instead to avoid the loop. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-16 23:39:57 -07:00
Iago Toral Quiroga	4472ab9e6d	glsl: Fix incorrect indentation. Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-16 23:22:24 -07:00
Chris Forbes	d1b6f67110	meta: Clip src/dest rects in BlitFramebuffer, using the scissor Fixes piglit's fbo-blit-stretch test on drivers which use the meta path. (i965: should fix Broadwell, but also fixes Sandybridge/Ivybridge/Haswell since this test falls off the blorp path now due to format conversion) V2: Use scissor instead of just mangling the rects, to avoid texcoord rounding problems. (Thanks Marek) V3: Rebase on Eric's CTSI meta changes; re-add _mesa_update_state in the CTSI path so that _mesa_clip_blit sees the correct bounds. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=77414 Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Tested-by: Anuj Phogat <anuj.phogat@gmail.com>	2014-04-17 18:11:24 +12:00
Samuel Iglesias Gonsalvez	9927180714	mesa: fix check for dummy renderbuffer in _mesa_FramebufferRenderbufferEXT() According to the spec: <renderbuffertarget> must be RENDERBUFFER and <renderbuffer> should be set to the name of the renderbuffer object to be attached to the framebuffer. <renderbuffer> must be either zero or the name of an existing renderbuffer object of type <renderbuffertarget>, otherwise an INVALID_OPERATION error is generated. This patch changes the previous returned GL_INVALID_VALUE to GL_INVALID_OPERATION. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=76894 Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Samuel Iglesias Gonsalvez <siglesias@igalia.com>	2014-04-16 23:00:40 -07:00
Matt Turner	42a26cb5e4	i965: Don't make instructions with a null dest a barrier to scheduling. Now that we properly track accumulator dependencies, the scheduler is able to schedule instructions between the mach and mov in the common the integer multiplication pattern: mul acc0, x, y mach null, x, y mov dest, acc0 Since a null destination implies no dependency on the destination, we can also safely schedule instructions (that don't write the accumulator) between the mul and mach. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-16 22:46:45 -07:00
Juha-Pekka Heikkila	a6860100b8	i965/fs: Change fs_visitor::emit_lrp to use MAC for gen<6 This allows us to emit ADD/MUL/MAC instead of MUL/ADD/MUL/ADD, saving one instruction and two temporary registers. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com>	2014-04-16 22:46:45 -07:00
Juha-Pekka Heikkila	da0c3b02e7	i965/fs: Add support for the MAC instruction. This allows us to generate the MAC (multiply-accumulate) instruction, which can be used to implement some expressions in fewer instructions than doing a series of MUL and ADDs. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com>	2014-04-16 22:46:45 -07:00
Juha-Pekka Heikkila	2dfbbeca50	i965/vec4: Change vec4_visitor::emit_lrp to use MAC for gen<6 This allows us to emit ADD/MUL/MAC instead of MUL/ADD/MUL/ADD, saving one instruction and two temporary registers. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com>	2014-04-16 22:46:45 -07:00

1 2 3 4 5 ...

62486 commits