fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 11:38:05 +02:00

Author	SHA1	Message	Date
Eric Anholt	477f8cd08b	glsl: Apply the transformation "(a ^^ a) -> false" in opt_algebraic. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2013-11-15 11:33:07 -08:00
Eric Anholt	58a98d32e4	glsl: Apply the transformation "(a && a) -> a" in opt_algebraic. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2013-11-15 11:33:07 -08:00
Eric Anholt	ee27048262	glsl: Apply the transformation "(a \|\| a) -> a" in opt_algebraic. total instructions in shared programs: 1732385 -> 1732373 (-0.00%) instructions in affected programs: 416 -> 404 (-2.88%) GAINED: 0 LOST: 0 (That's 4 already-short fragment shaders in dota2) Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2013-11-15 11:33:07 -08:00
Eric Anholt	8957c6b887	glsl: Move the CSE equality functions to the ir class. I want to reuse them in opt_algebraic. v2: Merge in Chris Forbes's break fix. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2013-11-15 11:33:07 -08:00
Matt Turner	fc51e7ac58	clover: Remove dead file from Makefile.sources. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>	2013-11-15 11:10:32 -08:00
Kenneth Graunke	4ec982ad01	i965: Rework brw_new_batch to actually start a new batch. Previously, brw_new_batch was called just after execbuf, but before intel_batchbuffer_reset. Essentially, it prepared for the creation of a new batch, that wasn't yet available, and which it didn't create. This was a bit awkward. This patch makes brw_new_batch call intel_batchbuffer_reset as the very first operation. This means that brw_new_batch actually creates a new batchbuffer, and thus has it available. It brings the creation of the new batchbuffer and BRW_NEW_BATCH flagging together into one place. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2013-11-15 10:24:07 -08:00
Kenneth Graunke	720d935fff	i965: Move cache_used_by_gpu flag setting to brw_finish_batch. It really makes more sense here. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2013-11-15 10:24:07 -08:00
Ian Romanick	96a3527a63	i915: Actually enable __DRI2rendererQueryExtensionRec More rebase fail. This code was written long before i915 and i965 were split, so most of the code in i9[16]5/intel_screen.c only needed to exist in one place. It looks like I fixed n-1 of those places after rebasing on the split. I only found this from the defined-but-not-used warning for intelRendererQueryExtension. I noticed this while fixing the other, related warnings. (Note: During review, we decided to not pick this back to 10.0.) Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Cc: Daniel Vetter <daniel@ffwll.ch> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Paul Berry <stereotype441@gmail.com>	2013-11-15 10:10:29 -08:00
Aaron Watry	2be85e2492	radeon/llvm: Free elf_buffer after use Prevents a memory leak. v2: Remove null check CC: "10.0" <mesa-stable@lists.freedesktop.org>	2013-11-15 09:53:31 -08:00
Aaron Watry	01f3622c74	r600/llvm: Free binary.code/binary.config in r600_llvm_compile radeon_llvm_compile allocates memory for binary.code, binary.config, or neither depending on what's being done. We need to make sure to free that memory after it's no longer needed. v2: Don't bother checking for null before FREE() CC: "10.0" <mesa-stable@lists.freedesktop.org>	2013-11-15 09:53:31 -08:00
Aaron Watry	dd73b99420	r600/llvm: initialize radeon_llvm_binary use memset to initialize to 0's... otherwise code_size and config_size could be uninitialized when read later in this method. It's also hard to do NULL checks on uninitialized pointers. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> v2: Fix indentation CC: "10.0" <mesa-stable@lists.freedesktop.org>	2013-11-15 09:53:31 -08:00
Brian Paul	2bc1680665	svga: remove unused vars in svga_hwtnl_simple_draw_range_elements() And simplify the code. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-11-15 10:27:01 -07:00
Brian Paul	1a36dfb21e	svga: print warning for unsupported indirect dest reg indexing For DX9-level shaders, there's only limited support for indirect indexing of registers (with the loop counter register, not the general address register.) Reviewed-by: José Fonseca <jfonseca@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2013-11-15 10:23:49 -07:00
Brian Paul	3969330b47	svga: mark dest image as defined in svga_surface_copy() After we blit/copy to a dest texture image we need to mark it as being defined. This fixes broken mipmap generation for quite a few texture formats. Mipgen involves making texture views and svga_texture_view_surface() skips texture images that are undefined. Cc: "10.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: José Fonseca <jfonseca@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2013-11-15 10:23:48 -07:00
Brian Paul	79984b9928	svga: do primitive trimming in translate_indices() The index translation code expects the number of indexes to be consistent with the primitive type (ex: a multiple of 3 for PIPE_PRIM_TRIANGLES). If it's not, we can write out of bounds in the destination buffer. Fixes failed assertions in the pipebuffer debug code found with Piglit primitive-restart-draw-mode test. Cc: "10.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2013-11-15 10:23:48 -07:00
Brian Paul	491d6397fc	indices: add comments, assertions in u_indices.c file Reviewed-by: José Fonseca <jfonseca@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2013-11-15 10:23:48 -07:00
Brian Paul	2253fed4a0	mesa: remove duplicated prototypes in varray.h	2013-11-15 10:23:48 -07:00
Aaron Watry	598f61ba28	gallium/pipe_loader: un-reference udev resources when we're done with them. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> CC: "10.0" <mesa-stable@lists.freedesktop.org>	2013-11-15 09:16:49 -08:00
Aaron Watry	4c6ac9e614	radeonsi/compute: Dispose of LLVM module after compiling kernels v2: Fix indentation Reviewed-by: Tom Stellard <thomas.stellard@amd.com> CC: "10.0" <mesa-stable@lists.freedesktop.org>	2013-11-15 09:16:49 -08:00
Aaron Watry	35dad4a1e2	radeonsi/compute: Free program and program.kernels on shutdown v2: Fix indentation Reviewed-by: Tom Stellard <thomas.stellard@amd.com> CC: "10.0" <mesa-stable@lists.freedesktop.org>	2013-11-15 09:16:49 -08:00
Aaron Watry	d41b10f811	radeon/llvm: Free created llvm memory buffer v2: Fix indentation Reviewed-by: Tom Stellard <thomas.stellard@amd.com> CC: "10.0" <mesa-stable@lists.freedesktop.org>	2013-11-15 09:16:49 -08:00
Aaron Watry	a2b93da84b	radeon/llvm: Free libelf resources v2: Fix indentation Reviewed-by: Tom Stellard <thomas.stellard@amd.com> CC: "10.0" <mesa-stable@lists.freedesktop.org>	2013-11-15 09:16:49 -08:00
Aaron Watry	df482fe02f	radeon/llvm: fix spelling error Reviewed-by: Tom Stellard <thomas.stellard@amd.com> CC: "10.0" <mesa-stable@lists.freedesktop.org>	2013-11-15 09:16:49 -08:00
Tom Stellard	17af4dd52b	clover: Support multiple devices in clCreateContextFromType() v2 v2: - Use clGetDeviceIDs to query devices. Reviewed-by: Francisco Jerez <currojerez@riseup.net> CC: "10.0" <mesa-stable@lists.freedesktop.org>	2013-11-15 09:16:48 -08:00
Paul Berry	f38ac41ed4	glsl: Rework interface block linking. Previously, when doing intrastage and interstage interface block linking, we only checked the interface type; this prevented us from catching some link errors. We now check the following additional constraints: - For intrastage linking, the presence/absence of interface names must match. - For shader ins/outs, the interface names themselves must match when doing intrastage linking (note: it's not clear from the spec whether this is necessary, but Mesa's implementation currently relies on it). - Array vs. nonarray must be consistent, taking into account the special rules for vertex-geometry linkage. - Array sizes must be consistent (exception: during intrastage linking, an unsized array matches a sized array). Note: validate_interstage_interface_blocks currently handles both uniforms and in/out variables. As a result, if all three shader types are present (VS, GS, and FS), and a uniform interface block is mentioned in the VS and FS but not the GS, it won't be validated. I plan to address this in later patches. Fixes the following piglit tests in spec/glsl-1.50/linker: - interface-blocks-vs-fs-array-size-mismatch - interface-vs-array-to-fs-unnamed - interface-vs-unnamed-to-fs-array - intrastage-interface-unnamed-array v2: Simplify logic in intrastage_match() for handling array sizes. Make extra_array_level const. Use an unnamed temporary interface_block_definition in validate_interstage_interface_blocks()'s first call to definitions->store(). Cc: "10.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2013-11-15 08:56:28 -08:00
Paul Berry	b4c3b833ec	i965: Fix vertical alignment for multisampled buffers. From the Sandy Bridge PRM, Vol 1 Part 1 7.18.3.4 (Alignment Unit Size): j [vertical alignment] = 4 for any render target surface is multisampled (4x) From the Ivy Bridge PRM, Vol 4 Part 1 2.12.2.1 (SURFACE_STATE for most messages), under the "Surface Vertical Alignment" heading: This field is intended to be set to VALIGN_4 if the surface was rendered as a depth buffer, for a multisampled (4x) render target, or for a multisampled (8x) render target, since these surfaces support only alignment of 4. Back in 2012 when we added multisampling support to the i965 driver, we forgot to update the logic for computing the vertical alignment, so we were often using a vertical alignment of 2 for multisampled buffers, leading to subtle rendering errors. Note that the specs also require a vertical alignment of 4 for all Y-tiled render target surfaces; I plan to address that in a separate patch. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=53077 Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2013-11-15 08:54:15 -08:00
Paul Berry	46e9f78efc	main: Fix MaxUniformComponents for geometry shaders. For both vertex and fragment shaders we default MaxUniformComponents to 4 * MAX_UNIFORMS. It makes sense to do this for geometry shaders too; if back-ends have different limits they can override them as necessary. Fixes piglit test: spec/glsl-1.50/built-in constants/gl_MaxGeometryUniformComponents Cc: "10.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-11-15 08:47:41 -08:00
José Fonseca	420ccf7b8f	tools/trace: Several bugfixes/improvements to dump_state.py - Don't crash with user memory pointers. - Support old bind__sampler_ methods. Useful when comparing dumps from old branches. - Misc.	2013-11-15 15:42:02 +00:00
José Fonseca	c5a05a6aef	trace: Dump user_buffer members.	2013-11-15 15:32:33 +00:00
Fredrik Höglund	ff353c218a	mesa: Fix derived vertex state not being updated in glCallList() AEcontext::NewState is not always set when the vertex array state is changed. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=71492 Cc: "10.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: José Fonseca <jfonseca@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-11-15 15:23:23 +00:00
Alex Deucher	469b42ee21	radeonsi: add Hawaii pci ids Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com>	2013-11-15 08:51:20 -05:00
Alex Deucher	f5778f152b	radeonsi: add support for Hawaii asics (v2) Update additional register fields. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com>	2013-11-15 08:51:09 -05:00
Vinson Lee	78fc159d68	i965: Initialize schedule_node::delay. Fixes "Uninitialized scalar field" defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Paul Berry <stereotype441@gmail.com>	2013-11-14 22:36:26 -08:00
Alexander von Gluck IV	f7ce1d772d	haiku/swrast: Inherit gl_config, fix flush * Inherit gl_context so we always have access to it * Thanks curro for the idea. * Last Haiku cannidate for 10.0.0 Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2013-11-14 12:33:03 -06:00
Roland Scheidegger	473cb3fe4a	llvmpipe: (trivial) fix more fallout from the setup cleanup. Oops... Should have done some more testing.	2013-11-14 15:49:42 +00:00
Roland Scheidegger	5190c16a04	llvmpipe: (trivial) fix misplaced bld context assignment. Should fix polygon offset crashes...	2013-11-14 14:44:15 +00:00
José Fonseca	a29e40a423	gallivm: Compile flag to debug TGSI execution through printfs. It is similar to tgsi_exec.c's DEBUG_EXECUTION compile flag. I had prototyped this for a while while debugging an issue, but finally cleaned this up and added a few more bells and whistles. v2: Use '$' as marker; better output. Thanks to Brian, Zack and Roland reviews. Here is a sample output. CONST[0].x = 0.00625000009 0.00625000009 0.00625000009 0.00625000009 CONST[0].y = -0.00714285718 -0.00714285718 -0.00714285718 -0.00714285718 CONST[0].z = -1 -1 -1 -1 CONST[0].w = 1 1 1 1 IN[0].x = 143.5 175.5 175.5 143.5 IN[0].y = 123.5 123.5 155.5 155.5 IN[0].z = 0 0 0 0 IN[0].w = 1 1 1 1 $ 1: RCP TEMP[0].w, IN[0].wwww TEMP[0].w = 1 1 1 1 $ 2: MAD TEMP[0].xy, IN[0], CONST[0], CONST[0].zwzw TEMP[0].x = -0.103124976 0.0968750715 0.0968750715 -0.103124976 TEMP[0].y = 0.117857158 0.117857158 -0.110714316 -0.110714316 $ 3: MUL OUT[0].xy, TEMP[0], TEMP[0].wwww OUT[0].x = -0.103124976 0.0968750715 0.0968750715 -0.103124976 OUT[0].y = 0.117857158 0.117857158 -0.110714316 -0.110714316 $ 4: MUL OUT[0].z, IN[0].zzzz, TEMP[0].wwww OUT[0].z = 0 0 0 0 $ 5: MOV OUT[0].w, TEMP[0] OUT[0].w = 1 1 1 1 $ 6: END OUT[0].x = -0.103124976 0.0968750715 0.0968750715 -0.103124976 OUT[0].y = 0.117857158 0.117857158 -0.110714316 -0.110714316 OUT[0].z = 0 0 0 0 OUT[0].w = 1 1 1 1	2013-11-14 14:04:28 +00:00
Roland Scheidegger	673d5391a2	softpipe: (trivial) fix debug code The debug printfs wouldn't actually compile when enabled, so kill them off and insert some new one in another place, and make sure it keeps compiling by enclosing it in a if-0 clause.	2013-11-14 12:24:55 +00:00
Roland Scheidegger	2dd693412a	llvmpipe: clean up state setup code a bit In particular get rid of home-grown vector helpers which didn't add much. And while here fix formatting a bit. No functional change. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-11-14 12:24:55 +00:00
Roland Scheidegger	754319490f	gallivm,llvmpipe: fix float->srgb conversion to handle NaNs d3d10 requires us to convert NaNs to zero for any float->int conversion. We don't really do that but mostly seems to work. In particular I suspect the very common float->unorm8 path only really passes because it relies on sse2 pack intrinsics which just happen to work by luck for NaNs (float->int conversion in hw gives integer indeterminate value, which just happens to be -0x80000000 hence gets converted to zero in the end after pack intrinsics). However, float->srgb didn't get so lucky, because we need to clamp before blending and clamping resulted in NaN behavior being undefined (and actually got converted to 1.0 by clamping with sse2). Fix this by using a zero/one clamp with defined nan behavior as we can handle the NaN for free this way. I suspect there's more bugs lurking in this area (e.g. converting floats to snorm) as we don't really use defined NaN behavior everywhere but this seems to be good enough. While here respecify nan behavior modes a bit, in particular the return_second mode didn't really do what we wanted. From the caller's perspective, we really wanted to say we need the non-nan result, but we already know the second arg isn't a NaN. So we use this now instead, which means that cpu architectures which actually implement min/max by always returning non-nan (that is adhering to ieee754-2008 rules) don't need to bend over backwards for nothing. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-11-14 12:24:55 +00:00
Ian Romanick	a15a19f0d1	dri: Change value param to unsigned This silences some compiler warnings in i915 and i965. See also `75982a5`. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "10.0" <mesa-stable@lists.freedesktop.org>	2013-11-13 14:49:27 -08:00
Ian Romanick	cb6182bdfa	i965: Use drm_intel_get_aperture_sizes instead of hard-coded 2GiB Systems with little physical memory installed will report less than 2GiB, and some systems may (hypothetically?) have a larger address space for the GPU. My IVB still reports 1534. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch> Cc: "10.0" <mesa-stable@lists.freedesktop.org>	2013-11-13 14:49:27 -08:00
Ian Romanick	9fe108db09	i915: Use drm_intel_get_aperture_sizes instead of drmAgpSize Send the zombie back to the grave before it infects the townsfolk. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch> Cc: "10.0" <mesa-stable@lists.freedesktop.org>	2013-11-13 14:49:26 -08:00
Alexander Monakov	279e8d2641	i965: implement blit path for PBO glDrawPixels This patch implements accelerated path for glDrawPixels from a PBO in i965. The code follows what intel_pixel_read, intel_pixel_copy, intel_pixel_bitmap and intel_tex_image are doing. Piglit quick.tests show no regressions. In my testing on IVB, performance improvement is huge (about 30x, didn't measure exactly) since generic path goes via _mesa_unpack_color_span_float, memcpy, extract_float_rgba. Signed-off-by: Alexander Monakov <amonakov@ispras.ru> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-11-13 12:20:59 -08:00
Brian Paul	19c2f40649	docs: fill in md5 checksums for 9.2.3 release	2013-11-13 10:06:23 -07:00
Brian Paul	c093cd3984	docs: fix 9.2.2 -> 9.2.3 typos	2013-11-13 10:03:35 -07:00
Alexander von Gluck IV	df91144a6d	haiku: add swrast driver * This is pretty small and upkeep should be minimal. * Currently fully working. * Cannidate for 10.0.0 branch Acked-by: Brian Paul <brianp@vmware.com>	2013-11-13 10:41:10 -06:00
Carl Worth	9976a176ae	docs: Import 9.2.3 release notes, add news item.	2013-11-13 07:32:47 -08:00
Kristian Høgsberg	e048953145	dri: Remove redundant createNewContext function from __DRIimageDriverExtension createContextAttribs is a superset of what createNewContext provides. Also remove the function typedef, since createNewContext is deprecated and no longer used in multiple interfaces. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Eric Anholt <eric@anholt.net> Cc: "10.0" <mesa-stable@lists.freedesktop.org>	2013-11-12 16:08:17 -08:00
Kristian Høgsberg	68bb26bead	wayland: Use __DRIimage based getBuffers implementation when available This lets us allocate color buffers as __DRIimages and pass them into the driver instead of having to create a __DRIbuffer with the flink that requires. Signed-off-by: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Cc: "10.0" <mesa-stable@lists.freedesktop.org>	2013-11-12 16:08:17 -08:00

1 2 3 4 5 ...

59712 commits