fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 18:08:40 +02:00

Author	SHA1	Message	Date
Roland Scheidegger	b5918d8f1d	gallivm: fix a trivial txq issue for 2d shadow and cube shadow samplers untested (couldn't get the piglit test to run even with version overrides) but seemed blatantly wrong. In any case it would only affect an error case which when it would happen probably all hope is lost anyway. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-11-29 15:31:46 +01:00
Roland Scheidegger	6d50148742	llvmpipe: support array textures This adds array (1d,2d) texture support to llvmpipe. Though probably should do something about 1d array textures requiring gobs of memory (this issue is not strictly limited to arrays but it is probably worse there). Initial code by Jakob Bornecrantz <jakob@vmware.com> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-11-29 15:30:19 +01:00
Roland Scheidegger	95e03914d8	gallivm: support array textures Support 1d and 2d array textures (including shadow samplers), and (as a side effect mostly) also shadow cube samplers. Seems to pass the relevant piglit tests both for sampling and rendering to (though some require version overrides). Since we don't support render target indices rendering to array textures is still restricted to a single layer at a time. Also, the min/max layer in the sampler view (which is unnecessary for GL) is ignored (always use all layers). Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-11-29 15:28:25 +01:00
José Fonseca	88e92f5bcd	llvmpipe: Remove lp_build_blend_soa() No longer used/necessary, as we always blend in AoS now. Trivial.	2012-11-29 14:08:43 +00:00
José Fonseca	75da95c50a	llvmpipe: Eliminate color buffer swizzling. Now dead code. Also had to remove the show_tiles/show_subtiles because now the color buffers are always stored in their native format, so there is no longer an easy way to paint the tile sizes. Depth-stencil buffers are still swizzled. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-11-29 14:08:43 +00:00
José Fonseca	6916387e53	llvmpipe: Only advertise unswizzled formats. Update llvmpipe_is_format_supported and llvmpipe_is_format_unswizzled so that only the formats that we can render without swizzling are advertised. We can still render all D3D10 required formats except PIPE_FORMAT_R11G11B10_FLOAT, which needs to be implemented in a future opportunity. Removal of rendertarget swizzling will be done in a subsequent change. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-11-29 14:08:42 +00:00
José Fonseca	9f06061d50	util/u_format: Kill util_format_is_array(). It is buggy (it was giving wrong results for some of the formats with padding), and util_format_description::is_array already does precisely what's intended. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-11-29 14:08:42 +00:00
José Fonseca	a47674ee89	util/u_format: Tighten the meaning of is_array bit to exclude mixed type formats. This is what we want in practice. The only change is in PIPE_FORMAT_R8SG8SB8UX8U_NORM, which no longer is considered an array format. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2012-11-29 14:08:42 +00:00
Adhemerval Zanella	64e9ec634b	util/u_format: Fix format manipulation for big-endian This patch fixes various format manipulation for big-endian architectures. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-11-29 11:54:23 +00:00
Adhemerval Zanella	e25abacc18	gallivm: Fix format manipulation for big-endian This patch fixes various format manipulation for big-endian architectures. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-11-29 11:54:18 +00:00
Adhemerval Zanella	b772d784b2	gallivm: Add byte-swap construct calls This patch adds two more functions in type conversions header: * lp_build_bswap: construct a call to llvm.bswap intrinsic for an element * lp_build_bswap_vec: byte swap every element in a vector base on the input and output types. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-11-29 11:54:14 +00:00
Adhemerval Zanella	86902b5134	gallivm: Fix vector constant for shuffle This patch fixes the vector constant generation used for vector shuffle for big-endian machines. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-11-29 11:54:10 +00:00
Adhemerval Zanella	29ba79b2c9	gallivm: clear Altivec NJ bit This patch enforces the clear of NJ bit in VSCR Altivec register so denormal numbers are handles as expected by IEEE standards. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-11-29 11:52:05 +00:00
Adhemerval Zanella	43ce9efdbf	gallivm: Altivec floating-point rounding This patch adds Altivec intrinsics for float vector types. It changes the SSE specific definitions to a platform neutral and adds the calls to Altivec intrinsic builder. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-11-29 11:52:00 +00:00
Adhemerval Zanella	dd5c580816	gallivm: Altivec vector add/sub intrisics This patch add correct vector addition and substraction intrisics when using Altivec with PPC. Current code uses default path and LLVM backend ends up issuing carry-out arithmetic instruction while it is expected saturated ones. It also includes a fix for PowerPC where char are unsigned by default, resulting in bogus values for vector shifting. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-11-29 11:51:53 +00:00
Adhemerval Zanella	2ea7d3dabd	gallivm: Altivec vector max/min intrisics This patch adds the PPC Altivec instrics max/min instruction for supported Altivec vector types (16xi8, 8xi16, 4xi32, 4xf32). Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-11-29 11:51:46 +00:00
Adhemerval Zanella	31c63b058e	gallivm: Altivec pack/unpack intrisics This patch adds PPC Altivec support for pack/unpack operations using Altivec supported vector type (8xi8, 16xi16, 4xi32, 4xf32). Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-11-29 11:51:41 +00:00
Michel Dänzer	8b6aec6533	radeonsi: Bitcast result of packf16 intrinsic to float for export intrinsic. Fixes 7 piglit tests, and prevents many more from crashing. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-and-Tested-by: Christian König <christian.koenig@amd.com>	2012-11-29 10:08:53 +01:00
Kenneth Graunke	c102360800	i965/vs: Move struct brw_compile (p) entirely inside vec4_generator. The brw_compile structure contains the brw_instruction store and the brw_eu_emit.c state tracking fields. These are only useful for the final assembly generation pass; the earlier compilation stages doesn't need them. This also means that the code generator for future hardware won't have access to the brw_compile structure, which is extremely desirable because it prevents accidental generation of Gen4-7 code. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-11-28 18:16:01 -08:00
Kenneth Graunke	eda9726ef5	i965/vs: Split final assembly code generation out of vec4_visitor. Compiling shaders requires several main steps: 1. Generating VS IR from either GLSL IR or Mesa IR 2. Optimizing the IR 3. Register allocation 4. Generating assembly code This patch splits out step 4 into a separate class named "vec4_generator." There are several reasons for doing so: 1. Future hardware has a different instruction encoding. Splitting this out will allow us to replace vec4_generator (which relies heavily on the brw_eu_emit.c code and struct brw_instruction) with a new code generator that writes the new format. 2. It reduces the size of the vec4_visitor monolith. (Arguably, a lot more should be split out, but that's left for "future work.") 3. Separate namespaces allow us to make helper functions for generating instructions in both classes: ADD() can exist in vec4_visitor and create IR, while ADD() in vec4_generator() can create brw_instructions. (Patches for this upcoming.) Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-11-28 18:15:58 -08:00
Kenneth Graunke	db6231fece	i965/vs: Abort on unsupported opcodes rather than failing. Final code generation should never fail. This is a bug, and there should be no user-triggerable cases where this could occur. Also, we're not going to have a fail() method after the split. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-11-28 18:15:57 -08:00
Kenneth Graunke	8af8a26480	i965/vs: Move uses of brw_compile from do_vs_prog to brw_vs_emit. The brw_compile structure is closely tied to the Gen4-7 hardware encoding. However, do_vs_prog is very generic: it just calls out to get a compiled program and then uploads it. This isn't ultimately where we want it, but it's a step in the right direction: it's now closer to the code generator. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-11-28 18:15:55 -08:00
Kenneth Graunke	746fc346ea	i965/vs: Rework memory contexts for shader compilation data. During compilation, we allocate a bunch of things: the IR needs to last at least until code generation...and then the program store needs to last until after we upload the program. For simplicity's sake, just keep it all around until we upload the program. After that, it can all be freed. This will also save a lot of headaches during the upcoming refactoring. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-11-28 18:15:53 -08:00
Kenneth Graunke	031146736c	i965/vs: Pass the brw_context pointer into brw_compute_vue_map(). We used to steal it out of the brw_compile struct, but that won't be initialized in time soon (and is eventually going away). Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-11-28 18:15:51 -08:00
Kenneth Graunke	403bb1d306	i965/vs: Pass the brw_context pointer into vec4_visitor and do_vs_prog. We used to steal it out of the brw_compile struct...but vec4_visitor isn't going to have one of those in the future. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-11-28 18:15:50 -08:00
Kenneth Graunke	dd50c88386	i965/vs: Move some functions from brw_vec4_emit.cpp to brw_vec4.cpp. This leaves only the final code generation stage in brw_vec4_emit.cpp, moving the payload setup, run(), and brw_vs_emit functions to brw_vec4.cpp. The fragment shader backend puts these functions in brw_fs.cpp, so this patch also helps with consistency. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2012-11-28 18:15:26 -08:00
Kenneth Graunke	9947470655	meta: Don't try to glOrtho when the draw buffer isn't initialized. I ran across this while running a glGenerateMipmap() test. _meta_GenerateMipmap sets MESA_META_TRANSFORM, which causes _mesa_meta_begin to try and set a default orthographic projection. Unfortunately, if the drawbuffer isn't set up, ctx->DrawBuffer->Width and Height are 0, which just causes an GL_INVALID_VALUE error. Fixes oglconform's fbo/mipmap.automatic, mipmap.manual, and mipmap.manualIterateTexTargets. Reviewed-by: Brian Paul <brianp@vmware.com>	2012-11-28 18:12:07 -08:00
Jason Wood	8d1ee38a4c	docs: Mark some features in GL3.txt as done for r600 Signed-off-by: Marek Olšák <maraeo@gmail.com>	2012-11-29 01:07:26 +01:00
Marek Olšák	aa46cc2879	st/mesa: allow forward-compatible contexts and set Const.ContextFlags Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-11-29 01:07:26 +01:00
Marek Olšák	249f86e3f8	st/mesa: add support for GL core profiles The rest of the plumbing was in place already. I have tested this by turning on all GL 3.1 features. The drivers not supporting GL 3.1 will fail to create a core profile as they should. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2012-11-29 01:07:26 +01:00
Marek Olšák	f9429e30aa	configure.ac: remove -fomit-frame-pointer from LLVM flags Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-11-29 00:07:27 +01:00
Marek Olšák	3d59cde92e	configure.ac: look for whole words in LLVM flags, not prefixes Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-11-29 00:07:27 +01:00
Marek Olšák	9b67a347f6	configure.ac: consolidate stripping unwanted LLVM flags Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-11-29 00:07:27 +01:00
Marek Olšák	a84a8da4f8	configure.ac: print LLVM flags to see what we're mixing with ours Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2012-11-29 00:07:27 +01:00
Brian Paul	0904973e39	util: add more memory debugging features Add a DEBUG_FREED_MEMORY option to help catch use-after-free errors. Add debug_memory_check() function which can be periodically called to check that all known blocks are good. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2012-11-28 15:03:29 -07:00
José Fonseca	1cead8845b	llvmpipe: Implement logic ops for the AoS path. It was forgotten in the previous patch series, but it is trivial to implement, based on the SoA path. This fixes glean logicOp failures.	2012-11-28 20:45:18 +00:00
José Fonseca	547efc76df	llvmpipe: Don't use dynamically sized arrays. Unfortunately for MSVC arrays with a constant variable size are still considered dynamically sized.	2012-11-28 19:58:47 +00:00
Eric Anholt	c8ed9f6262	i965/gen4-5: Fix segfaults with stencil-only depth/stencil setups. Fixes a ton of piglit regressions since the depthstencil fixes for gen6+. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=57309 Reviewed-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-11-28 11:26:41 -08:00
Eric Anholt	b9b033d8e4	i965/fs: Don't generate saturates over existing variable values. Fixes a crash in http://workshop.chromeexperiments.com/stars/ on i965, and the new piglit test glsl-fs-clamp-5. We were trying to emit a saturating move into a uniform, which the code generator appropriately choked on. This was broken in the change in `32ae8d3b32`. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=57166 NOTE: This is a candidate for the 9.0 branch. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-11-28 11:26:34 -08:00
Eric Anholt	154ef07aa7	i965/fs: Add some minimal backend-IR dumping. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-11-28 11:26:33 -08:00
James Benton	960ab06da0	llvmpipe: Update llvmpipe_is_format_unswizzled to reflect latest changes. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-11-28 19:14:36 +00:00
James Benton	66fdf626bb	llvmpipe: Enable vertex color clamping. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-11-28 19:14:36 +00:00
James Benton	fa1b481c09	llvmpipe: Unswizzled rendering. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-11-28 19:14:36 +00:00
James Benton	1d3789bccb	gallivm: Updated lp_build_const_mask_aos to input number of channels. Also updated lp_build_const_mask_aos_swizzled to reflect this. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-11-28 19:14:36 +00:00
James Benton	d03d29a044	util: Updated util_format_is_array to be more accurate. Will allow formats with padding, e.g. RGBX. Will now allow swizzled formats as long as the alpha is channel 3. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-11-28 19:14:36 +00:00
James Benton	e66ec7c46b	gallivm: Added support for float to half-float conversion in lp_build_conv. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-11-28 19:14:36 +00:00
James Benton	d7a8390a82	gallivm: Changed lp_build_pad_vector to correctly handle scalar argument. Removed the lp_type argument as it was unnecessary. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-11-28 19:14:36 +00:00
James Benton	71c6fe76c0	gallivm: Add a function to generate lp_type for a format. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-11-28 19:14:36 +00:00
James Benton	cd548836a1	gallivm: Add support for unorm16 in lp_build_mul. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2012-11-28 19:14:20 +00:00
Matt Turner	c3a465ae98	glcpp: Support #elif(expression) with no intervening space. And add test cases to ensure that this works - 110 verifies that glcpp rejects #elif<digits> which glcpp previously accepted. - 111 verifies that glcpp accepts #if followed immediately by (, +, -, !, or ~. - 112 does the same as 111 but for #elif. See `17f9beb6` for #if change. Reviewed-by: Carl Worth <cworth@cworth.org>	2012-11-28 10:27:02 -08:00

1 2 3 4 5 ...

53866 commits