fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 13:48:06 +02:00

Author	SHA1	Message	Date
José Fonseca	eda21d2a30	llvmpipe: Fix the bottom_edge_rule adjustment for points. The adjustment needs to be applied to the y coordinates and not the x coordinates, just like the equivalent code for lines and triangles in lp_setup_line.c and lp_setup_tri.c. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Zack Rusin <zackr@vmware.com>	2014-01-08 12:18:17 +00:00
José Fonseca	37de6b0682	llvmpipe: Respect bottom_edge_rule when computing the rasterization bounding boxes. This was inadvertently forgotten when replacing gl_rasterization_rules with lower_left_origin and half_pixel_center (commit `2737abb44e`). This makes a difference when lower_left_origin != half_pixel_center, e.g, D3D10. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Zack Rusin <zackr@vmware.com>	2014-01-08 12:18:17 +00:00
Chia-I Wu	76edf44f9e	ilo: enable HiZ The support is still early. Fast depth buffer clear is not enabled yet. HiZ can be forced off with ILO_DEBUG=nohiz.	2014-01-08 18:11:36 +08:00
Chia-I Wu	e7b4219e22	ilo: resolve Z/HiZ correctly When the depth buffer is to be read, perform a Depth Buffer Resolve if it has been rendered. When the depth buffer is to be rendered, perform a HiZ Buffer Resolve when the depth buffer is modified externally.	2014-01-08 18:11:35 +08:00
Chia-I Wu	77e3db464f	ilo: add flags to texture slices The flags are used to mark who (CPU, BLT, or RENDER) has accessed the resource and how (READ or WRITE).	2014-01-08 18:11:35 +08:00
Chia-I Wu	846f70a6ef	ilo: rename and add an accessor for texture slices Rename ilo_texture::slice_offsets to ilo_texture::slices and add an accessor, ilo_texture_get_slice().	2014-01-08 18:11:35 +08:00
Chia-I Wu	127fbc086b	ilo: add HiZ op support to the pipelines Add blitter functions to perform Depth Buffer Clear, Depth Buffer Resolve, and Hierarchical Depth Buffer Resolve. Those functions set ilo_blitter up and pass it to the pipelines to emit the commands.	2014-01-08 18:11:35 +08:00
Chia-I Wu	546416d495	ilo: add support for HiZ allocation Add tex_create_hiz() to create HiZ bo. It is not really called yet.	2014-01-08 18:11:35 +08:00
Chia-I Wu	e372819589	ilo: refactor separate stencil allocation Move separate stencil allocation code to tex_create_separate_stencil to keep tex_create sane.	2014-01-08 18:11:35 +08:00
Chia-I Wu	82676f5d34	ilo: assorted GPE fixes for HiZ Allow HiZ op to be specified in 3DSTATE_WM. Pass depth format directly in gen7_emit_3DSTATE_SF. Use tex->hiz.bo to determine if HiZ exists. Fix 3DSTATE_SF for the case when there is no ilo_rasterizer_state. Fix 3DSTATE_PS for the case when there is no ilo_shader_state.	2014-01-08 18:11:35 +08:00
Chia-I Wu	6642381e75	ilo: no layer offsetting on GEN7+ Even though the Ivy Bridge PRM lists some restrictions that require layer offsetting as the Sandy Bridge PRM does, it seems they are actually lifted.	2014-01-08 18:11:34 +08:00
Chia-I Wu	011fde4bf2	ilo: offset to layers only when necessary GEN6 has several requirements regarding the LOD/Depth/Width/Height of the render targets and the depth buffer. We used to offset to the layers in question unconditionally to meet the requirements. With this commit, offseting is done only when the requirements are not met.	2014-01-08 18:11:34 +08:00
Chia-I Wu	0a2a221d01	ilo: allow ilo_zs_surface to skip layer offsetting Make offset to layer optional in ilo_gpe_init_zs_surface.	2014-01-08 18:11:34 +08:00
Chia-I Wu	8d9f5d57e2	ilo: allow ilo_view_surface to skip layer offsetting Make offset to layer optional in ilo_gpe_init_view_surface_for_texture. render_cache_rw is always the same as is_rt and is replaced.	2014-01-08 18:11:34 +08:00
Tapani Pälli	0978a6966a	i965/fs: do SEL optimization only when src type for MOV matches Fixes a bug where then branch operates with ivec4 while else uses vec4. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=72379 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-01-08 07:06:45 +02:00
Kenneth Graunke	847bc36a38	glsl: Optimize pow(2, x) --> exp2(x). On Haswell, POW takes 24 cycles, while EXP2 only takes 14. Plus, using POW requires putting 2.0 in a register, while EXP2 doesn't. I believe that EXP2 will be faster than POW on basically all GPUs, so it makes sense to optimize it. Looking at the savage2 subset of shader-db: total instructions in shared programs: 113225 -> 113179 (-0.04%) instructions in affected programs: 2139 -> 2093 (-2.15%) instances of 'math pow': 795 -> 749 (-6.14%) instances of 'math exp': 389 -> 435 (11.8%) Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-01-07 12:54:57 -08:00
Kenneth Graunke	5e3fd6a9db	glsl: Refactor is_zero/one/negative_one into an is_value() method. This patch creates a new generic is_value() method, which checks if an ir_constant has a particular value. (For vectors, it must have the single value repeated across all components.) It then rewrites the is_zero/is_one/is_negative_one methods to use this generic helper. All three were basically identical except for the value they checked for. The other difference is that is_negative_one rejects boolean types. The new is_value function maintains this behavior, only allowing boolean types when checking for 0 or 1. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-01-07 12:54:57 -08:00
Kenneth Graunke	d6c1d66d3a	glsl: Optimize pow(1.0, X) --> 1.0. Surprisingly, this helps one vertex shader in 3DMMES. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-01-07 12:54:57 -08:00
Kenneth Graunke	05fbb021a6	mesa: Use get_local_param_pointer in glProgramLocalParameters4fvEXT(). Using the get_local_param_pointer helper ensures that the LocalParams arrays have actually been allocated before attempting to use them. glProgramLocalParameters4fvEXT needs to do a bit of extra checking, but it can be simplified since the helper has already validated the target. Fixes crashes in programs that use Cg (for example, Awesomenauts, Rocketbirds: Hardboiled Chicken, and Tiny and Big: Grandpa's Leftovers) since commit `e5885c119d` (mesa: Dynamically allocate the storage for program local parameters.) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=73136 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com> Tested-by: Laurent Carlier <lordheavym@gmail.com>	2014-01-07 12:50:23 -08:00
José Fonseca	2d368b982a	llvmpipe: Basic implementation of pipe_context::set_sample_mask. We don't support MSAA (ie, number of samples is always one) therefore sample_mask boils down to a synonym of the rasterizer_discard flag. Also, this change makes setup actually use the value received in lp_setup_set_rasterizer_discard instead of reaching out to llvmpipe upper layers to re-fetch it. Based on Si Chen's draft. With this patch `wgf11multisample Coverage passes 100%` on the UMD D3D10 state tracker. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Si Chen <sichen@vmware.com>	2014-01-07 16:04:42 +00:00
José Fonseca	95bf222603	cso_context: Fix cso_context::sample_mask initial value. The initial value of cso_context::sample_mask_saved is irrelevant as it will be overwritten with cso_context::sample_mask in cso_save_sample_mask. Therefore it is cso_context::sample_mask that needs to be properly initialized. This fixes regressions in blits and mipmap generation after adding support for sample_mask to llvmpipe. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2014-01-07 16:04:42 +00:00
Si Chen	72c6d0e506	llvmpipe: Implement alpha_to_coverage for non-MSAA framebuffers. Implement Alpha to Coverage by discarding a fragment alpha component is less than 0.5. This is a joint work of Jose and Si. Reviewed-by: José Fonseca <jfonseca@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2014-01-07 16:04:42 +00:00
Andreas Fänger	2a0fb946e1	swrast: fix delayed texel buffer allocation regression for OpenMP Commit `9119269ca1` moved the texel buffer allocation to _swrast_texture_span(), however, when compiled with OpenMP support this code already runs multi-threaded so a critical section is required to prevent multiple allocations and rendering errors. Cc: "10.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2014-01-07 08:03:49 -07:00
Dave Airlie	aa4e2243a2	gallium/draw: remove double semicolon code cleanup. Signed-off-by: Dave Airlie <airlied@redhat.com>	2014-01-07 18:52:46 +10:00
Brian Paul	8d1400fe12	glsl: rename min(), max() functions to fix MSVC build Evidently, there's some other definition of "min" and "max" that causes MSVC to choke on these function names. Renaming to min2() and max2() fixes things. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-01-06 16:57:49 -07:00
Kenneth Graunke	f6b10544cd	i965: Remove unused PIPE_CONTROL defines. Both brw_defines.h and intel_reg.h defined PIPE_CONTROL fields, which had similar names, but couldn't be used in the same way. (One had built-in shifts, and the other didn't...) Delete the unused set to preserve sanity. (Eric wrote an almost identical patch back in August, so I believe he approves.) Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2014-01-06 15:45:42 -08:00
Vinson Lee	f8432832a7	mesa: Remove GLXContextID typedef from glxext.h. This patch fixes this build error with gcc <= 4.5 and clang <= 3.1. CC clientattrib.lo In file included from ../../include/GL/glx.h:333:0, from glxclient.h:45, from clientattrib.c:32: ../../include/GL/glxext.h:275:13: error: redefinition of typedef 'GLXContextID' ../../include/GL/glx.h:171:13: note: previous declaration of 'GLXContextID' was here Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=70591 Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-01-06 14:57:23 -08:00
Maxence Le Doré	a44ca3595e	docs/relnotes/10.1.html: report AMD_shader_trinary_minmax support Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-01-06 14:28:11 -08:00
Maxence Le Doré	1a9e8c23eb	mesa: enable AMD_shader_trinary_minmax Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-01-06 14:28:10 -08:00
Maxence Le Doré	eb5dc75601	glsl: implement mid3 built-in function Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-01-06 14:28:09 -08:00
Maxence Le Doré	73c7451587	glsl: implement max3 built-in function Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-01-06 14:28:08 -08:00
Maxence Le Doré	ce46e14729	glsl: Implement min3 built-in function Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-01-06 14:28:08 -08:00
Maxence Le Doré	61c450fc81	glsl: add min() and max() functions to builder.cpp Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-01-06 14:28:07 -08:00
Maxence Le Doré	cf70d2a7c0	glsl: add a shader_trinary_minmax predicate Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-01-06 14:28:06 -08:00
Maxence Le Doré	ff50493bb3	glsl: Add extension tracking for AMD_shader_trinary_minmax Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-01-06 14:28:02 -08:00
Alexander von Gluck IV	61ef697afc	haiku libGL: Move from gallium target to src/hgl * The Haiku renderers need to link to libGL to function properly in all usage contexts. As mesa drivers build before gallium targets, we couldn't properly link the mesa swrast driver to the gallium libGL target for Haiku. * This is likely better as it mimics how glx is laid out ensuring the Haiku libGL is better understood. * All renderers properly link in libGL now. Acked-by: Brian Paul <brianp@vmware.com>	2014-01-06 15:50:21 -06:00
Alexander von Gluck IV	b236314a11	haiku: Fix missing HaikuGL header paths Acked-by: Brian Paul <brianp@vmware.com>	2014-01-06 15:50:15 -06:00
Brian Paul	3486f6f31b	mesa: implement missing glGet(GL_RGBA_SIGNED_COMPONENTS_EXT) query This is part of the GL_EXT_packed_float extension. Bugzilla: http://bugs.freedesktop.org/show_bug.cgi?id=73096 Cc: 10.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2014-01-06 13:37:00 -07:00
Eric Anholt	7db56ddee0	i965: Warning fix Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-01-06 10:54:22 -08:00
Kenneth Graunke	242ca9acb4	i965: Delete unused INTEL_WRITE_{PART,FULL} and INTEL_READ #defines. These are just software flag values (not hardware specific values), and aren't used anywhere. Delete them to avoid confusion. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2014-01-06 10:52:43 -08:00
Marek Olšák	346b6abab9	radeonsi: calculate NUM_BANKS for DB correctly on CIK NUM_BANKS is not constant on CIK. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2014-01-06 18:40:42 +01:00
Marek Olšák	bf3c361113	radeonsi: set correct pipe config for Hawaii in DB Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-01-06 18:40:42 +01:00
Marek Olšák	2748b7da7e	radeonsi: disable HTILE for 1D-tiled depth-stencil buffers Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-01-06 18:40:41 +01:00
Juha-Pekka Heikkila	d41f5396f3	glx: check memory allocations in __glXInitVertexArrayState() Signed-off-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2014-01-06 10:23:26 -07:00
Juha-Pekka Heikkila	0c04cca0e1	glx: Add missing null check in __glXNewIndirectAPI() Add extra null check in auto generated indirect_init.c via src/mapi/glapi/gen/glX_proto_send.py Signed-off-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2014-01-06 10:23:12 -07:00
Nathan Kidd	0691b37732	docs: fix misspellings Fixed what I noticed; no warranty for exhaustiveness. Signed-off-by: Nathan Kidd <nkidd@opentext.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2014-01-06 09:55:38 -07:00
Chris Forbes	a61ae2aa01	i965: set size of txf_mcs payload vgrf properly Previously we left the size of this vgrf as 1, which caused register allocation to be subtly broken. If we were lucky we would explode in the post-alloc instruction scheduler; if we were unlucky we'd just stomp on someone else and get broken rendering. Fixes crash when running `tesseract` with the following settings: msaa 4 glineardepth 0 Also fixes the piglit test: arb_sample_shading-builtin-gl-sample-id Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Cc: Anuj Phogat <anuj.phogat@gmail.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=72859 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-01-04 20:24:29 +13:00
Erik Faye-Lund	eb212c5a30	glcpp: error on multiple #else/#elif directives The preprocessor currently accepts multiple else/elif-groups per if-section. The GLSL-preprocessor is defined by the C++ specification, which defines the following parse-rule: if-section: if-group elif-groups(opt) else-group(opt) endif-line This clearly only allows a single else-group, that has to come after any elif-groups. So let's modify the code to follow the specification. Add test to prevent regressions. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Carl Worth <cworth@cworth.org> Cc: 10.0 <mesa-stable@lists.freedesktop.org>	2014-01-02 14:22:58 -08:00
Carl Worth	6005e9cb28	glcpp: Replace multi-line comment with a space (even as part of macro definition) The preprocessor has always replaced multi-line comments with a single space character, (as required by the specification), but as of commit `bd55ba568b` the lexer also emitted a NEWLINE token for each newline within the comment, (in order to preserve line numbers). The emitting of NEWLINE tokens within the comment broke the rule of "replace a multi-line comment with a single space" as could be exposed by code like the following: #define FOO a/* */b FOO Prior to commit `bd55ba568b`, this code defined the macro FOO as "a b" as desired. Since that commit, this code instead defines FOO as "a" and leaves a stray "b" in the output. In this commit, we fix this by not emitting the NEWLINE tokens while lexing the comment, but instead merely counting them in the commented_newlines variable. Then, when the lexer next encounters a non-commented newline it switches to a NEWLINE_CATCHUP state to emit as many NEWLINE tokens as necessary (so that subsequent parsing stages still generate correct line numbers). Of course, it would have been more clear if we could have written a loop to emit all the newlines, but flex conventions prevent that, (we must use "return" for each token we emit). It similarly would have been clear to have a new rule restricted to the <NEWLINE_CATCHUP> state with an action much like the body of this if condition. The problem with that is that this rule must not consume any characters. It might be possible to write a rule that matches a single lookahead of any character, but then we would also need an additional rule to ensure for the <EOF> case where there are no additional characters available for the lookahead to match. Given those considerations, and given that the SKIP-state manipulation already involves a code block at the top of the lexer function, before any rules, it seems best to me to go with the implementation here which adds a similar pre-rule code block for the NEWLINE_CATCHUP. Finally, this commit also changes the expected output of a few, existing glcpp tests. The change here is that the space character resulting from the multi-line comment is now emitted before the newlines corresponding to that comment. (Previously, the newlines were emitted first, and the space character afterward.) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=72686 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-01-02 14:15:51 -08:00
Carl Worth	61cea49014	glcpp: Add a more descriptive comment for the SKIP state manipulation Two things make this code confusing: 1. The uncharacteristic manipulation of lexer start state outside of flex rules. 2. The confusing semantics of the skip_stack (including the "lexing_if" override and the SKIP_NO_SKIP state). This new comment is intended to bring a bit more clarity for any readers. There is no intended beahvioral change to the code here. The actual code changes include better indentation to avoid an excessively-long line, and using the more descriptive INITIAL rather than 0. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-01-02 14:15:24 -08:00

1 2 3 4 5 ...

60347 commits