fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 11:38:05 +02:00

Author	SHA1	Message	Date
Roland Scheidegger	4dbca8672b	gallivm: (trivial) fix bogus assertion for per-element lod with 1d resources The assertion was always broken but the code unused until enabling the per-element lod code. Fixes piglit texelFetch vs isampler1D and similar tests (only run with GL 3.0 version override).	2013-07-05 01:19:23 +02:00
Roland Scheidegger	f3bbf65929	gallivm: do per-pixel lod calculations for explicit lod d3d10 requires per-pixel lod calculations for explicit lod, lod bias and explicit derivatives, and we should probably do it for OpenGL too - at least if they are used from vertex or geometry shaders (so doesn't apply to lod bias) this doesn't just affect neighboring pixels. Some code was already there to handle this so fix it up and enable it. There will no doubt be a performance hit unfortunately, we could do better if we'd knew we had a real vector shift instruction (with variable shift count) but this requires AVX2 on x86 (or a AMD Bulldozer family cpu). Don't do anything for lod bias and explicit derivatives yet, though no special magic should be needed for them neither. Likewise, the size query is still broken just the same. v2: Use information if lod is a (broadcast) scalar or not. The idea would be to base this on the actual value, for now just pretend it's a scalar in fs and not a scalar otherwise (so, per-pixel lod is only used in gs/vs but same code is generated for fs as before). Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-07-04 19:42:04 +02:00
Zack Rusin	bbd1e60198	draw: fix overflows in the indexed rendering paths The semantics for overflow detection are a bit tricky with indexed rendering. If the base index in the elements array overflows, then the index of the first element should be used, if the index with bias overflows then it should be treated like a normal overflow. Also overflows need to be checked for in all paths that either the bias, or the starting index location. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2013-07-03 09:06:30 -04:00
Zack Rusin	09820902d7	draw/llvm: index overflows if it's greater than elt max The comparison, incorrectly, was greater-than-or-equal to elt max. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2013-07-03 09:06:24 -04:00
Kenneth Graunke	764afc48cf	i965: Move the rest of intel_tex_layout.c into brw_tex_layout.c. The texture alignment unit functions are called from brw_tex_layout.c, so it makes sense to put them there. Since the only caller of intel_get_texture_alignment_unit() is in brw_tex_layout.c, it could be made into a static function. However, this patch instead simply folds it into the caller, as it's only two lines anyway. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-03 10:48:15 -07:00
Kenneth Graunke	466aa712b6	i965: Push intel_get_texture_alignment_unit call into brw_miptree_layout intel_miptree_create_layout() calls intel_get_texture_alignment_unit() and then immediately calls brw_miptree_layout(). There are no other callers. intel_get_texture_alignment_unit() populates the miptree's alignment unit fields, which are used by brw_miptree_layout() to determine where to place each miplevel. Since brw_miptree_layout() needs those to be present, it makes sense to have it initialize them as the first step. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-03 10:48:15 -07:00
Kenneth Graunke	c4c3c0dc94	i965: Declare for-loop counters in the loop in brw_tex_layout.c. The driver is compiled in C99 mode, so this is not a problem. It's slighlty tidier. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2013-07-03 10:48:15 -07:00
Kenneth Graunke	ccf312fd12	i965: Remove use of GLuint/GLint in brw_tex_layout.c. Using GL types is silly; this isn't even remotely API-facing. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2013-07-03 10:48:15 -07:00
Kenneth Graunke	ed95e396f3	i965: Tidy the brw_tex_layout.c copyright and file header comments. This uses Doxygen style for the file comments, and generally makes it more consistent with the rest of the driver. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2013-07-03 10:48:15 -07:00
Kenneth Graunke	2ea87fde31	i965: Move i945_texture_layout_2d to brw_tex_layout.c This consolidates the miptree layout logic in a single file. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2013-07-03 10:48:15 -07:00
Kenneth Graunke	1920209970	i965: Remove fallthrough for Gen4 cube map layout. Now that both 2DArray and Cube layouts are taken care of by helper functions, it's easy to just call the right function for each generation. This is a little cleaner than falling through. This also reworks the comments. Referencing "Volume 1" of the BSpec isn't very helpful, since that's only available inside Intel, and it doesn't even use volume numbers. Also, "Ironlake...finally" sounds a bit strange considering that almost all hardware uses the 2D array approach. At this point, Gen4 is the only special case. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2013-07-03 10:48:14 -07:00
Kenneth Graunke	7e4007a1b3	i965: Combine GL_TEXTURE_CUBE_MAP_ARRAY case with the other array cases. These do the exact same thing; combining them is tidier. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2013-07-03 10:48:14 -07:00
Kenneth Graunke	bc51f15b32	i965: Pull 3D texture layout code out into a helper function. A bit cleaner than having it in one giant function. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2013-07-03 10:48:14 -07:00
Kenneth Graunke	abc2bdffd6	i965: Replace maxBatchSize variable with BATCH_SZ define. maxBatchSize was only ever initialized to BATCH_SZ, and a few places used BATCH_SZ directly anyway. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-03 10:48:14 -07:00
Kenneth Graunke	2c602d2adf	i965: Move annotate_aub out of the vtable. brw_annotate_aub() is the only implementation of this function, so it makes sense to just call it directly. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-03 10:48:14 -07:00
Kenneth Graunke	f05f8793c8	i965: Move debug_batch hook out of the vtable. brw_debug_batch() is the only implementation of this function, so it makes sense to just call it directly. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-03 10:48:14 -07:00
Kenneth Graunke	749160aab3	i965: Remove render_target_supported from the vtable. brw_render_target_supported() is the only implementation of this function, so it makes sense to just call it directly. Rather than adding an #include of brw_wm.h, this patch moves the prototype to brw_context.h. Prototypes seem to be in rather arbitrary places at the moment, and either place seems as good as the other. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-03 10:48:14 -07:00
Kenneth Graunke	7c5279e554	i965: Move is_hiz_depth_format out of the vtable. brw_is_hiz_depth_format() is the only implementation of this function, so it makes sense to just call it directly. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-03 10:48:14 -07:00
Kenneth Graunke	607338f1cb	i965: Remove the invalidate_state() vtable hook. The hook was a noop. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-03 10:48:14 -07:00
Kenneth Graunke	251cdcf059	i965: Replace fprintfs with assertions in GLenum comparison translators. These functions translate GLenum comparison operations into the hardware enumerations. They should never be passed something other than a GL comparison operator, or something is very broken. Assertions seem more appropriate than fprintf. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-03 10:48:14 -07:00
Kenneth Graunke	7ee616f1bf	i965: Replace intel_state.c enums with those from brw_defines.h. Both intel_context.h and brw_defines.h have #defines for comparison functions, stencil ops, blending logic ops, and blending factors. They're exactly the same values, so it makes sense to pick one. brw_defines.h is the logical place for this kind of stuff, so this patch converts intel_state.c to use the set defined there. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-03 10:48:13 -07:00
Kenneth Graunke	c9db037dc9	i965: Delete pre-DRI2.3 viewport hacks. The __DRI_USE_INVALIDATE extension was added in May 11th, 2010 by commit `4258e3a2e1`. At this point, it's unlikely that anyone's using the right mix of new and old components to hit this path. Deleting it removes an untested code path and cleans up the driver a bit. Cc: Kristian Høgsberg <krh@bitplanet.net> Cc: Keith Packard <keithp@keithp.com>	2013-07-03 10:48:13 -07:00
Kenneth Graunke	cbb37b7586	i965: Remove "There are probably better ways" comment. There are always better ways to do things. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-03 10:48:13 -07:00
Kenneth Graunke	7115bee993	i965: Delete brw_print_reg() function. This wasn't called from anywhere; presumably it was used to examine brw_regs when debugging shader assembly. However, it prints registers in a different notation than brw_disasm.c which everyone is used to...which means I doubt anyone will want to use it. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-03 10:48:13 -07:00
Kenneth Graunke	bc8b62e3a0	i965: Move contents of intel_clear.h to intel_context.h. Having a header file for a single prototype seems rather excessive. Plus, the actual function is in brw_clear.c, not intel_clear.c, so there isn't even the .c/.h filename symmetry one might expect. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-03 10:48:13 -07:00
Kenneth Graunke	7d8e70f301	i965: Move contents of intel_extensions.h to intel_context.h. Having an entire header file for a single prototype seems a bit excessive. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-03 10:48:13 -07:00
Kenneth Graunke	7d119880e8	i965: Remove some dead code. A random smattering of things that just aren't used anymore. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-03 10:48:13 -07:00
Kenneth Graunke	d245e795cf	i965: Delete dead intel_buffer_object::range_map_size field. Nothing uses this, apparently. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-03 10:48:13 -07:00
Kenneth Graunke	1f6ebdd43f	i965: Remove intel_buffer_object::source. This was only used for BOs backed by system memory on i915. With that gone, there's nothing that even sets source to non-zero, so this is purely dead code. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-03 10:48:13 -07:00
Kenneth Graunke	6e5b80ee5a	i965: Fix buffer object segfault since removal of system memory BOs. Commit `cf31a19300` removed support for BOs backed by system memory, as it was only useful for i915. However, it removed a little too much code: intel_bufferobj_buffer() used to call intel_bufferobj_alloc_buffer(), and after that commit, it didn't. This led to NULL pointer dereferences in several test cases, such as es3conform's transform_feedback_state_variables test. This commit restores the allocation, preserving the original behavior. It may not be the cleanest approach, but tidying should come later. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=66432 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-03 10:48:12 -07:00
Matthew McClure	012ba47076	postprocess: move second temporary assertion into isolated configuration With this patch we will only assert that the second temporary is allocated, when there are more than two active filters. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=66423 Signed-off-by: Brian Paul <brianp@vmware.com>	2013-07-03 09:19:04 -06:00
José Fonseca	9b6788eb15	glsl: Ensure snprintf is defined on MSVC builds. Should fix: src\glsl\opt_dead_builtin_varyings.cpp(244) : error C3861: 'snprintf': identifier not found ...	2013-07-03 08:26:08 +01:00
Ilia Mirkin	4bc8e3c3e4	targets/xvmc-nouveau: add in missing nv30 lib Currently libXvMCnouveau.so is missing nv30_screen_create. Add it in so that it may be dlopen'd. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2013-07-03 09:02:40 +02:00
Marek Olšák	30c3e8718d	mesa,glsl,gallium: remove GLSLSkipStrictMaxVaryingLimitCheck and dependencies Not needed with do_dead_builtin_varyings. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2013-07-02 17:02:14 +02:00
Marek Olšák	74edd56927	st/mesa: disable EXT_separate_shader_objects The extension disallows elimination of set-but-unused varyings. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2013-07-02 17:02:14 +02:00
Marek Olšák	b3d8b4c0b4	glsl/linker: eliminate unused and set-but-unused built-in varyings This eliminates built-in varyings such as gl_Color, gl_SecondaryColor, gl_TexCoord, and gl_FogFragCoord if they are unused by the next stage or not written at all (e.g. gl_TexCoord elements). The gl_TexCoord array is broken down into separate vec4s if needed. v2: - use a switch statement in varying_info_visitor::visit(ir_variable*) - use snprintf - disable the optimization for GLES2 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2013-07-02 17:02:14 +02:00
Marek Olšák	3c555827c3	glsl/linker: check against varying limit after unused varyings are eliminated We counted even the varyings which were later eliminated, which was suboptimal. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2013-07-02 17:02:14 +02:00
Marek Olšák	284d954912	glsl/linker: link shaders in the opposite order (from fragment to vertex) This ensures that inter-shader outputs and inputs are properly eliminated across 3 or more shader stages. The behavior is unchanged with 2 or less shader stages. For example, elimination of unused FS inputs causes elimination of matching GS outputs, which causes elimination of the GS inputs that were needed for evaluation of the eliminated GS outputs, which causes elimination of matching VS outputs. An unused FS input is all that's needed to trigger this chain reaction. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2013-07-02 17:02:14 +02:00
Marek Olšák	030ca230e2	mesa: renumber shader indices according to their placement in pipeline See my explanation in mtypes.h. v2: don't do this in gallium v3: also updated the comment at the gl_shader_type definition Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2013-07-02 17:02:14 +02:00
José Fonseca	84f367e69a	gallivm: Simplify intrinsic name construction. Just noticed this could be slightly shortened when fixing MSVC build. Trivial.	2013-07-02 13:12:31 +01:00
Kenneth Graunke	15ca0ca1b6	glsl/builtins: Fix ARB_texture_cube_map_array built-in availability. This patch adds texture() for isamplerCubeArray and usamplerCubeArray, which were entirely missing. It also makes texture() with a LOD bias fragment shader specific. The main GLSL specification explicitly says that texturing with LOD bias should not be allowed for vertex shaders. Affects Piglit's ARB_texture_cube_map_array/compiler/tex_bias-01.vert. which tries to use bias in a vertex shader. Currently, it expects this to pass (so this patch regresses the test), but I've sent a patch to reverse the expected behavior (so this patch would fix the updated test): http://lists.freedesktop.org/archives/piglit/2013-June/006123.html NOTE: This is a candidate for stable branches. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Paul Berry <stereotype441@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2013-07-02 01:01:30 -07:00
José Fonseca	4c859901ce	gallivm: Fix MSVC build.	2013-07-02 06:41:32 +01:00
José Fonseca	e621ec816d	gallivm: Fix indirect immediate registers. If reg->Register.Indirect is true then the immediate is not truly a constant LLVM expression. There is no performance regression in using LLVMBuildBitCast, as it will fallback to LLVMConstBitCast internally when the argument is a constant. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Zack Rusin <zackr@vmware.com>	2013-07-02 06:30:06 +01:00
Zack Rusin	70bc43acdb	gallium/tests: fix the translate test	2013-06-28 09:43:17 -04:00
Anuj Phogat	722721d718	i965: Enable ext_framebuffer_multisample_blit_scaled on intel h/w This patch enables ext_framebuffer_multisample_blit_scaled extension on intel h/w >= gen6. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Acked-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Paul Berry <stereotype441@gmail.com>	2013-07-01 15:21:25 -07:00
Anuj Phogat	6fc3da2da0	i965/blorp: Add bilinear filtering of samples for multisample scaled blits Current implementation of ext_framebuffer_multisample_blit_scaled in i965/blorp uses nearest filtering for multisample scaled blits. Using nearest filtering produces blocky artifacts and negates the benefits of MSAA. That is the reason why extension was not enabled on i965. This patch implements the bilinear filtering of samples in blorp engine. Images generated with this patch are free from blocky artifacts and show big improvement in visual quality. Observed no piglit and gles3 regressions. V3: - Algorithm used for filtering assumes a rectangular grid of samples roughly corresponding to sample locations. - Test the boundary conditions on the edges of texture. V4: - Clip texcoords and use conditional MOVs. - Send texture dimensions as push constants. - Remove the optimization in case of scaled multisample blits. V5: - Move mcs_fetch() inside the 'for' loop after computing pixel coordinates. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Acked-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Paul Berry <stereotype441@gmail.com>	2013-07-01 15:21:25 -07:00
Ian Romanick	27f2df2507	docs: Import 9.1.4 release notes, add news item. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2013-07-01 14:48:58 -07:00
Zack Rusin	1c2e5c223d	draw/translate: fix instancing We were incorrectly computing the buffer offset when using the instances. The buffer offset is always equal to: start_instance * stride + (instance_num / instance_divisor) * stride We were completely ignoring the start instance quite often producing instances that completely wrong, e.g. if start instance = 5, instance divisor = 2, then on the first iteration it should be: 5 * stride, not (5/2) * stride as we'd have currently, and if start instance = 1, instance divisor = 3, then on the first iteration it should be: 1 * stride, not 0 as we'd have. This fixes it and adjusts all the code to the changes. Signed-off-by: Zack Rusin <zackr@vmware.com>	2013-06-28 05:21:20 -04:00
Zack Rusin	df4ab7974a	draw: fix incorrect clipper invocation statistics clipper invocations are computed earlier (of course before the emittion) so this code was adding bogus numbers to already computed clipper invocations. Signed-off-by: Zack Rusin <zackr@vmware.com>	2013-06-28 04:24:29 -04:00
Zack Rusin	34546d61c1	draw/gallivm: export overflow arithmetic to its own file We'll be reusing this code so lets put it in a common file and use it in the draw module. Signed-off-by: Zack Rusin <zackr@vmware.com>	2013-06-28 04:24:24 -04:00

1 2 3 4 5 ...

57421 commits