Commit graph

37913 commits

Author SHA1 Message Date
Luca Barbieri
68e74f1b01 translate_sse: remove useless generated function wrappers
Currently translate_sse puts two trivial wrappers in the translate vtable.

These slow it down and enlarge the source code for no gain, except perhaps
the ability to set a breakpoint there, so remove them.

Breakpoints can be set on the caller of the translate functions, with no
loss of functionality.
2010-08-16 16:57:05 +02:00
Luca Barbieri
1cb92fb92e translate_generic: factor out common code between linear and indexed
This moves the common code into a separate ALWAYS_INLINE function.
2010-08-16 16:57:05 +02:00
Luca Barbieri
ddcf028aa0 translate_generic: use memcpy if possible (v3)
Changes in v3:
- If we can do a copy, don't try to get an emit func, as that can assert(0)

Changes in v2:
- Add comment regarding copy_size

When used in GPU drivers, translate can be used to simultaneously
perform a gather operation, and convert away from unsupported formats.

In this use case, input and output formats will often be identical: clearly
it would make sense to use a memcpy in this case.

Instead, translate will insist to convert to and from 32-bit floating point
numbers.

This is not only extremely expensive, but it also loses precision for
32/64-bit integers and 64-bit floating point numbers.

This patch changes translate_generic to just use memcpy if the formats are
identical, non-blocked, and with an integral number of bytes per pixel (note
that all sensible vertex formats are like this).
2010-08-16 16:57:05 +02:00
Chia-I Wu
9271059b36 drwa: Fix polygon edge flags.
Fix a copy-and-paste error introduced by
f141abdc8f.
2010-08-16 22:01:57 +08:00
Chia-I Wu
aaf51ed7c2 draw: No need to make max_vertices even.
Triangle strip alternates the front/back orientation of its triangles.
max_vertices was made even so that varray never splitted a triangle
strip at the wrong positions.

It did not work with triangle strips with adjacencies.  And it is no
longer relevant with vsplit.
2010-08-16 21:04:24 +08:00
Chia-I Wu
c3fee80f2b draw: Remove DRAW_PIPE_MAX_VERTICES and DRAW_PIPE_FLAG_MASK.
The higher bits of draw elements are no longer used for the stipple or
edge flags.
2010-08-16 20:57:08 +08:00
Chia-I Wu
a072f0e186 drwa: Add PRIMITIVE macro to vsplit.
PRIMITIVE is used by the indexed path to flush the entire primitive with
custom vertex count checks.  It replaces the existing fast path.
2010-08-16 20:46:29 +08:00
Chia-I Wu
7b3beb2240 draw: last_vertex_last is always true for GS and SO.
That is, OpenGL decomposition rule is assumed.  There should be a
pipe_context state to specify the rules.
2010-08-16 20:46:29 +08:00
Chia-I Wu
a97419a3ba draw: Remove varray and vcache.
They have been deprecated by vsplit.
2010-08-16 20:46:29 +08:00
Chia-I Wu
5a085c623f draw: Replace vcache by vsplit.
vcache decomposes primitives while vsplit splits primitives.  Splitting
is generally easier to do and is faster.  More importantly, vcache
depends on flatshade_first to decompose.  The outputs may have incorrect
vertex order which is significant to GS.
2010-08-16 20:46:28 +08:00
Chia-I Wu
5b6bf799e6 draw: Replace varray by vsplit.
vsplit is a superset of varray.  It sets the split flags comparing to
varray.
2010-08-16 20:46:28 +08:00
Chia-I Wu
04bc530dbd draw: Add vsplit frontend.
vsplit is based on varray.  It sets the split flags when a primitive is
splitted.  It also has support for indexed primitives.

For indexed primitives, unlike vcache, vsplit splits the primitives
instead of decomposes them.
2010-08-16 20:46:28 +08:00
Chia-I Wu
56213a64fe draw: Add new util function draw_pt_trim_count.
draw_pt_trim_count is renamed from trim in draw_pt.c.
2010-08-16 20:46:28 +08:00
Chia-I Wu
9d2be38fad draw: Simplify frontend interface a little.
The run method is simplified to take the start vertex and the vertex
count.
2010-08-16 20:46:27 +08:00
Chia-I Wu
f132498347 draw: Add prim flags to middle ends.
Update the middle end interface to pass the primitive flags from the
frontends to the pipeline.  No frontend sets the flags yet.
2010-08-16 20:46:27 +08:00
Chia-I Wu
f141abdc8f draw: Add flags to draw_prim_info.
A primitive may be splitted in frontends.  The splitted primitives
should convey certain flag bits so that the decomposer can correctly
decide the stipple or edge flags.

This commit adds flags to draw_prim_info and updates the decomposer to
honor the flags.  Frontends and middle ends will be updated later.
2010-08-16 20:46:27 +08:00
nobled
9d4a0d7d4d st/mesa: test for FEATURE defines
'struct dd_function_table' only conditionally contains
the function pointer NewFramebuffer and friends based on
FEATURE_EXT_framebuffer_* defines. (See src/mesa/main/dd.h)

Fixes the build when the features are disabled and the vfuncs
don't exist.
2010-08-16 20:29:19 +08:00
nobled
70f9f5f7d4 dri/nouveau: test for FEATURE defines
'struct dd_function_table' only conditionally contains
the function pointer NewFramebuffer and friends based on
FEATURE_EXT_framebuffer_* defines. (See src/mesa/main/dd.h)

Fixes the build when the features are disabled and the vfuncs
don't exist.
2010-08-16 20:29:16 +08:00
nobled
234a065171 dri/radeon: test for FEATURE defines
'struct dd_function_table' only conditionally contains
the function pointer NewFramebuffer and friends based on
FEATURE_EXT_framebuffer_* defines. (See src/mesa/main/dd.h)

Fixes the build when the features are disabled and the vfuncs
don't exist.
2010-08-16 20:29:12 +08:00
Vinson Lee
3261c6c41e x86: Remove unnecessary header from sse.h. 2010-08-16 00:26:10 -07:00
Vinson Lee
2144f8e95e x86: Include missing headers in mmx.h.
Include compiler.h for _ASMAPI symbol.
Include mtypes.h for GLcontext symbol.
2010-08-15 23:29:09 -07:00
Vinson Lee
51f979c347 x86: Remove unnecessary header from 3dnow.h. 2010-08-15 23:10:42 -07:00
Vinson Lee
15c7ce867c glsl: Clean up header file inclusion in slang_typeinfo.h.
Remove imports.h, mtypes.h, and slang_vartable.h.
Include glheader.h for GL symbols.
2010-08-15 22:44:29 -07:00
Vinson Lee
9965ee5571 glsl: Include missing header in slang_codegen.h.
Include slang_vartable.h for slang_var_table symbol.
2010-08-15 22:39:51 -07:00
Vinson Lee
640139f80c glsl: Include missing header in slang_compile_operation.h.
Include compiler.h for INLINE symbol.
2010-08-15 22:38:23 -07:00
Vinson Lee
35a27f6fce glsl: Clean up header file inclusion in slang_storage.h.
Remove slang_compile.h.
Include glheader.h for GL symbols.
Include slang_compile_function.h for slang_function_scope symbol.
Include slang_compile_struct.h for slang_struct_scope symbol.
Include slang_compile_variable.h for slang_variable_scope symbol.
Include slang_typeinfo.h for slang_type_specifier symbol.
Include slang_utility.h for slang_atom_pool symbol.
2010-08-15 22:26:27 -07:00
Vinson Lee
97590ebeb0 glsl: Clean up header file inclusion in slang_mem.h.
slang_mem.h
Remove imports.h.
Include glheader.h for GL symbols.

slang_label.c
Include imports.h now that slang_mem.c does not include it.
2010-08-15 22:07:04 -07:00
Vinson Lee
5d2d5bf0c7 glsl: Clean up header file inclusion in slang_ir.h.
Remove imports.h and slang_compile.h.
Include glheader.h for GL symbols.
Include slang_compile_variable.h for slang_variable symbol.
2010-08-15 21:17:50 -07:00
Vinson Lee
44ad729aa3 glsl: Include missing header in slang_link.c.
Include slang_compile.h for _slang_compile function.
2010-08-15 21:13:32 -07:00
Vinson Lee
43ba1f63c7 glsl: Include missing headers in slang_builtin.c.
Include slang_typeinfo.h for slang_type_specifier symbol.
Include slang_compiler_struct.h for slang_struct_ symbol.
2010-08-15 21:10:16 -07:00
Marek Olšák
ecae1fca6f r300g: fix an invalid pointer in free 2010-08-16 05:05:43 +02:00
nobled
e897bf524b r300g: Let hyperz init fail
Signed-off-by: Marek Olšák <maraeo@gmail.com>
2010-08-16 05:00:09 +02:00
nobled
1e2cd02eae r300g: Fix leaks in failed context creation
This changes r300_destroy_context() so it can be called
on a partially-initialized context, and uses it when
r300_create_context() hits a fatal error.

This makes sure r300_create_context() doesn't leak memory
or neglect to call r300_update_num_contexts() when it fails.

Signed-off-by: Marek Olšák <maraeo@gmail.com>
2010-08-16 04:59:58 +02:00
nobled
b0e1565b5f r300g: Fix macro
This fixes a potential bug if (has_hyperz) is false
(it would still init the atom as if has_hyperz were true).

Signed-off-by: Marek Olšák <maraeo@gmail.com>
2010-08-16 04:59:13 +02:00
Marek Olšák
83baa8a6c5 st/mesa: remove output register reads inside shaders
This is a GLSL2 regression fix.
2010-08-16 02:19:18 +02:00
Marek Olšák
27eb2e2755 r300/compiler: implement DP2 opcode 2010-08-16 00:08:00 +02:00
Marek Olšák
b217167056 r300/compiler: implement SSG opcode 2010-08-16 00:08:00 +02:00
Marek Olšák
b97ab20f29 r300/compiler: fix allocation of temporaries in radeonTransformTEX 2010-08-16 00:07:54 +02:00
Keith Whitwell
c9b7bece05 llvmpipe: special case triangles which fall in a single 16x16 block
Check for these and route them to a dedicated handler with one fewer
levels of recursive rasterization.
2010-08-15 23:02:10 +01:00
Vinson Lee
a0b0afc694 mesa: Check that _XOPEN_SOURCE is defined before using it. 2010-08-15 13:08:15 -07:00
Keith Whitwell
85d9bc236d llvmpipe: consolidate several loops in lp_rast_triangle 2010-08-15 17:40:39 +01:00
Keith Whitwell
ff26594a92 llvmpipe: remove all traces of step arrays, pos_tables
No need to calculate these values any longer, nor to store them in the
bin data.  Improves isosurf a bit more, 115->123 fps.
2010-08-15 17:31:13 +01:00
Keith Whitwell
4c0641454b llvmpipe: eliminate last usage of step array in rast_tmp.h
For 16 and 64 pixel levels, calculate a mask which is linear in x and
y (ie not in the swizzle layout).

When iterating over full and partial masks, figure out position by
manipulating the bit number set in the mask, rather than relying on
postion arrays.

Similarly, calculate the lower-level c values from dcdx, dcdy and the
position rather than relying on the step array.
2010-08-15 17:24:54 +01:00
Keith Whitwell
ee0d1c29ee llvmpipe: don't refer to plane->step when dcdx or dcdy would do 2010-08-15 17:22:25 +01:00
Keith Whitwell
4b322e71bb llvmpipe: also use build_mask at 16, 64 pixel levels 2010-08-15 16:49:26 +01:00
Keith Whitwell
515194968d llvmpipe: version of block4 which doesn't need the full step array
No noticable slowdown with isosurf.
2010-08-15 16:32:45 +01:00
Keith Whitwell
510b035394 llvmpipe: reorganize block4 loop, nice speedup
isosurf 95->115 fps just by exchanging the two inner loops in this
function...
2010-08-15 16:25:06 +01:00
Corbin Simpson
2d53dc873e gallium/docs: Cleanup debugging. Spelling, grammar, organization. 2010-08-15 03:43:54 -07:00
Corbin Simpson
bf357aedff gallium/docs: Add formatting for envvar role; change debugging.
Per Jakob's request. Not super-pretty, but it's a good point for modding
later.
2010-08-15 03:43:54 -07:00
Vinson Lee
d375cb869e glsl: Fix self inclusion in slang_compile_function.h.
Fix self inclusion introduced by commit
4fef77c7c5.
2010-08-15 01:30:02 -07:00