José Fonseca
00989d5bfc
llvmpipe: Relax the colormask constraint on opaque.
...
Also, include the color buffer in the key. Not having it there
causes a tight knots in the logic to determine when it is OK or not
to discard previous color buffer contents.
2010-09-05 10:17:51 +01:00
José Fonseca
f25b6e546e
util: Utility function to determine the channels that can be written in a color format.
2010-09-05 10:17:51 +01:00
José Fonseca
30c5d5d9d8
scons: Re-enable SSE on MinGW.
...
It seems to be working correctly with gcc 4.4, and enabling it allows to
test some of the llvmpipe instrinsics on Windows.
2010-09-05 10:17:51 +01:00
José Fonseca
6ed726b8fc
gallivm: Pass condition masks as an unsigned bitmask.
...
Much more convenient than boolean arrays.
2010-09-05 10:17:51 +01:00
José Fonseca
079763f746
gallivm: Cope with tgsi instruction reallocation failure.
2010-09-05 10:17:51 +01:00
Kenneth Graunke
9a3df46fbc
ir_reader: Only validate IR when a global 'debug' flag is set.
...
This extra validation is very useful when working on the built-ins, but
in general overkill - the results should stay the same unless the
built-ins or ir_validate have changed.
Also, validating all the built-in functions in every test case makes
piglit run unacceptably slow.
2010-09-05 01:57:37 -07:00
Marek Olšák
ece8490bb5
r300g,r300c: memset the compiler struct to zeros
...
This should fix bogus reports "Too many temporaries." and maybe some others.
2010-09-05 05:09:25 +02:00
Tom Stellard
7a212525a2
r300/compiler: Remove stray break statement
...
This fixes glsl-fs-loop-nested.
2010-09-04 17:27:55 -07:00
Luca Barbieri
c50b7e04eb
nvfx: support unlimited constants and immediates in fp
2010-09-05 02:01:02 +02:00
Luca Barbieri
9fc50968ad
nvfx: support using blitter to copy depth/stencil resources, fix Heaven
...
We might want to copy them as color ones though.
Also works around crash in Unigine Heaven due to failing to allocate
a 64 MB temporary in GART for a CPU copy.
Unigine Heaven now works on nv40, albeit with very heavy glitches (with
the floating branch with render_hdr 0).
2010-09-05 02:01:02 +02:00
Marek Olšák
6dc71d6aa5
r300/compiler: fix the instruction limit in vertex shaders
...
Broken with commit d774b0c710 .
Reported by Chris Rankin.
2010-09-05 00:44:52 +02:00
Luca Barbieri
20bf5037d6
nvfx: support rendering to more formats
2010-09-04 22:45:21 +02:00
Luca Barbieri
25ecc9521d
nvfx: move 2D format selection logic to 2D code
2010-09-04 22:45:21 +02:00
Luca Barbieri
5bd0e0adb1
nvfx: fix swizzling of high bpp surfaces
2010-09-04 22:45:21 +02:00
Luca Barbieri
2e6c65722a
nvfx: fix some subrectangle copies
...
Actually, we may want to get rid of the x/y coordinates for linear
surfaces, and realign the origin from scratch if necessary, instead
of doing this "on-demand realignment".
2010-09-04 22:45:21 +02:00
Luca Barbieri
90d3291595
nvfx: fix inlinining in nv04_2d.c
2010-09-04 22:45:21 +02:00
Luca Barbieri
11d29739e1
nvfx: fix the temporary copying logic and add asserts
2010-09-04 22:45:21 +02:00
Luca Barbieri
f7392eb23a
nvfx: prevent swizzled rendering into formats where it's not supported
2010-09-04 22:45:21 +02:00
Marek Olšák
280665be70
Revert "ir_to_mesa: Load all the STATE_VAR elements of a builtin uniform to a temp."
...
This reverts commit 5ad74779ce .
Sorry, but I had to revert this.
Any commit which needlessly increases the number of temporaries is wrong.
More temporaries mean less shader performance because of reduced parallelism
and therefore less efficient latency hiding. In this case, there is possible
performance degradation of every shader which uses GL state variables.
I cannot accept this.
2010-09-04 19:06:47 +02:00
Marek Olšák
cbf7e82786
Revert "r300g: refuse to create a texture with size 0"
...
This reverts commit 5cdedaaf29 .
https://bugs.freedesktop.org/show_bug.cgi?id=30002
Conflicts:
src/gallium/drivers/r300/r300_texture.c
2010-09-04 19:06:47 +02:00
Marek Olšák
6e64096190
r300g: remove unnecessary assignments
2010-09-04 19:06:47 +02:00
Marek Olšák
58471c801a
r300/compiler: indent printed instructions according to the branch depth
2010-09-04 18:56:22 +02:00
Marek Olšák
d3ca8a4edd
r300g: skip draw calls with no vertex elements, fixing hardlocks
2010-09-04 18:56:22 +02:00
Marek Olšák
d774b0c710
r300/compiler: use limits from the compiler input instead of inline constants
2010-09-04 18:56:22 +02:00
Marek Olšák
63943c8fcd
r300/compiler: improve register allocation with indexable temporaries for VS
...
Register allocation can now reallocate temporaries right after the last indexed
source operand, instead of being disabled for the whole shader.
2010-09-04 18:56:22 +02:00
Marek Olšák
33360a707e
r300/compiler: fix handling of indexed temporaries in peephole
2010-09-04 18:56:22 +02:00
Marek Olšák
aa554d508b
r300/compiler: disable deadcode elimination for indexed dst operands
2010-09-04 18:56:22 +02:00
Marek Olšák
f90c870304
r300/compiler: allocate at least FS inputs if register allocation is disabled
2010-09-04 18:56:22 +02:00
Marek Olšák
cfc461fca6
r300g: add a new debug option which disables compiler optimizations
...
Those are:
- dead-code elimination
- constant folding
- peephole (mainly copy propagation)
- register allocation
There are some bugs which I need to track down.
Also fix up the descriptions of all the debug options.
2010-09-04 18:56:22 +02:00
Marek Olšák
d2f4ceaa47
r300/compiler: compute the final number of temporaries during translation
...
And not during the register allocation, which may be skipped for debugging
purposes. Also the predicate register is now added to the number of temps.
2010-09-04 18:56:21 +02:00
Marek Olšák
f46fd04136
r300/compiler: make optimizations not use 0.5 swizzles in vertex shaders
2010-09-04 18:56:21 +02:00
Marek Olšák
63eafaa8ec
r300/compiler: use peephole and constant folding for vertex shaders too
2010-09-04 18:56:21 +02:00
Marek Olšák
f5609ff0d7
r300/compiler: remove unused enum OPCODE_REPL_ALPHA
...
We use RC_OPCODE_REPL_ALPHA instead.
2010-09-04 18:56:21 +02:00
Marek Olšák
fba5f6bda7
r300/compiler: refactor fragment shader compilation
...
This cleans up the mess in r3xx_compile_fragment_program.
2010-09-04 18:56:21 +02:00
Marek Olšák
3ba562e62a
r300/compiler: add new compiler parameter max_constants
2010-09-04 18:56:21 +02:00
Marek Olšák
0b9f836147
r300/compiler: refactor vertex shader compilation
...
First list compiler passes in an array, then run the new function rc_run_compiler.
Every backend may need a different set of passes.
This cleans up the mess in r3xx_compile_vertex_program.
2010-09-04 18:56:21 +02:00
Marek Olšák
9a9aa7daa8
r300/compiler: remove a redundant parameter in rc_pair_regalloc
2010-09-04 18:56:21 +02:00
Marek Olšák
f0ae99252a
r300/compiler: remove a redundant parameter in rc_dataflow_deadcode
...
&c->Base == c.
2010-09-04 18:56:21 +02:00
Marek Olšák
cc8f2d5599
r300/compiler: use null-terminated array of transformation functions
...
I need to reduce the number of parameters of each compiler pass function.
This is part of a larger cleanup.
2010-09-04 18:56:21 +02:00
Marek Olšák
56de72c78a
r300g: only check for an empty shader if there are no compile errors
2010-09-04 18:56:20 +02:00
Marek Olšák
313e95f0c7
r300/compiler: add new compiler parameter max_alu_insts
2010-09-04 18:56:20 +02:00
Marek Olšák
a0fb406d9f
r300/compiler: put emulate_loop_state in radeon_compiler
2010-09-04 18:56:20 +02:00
Kenneth Graunke
79088746a2
ir_reader: Run ir_validate on the generated IR.
...
It's just too easy to get something wrong in hand-written IR.
2010-09-04 02:19:38 -07:00
Kenneth Graunke
2809d70723
ir_reader: Emit global variables at the top of the instruction list.
...
Since functions are emitted when scanning for prototypes, functions
always come first, even if the original IR listed the variable
declarations first.
Fixes an ir_validate error (to be turned on in the next commit).
2010-09-04 02:19:38 -07:00
Kenneth Graunke
b758de16e3
ir_reader: Drop support for reading the old assignment format.
2010-09-04 02:19:38 -07:00
Kenneth Graunke
a878107d6c
glsl: Regenerate autogenerated file builtin_function.cpp.
2010-09-04 02:19:37 -07:00
Kenneth Graunke
bacbf941d2
glsl/builtins: Convert assignments to new format (with write mask).
2010-09-04 02:19:37 -07:00
Kenneth Graunke
03a6276477
ir_reader: Read the new assignment format (with write mask).
...
This preserves the ability to read the old format, for momentary
compatibility with all the existing IR implementations of built-ins.
2010-09-04 02:19:37 -07:00
Kenneth Graunke
a71b46a8ad
ir_reader: Track the current function and report it in error messages.
2010-09-04 02:19:37 -07:00
Kenneth Graunke
a7dc8081b6
glsl/builtins: Actually print the info log if reading a builtin failed.
2010-09-04 02:19:37 -07:00