Keith Whitwell
caf5d52d48
tgsi: simplify and fix sse KIL implementation
...
Use sse_movmskps to extract the correct bits of the comparison result
for use in updating the killmask. Simplify some logic around
identifying the set of necessary comparisons to make.
2009-07-16 12:49:27 +01:00
Keith Whitwell
e1ad38ad8e
rtasm: export sse_movmskps
2009-07-16 12:47:11 +01:00
Keith Whitwell
3858f47994
tgsi: initial texturing support on sse path
...
Most obvious problem is drawpixels comes out blocky, but this may be
an existing issue of KIL on the sse path.
2009-07-16 11:37:52 +01:00
Keith Whitwell
a01383ff6a
tgsi: make sse function callout mechanism more generic
...
Take a list of arguments rather than hardcoding TEMP_R0.
2009-07-16 10:11:46 +01:00
Keith Whitwell
ebc4a9bf2e
tgsi: reduce x86 reg usage in tgsi_sse generated programs
...
Pass the tgsi_exec_machine struct in directly and just hold a single
pointer to this struct, rather than keeping one for each of its
internal members.
2009-07-16 09:53:08 +01:00
Keith Whitwell
4e3002b50f
tgsi: no need to separately malloc input and output arrays
...
Can now guarantee alignment in the initial allocation of the tgsi exec machine.
2009-07-16 09:53:08 +01:00
Keith Whitwell
c202fe187c
gallium: reduce recursive include of tgsi_exec.h
...
A lot of draw code no longer needs to see this header.
2009-07-16 09:53:07 +01:00
Keith Whitwell
7fb702705a
tgsi: make function call code in tgsi_sse.c less opaque
...
Explictly pass src and dst arguments (previously dst argument was also
being used as a src). Separate argument handling from the rest of
the function call emit.
2009-07-16 09:53:07 +01:00
Keith Whitwell
6175653d0b
gallium: proper constructor and destructor for tgsi_exec_machine
...
Centralize the creation, initialization and destruction of this struct.
Use align_malloc instead of home-brew alternatives.
2009-07-16 09:53:07 +01:00
Dave Airlie
3a3b83e511
radeon: update span reading micro tile code
...
tested on r500 with zreaddraw with Z buffer in all 3 modes
2009-07-15 14:23:14 +10:00
Dave Airlie
bd4ed25222
r300: emit z depth pitch reloc in preparation for tiling
2009-07-15 14:23:14 +10:00
Dave Airlie
94d2a809f9
radeon: r300 fix span reading for macro tiled buffers.
...
this uses the correct formula for macro tiled buffers for readback
2009-07-15 14:23:13 +10:00
Dave Airlie
9b17f041d1
radeon: for tiling you really need to use GET/PUT VALUE not PTR.
...
since the surfaces aren't linear you can't just use GET_PTR
2009-07-15 14:23:13 +10:00
Xiang, Haihao
868aa16074
i965: the offset of any branch/jump instruction is in unit of 64bits on IGDNG
2009-07-15 10:48:13 +08:00
Dave Airlie
f6f0e117a4
intel/radeon: add common metaops code.
...
Move all the metaops to a dri_metaops file and port radeon/intel
to use the new common meta ops code.
2009-07-15 10:30:23 +10:00
Dave Airlie
681ede8836
radeon: fix unsigned vs signed comparison in stencil code.
...
This function takes a GLint not a GLuint, passing in -1
breaks internally.
2009-07-15 08:19:34 +10:00
Eric Anholt
99d07d0f91
intel: Fix ClearDepth to not be affected by DepthRange.
...
Fixes new piglit depthrange-clear.c test.
2009-07-14 15:14:52 -07:00
Eric Anholt
b677c40abb
intel: Set DepthRange in the metaops using RasterPos[2].
...
RasterPos[2] is already sent through the window transform, which includes
DepthRange handling. So make DepthRange for the metaops a noop.
Fixes a failure in oglconform depthrange.c
2009-07-14 15:14:52 -07:00
Eric Anholt
e5f63c403b
Fix state flag dependencies for fixed function fragment program updates.
...
I started looking into why _NEW_ARRAY punishes us, and while annotating
dependencies noticed that a bunch of dependencies were missing.
2009-07-14 15:14:52 -07:00
Eric Anholt
374110bc98
intel: Flag _NEW_ARRAY state when doing array object binding in clears.
...
This is just following bind_vertex_array()'s behavior.
2009-07-14 15:14:51 -07:00
Brian Paul
e786ecbfb5
mesa: regenerated enums.c file
2009-07-14 07:47:19 -06:00
Corbin Simpson
b727150b14
dri-st: Unbreak GL_EXT_blend_equation_separate.
...
Since it has a dispatch table entry (for BlendEquationSeparateEXT,)
can't omit it from this list. It'll still get disabled if the cap
(PIPE_CAP_BLEND_EQUATION_SEPARATE) isn't set.
Somebody that doesn't suck at GL (read: not me) should probably add
this into progs/samples/blendeq or similar so we can test it.
2009-07-14 01:16:25 -07:00
Patrice Mandin
54cf63278f
gallium: Define PIPE_CAP_BLEND_EQUATION_SEPARATE, remove extension from default extension list
2009-07-14 09:44:49 +02:00
Michel Dänzer
33f56b4612
radeon: Differentiate 16 bpp destination formats.
...
Fixes those formats in fbo_firecube.
Only tested with r300, radeon and r200 compile tested only.
2009-07-14 08:25:27 +02:00
Michel Dänzer
60e60bb302
radeon: Invert front face winding when rendering to FBO.
...
Fixes fgl_glxgears and progs/demos/fbotexture after pressing 'c'.
Tested with r300, radeon and r200 compile tested only.
2009-07-14 08:00:49 +02:00
Nicolai Hähnle
59155f70e7
r300g: Small compile warning fixes.
...
Signed-off-by: Corbin Simpson <MostAwesomeDude@gmail.com>
2009-07-13 16:26:29 -07:00
Nicolai Hähnle
b3f1d370a2
r300g, radeon: Whitespace fixes.
...
Signed-off-by: Corbin Simpson <MostAwesomeDude@gmail.com>
2009-07-13 16:26:29 -07:00
Corbin Simpson
ca28e591f2
r300g: Use align() instead of inline maths.
2009-07-13 16:26:29 -07:00
Maciej Cencora
9226e3d6a6
Merge branch 'shaders_cleanup'
2009-07-13 20:29:11 +02:00
Eric Anholt
0dc700850a
intel: Partially fix fallback detection for glCopyTexSubImage.
...
Really, we should be checking that the MesaTexFormat for the read buffer
and the texture match, but the previous code wasn't even doing that, so
matching the cpp should be an improvement (and potentially fix some hangs!).
The previous code also rejected GL_RGB even though blitting the alpha
channel to the ignored channel of an x8r8g8b8 texture should be fine, which
tripped up compiz's blur plugin.
2009-07-13 10:57:12 -07:00
Maciej Cencora
582bd34665
r300: always assume all components are read by fragment program
...
Components of input attributes that are used by fragment program aren't part of vertex program key, and that may lead to situations when vertex program writes only TEX1.xy and fragment program reads TEX1.xyz, resulting in rendering errors.
Reported-by: Nicolai Hähnle <nhaehnle@gmail.com>
2009-07-13 19:28:15 +02:00
Nicolai Hähnle
f06910f6c3
r300: Fix fogcoord rewriting
...
We only care about the actual fogcoord itself now, reducing the rewriting
done for the vertex program.
The rewriting of source operand swizzles in the fragment program takes
care that fogcoord.yzw = 001.
This should fix fogcoord rewriting entirely, which had been horribly
broken in the face of dot-product instructions, and just broken (though
not horribly so) in the face of almost every other instruction (the W
component would be incorrect for most arithmetic instructions).
Signed-off-by: Nicolai Hähnle <nhaehnle@gmail.com>
2009-07-13 19:28:15 +02:00
Maciej Cencora
3f5382819e
r300: fix swizzle masking in getUsedComponents
2009-07-13 19:28:15 +02:00
Maciej Cencora
acd3360041
r300: minor fix
...
Split initializations becase the vars are of different type.
Reported-by: Nicolai Hähnle <nhaehnle@gmail.com>
2009-07-13 19:28:15 +02:00
Maciej Cencora
1a5520fcd3
r300: move variables declarations
2009-07-13 19:28:14 +02:00
Maciej Cencora
b3716eeb61
r300: document r300_vertex_program_cont structure
...
Reported-by: Nicolai Hähnle <nhaehnle@gmail.com>
2009-07-13 19:28:14 +02:00
Maciej Cencora
a0204ce456
r300: document r300_fragment_program_cont struct
2009-07-13 19:28:14 +02:00
Maciej Cencora
ec854729d1
r300: fix indentation
...
Reported-by: Nicolai Hähnle <nhaehnle@gmail.com>
2009-07-13 19:28:13 +02:00
Maciej Cencora
48cc352a71
r300: fix StrideB == 0 case when converting data format
...
Reported-by: Nicolai Hähnle <nhaehnle@gmail.com>
2009-07-13 19:28:13 +02:00
Maciej Cencora
4eff323731
r300: hw can handle per component negations in vertex shaders
...
Reported-by: Nicolai Haehnle <nhaehnle@gmail.com>
2009-07-13 19:28:13 +02:00
Maciej Cencora
4efb9f053c
r300: fix WPOS for SWTCL
2009-07-13 19:28:13 +02:00
Maciej Cencora
bdc8a95fc9
r300: removed unnecessary params
...
We don't have check which attributes are used by fragment program - it's already done by NQSSADCE.
2009-07-13 19:28:12 +02:00
Maciej Cencora
65d9f23c7c
r300: use NQSSADCE for vertex programs
2009-07-13 19:28:12 +02:00
Maciej Cencora
70448b9f95
r300: operate on copy of a program when pairing instructions
...
We need to keep unpaired program for vertex program NQSSADCE.
2009-07-13 19:28:12 +02:00
Maciej Cencora
12a6d73c75
r300: handle relative addressing in NQSSADCE
2009-07-13 19:28:11 +02:00
Maciej Cencora
96b2eb18c5
r300: handle ARB_vertex_program specific instructions in NQSSADCE
2009-07-13 19:28:07 +02:00
Maciej Cencora
e43cc28c1b
r300: move depth output rewrite out of NQSSADCE
2009-07-13 19:25:59 +02:00
Maciej Cencora
f79ef95df4
r300: rewrite FOGC and HPOS attribs handling
...
Rewrite vertex and fragment programs so that we don't have to do any hacks on lower level.
2009-07-13 19:25:59 +02:00
Maciej Cencora
df5fe747fa
r300: bind vertex program to fragment program
2009-07-13 19:25:58 +02:00
Maciej Cencora
d1e4caa6e2
r300: recalculate used inputs and outputs after dead code removal
2009-07-13 19:25:58 +02:00