fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-22 07:00:12 +01:00

Author	SHA1	Message	Date
Ilia Mirkin	6152ba0894	nv50: make sure to clear _all_ layers of all attachments Unfortunately there's only one RT_ARRAY_MODE setting for all attachments, so clears were previously truncated to the minimum number of layers any attachment had. Instead set the RT_ARRAY_MODE to 512 (the max number of layers) before doing the clear. This fixes gl-3.2-layered-rendering-clear-color-mismatched-layer-count. Also fix clears of individual layered rt/zeta, in case it ever happens. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Christoph Bumiller <e0425955@student.tuwien.ac.at> Cc: 10.1 <mesa-stable@lists.freedesktop.org>	2014-02-22 18:42:31 -05:00
Chia-I Wu	d5cbd73d21	ilo: fix and enable fast depth clear Use tex->bo_format instead of zs->format in ilo_blitter_rectlist_clear_zs() because the latter may be combined depth/stencil format. hiz_can_clear_zs() is no-op for GEN7+, but move the GEN check so that the assertions are tested. Finally, call the fast depth clear function from ilo_clear().	2014-02-22 22:45:13 +08:00
Chia-I Wu	f57bddc7e4	ilo: add slice clear value It is needed for 3DSTATE_CLEAR_PARAMS, and can also be used to track what value the slice has been cleared to.	2014-02-22 22:45:13 +08:00
Chia-I Wu	4afb8a7fb5	ilo: better readability and doc for texture flags Improve comments for the flags, and explicitly separate their uses in slice flags and resolve flags.	2014-02-22 22:45:13 +08:00
Chia-I Wu	cb8a0d2be1	ilo: fix for stencil only rectlist ops 3DSTATE_STENCIL_BUFFER inherits some states from 3DSTATE_DEPTH_BUFFER. We need to emit both even the surface is stencil only.	2014-02-22 22:45:13 +08:00
Chia-I Wu	409add30b3	ilo: fix a false assertion failure on GEN6 Layer offsetting is possible when it is level 0, layer 0.	2014-02-22 22:45:12 +08:00
Chia-I Wu	e7307fe708	ilo: pipe_texture::usage is not a bitfield It happens to work because PIPE_USAGE_STAGING is 0x100.	2014-02-22 22:45:12 +08:00
Chia-I Wu	f8d19a58dc	ilo: set ILO_TEXTURE_CPU_WRITE for imported textures Assume the bo has been written by another process, which will trigger a HiZ resolve.	2014-02-22 22:45:12 +08:00
Christoph Bumiller	1f4bfb8797	nv50/ir/ra: fix SpillCodeInserter::offsetSlot usage We were turning non-memory spill slots into NULL. Cc: 10.1 <mesa-stable@lists.freedesktop.org>	2014-02-22 13:17:23 +01:00
Rob Clark	9186cd39d4	freedreno: tweak ringbuffer sizes/count Since we are now consuming two ringbuffers at a time, we probably want a pool larger than 4.. but we don't need each individual ringbuffer to be so large, so offset the pool size increase by reducing rb size. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-19 12:02:57 -05:00
Rob Clark	5993723471	freedreno/a3xx/compiler: scheduling/legalize fixes It seems the write-after-read hazard that applies to texture fetch instructions, also applies to sfu instructions. Also, cat5/cat6 instructions do not have a (ss) bit, so in these cases we need to insert a dummy nop instruction with (ss) bit set. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-19 12:01:26 -05:00
Michel Dänzer	cf0172d46a	r600g,radeonsi: Consolidate logic for short-circuiting flushes Fixes radeonsi emitting command streams to the kernel even when there have been no draw calls before a flush, potentially powering up the GPU needlessly. Incidentally, this also cuts the runtime of piglit gpu.py in about half on my Kaveri system, probably because an X11 client going away no longer always results in a command stream being submitted to the kernel via glamor. Bugzilla: https://bugzilla.kernel.org/show_bug.cgi?id=65761 Cc: "10.1" mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-02-18 10:46:23 +09:00
Rob Clark	d73b2c0517	freedreno/a3xx/compiler: use (ss) for WAR hazards Seems texture sample instructions don't immediately consume there src(s). In fact, some shaders from blob compiler seem to indiciate that it does not even count the texture sample instructions when calculating number of delay slots to fill for non-sample instructions. (Although so far it seems inconclusive as to whether this is required.) In particular, when a src register of a previous texture sample instruction is clobbered, the (ss) bit is needed to synchronize with the tex pipeline to ensure it has picked up the previous values before they are overwritten. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-16 08:17:23 -05:00
Rob Clark	e8cca57a3f	freedreno/a3xx/compiler: fix RA typo Was supposed to be a '+', otherwise we end up with a negative offset and choosing registers below the assigned range. This seems to fix the scheduling mystery "solved" by adding in extra delay slots. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-16 08:17:23 -05:00
Rob Clark	579473f8f8	freedreno/a3xx/compiler: handle kill properly (new compiler) Since 'kill' does not produce a result, the new compiler was happily optimizing them out. We need to instead track 'kill's similar to outputs. But since there is no non-predicated kill instruction, (and for flattend if/else we do want them to be predicated), we need to track the topmost branch condition on the stack and use that as src arg to the kill. For a kill at the topmost level, we have to generate an immediate 1.0 to feed into the cmps.f for setting the predicate register. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-16 08:17:23 -05:00
Rob Clark	e35747b882	freedreno/a3xx/compiler: trans_cmp() sanity Thanks to figuring out 32bit float render target, and adding regdump test in fdre-a3xx, I can more easily play around with instructions to figure out range of inputs/outputs/etc. And from this I can conclude that cmps.f works more like expected and I can do something much more simple in trans_cmp() (compared to before which was more closely emulating the instruction sequence of the blob compiler). And using sel.b32 (binary 0/1) often makes more sense than sel.f32 (+/- float) or sel.u32 (+/- uint) as it can use the output directly from cmps.f without needing the 'add.s tmp0, tmp0, -1'. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-16 08:17:23 -05:00
Rob Clark	89dc282581	freedreno: fix problems if no color buf bound Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-16 08:17:23 -05:00
Thomas Hellstrom	141e39a893	svga/winsys: Propagate surface shared information to the winsys The linux winsys needs to know whether a surface is shared. For guest-backed surfaces we need this information to avoid allocating a mob out of the mob cache for shared surfaces, but instead allocate a shared mob, that is never put in the mob cache, from the kernel. Also previously, all surfaces were given the "shareable" attribute when allocated from the kernel. This is too permissive for client-local surfaces. Now that we have the needed info, only set the "shareable" attribute if the client indicates that it needs to share the surface. Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Jakob Bornecrantz <jakob@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com> Cc: "10.1" <mesa-stable@lists.freedesktop.org>	2014-02-14 08:21:44 -07:00
Brian Paul	3d1fd6df53	svga: update texture code for GBS Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Cc: "10.1" <mesa-stable@lists.freedesktop.org>	2014-02-14 08:21:44 -07:00
Brian Paul	72b0e959fc	svga: update buffer code for GBS Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Cc: "10.1" <mesa-stable@lists.freedesktop.org>	2014-02-14 08:21:44 -07:00
Brian Paul	e0a6fb09bd	svga: add new helper functions for GBS buffers Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Cc: "10.1" <mesa-stable@lists.freedesktop.org>	2014-02-14 08:21:44 -07:00
Brian Paul	6476bcbc50	svga: remove a couple unneeded assertions Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Cc: "10.1" <mesa-stable@lists.freedesktop.org>	2014-02-14 08:21:44 -07:00
Brian Paul	f8bbd8261d	svga: adjust adjustment for point coordinates Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Cc: "10.1" <mesa-stable@lists.freedesktop.org>	2014-02-14 08:21:44 -07:00
Brian Paul	d0c22a6d53	svga: track which textures are rendered to Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Cc: "10.1" <mesa-stable@lists.freedesktop.org>	2014-02-14 08:21:44 -07:00
Brian Paul	c1e60a61e8	svga: add helpers for tracking rendering to textures Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Cc: "10.1" <mesa-stable@lists.freedesktop.org>	2014-02-14 08:21:44 -07:00
Brian Paul	f84c830b14	svga: update shader code for GBS Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Cc: "10.1" <mesa-stable@lists.freedesktop.org>	2014-02-14 08:21:44 -07:00
Brian Paul	2f1fc8db10	svga: update constant buffer code for GBS Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Cc: "10.1" <mesa-stable@lists.freedesktop.org>	2014-02-14 08:21:44 -07:00
Brian Paul	31dfefc47f	svga: add svga_have_gb_objects/dma() functions Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Cc: "10.1" <mesa-stable@lists.freedesktop.org>	2014-02-14 08:21:44 -07:00
Brian Paul	823fbfdca7	svga: add new GBS commands And update some existing commands. Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Cc: "10.1" <mesa-stable@lists.freedesktop.org>	2014-02-14 08:21:44 -07:00
Brian Paul	d993ada50c	svga: update svga_winsys interface for GBS This adds new interface functions for guest-backed surfaces and adds a mobid parameter to the surface_relocation() function. Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Cc: "10.1" <mesa-stable@lists.freedesktop.org>	2014-02-14 08:21:44 -07:00
Brian Paul	024711385e	svga: update dumping code with new GBS commands, etc Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Cc: "10.1" <mesa-stable@lists.freedesktop.org>	2014-02-14 08:21:44 -07:00
Brian Paul	2e0c90847f	svga: split / update svga3d header files The old svga3d_reg.h file is split into separate header files and we add new items for guest-backed surfaces. Plus some minor code fixes because of renamed symbols. Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Cc: "10.1" <mesa-stable@lists.freedesktop.org>	2014-02-14 08:21:43 -07:00
Alex Deucher	01e6371149	radeon: reverse DBG_NO_HYPERZ logic Change the flag to DBG_HYPERZ and reverse the logic so setting the flag enabled the feature. This disables hyperz on r600g and radeonsi by default. It can be enabled by setting the env var. There are just too many issues with certain apps so leave it disabled for now until we sort out the issues with the problematic apps. Bugs: https://bugs.freedesktop.org/show_bug.cgi?id=58660 https://bugs.freedesktop.org/show_bug.cgi?id=64471 https://bugs.freedesktop.org/show_bug.cgi?id=66352 https://bugs.freedesktop.org/show_bug.cgi?id=68799 https://bugs.freedesktop.org/show_bug.cgi?id=72685 https://bugs.freedesktop.org/show_bug.cgi?id=73088 https://bugs.freedesktop.org/show_bug.cgi?id=74428 https://bugs.freedesktop.org/show_bug.cgi?id=74803 https://bugs.freedesktop.org/show_bug.cgi?id=74863 https://bugs.freedesktop.org/show_bug.cgi?id=74892 https://bugzilla.kernel.org/show_bug.cgi?id=70411 Signed-off-by: Alex Deucher <alexander.deucher@amd.com> Cc: "10.1" "10.0" <mesa-stable@lists.freedesktop.org> Acked-by: Marek Olšák <marek.olsak@amd.com>	2014-02-13 20:55:54 -05:00
Christian König	9ff0cf903d	radeon/vce: initial VCE support v8 v2 (chk): revert feedback buffer hack v3 (slava): fixed bitstream size calculation v4 (chk): always create buffers in the right domain v5 (chk): flush async v6 (chk): rework fw interface add version check v7 (leo): implement cropping support v8 (chk): add hw checks Signed-off-by: Christian König <christian.koenig@amd.com> Signed-off-by: Leo Liu <leo.liu@amd.com> Signed-off-by: Slava Grigorev <slava.grigorev@amd.com>	2014-02-13 11:11:24 +01:00
Ilia Mirkin	ef9a6ded10	nv50: mark scissors/viewports dirty on context switch Commit `246ca4b001` ("nv50: implement multiple viewports/scissors, enable ARB_viewport_array") added dirty tracking to scissors/viewports. However it neglected to mark them all as dirty on a context switch. This fixes an apparent regression in webgl in chrome, but probably in any application that switches contexts. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2014-02-13 10:08:29 +01:00
Christian König	1ef7b9de06	gallium/vl: remove remaining softpipe video functions Unused and unmaintained for quite a while. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Maarten Lankhorst <maarten.lankhorst@canonical.com>	2014-02-13 09:46:54 +01:00
Ilia Mirkin	246ca4b001	nv50: implement multiple viewports/scissors, enable ARB_viewport_array Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Christoph Bumiller <e0425955@student.tuwien.ac.at>	2014-02-12 21:47:36 -05:00
Brian Paul	23d4ff53d4	svga: replace out-of-temps assertion with debug warning Signed-off-by: Brian Paul <brianp@vmware.com>	2014-02-12 11:21:46 -07:00
Maarten Lankhorst	fee0686c21	nouveau: create only 1 shared screen between vdpau and opengl This fixes bug 73200 "vdpau-GL interop fails due to different screen objects" in the same way radeon does. Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>	2014-02-12 14:57:25 +01:00
Ilia Mirkin	908a711313	nv30,nvc0: only claim a single viewport It should be possible to make this be 16 on nvc0. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>	2014-02-11 22:08:01 +00:00
Christian König	ee978aee94	vl: add H264 encoding interface Signed-off-by: Christian König <christian.koenig@amd.com> Signed-off-by: Leo Liu <leo.liu@amd.com>	2014-02-11 13:26:13 +01:00
Dave Airlie	6d434252e2	r600g: add support for multiple viewports. tested on rv635 and barts. Signed-off-by: Dave Airlie <airlied@redhat.com>	2014-02-11 14:14:50 +10:00
Ilia Mirkin	40dd777b33	nouveau/video: make sure that firmware is present when checking caps Apparently some players are ill-prepared for us claiming that a decoder exists only to have creating it fail, and express this poor preparation with crashes (e.g. flash). Check that firmware is there to increase the chances of there being a high correlation between reported capabilities and ability to create a decoder. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: 10.0 10.1 <mesa-stable@lists.freedesktop.org> Tested-by: Emil Velikov <emil.l.velikov@gmail.com>	2014-02-10 14:00:17 +01:00
Grigori Goronzy	d34d5fddf8	gallium: add geometry shader output limits v2: adjust limits for radeonsi and llvmpipe v3: add documentation Cc: "10.1" <mesa-stable@lists.freedesktop.org> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2014-02-09 23:31:38 +01:00
Ilia Mirkin	356aff3a5c	nv30: report 8 maximum inputs nvfx_fragprog_assign_generic only allows for up to 10/8 texcoords for nv40/nv30. This fixes compilation of the varying-packing tests. Furthermore it appears that the last 2 inputs on nv4x don't seem to work in those tests, so just report 8 everywhere for now. Tested on NV42, NV44. NV4B appears to have additional problems. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: 9.1 9.2 10.0 10.1 <mesa-stable@lists.freedesktop.org>	2014-02-08 19:06:51 -05:00
Christoph Bumiller	2e9ee44797	nv50/ir/ra: some register spilling fixes Cc: 10.1 <mesa-stable@lists.freedesktop.org>	2014-02-09 00:04:13 +01:00
Christoph Bumiller	882e98e5e6	nvc0: handle TGSI_SEMANTIC_LAYER Cc: 10.1 <mesa-stable@lists.freedesktop.org>	2014-02-07 23:14:00 +01:00
Christoph Bumiller	dd2229d4c6	nvc0: create the SW object It's required for being able to use software methods now.	2014-02-07 22:53:37 +01:00
Christoph Bumiller	b7233acf78	nvc0/ir/emit: hardcode vertex output stream to 0 for now	2014-02-07 22:53:36 +01:00
Ilia Mirkin	0befbafb4b	nouveau/codegen: allow tex offsets on non-TXF instructions (e.g. TXL) Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Christoph Bumiller <e0425955@student.tuwien.ac.at>	2014-02-06 18:50:19 -05:00

1 2 3 4 5 ...

11465 commits