fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-05 05:18:08 +02:00

Author	SHA1	Message	Date
Alexandre Courbot	f22406837f	nouveau: support for custom VRAM domains Some GPUs (e.g. GK20A, GM20B) do not embed VRAM of their own and use the system memory as a backend instead. For such systems, allocating objects in VRAM results in errors since the kernel will not allow VRAM objects allocations. This patch adds a vram_domain member to struct nouveau_screen that can optionally be initialized to an alternative domain to use for VRAM allocations. If left untouched, NOUVEAU_BO_VRAM will be used for systems that embed VRAM, and NOUVEAU_BO_GART will be used for VRAM-less systems. Code that uses GPU objects is then expected to use the NV_VRAM_DOMAIN() macro in place of NOUVEAU_BO_VRAM to ensure correct behavior on VRAM-less chips. Signed-off-by: Alexandre Courbot <acourbot@nvidia.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Martin Peres <martin.peres@free.fr>	2015-06-22 01:00:02 -04:00
Chia-I Wu	57bdcae9e0	ilo: add ilo_state_compute Replace gen6_idrt_data with ilo_state_compute, which has a bunch of validations and is now preferred.	2015-06-22 12:56:55 +08:00
Dave Airlie	2bf5a4211e	r600g: ignore sampler views for now. This fixes a regression in that r600 stopped working when sampler views were pushed. Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-06-22 14:02:49 +10:00
Rob Clark	66a93a0ff9	freedreno/ir3: pass sz to split_dest() For query_levels, we generate a getinfo with writemask of (z), which RA will consider as size==3. But we were still generating four fanouts. Which meant that RA would see it as two different register classes, depending on the path to definer. Ie. on the getinfo instruction itself it would see size==3, but when chasing back through the fanouts it would see size==4. Easiest way to solve that is to just generate the chain of neighboring fanouts to have the correct size in the first place. Note: we may eventually want split_dest() to take start/end or wrmask instead, since really we only need size==1. But RA is not clever enough for that, query_levels is not that common, and the other two registers that get allocated are never used so those register slots can be immediately re-used. So bunch of work for probably no real gain. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-06-21 08:01:12 -04:00
Rob Clark	1ee4d51e7a	freedreno/ir3/nir: add more opcodes Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-06-21 08:01:06 -04:00
Rob Clark	43048c7093	freedreno/ir3: only unminify txf coords on a3xx Seems like a4xx gets this right. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-06-21 08:01:05 -04:00
Rob Clark	0f008082b1	freedreno: remove int sampler shader variants We get this information from NIR (which gets it from sview decl in tgsi when translating from tgsi), so no need to maintain shader variants for this. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-06-21 08:00:58 -04:00
Rob Clark	457f7c2a2a	freedreno/ir3: block reshuffling and loops! This shuffles things around to allow the shader to have multiple basic blocks. We drop the entire CFG structure from nir and just preserve the blocks. At scheduling we know whether to schedule conditional branches or unconditional jumps at the end of the block based on the # of block successors. (Dropping jumps to the following instruction, etc.) One slight complication is that variables (load_var/store_var, ie. arrays) are not in SSA form, so we have to figure out where to put the phi's ourself. For this, we use the predecessor set information from nir_block. (We could perhaps use NIR's dominance frontier information to help with this?) Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-06-21 07:54:38 -04:00
Rob Clark	660d5c1646	freedreno/ir3: a4xx encodes larger immed offset Without this, negative branch/jump offsets look like very large positive offsets. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-06-21 07:54:31 -04:00
Rob Clark	d646d3ae9d	freedreno/ir3: simplify find_neighbors stop condition Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-06-21 07:54:16 -04:00
Rob Clark	c8fb5f8a01	freedreno/ir3: move inputs/outputs to shader These belong in the shader, rather than the block. Mostly a lot of churn and nothing too interesting. But splitting this out from the rest of ir3_block reshuffling to cut down the noise in the later patch. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-06-21 07:54:04 -04:00
Rob Clark	d52fb2f5ad	freedreno/ir3/ra: use register_allocate Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-06-21 07:53:58 -04:00
Rob Clark	694beb8b83	freedreno/ir3: introduce ir3_compiler object Right now, just provides a cleaner way to get at the gpu-id, given the separation between compiler and context. But we will need this also to hold the reg-set for new register allocation. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-06-21 07:53:50 -04:00
Rob Clark	5c1e153467	freedreno/ir3: dump nocp option No longer used, or even possible, with NIR frontend. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-06-21 07:53:43 -04:00
Rob Clark	7674ab12e8	freedreno/ir3: silence warnings Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-06-21 07:53:35 -04:00
Rob Clark	0f6faa8ff3	freedreno/ir3: remove tgsi f/e Also remove ir3_flatten which was only used by tgsi f/e. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-06-21 07:53:25 -04:00
Rob Clark	7273cb4e93	freedreno/ir3/sched: convert to priority queue Use a more standard priority-queue based scheduling algo. It is simpler and will make things easier once we have multiple basic blocks and flow control. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-06-21 07:53:17 -04:00
Rob Clark	adf1659ff5	freedreno/ir3: use standard list implementation Use standard list_head double-linked list and related iterators, helpers, etc, rather than weird combo of instruction array and next pointers depending on stage. Now block has an instrs_list. In certain stages where we want to remove and re-add to the blocks list we just use list_replace() to copy the list to a new list_head. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-06-21 07:53:09 -04:00
Rob Clark	67d994c676	freedreno/ir3: drop dot graph dumping At least for now.. right now the instruction and instruction list printing should suffice, and the re-working of ir3_block would require a lot of changes in that code. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-06-21 07:52:58 -04:00
Rob Clark	5c8c2e2f97	freedreno/ir3: more builder helpers Use ir3_MOV() builder in a couple of spots, rather than open-coding the instruction construction. Also add ir3_NOP() builder and use that instead of open coding. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-06-21 07:52:41 -04:00
Rob Clark	b33015f889	gallium/ttn: add missing SNE Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-06-21 07:52:36 -04:00
Rob Clark	c79b2e626c	util/list: add list_first/last_entry I need an easier way to get at head/tail in ir3. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-06-21 07:52:36 -04:00
Rob Clark	b3d2e36716	gallium/ttn: add texture-type support v2: rebased on using SVIEW to hold type information Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-06-21 07:52:29 -04:00
Rob Clark	cb258c1dec	glsl_to_tgsi: add SVIEW decl support Freedreno needs sampler type information to deal with int/uint textures. To accomplish this, start creating sampler-view declarations, as suggested here: http://lists.freedesktop.org/archives/mesa-dev/2014-November/071583.html create a sampler-view with index matching the sampler, to encode the texture type (ie. SINT/UINT/FLOAT). Ie: DCL SVIEW[n], 2D, UINT DCL SAMP[n] TEX OUT[1], IN[1], SAMP[n] For tgsi texture instructions which do not take an explicit SVIEW argument, the SVIEW index is implied by the SAMP index. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-06-21 07:52:22 -04:00
Rob Clark	93379748f7	util/blitter (and friends): generate appropriate SVIEW decls Some hardware needs to know the sampler type. Update the blit related shaders to include SVIEW decl. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-06-21 07:52:16 -04:00
Rob Clark	e536992986	util/pstipple: updates for SVIEW decls To allow for shaders which use SVIEW decls for TEX* instructions, we need to preserve the constraint that the shader either has no SVIEW's or it has one matching SVIEW for each SAMP. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-06-21 07:52:12 -04:00
Rob Clark	b516e68afb	draw: updates to support SVIEW decls To allow for shaders which use SVIEW decls for TEX* instructions, we need to preserve the constraint that the shader either has no SVIEW's or it has one matching SVIEW for each SAMP. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-06-21 07:52:07 -04:00
Rob Clark	f481af110e	tgsi/transform: add support for SVIEW decls TODO single return_type (use enum) v2: single return_type arg, and use enum Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-06-21 07:52:02 -04:00
Rob Clark	b13135e066	tgsi: update docs for SVIEW usage with TEX* instructions Based on mailing list discussion here: http://lists.freedesktop.org/archives/mesa-dev/2014-November/071583.html Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-06-21 07:51:53 -04:00
Eric Anholt	717376155d	mesa: Back out an accidental change I had in a VC4 commit. This was a hack as part of debugging some glamor-on-GLES2 behavior that ended up being an xserver bug. I suspect we can just flip this extension on for GLES2, but the spec says it requires 3.1.	2015-06-20 15:04:17 -07:00
Emil Velikov	104bff0376	docs: add news item and link release notes for mesa 10.5.8 Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>	2015-06-20 16:42:21 +01:00
Emil Velikov	aa28423bcc	docs: Add sha256sums for the 10.5.8 release Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> (cherry picked from commit `a81b1d5512`)	2015-06-20 16:42:21 +01:00
Emil Velikov	97caf2054f	Add release notes for the 10.5.8 release Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> (cherry picked from commit `24b043aab7`)	2015-06-20 16:42:21 +01:00
Eric Anholt	c009038674	vc4: Use a defined t value for 1D textures. This doesn't fix the broken 1D cases of texsubimage, but it does prevent segfaulting when dumping the QIR code generated in fbo-1d.	2015-06-20 00:16:32 -07:00
Eric Anholt	bb107110a4	vc4: Fix write-only texsubimage when we had to align. We need to make sure that when we store the aligned box, we've got initialized contents in the border. We could potentially just load the border area, but for now let's get text rendering working in X (and fix the GL_TEXTURE_2D errors in piglit's texsubimage test and gl-2.1-pbo/test_tex_image)	2015-06-20 00:16:32 -07:00
Chia-I Wu	028590cbc7	ilo: clean up header includes Core is more self-contained now.	2015-06-20 11:20:12 +08:00
Chia-I Wu	244caba250	ilo: avoid ilo_ib_state in genX_3DPRIMITIVE() ilo_ib_state is not in core.	2015-06-20 11:18:30 +08:00
Chia-I Wu	dcb5bad3a3	ilo: move gen6_so_SURFACE_STATE() out of core It does not belong to core.	2015-06-20 11:18:10 +08:00
Chia-I Wu	e3372c4bfb	ilo: add ilo_state_sol_buffer It serves the same purpose as ilo_state_vertex_buffer does.	2015-06-20 11:18:09 +08:00
Chia-I Wu	9904e647cc	ilo: add ilo_state_index_buffer It serves the same purpose as ilo_state_vertex_buffer does.	2015-06-20 11:18:07 +08:00
Chia-I Wu	da4878cb80	ilo: add ilo_state_vertex_buffer Being a parameter-like state, we may want to get rid of ilo_state_vertex_buffer_info or ilo_state_vertex_buffer eventually. But we want them now as they are how we do cross-validation right now.	2015-06-20 11:14:14 +08:00
Chia-I Wu	4555211028	ilo: add 3DSTATE_VF_INSTANCING to ilo_state_vf 3DSTATE_VF_INSTANCING specifies instancing enable and step rate. They are specified along with 3DSTATE_VERTEX_BUFFERS instead prior to Gen8. Both commands are added.	2015-06-20 11:14:14 +08:00
Chia-I Wu	e8d297b7a1	ilo: add 3DSTATE_VF to ilo_state_vf 3DSTATE_VF specifies cut index enable and cut index. Cut index enable is specified in 3DSTATE_INDEX_BUFFER instead prior to Gen7.5. Both commands are added.	2015-06-20 11:14:14 +08:00
Chia-I Wu	7b3432b62d	ilo: embed pipe_index_buffer in ilo_ib_state Make it obvious that we save a copy of pipe_index_buffer.	2015-06-20 11:14:10 +08:00
Chia-I Wu	73f0d6d22d	ilo: fix a buffer overrun Add missing parentheses in SURFTYPE_NULL initialization.	2015-06-20 11:13:20 +08:00
Chia-I Wu	aa3ec8bc46	ilo: fix a -Wmaybe-uninitialized warning ilo_shader.c: In function ‘ilo_shader_select_kernel_sbe’: ilo_shader.c:1140:27: warning: ‘src_skip’ may be used uninitialized in this function [-Wmaybe-uninitialized]	2015-06-20 11:13:20 +08:00
Brian Paul	a1f84453a2	glsl: fix formatting glitch in _mesa_print_ir() Print the closing ) before the newline. Trivial.	2015-06-19 16:46:29 -06:00
Ben Widawsky	7c3da3592e	i965/gen8: Use HALIGN_16 for single sample mcs buffers The original code meant to do this, but was only checking num_samples == 1 to figure out if a surface was fast clear capable. However, we can allocate single sample miptrees with num_samples == 0 (when it's an internally created buffer). This fixes a bunch of the piglit tests on gen8. Other gens should have been fine. Here is the order of events that allowed this to slip through: t0: I wrote halign patches and tested them. These alignment assertions are for gen8 fast clear surfaces, basically. t1: I pushed bogus perf patch which made fast clears never happen t2: Reworked halign patches based on Chad's feedback and introduced the bug this patch fixes. t2.5: I tested reworked patches, but assertion wasn't hit because of t1. t3. Matt fixed issue in t1 which made fast clears happen here: commit `22af95af83` Author: Matt Turner <mattst88@gmail.com> Date: Thu Jun 18 16:14:50 2015 -0700 i965: Add missing braces around if-statement. This logic should match that of the v1 of my halign patch series. Cc: Kenneth Graunke <kenneth@whitecape.org> Cc: Matt Turner <mattst88@gmail.com> Reported-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Ben Widawsky <ben@bwidawsk.net> Tested-by: Mark Janes <mark.a.janes@intel.com>	2015-06-19 11:25:00 -07:00
Ilia Mirkin	539cb2b76e	mesa: move ARB_gs5 enums to core, EXT_polygon_offset_clamp to desktop When adding EXT_polygon_offset_clamp, I first made it core-only, and never moved the enum getter back to the GL/GL_CORE section. Similarly, ARB_gs5 is a core-only extension, so move its getters to the GL_CORE section. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-06-19 14:11:27 -04:00
Brian Paul	6ec4e9c28d	u_vbuf: fix src_offset alignment in u_vbuf_create_vertex_elements() If the driver says PIPE_CAP_VERTEX_ELEMENT_SRC_OFFSET_4BYTE_ALIGNED_ONLY=1, the driver should never receive a pipe_vertex_element::src_offset value that's not a multiple of four. But the vbuf code wasn't actually adjusting the src_offset value when creating the vertex element state object. We just need to align the src_offset values put in the driver_attribs[] array. See the piglit gl-1.5-vertex-buffer-offsets test. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2015-06-19 10:54:24 -06:00

1 2 3 4 5 ...

70757 commits