fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 13:48:06 +02:00

Author	SHA1	Message	Date
Ilia Mirkin	0c38006b55	mesa: force height of 1D textures to be 1 in texture views Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2014-09-01 18:38:02 -04:00
Ilia Mirkin	2c44043313	nv50: attach the buffer bo to the miptree structures The current code... makes no sense. Use nouveau_bo_ref to attach the bo to the exposed resource so as to have the proper lifetime guarantees. Tested-by: Emil Velikov <emil.l.velikov@gmail.com> Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.2 10.3" <mesa-stable@lists.freedesktop.org>	2014-09-01 18:38:02 -04:00
Ilia Mirkin	9d52e551a5	nv50: mt address may not be the underlying bo's start address With VP2, nv50_miptree is faked because the underlying bo's have to be laid out in a certain way. This is done by adjusting the address. Make sure that blits (and everything else for consistency) use the mt address rather than the bo address as a base. This fixes retrieving chroma plane with VDPAU. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=82255 Tested-by: Emil Velikov <emil.l.velikov@gmail.com> Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.2 10.3" <mesa-stable@lists.freedesktop.org>	2014-09-01 18:38:02 -04:00
Ilia Mirkin	2528d402b9	nv50: set the miptree address when clearing bo's in vp2 init The mt address is about to be used more, make sure it's set appropriately. Reported-by: Emil Velikov <emil.l.velikov@gmail.com> Tested-by: Emil Velikov <emil.l.velikov@gmail.com> Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.2 10.3" <mesa-stable@lists.freedesktop.org>	2014-09-01 18:38:02 -04:00
Ilia Mirkin	6c2b079231	nv50/ir: avoid creating instructions that can't be emitted When constant folding a MAD operation, we first fold the multiply and generate an ADD. However we do so without making sure that the immediate can be handled in the saturate case. If it can't, load the immediate in a separate instruction. Reported-by: Tiziano Bacocco <tizbac2@gmail.com> Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.2 10.3" <mesa-stable@lists.freedesktop.org>	2014-09-01 18:38:02 -04:00
Ilia Mirkin	115d9a5525	nvc0: don't make 1d staging textures linear Experimentally, the sampler doesn't appear to like these, neither as buffer nor as rect textures. So remove 1D from the list of texture types to make linear when used for staging. This fixes the OSD in mplayer for VDPAU. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.2 10.3" <mesa-stable@lists.freedesktop.org>	2014-09-01 18:38:02 -04:00
Ilia Mirkin	362cd26960	nv50: zero out unbound samplers Samplers are only defined up to num_samplers, so set all samplers above nr to NULL so that we don't try to read them again later. Tested-by: Christian Ruppert <idl0r@qasl.de> Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.2 10.3" <mesa-stable@lists.freedesktop.org>	2014-09-01 18:38:02 -04:00
Ilia Mirkin	c4bb436f76	nvc0/ir: avoid infinite recursion when finding first uses of tex In certain circumstances, findFirstUses could end up doubling back on instructions it had already processed, resulting in an infinite recursion. Avoid this by keeping track of already-visited instructions. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=83079 Tested-by: Tobias Klausmann <tobias.johannes.klausmann@mni.thm.de> Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.2 10.3" <mesa-stable@lists.freedesktop.org>	2014-09-01 18:38:02 -04:00
Rob Clark	ef858ac770	freedreno/ir3: add DDX/DDY Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-09-01 18:08:21 -04:00
Rob Clark	5e5604cc28	freedreno/ir3: don't keep IR around Once we've assembled the shader, no need to keep the intermediate around. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-09-01 18:08:21 -04:00
Jason Ekstrand	e8f83538dd	i965/fs: Don't segfault when debug-logging a null program Signed-off-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-09-01 12:33:13 -07:00
Jason Ekstrand	1c573c9adb	i965/vec4: Don't segfault when debug-logging a null program Signed-off-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-09-01 12:31:56 -07:00
Marek Olšák	a10c8db715	radeonsi: implement EXPCLEAR optimization for depth Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:52 +02:00
Marek Olšák	f05fe294e7	r600g,radeonsi: initialize HTILE to fully-expanded state Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:52 +02:00
Marek Olšák	573313c94e	radeonsi: implement fast depth clear Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:51 +02:00
Marek Olšák	63cb4077e6	radeonsi: move DB_RENDER_CONTROL into draw_vbo So that I can add fast depth clear. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:51 +02:00
Marek Olšák	78aa717601	radeonsi: disable occlusion queries if they are not needed We always left them enabled, which turned off HiZ in some cases. This should improve performace with Hyper-Z. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:51 +02:00
Marek Olšák	ab9ad91779	r600g,radeonsi: force fast stencil and HTILE stencil off, fixing a Hyper-Z hang This should be as fast as no HTILE for stencil. I think we can still get full performance with depth-only rendering even if stencil is present in the buffer but not used, but I'm not 100% sure. This may be revisited when HiS and fast stencil clear are implemented. This fixes a hang in Brutal Legend. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=64471 Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:51 +02:00
Marek Olšák	ba14d4910c	r600g: set VGT_ENHANCE=4 on R7xx This is a golden setting on RV740, but there is a hw bug which recommends setting it on all R7xx chipsets. Acked-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:49 +02:00
Marek Olšák	13b93596da	r600g: expose AMD_vertex_shader_layer and *_viewport_index on R600-R700 already implemented Acked-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:45 +02:00
Marek Olšák	d159c5e3e0	r600g: fix layered clear Cc: mesa-stable@lists.freedesktop.org Acked-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:42 +02:00
Marek Olšák	e6d191bb6f	r600g: some DB bug workarounds for R6xx DB flushing Acked-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:40 +02:00
Marek Olšák	0ccc653c70	r600g: enable fast depth clear for array textures and cubemaps I have a piglit test that hits this. Acked-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:37 +02:00
Marek Olšák	6d751065cc	r600g: use HTILE allocator from SI It's almost the same. This enables tiling for HTILE. It also enables Hyper-Z for other texture targets (1D, 1D_ARRAY, 2D_ARRAY, CUBE, CUBE_ARRAY, 3D, RECT). 2D array depth textures are tested by Unigine Sanctuary and my new piglit test. Acked-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:33 +02:00
Marek Olšák	ee1b30eaff	r600g: set DB_DEPTH_SIZE.HEIGHT_TILE_MAX for EG/CM, inline other fields This fixes rendering to non-zero layer/face/slice with HTILE. v2: added the assertion Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:17:40 +02:00
Marek Olšák	91050ff215	radeonsi: set DB_DEPTH_SIZE.HEIGHT_TILE_MAX, inline other fields This fixes rendering to a non-zero layer/face/slice with HTILE. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=72685 v2: added the assertion Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:15:36 +02:00
Glenn Kennard	8d0f6ff810	r600g: Implement sm5 geometry shader instancing Requires Evergreen or later hardware. Signed-off-by: Glenn Kennard <glenn.kennard@gmail.com>	2014-09-01 21:12:03 +02:00
Marek Olšák	482def592f	glsl_to_tgsi: allocate and enlarge arrays for temporaries on demand This fixes crashes if the number of temporaries is greater than 4096. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=66184 v2: added fail paths for realloc failures Cc: 10.2 10.3 mesa-stable@lists.freedesktop.org Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2014-09-01 21:03:58 +02:00
Marek Olšák	b419c651fb	gallium/pb_bufmgr_cache: limit the size of cache This should make a machine which is running piglit more responsive at times. e.g. streaming-texture-leak can easily eat 600 MB because of how fast it creates new textures.	2014-09-01 20:17:48 +02:00
Marek Olšák	bba7d29a86	pipe-loader: use the correct screen index	2014-09-01 20:09:19 +02:00
Marek Olšák	0b56e23e7f	egl/dri2: use the correct screen index Required for multi-GPU configuration where each GPU has its own X screen.	2014-09-01 20:09:19 +02:00
Jordan Justen	1a428a5256	docs: Mark ARB_compute_shader as work in progress Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2014-09-01 10:45:37 -07:00
Connor Abbott	d571f2b15d	i965/fs: don't use ir->shadow_comparitor in emit_texture_* Signed-off-by: Connor Abbott <connor.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-09-01 00:55:14 -07:00
Connor Abbott	cbfcb1b069	i965/fs: don't pass ir_variable * to emit_samplepos_setup() We were only using it to get at its type, which we already know because it's a builtin variable. Signed-off-by: Connor Abbott <connor.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-09-01 00:12:15 -07:00
Connor Abbott	ec3d06f591	i965/fs: don't pass ir_variable * to emit_frontfacing_interpolation() We were only using it to get at its type, which we already know because it's a builtin variable. v2 (Ken): Rebase on Matt's optimized gl_FrontFacing calculations. Signed-off-by: Connor Abbott <connor.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-09-01 00:11:16 -07:00
Kenneth Graunke	70691f0c28	i965: Fix GPU hangs when INTEL_DEBUG=no16 is set. The replicated data clear shader needs to be SIMD16, or else the GPU will hang. So, compile it even if INTEL_DEBUG=no16 is set. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-08-31 17:03:31 -07:00
Emil Velikov	88cbe3908f	mesa: fix make tarballs Current method of generating distribution tar-balls involves manually invoking make + target name in the appropriate places. This temporary solution is used until we get 'make dist' working. Currently it does not work, as in order to have the target (which is also a filename) available in the final Makefile we need to add a PHONY target + use the correct target name. Cc: "10.2 10.3" <mesa-stable@lists.freedesktop.org> Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>	2014-09-01 00:22:20 +01:00
Abdiel Janulgue	5598458e69	i965/vec4: Remove try_emit_saturate Now that saturate is implemented natively as an instruction, we can cut down on unneeded functionality. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:09 +03:00
Abdiel Janulgue	cbd225057a	i965/fs: Refactor try_emit_saturate v3: Since the fs backend can emit saturate as a separate instruction, there is no need to detect for min/max instructions and to rewrite the instruction tree accordingly. On the other hand, we don't need to emit a separate saturated mov either when the expression generating src can do saturate directly. v4: Add can_do_saturate() check before enabling saturate modifer (Ken) Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:09 +03:00
Abdiel Janulgue	b2c0c35907	ir_to_mesa, glsl_to_tgsi: Remove try_emit_saturate Now that saturate is implemented natively as instruction, we can cut down on unneeded functionality. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:09 +03:00
Abdiel Janulgue	7841a246b9	i965/vec4: Allow propagation of instructions with saturate flag to sel When sel conditon is bounded within 0 and 1.0. This allows code as: mov.sat a b sel.ge dst a 0.25F To be propagated as: sel.ge.sat dst b 0.25F v3: - Syntax clarifications in inst->saturate assignment - Remove extra parenthesis when assigning src_reg value from copy_entry (Matt Turner) v4: - Take channels into consideration when propagating saturated instructions. Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:09 +03:00
Abdiel Janulgue	40aeb558ce	i965/fs: Allow propagation of instructions with saturate flag to sel When sel conditon is bounded within 0 and 1.0. This allows code as: mov.sat a b sel.ge dst a 0.25F To be propagated as: sel.ge.sat dst b 0.25F v3: Syntax clarifications in inst->saturate assignment (Matt Turner) Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:09 +03:00
Abdiel Janulgue	0e2ba3ee82	glsl: Optimize clamp(x, b, 1.0), where b > 0.0 as max(saturate(x),b) v2: - Output max(saturate(x),b) instead of saturate(max(x,b)) - Make sure we do component-wise comparison for vectors (Ian Romanick) v3: - Add missing condition where the outer constant value is > 0.0 and inner constant is 1.0. - Fix comments to show that the optimization is a commutative operation (Matt Turner) Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:08 +03:00
Abdiel Janulgue	d92394c5d8	glsl: Optimize clamp(x, 0.0, b), where b < 1.0 as min(saturate(x),b) v2: - Output min(saturate(x),b) instead of saturate(min(x,b)) suggested by Ilia Mirkin - Make sure we do component-wise comparison for vectors (Ian Romanick) v3: - Add missing condition where the outer constant value is zero and inner constant is < 1 - Fix comments to reflect we are doing a commutative operation (Matt Turner) Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:08 +03:00
Abdiel Janulgue	8f890b119e	glsl: Optimize clamp(x, 0, 1) as saturate(x) v2: - Check that the base type is float (Ian Romanick) v3: - Make sure comments reflect that we are doing a commutative operation - Add missing condition where the inner constant is 1.0 and outer constant is 0.0 - Make indexing of operands easier to read (Matt Turner) Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:08 +03:00
Abdiel Janulgue	cbd0d643a3	glsl: Implement saturate as ir_unop_saturate Now that we have the ir_unop_saturate implemented as a single instruction, generate the correct simplified expression. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:08 +03:00
Abdiel Janulgue	cb621166dc	yi965/vec4: Add support for ir_unop_saturate Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:08 +03:00
Abdiel Janulgue	4bfe8a1e61	i965/fs: Add support for ir_unop_saturate Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:08 +03:00
Abdiel Janulgue	909fa50f5b	ir_to_mesa, glsl_to_tgsi: Add support for ir_unop_saturate Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:08 +03:00
Abdiel Janulgue	cfa8c1cb39	ir_to_mesa, glsl_to_tgsi: lower ir_unop_saturate Needed when vertex programs doesn't allow saturate Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:08 +03:00

1 2 3 4 5 ...

65060 commits