fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 11:28:05 +02:00

Author	SHA1	Message	Date
Michel Dänzer	51131c423c	r600g,radeonsi: Inform the kernel if a BO will likely be accessed by the CPU This allows the kernel to prevent such BOs from ever being stored in the CPU inaccessible part of VRAM. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-09-02 15:24:07 +09:00
Dave Airlie	2d5d1f5598	glsl: free uniform_map on failure path. If we fails in reserve_explicit_locations, we leak uniform_map. Reported-by: coverity scanner. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2014-09-02 16:05:52 +10:00
Paul Berry	9f20503658	main/cs: Add gl_context::ComputeProgram Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-09-01 19:38:27 -07:00
Jordan Justen	d035d50e05	mesa: Convert NewDriverState to 64-bits i965 will have more than 32 bits when BRW_STATE_COMPUTE_PROGRAM is added. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-09-01 19:38:27 -07:00
Paul Berry	8e27a4d2b3	i965: Modify state upload to allow 2 different sets of state atoms. The set of state atoms for compute shaders is currently empty; it will be filled in by future patches. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-09-01 19:38:27 -07:00
Paul Berry	373143ed91	i965: Modify dirty bit handling to support 2 pipelines. The hardware state for compute shaders is almost entirely orthogonal to the hardware state for 3D rendering. To avoid sending unnecessary state to the hardware, we'll need to have a separate set of state atoms for the compute pipeline and the 3D pipeline. That means we need to maintain two separate sets of dirty bits to determine which state atoms need to be run. But the dirty bits are not completely independent; for example, if BRW_NEW_SURFACES is flagged while doing 3D rendering, then not only do we need to re-run 3D state atoms that depend on BRW_NEW_SURFACES, but we also need to re-run compute state atoms that depend on BRW_NEW_SURFACES. But we'll also need to re-run those state atoms the next time the compute pipeline is run. To accomplish this, we record two sets of dirty bits, one for each pipeline. When bits are dirtied (via SET_DIRTY_BIT() or SET_DIRTY_ALL()) we set them to the dirty state in both pipelines. When brw_state_upload() is run, we clear the dirty bits just for the pipeline that was run. Note that since the number of pipelines is known at compile time to be 2, the compiler should unroll the loops in SET_DIRTY_BIT() and SET_DIRTY_ALL(). Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-09-01 19:38:27 -07:00
Paul Berry	c5bdf9be1e	i965: Create a macro for checking a dirty bit. This will make it easier to extend dirty bit handling to support compute shaders. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-09-01 19:38:27 -07:00
Paul Berry	6f56e1424d	i965: Create a macro for setting all dirty bits. This will make it easier to extend dirty bit handling to support compute shaders. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-09-01 19:38:27 -07:00
Paul Berry	88e3d404da	i965: Create a macro for setting a dirty bit. This will make it easier to extend dirty bit handling to support compute shaders. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-09-01 19:38:27 -07:00
Dave Airlie	94a909ec2d	i965: add missing parens in vec4 visitor coverity reported this, Matt said it look like missing parens, not bad identing, so lets try that. Cc: "10.2 10.3" <mesa-stable@lists.freedesktop.org> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz> Signed-off-by: Dave Airlie <airlied@redhat.com>	2014-09-02 11:07:11 +10:00
Dave Airlie	19f6e80a1e	nouveau: don't leak dec struct on error This one path doesn't goto fail, so it seems to leak dec. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Dave Airlie <airlied@redhat.com>	2014-09-02 10:08:58 +10:00
Dave Airlie	32a8b2cf54	xvmc/tests: %C isn't a valid printf specifier. Reported-by: Coverity scanner. Signed-off-by: Dave Airlie <airlied@redhat.com>	2014-09-02 10:07:54 +10:00
Dave Airlie	ea88b1de2f	nouveau/nv40: quiten coverity warning in unused vertex texture code. This fixes the code, but we never run it anyways, so silence coverity. Signed-off-by: Dave Airlie <airlied@redhat.com>	2014-09-02 10:04:29 +10:00
Ilia Mirkin	d0cd86686d	nv50: remove unused variables Recent code changes have caused these to no longer be used. Remove them. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2014-09-01 18:47:42 -04:00
Ilia Mirkin	0c38006b55	mesa: force height of 1D textures to be 1 in texture views Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2014-09-01 18:38:02 -04:00
Ilia Mirkin	2c44043313	nv50: attach the buffer bo to the miptree structures The current code... makes no sense. Use nouveau_bo_ref to attach the bo to the exposed resource so as to have the proper lifetime guarantees. Tested-by: Emil Velikov <emil.l.velikov@gmail.com> Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.2 10.3" <mesa-stable@lists.freedesktop.org>	2014-09-01 18:38:02 -04:00
Ilia Mirkin	9d52e551a5	nv50: mt address may not be the underlying bo's start address With VP2, nv50_miptree is faked because the underlying bo's have to be laid out in a certain way. This is done by adjusting the address. Make sure that blits (and everything else for consistency) use the mt address rather than the bo address as a base. This fixes retrieving chroma plane with VDPAU. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=82255 Tested-by: Emil Velikov <emil.l.velikov@gmail.com> Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.2 10.3" <mesa-stable@lists.freedesktop.org>	2014-09-01 18:38:02 -04:00
Ilia Mirkin	2528d402b9	nv50: set the miptree address when clearing bo's in vp2 init The mt address is about to be used more, make sure it's set appropriately. Reported-by: Emil Velikov <emil.l.velikov@gmail.com> Tested-by: Emil Velikov <emil.l.velikov@gmail.com> Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.2 10.3" <mesa-stable@lists.freedesktop.org>	2014-09-01 18:38:02 -04:00
Ilia Mirkin	6c2b079231	nv50/ir: avoid creating instructions that can't be emitted When constant folding a MAD operation, we first fold the multiply and generate an ADD. However we do so without making sure that the immediate can be handled in the saturate case. If it can't, load the immediate in a separate instruction. Reported-by: Tiziano Bacocco <tizbac2@gmail.com> Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.2 10.3" <mesa-stable@lists.freedesktop.org>	2014-09-01 18:38:02 -04:00
Ilia Mirkin	115d9a5525	nvc0: don't make 1d staging textures linear Experimentally, the sampler doesn't appear to like these, neither as buffer nor as rect textures. So remove 1D from the list of texture types to make linear when used for staging. This fixes the OSD in mplayer for VDPAU. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.2 10.3" <mesa-stable@lists.freedesktop.org>	2014-09-01 18:38:02 -04:00
Ilia Mirkin	362cd26960	nv50: zero out unbound samplers Samplers are only defined up to num_samplers, so set all samplers above nr to NULL so that we don't try to read them again later. Tested-by: Christian Ruppert <idl0r@qasl.de> Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.2 10.3" <mesa-stable@lists.freedesktop.org>	2014-09-01 18:38:02 -04:00
Ilia Mirkin	c4bb436f76	nvc0/ir: avoid infinite recursion when finding first uses of tex In certain circumstances, findFirstUses could end up doubling back on instructions it had already processed, resulting in an infinite recursion. Avoid this by keeping track of already-visited instructions. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=83079 Tested-by: Tobias Klausmann <tobias.johannes.klausmann@mni.thm.de> Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.2 10.3" <mesa-stable@lists.freedesktop.org>	2014-09-01 18:38:02 -04:00
Rob Clark	ef858ac770	freedreno/ir3: add DDX/DDY Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-09-01 18:08:21 -04:00
Rob Clark	5e5604cc28	freedreno/ir3: don't keep IR around Once we've assembled the shader, no need to keep the intermediate around. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-09-01 18:08:21 -04:00
Jason Ekstrand	e8f83538dd	i965/fs: Don't segfault when debug-logging a null program Signed-off-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-09-01 12:33:13 -07:00
Jason Ekstrand	1c573c9adb	i965/vec4: Don't segfault when debug-logging a null program Signed-off-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-09-01 12:31:56 -07:00
Marek Olšák	a10c8db715	radeonsi: implement EXPCLEAR optimization for depth Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:52 +02:00
Marek Olšák	f05fe294e7	r600g,radeonsi: initialize HTILE to fully-expanded state Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:52 +02:00
Marek Olšák	573313c94e	radeonsi: implement fast depth clear Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:51 +02:00
Marek Olšák	63cb4077e6	radeonsi: move DB_RENDER_CONTROL into draw_vbo So that I can add fast depth clear. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:51 +02:00
Marek Olšák	78aa717601	radeonsi: disable occlusion queries if they are not needed We always left them enabled, which turned off HiZ in some cases. This should improve performace with Hyper-Z. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:51 +02:00
Marek Olšák	ab9ad91779	r600g,radeonsi: force fast stencil and HTILE stencil off, fixing a Hyper-Z hang This should be as fast as no HTILE for stencil. I think we can still get full performance with depth-only rendering even if stencil is present in the buffer but not used, but I'm not 100% sure. This may be revisited when HiS and fast stencil clear are implemented. This fixes a hang in Brutal Legend. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=64471 Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:51 +02:00
Marek Olšák	ba14d4910c	r600g: set VGT_ENHANCE=4 on R7xx This is a golden setting on RV740, but there is a hw bug which recommends setting it on all R7xx chipsets. Acked-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:49 +02:00
Marek Olšák	13b93596da	r600g: expose AMD_vertex_shader_layer and *_viewport_index on R600-R700 already implemented Acked-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:45 +02:00
Marek Olšák	d159c5e3e0	r600g: fix layered clear Cc: mesa-stable@lists.freedesktop.org Acked-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:42 +02:00
Marek Olšák	e6d191bb6f	r600g: some DB bug workarounds for R6xx DB flushing Acked-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:40 +02:00
Marek Olšák	0ccc653c70	r600g: enable fast depth clear for array textures and cubemaps I have a piglit test that hits this. Acked-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:37 +02:00
Marek Olšák	6d751065cc	r600g: use HTILE allocator from SI It's almost the same. This enables tiling for HTILE. It also enables Hyper-Z for other texture targets (1D, 1D_ARRAY, 2D_ARRAY, CUBE, CUBE_ARRAY, 3D, RECT). 2D array depth textures are tested by Unigine Sanctuary and my new piglit test. Acked-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:18:33 +02:00
Marek Olšák	ee1b30eaff	r600g: set DB_DEPTH_SIZE.HEIGHT_TILE_MAX for EG/CM, inline other fields This fixes rendering to non-zero layer/face/slice with HTILE. v2: added the assertion Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:17:40 +02:00
Marek Olšák	91050ff215	radeonsi: set DB_DEPTH_SIZE.HEIGHT_TILE_MAX, inline other fields This fixes rendering to a non-zero layer/face/slice with HTILE. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=72685 v2: added the assertion Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-09-01 21:15:36 +02:00
Glenn Kennard	8d0f6ff810	r600g: Implement sm5 geometry shader instancing Requires Evergreen or later hardware. Signed-off-by: Glenn Kennard <glenn.kennard@gmail.com>	2014-09-01 21:12:03 +02:00
Marek Olšák	482def592f	glsl_to_tgsi: allocate and enlarge arrays for temporaries on demand This fixes crashes if the number of temporaries is greater than 4096. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=66184 v2: added fail paths for realloc failures Cc: 10.2 10.3 mesa-stable@lists.freedesktop.org Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2014-09-01 21:03:58 +02:00
Marek Olšák	b419c651fb	gallium/pb_bufmgr_cache: limit the size of cache This should make a machine which is running piglit more responsive at times. e.g. streaming-texture-leak can easily eat 600 MB because of how fast it creates new textures.	2014-09-01 20:17:48 +02:00
Marek Olšák	bba7d29a86	pipe-loader: use the correct screen index	2014-09-01 20:09:19 +02:00
Marek Olšák	0b56e23e7f	egl/dri2: use the correct screen index Required for multi-GPU configuration where each GPU has its own X screen.	2014-09-01 20:09:19 +02:00
Jordan Justen	1a428a5256	docs: Mark ARB_compute_shader as work in progress Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2014-09-01 10:45:37 -07:00
Connor Abbott	d571f2b15d	i965/fs: don't use ir->shadow_comparitor in emit_texture_* Signed-off-by: Connor Abbott <connor.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-09-01 00:55:14 -07:00
Connor Abbott	cbfcb1b069	i965/fs: don't pass ir_variable * to emit_samplepos_setup() We were only using it to get at its type, which we already know because it's a builtin variable. Signed-off-by: Connor Abbott <connor.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-09-01 00:12:15 -07:00
Connor Abbott	ec3d06f591	i965/fs: don't pass ir_variable * to emit_frontfacing_interpolation() We were only using it to get at its type, which we already know because it's a builtin variable. v2 (Ken): Rebase on Matt's optimized gl_FrontFacing calculations. Signed-off-by: Connor Abbott <connor.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-09-01 00:11:16 -07:00
Kenneth Graunke	70691f0c28	i965: Fix GPU hangs when INTEL_DEBUG=no16 is set. The replicated data clear shader needs to be SIMD16, or else the GPU will hang. So, compile it even if INTEL_DEBUG=no16 is set. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-08-31 17:03:31 -07:00

... 2 3 4 5 6 ...

65224 commits