fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-24 04:08:10 +02:00

Author	SHA1	Message	Date
Marek Olšák	21ba8a204e	radeonsi: use gallivm less Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-06-25 18:33:58 -04:00
Marek Olšák	965904eebd	radeonsi: stop using lp_bld_intr.h Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-06-25 18:33:58 -04:00
Marek Olšák	6ab54d25a6	radeonsi: remove last uses of lp_build_context::undef Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-06-25 18:33:58 -04:00
Marek Olšák	30f3e2200a	radeonsi: stop using lp_bld_arit.h Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-06-25 18:33:58 -04:00
Marek Olšák	5f54fc3ad1	radeonsi: stop using lp_build_gather_values Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-06-25 18:33:58 -04:00
Marek Olšák	7bd40dc2f2	radeonsi: clean up some #includes Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-06-25 18:33:58 -04:00
Marek Olšák	f154555733	radeonsi: clean up passing the is_monolithic flag for compilation Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-06-25 18:33:58 -04:00
Robert Foss	5a34aba07d	gallium/util: Fix build error due to cast to different size Signed-off-by: Robert Foss <robert.foss@collabora.com> Reviewed-by: Tomasz Figa <tfiga@chromium.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-06-25 18:54:09 +02:00
Gert Wollny	eebb65258d	r600/sb: give the scheduler more margin to find valid instructions groups For instruction sequences that change the address register with every load the current limit to bail out of the scheduler and reject the optimisation was too tight, i.e. it was expected that at least one pending instruction would be scheduled each time. Give the scheduler more margin to sort out these load sequences by allowing a number of rounds where no instruction is scheduled. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106163 Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-06-25 05:40:19 +01:00
Gert Wollny	cd7db0ab0a	r600/sb: fix rotated register in while loop This patch is based on https://lists.freedesktop.org/archives/mesa-dev/2018-February/185805.html Dave Airlie: "A bunch of CTS tests led me to write tests/shaders/ssa/fs-while-loop-rotate-value.shader_test which r600/sb always fell over on. GCM seems to move some of the copies into other basic blocks, if we don't allow this to happen then it doesn't seem to schedule them badly. Everything I've read on SSA/phi copies say they have to happen in parallel, so keeping them in the same basic block seems like a good way to keep some of that property." This patch differs from the one proposed by Dave in that it only adds the NF_DONT_MOVE flag to copy_move instructions that are created by split_phi* and that are located in loops. Fixes piglit: tests/shaders/ssa/fs-while-loop-rotate-value.shader_test (no regressions in the shader set). It also fixes all failing tests from dEQP-GLES3.functional.shaders.loops.* Signed-off-by: Gert Wollny <gw.fossdev@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-06-25 05:39:41 +01:00
Rob Clark	1977e92ee3	freedreno/ir3: fix deref conversion fallout Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-06-23 18:23:11 -04:00
Rob Clark	445871de94	freedreno/ir3: fix unused variable warning Fixes: `cf0c7258ee` freedreno/a5xx: MSAA Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-06-23 18:23:11 -04:00
Rob Clark	868ca81cbe	freedreno: fix HW_ATOMIC_COUNTERS cap This was mistakenly exposed, even though we want atomic counters to be lowered to atomic ops on an SSBO like nearly every other GPU. Which somehow recently started getting segfaults due to calling a null pipe->set_hw_atomic_buffers(). Fixes a crash in stk, and probably other things. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-06-23 18:23:11 -04:00
Jason Ekstrand	a331d7d1cd	nir: Remove old-school deref chain support Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-22 21:23:06 -07:00
Rob Clark	2db8784167	freedreno/ir3: convert to deref instructions Signed-off-by: Rob Clark <robdclark@gmail.com> Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-22 21:23:05 -07:00
Jason Ekstrand	5a02ffb733	nir: Rework lower_locals_to_regs to use deref instructions This completely reworks the pass to support deref instructions and delete support for old deref chains Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-22 20:54:00 -07:00
Jason Ekstrand	2fa7a4a541	intel,ir3: Re-enable nir_opt_copy_prop_vars Now that it's rewritten for deref instructions, we can turn it back on. Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-22 20:54:00 -07:00
Bas Nieuwenhuizen	67df3739c5	radeonsi: Remove deref chain support in nir scan pass. Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Dave Airlie <airlied@redhat.com>	2018-06-22 20:54:00 -07:00
Bas Nieuwenhuizen	9bfd81b217	radeonsi: Add deref support to the nir scan pass. Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Dave Airlie <airlied@redhat.com>	2018-06-22 20:54:00 -07:00
Jason Ekstrand	6bdd867968	nir: Delete lower_io_types It's only used by the ir3 stand-alone compiler and Rob said we could delete it. Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-22 20:54:00 -07:00
Eric Anholt	e1f0a1b029	broadcom/vc4: Remove deref chain support from nir_lower_txf_ms. Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-22 20:54:00 -07:00
Rob Clark	3d19f116ad	st,ir3,radeonsi: push lower_deref_instrs back into driver vc4+vc5 is not really effected by the deref chain to deref instr conversion, so it no longer needs this pass. For others, now that all the passes mesa/st uses are using deref instructions, push the lowering to deref chains back into driver. Signed-off-by: Rob Clark <robdclark@gmail.com> Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-22 20:54:00 -07:00
Rob Clark	3e8879be5c	nir/lower_samplers: remove legacy version Signed-off-by: Rob Clark <robdclark@gmail.com> Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-22 20:54:00 -07:00
Rob Clark	a6ebbbc594	nir/lower_samplers: split out _legacy version for deref chains To simplify the transition, and make things bisectable, split out a legacy copy or lower_samplers. This way the i965 and gallium drivers can independently switch over to deref instructions. Since the lower_samplers_as_deref pass is only used by gallium drivers, it can be converted in lock-step with moving the lower_deref_instrs pass, and so does not need a corresponding _legacy clone. This legacy pass will be removed in a future commit. Signed-off-by: Rob Clark <robdclark@gmail.com> Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-22 20:54:00 -07:00
Jason Ekstrand	d7d5aab45b	intel,ir3: Disable nir_opt_copy_prop_vars This pass doesn't handle deref instructions yet. Making it handle both legacy derefs and deref instructions would be painful. Since it's not important for correctness, just disable it for now. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-22 20:15:56 -07:00
Rob Clark	f03a33a19a	ttn: convert to deref instructions Signed-off-by: Rob Clark <robdclark@gmail.com> Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-22 20:15:54 -07:00
Jason Ekstrand	74212c2414	anv,i965,radv,st,ir3: Call nir_lower_deref_instrs This inserts a call to nir_lower_deref_instrs at every call site of glsl_to_nir, spirv_to_nir, and prog_to_nir. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-22 20:15:54 -07:00
Karol Herbst	133e8bf4de	nv50/ir: only avoid spilling constrained def if a mov is added fix spilling regression introduced by `5428066f5e` this is just a minor mistake done while moving the code out into a new function. The function contained a loop which might have been terminated earlier and skipped setting noSpill to 1. After the refactoring it was always set. Fixes: `5428066f5e` ("nv50/ir: make a copy of tex src if it's referenced multiple times") Signed-off-by: Karol Herbst <kherbst@redhat.com>	2018-06-23 03:00:24 +02:00
Jonathan Marek	94bc06b196	freedreno: a2xx: fix clear color the format of the CLEAR_COLOR register doesn't depend on the target format this fixes clear color when rendering to 32-bit RGBA and 16-bit targets Signed-off-by: Jonathan Marek <jonathan@marek.ca> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-06-22 08:23:10 -04:00
Jonathan Marek	dd8553dd95	freedreno: a2xx: fix crash when freeing context Signed-off-by: Jonathan Marek <jonathan@marek.ca> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-06-22 08:23:10 -04:00
Jonathan Marek	6eeac34cee	freedreno: a2xx: fix crash on first clear blend can be NULL, so check for that Signed-off-by: Jonathan Marek <jonathan@marek.ca> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-06-22 08:23:10 -04:00
Jonathan Marek	17e16ba9db	freedreno: add a20x this patch adds support for a20x, which has some differences with a220: -no VGT_MAX_VTX_INDX register -no CLEAR_COLOR register -set RB_BC_CONTROL in restore (hangs without) -different CP_DRAW_INDX format tested with kmscube and glmark2 scenes, on par with a220 Signed-off-by: Jonathan Marek <jonathan@marek.ca> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-06-22 08:23:10 -04:00
Jonathan Marek	d5ff36b97b	freedreno: a2xx: increase size of the offset field in instr_fetch_vtx_t The offset field is 22 bit large. 11 bits are necessary because MaxVertexAttribRelativeOffset = 2047 Signed-off-by: Jonathan Marek <jonathan@marek.ca> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-06-22 08:23:10 -04:00
Eric Anholt	69ae42ca4c	v3d: Don't forget to initialize the buffer offset of a new winsys handle.	2018-06-21 15:56:18 -07:00
Marek Olšák	9410cd53c3	radeonsi: fix occlusion queries with 16x AA without FBO attachments on Stoney Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-06-21 14:42:14 -04:00
Marek Olšák	9c21002f6e	radeonsi: handle non-clearable DCC buffers as MSAA resolve dst This is reproducible on Stoney, but other chips may be affected too. Cc 18.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-06-21 14:42:14 -04:00
Marek Olšák	587e712eda	radeonsi: disable DCC MSAA for 128bpp formats on Stoney Cc: 18.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-06-21 14:42:14 -04:00
Rob Clark	cf0c7258ee	freedreno/a5xx: MSAA Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-06-21 08:54:47 -04:00
Rob Clark	b6e690ef80	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-06-21 08:54:47 -04:00
Rob Clark	418b3fd184	freedreno/ir3: txf_ms support Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-06-21 08:54:47 -04:00
Rob Clark	d03bd103f8	freedreno/a5xx: fix gpu hangs with large compute shaders Similar to the combined limit for VS+FS, there is an upper limit for shader size to run from internel memory. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-06-21 08:54:47 -04:00
Rob Clark	e1e40935b4	freedreno/ir3: fix base_vertex Fixes: `c366f422f0` nir: Offset vertex_id by first_vertex instead of base_vertex Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-06-21 08:54:47 -04:00
Emil Velikov	1c1f70d12f	st/dri: constify dri_fill_st_visual's screen As the function says - only the visual is changed. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-06-21 12:09:39 +01:00
Juan A. Suarez Romero	d24839be70	swr: bump minimum supported LLVM version to 5.0 RADV now requires LLVM 5.0 or greater, and thus we can't build dist tarball because swr requires LLVM 4.0. Let's bump required LLVM to 5.0 in swr too. Fixes: `f9eb1ef870` ("amd: remove support for LLVM 4.0") Cc: Tim Rowley <timothy.o.rowley@intel.com> Cc: Emil Velikov <emil.velikov@collabora.com> Cc: Dylan Baker <dylan@pnwbakers.com> Cc: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Acked-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-06-21 12:16:46 +02:00
Grazvydas Ignotas	f966929805	radeonsi: add a debug flag to zero vram allocations This allows to avoid having to see garbage in Dying Light loading screen at least, which probably expects Windows/NV behavior of all allocations being zeroed by default. Analogous to radv flag with the same name. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-06-21 12:18:50 +03:00
Grazvydas Ignotas	4e0d93dc0e	radeonsi: use shifts for sign extension Avoids a branch and reduces code size a tiny bit: text data bss dec hex filename 10804563 398653 2070368 13273584 ca89f0 /tmp/radeonsi_dri.so.old 10804499 398653 2070368 13273520 ca89b0 /tmp/radeonsi_dri.so Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-06-21 12:17:34 +03:00
Roland Scheidegger	53959fcbd8	r600: fix copy/paste bug for sampleMaskIn workaround The sampleMaskIn workaround (`b936f4d1ca`) tries to figure out if the shader is running at per-sample frequency, but there's a typo bug so it will only recognize per-sample linar inputs, not per-sample perspective ones. Spotted by Eric Engestrom <eric.engestrom@intel.com> Fixes: b936f4d1ca0d2ab1e828a "r600: partly fix sampleMaskIn value"	2018-06-21 02:37:11 +02:00
Eric Anholt	edb7890750	v3d: Fix min vs mag determination when not doing mip filtering. Fixes all 128 failing tests in dEQP-GLES3.functional.texture.filtering.*.combinations	2018-06-20 12:31:54 -07:00
Eric Anholt	94f7c011d6	v3d: Track write reference to the separate stencil buffer. Otherwise, a blit from separate stencil may fail to flush the job that initialized it, or new drawing could fail to flush a blit reading from stencil. Fixes: dEQP-GLES3.functional.fbo.blit.depth_stencil.depth32f_stencil8_basic dEQP-GLES3.functional.fbo.blit.depth_stencil.depth32f_stencil8_scale dEQP-GLES3.functional.fbo.blit.depth_stencil.depth32f_stencil8_stencil_only dEQP-GLES3.functional.fbo.msaa.2_samples.depth32f_stencil8 dEQP-GLES3.functional.fbo.msaa.4_samples.depth32f_stencil8	2018-06-20 09:30:46 -07:00
Eric Anholt	a52c357a65	v3d: Add missing reference to the separate stencil buffer. Noticed while debugging a missing flush of rendering in the z32f_s8 case.	2018-06-20 09:30:46 -07:00

1 2 3 4 5 ...

34661 commits