fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-25 10:28:11 +02:00

Author	SHA1	Message	Date
Iago Toral Quiroga	9b96ae69bc	v3d: don't emit point coordinates varyings if the FS doesn't read them We still need to emit them in V3D 3.x since there there is no mechanism to disable them. Reviewed-by: Eric Anholt <eric@anholt.net>	2019-06-07 08:29:42 +02:00
Mark Janes	04dac69752	tests/graw: use C99 print conversion specifier for 32 bit builds Fixes formatting errors for 32 bit compilations, eg: error: format specifies type 'unsigned long' but the argument has type 'uint64_t' (aka 'unsigned long long') [-Werror,-Wformat] printf("result1 = %lu result2 = %lu\n", res1.u64, res2.u64); Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-06-06 14:39:41 -07:00
Alyssa Rosenzweig	30adeb7a53	panfrost/midgard: Fix crash with unused SSA values Crash introduced in "b38dab101ca7e0896255dccbd85fd510c47d84d1" but not adding a Fixes tag since it's our bug anyway. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-06 13:44:27 -07:00
Boris Brezillon	3d661a4ef9	panfrost: Report sRGB colorspace as not supported The driver does not support sRGB yet, so let's report it as unsupported. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-06 13:41:54 -07:00
Connor Abbott	1d55b0da59	radeonsi: Don't force dcc disable for loads When `e9d935ed0e` added force_dcc_off(), we forced it off for any preloaded image descriptor which had stores associated with them, since the same preloaded descriptors were used for loads and stores. However, when the preloading was removed in `16be87c904`, the existing logic was kept despite it not being necessary anymore. The comment above force_dcc_off() only mentions stores, so only force DCC off for stores. Cc: Nicolai Hähnle <nicolai.haehnle@amd.com> Cc: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-06-06 17:14:28 +02:00
Gert Wollny	8657257a6e	virgl: Enable CAP_CLIP_HALFZ if host supports it On according hosts this enables the piglits as "pass": arb_clip_control-* v2: sync flag with host Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> (v1) Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2019-06-06 12:24:53 +02:00
Charmaine Lee	f29b8fde91	svga: Remove unnecessary check for the pre flush bit for setting vertex buffers This fixes the missing rebind when the can_pre_flush bit is not set and the vertex buffers are the same as what have been sent. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Neha Bhende <bhenden@vmware.com> Signed-off-by: Charmaine Lee <charmainel@vmware.com> Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com>	2019-06-06 10:27:10 +02:00
Deepak Rawat	72fc886826	winsys/svga/drm: Fix 32-bit RPCI send message Depending on whether compiled with frame-pointer or not, the temporary memory location used for the bp parameter in these macros are referenced relative to the stack pointer or the frame pointer. Hence we can never reference that parameter when we've modified either the stack pointer or the frame pointer, because then the compiler would generate an incorrect stack reference. Fix this by pushing the temporary memory parameter on a known location on the stack before modifying the stack- and frame pointers. Also in case of failuire RPCI channel is not closed which lead to vmx running out of channels. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Deepak Rawat <drawat@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com> Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com>	2019-06-06 10:27:10 +02:00
Vasily Khoruzhick	b412e05751	lima/ppir: add missing handling of min/max ops for vec4 add slot Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com>	2019-06-06 04:30:36 +00:00
Vasily Khoruzhick	5980565a37	lima/ppir: fix crash when program uses no registers at all Program may need no regalloc at all, e.g. in case when program consists of single discard op. Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com>	2019-06-06 04:30:36 +00:00
Kenneth Graunke	f4d4c42608	radeonsi: Enable NIR's lower_fmod option. Currently, st/mesa is always calling the GLSL IR lower_instructions() pass with MOD_TO_FLOOR set, so mod operations will be lowered before ever reaching NIR. This enables the same lowering at the NIR level, which will let me shut off the GLSL IR path for NIR-based drivers. The AMD NIR backend also has code to handle fmod, so we could potentially skip this and still be fine. I don't have an opinion on that. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-06-05 16:45:12 -07:00
Kenneth Graunke	e0641e0728	vc4: Enable NIR's lower_fmod option. Currently, st/mesa is always calling the GLSL IR lower_instructions() pass with MOD_TO_FLOOR set, so mod operations will be lowered before ever reaching NIR. This enables the same lowering at the NIR level, which will let me shut off the GLSL IR path for NIR-based drivers. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Eric Anholt <eric@anholt.net>	2019-06-05 16:45:12 -07:00
Kenneth Graunke	c7d1b52a2c	nir: Combine lower_fmod16/32 back into a single lower_fmod. We originally had a single lower_fmod option. In commit `2ab2d2e5`, Sam split 32 and 64-bit lowering into separate flags, with the rationale that some drivers might want different options there. This left 16-bit unhandled, so Iago added a lower_fmod16 option in commit `ca31df6f`. Now that lower_fmod64 is gone (in favor of nir_lower_doubles and nir_lower_dmod), we re-combine lower_fmod16 and lower_fmod32 into a single lower_fmod flag again. I'm not aware of any hardware which need lowering for one bitsize and not the other. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-06-05 16:45:12 -07:00
Kenneth Graunke	dfb18f0a28	panfrost: Switch to nir_lower_doubles instead of lower_fmod64. I don't think panfrost actually does doubles yet, but it at least claims to support PIPE_CAP_DOUBLES, so at least pretend to switch to the new lowering. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-05 16:45:12 -07:00
Kenneth Graunke	d13059f4d5	nouveau: Use nir_lower_doubles instead of lower_fmod64 on nvc0. We currently have two duplicate mechanisms for lowering fmod@64. One is a nir_opt_algebraic rule keyed off of options->lower_fmod64, and the other is nir_lower_doubles, which offers a full gamut of fp64 lowering. The latter works slightly better in some corner cases, so I'm trying to eliminate lower_fmod64 and drop the redundancy. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-06-05 16:45:12 -07:00
Kenneth Graunke	fa56a3795f	gallium: Drop lower_fmod64 from drivers that don't support doubles. Neither freedreno nor nv50 expose PIPE_CAP_DOUBLES, so there's no fmod64 to be lowered. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-06-05 16:45:12 -07:00
Alyssa Rosenzweig	905d914cb6	panfrost/midgard: Verify SSA claims when pipelining The pipeline register creation algorithm is only valid for SSA indices; NIR registers and such cannot be pipelined without more complex analysis. However, there are the ocassional class of "liars" -- indices that claim to be SSA but are not. This occurs in the blend shader prologue, for example. Detect this and just bail quietly for now. Eventually we need to rewrite the blend shader prologue to occur in NIR anyway (which would mitigate the issue), but that's more involved and depends on a better understanding of pixel formats in blend shaders (for non-RGBA8888/UNORM cases). Fixes some blend shader regressions. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-05 14:40:08 -07:00
Alyssa Rosenzweig	dcd12aad46	panfrost/midgard: Don't assign var locations ourselves This piece of code was cargo-culted from the ir3 standalone compiler and made sense when we were a standalone compiler ourselves. Unfortunately, for the online compiler, mesa/st already handles this for us and if we duplicate it here, we're duplicating it incorrectly. So just delete these lines and fix a heck of a lot of tests. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-05 14:40:08 -07:00
Tomeu Vizoso	de5c882973	panfrost: Reload framebuffer contents if there's no clear If by flush time the client hasn't submitted a clear, add jobs for reloading the framebuffer contents as the first draw in the frame. This is required by programs such as Weston which don't do clears and rely on the previous contents of the framebuffer being there. Reloading the whole framebuffer on every frame without regards to what is needed or what is going to be covered is very inefficient, but future work will introduce support for damage regions and partial updates so we know what needs to be actually reloaded. Fixes quite a few tests in dEQP-EGL.functional.buffer_age.*. [Alyssa: The context is that tilers do an implicit glClear() on every frame, whether you asked them to or not. If you want a clear, this is very efficient. But if you don't, you have to explicitly blit the backbuffer back into tile memory, accomplished by a dummy texturing draw. This patch generates that draw via u_blitter, although we could do a bit better ourselves by eliding the vertex job. This fixes "black rectangles in Weston/sway" as well as "video not displaying when UI visible in mpv"] Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-05 14:35:48 -07:00
Alyssa Rosenzweig	2adf35e4f5	panfrost: Don't flip scanout The mesa/st flips the viewport, so we respect that rather than trying to flip the framebuffer itself and ignoring the viewport and using a messy heuristic. However, this brings an underlying disagreement about the interpretation of winding order to light. The blob uses a different strategy than Mesa for handling viewport Y flipping, so the meanings of the winding order bit are flipped for it. To keep things clean on our end, we rename to explicitly use Gallium (rather than flipped OpenGL) conventions. Fixes upside-down Xwayland/egl windows. v2: Adjust lowering configuration to correctly flip gl_PointCoord.y and gl_FragCoord.y. v1 was R-b'd by Tomeu, but then retracted due to these regressions which are not fixed. Suggested-by: Rob Clark <robdclark@chromium.org> Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Sort-of-reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>	2019-06-05 14:35:48 -07:00
Timur Kristóf	c94b70a178	st/nine: Use tgsi_to_nir when preferred IR is NIR. This patch allows nine to read the preferred IR from pipe caps and use NIR when that is preferred by the driver, by calling tgsi_to_nir. Also adds some debug options that allow overriding it. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Axel Davy <davyaxel0@gmail.com>	2019-06-05 23:32:13 +02:00
Jason Ekstrand	bb67a99a2d	intel/nir: Stop returning the shader from helpers Now that NIR_TEST_* doesn't swap the shader out from under us, it's sufficient to just modify the shader rather than having to return in case we're testing serialization or cloning. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-06-05 20:07:28 +00:00
Caio Marcelo de Oliveira Filho	747926ddfb	iris: Only recompile CS when needed Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com>	2019-06-05 12:57:54 -07:00
Kristian H. Kristensen	3da9a24f35	freedreno/a6xx: Use VALIDREG in next_regid() helper Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-06-05 11:15:04 -07:00
Kristian H. Kristensen	6fffc091e2	freedreno/a6xx: Remove dead code from a5xx Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-06-05 11:15:04 -07:00
Alyssa Rosenzweig	1ea987576d	panfrost/midgard: Always break up fragment writeout In a fragment shader, r0 is written out with a special branch sequence. r0 is not a real register here, but essentially a pipeline register -- as such, it needs to be written out in full and on time, with hanging dependencies in the bundle. Otherwise, we break up the bundle, which costs an extra ALU cycle and adds a move. When the scheduler ran last thing, we could do this analysis within the scheduler. Now that RA can run after scheduling, that's no longer valid, so we remove the analysis and always break it up (at a performance penalty). Future work can add a post-RA/post-schedule pass to merge writeout blocks if possible. It's a bit of a low-priority next to fixing conformance regressions, of course. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-05 18:06:49 +00:00
Alyssa Rosenzweig	3d11b075f0	panfrost/midgard: Fix cubemap regression Fixes: `2d9802233` ("panfrost/midgard: Extend RA to non-vec4 sources") Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-05 18:06:48 +00:00
Deepak Rawat	828e1b0b4c	winsys/drm: Fix out of scope variable usage In this particular instance, struct member were used outside of the block where it was defined. Fix this by moving the definition outside of block. Signed-off-by: Deepak Rawat <drawat@vmware.com> Fixes: `569f838987` ("winsys/svga: Add support for new surface ioctl, multisample pattern") Reviewed-by: Brian Paul <brianp@vmware.com>	2019-06-02 22:31:07 -07:00
Alyssa Rosenzweig	c51312bc94	panfrost/midgard: Lower integer division We use the shared nir_lower_idiv pass to lower integer division, fixing 144 dEQP tests. This pass was not applied in the past due to breakage from iabs fixed earlier in the series. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-By: Ryan Houdek <Sonicadvance1@gmail.com>	2019-06-05 17:59:27 +00:00
Alyssa Rosenzweig	88c59798fe	panfrost/midgard: Fix 1-arg ALU memory corruption Certain ops that only take one argument have an imaginary "zero" constant for their second argument. For instance, conversions: i2f [dest], [source], #0 Memory corruption meant that #0 was instead random noise. For some ops, that doesn't matter (manifested as abnormally large code size and poor scheduling due to extra constants in random places). But for others, where a 1-op is emulated by a 2-op with an implicit 0 second argument, that broke things. Fixes iabs (emulated by iabsdiff). Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-By: Ryan Houdek <Sonicadvance1@gmail.com>	2019-06-05 17:59:24 +00:00
Alyssa Rosenzweig	9f14e20fa1	panfrost/midgard: Add a bunch of new ALU ops These ops are used to accelerate various functions exposed in OpenCL. This commit only includes the routine additions to the table. They are not wired through the compiler; rather, they are just here to keep a reference for the disassembler. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-By: Ryan Houdek <Sonicadvance1@gmail.com>	2019-06-05 17:58:14 +00:00
Krzysztof Raszkowski	4ff02b3edd	swr: fix support for GL_ARB_copy_image extension This commit fix support and adjusts the capabilities returned by the SWR driver and the documentation to correctly report the GL_ARB_copy_image extension. Reviewed-by: Alok Hota <alok.hota@intel.com>	2019-06-05 15:26:47 +00:00
Guido Günther	b921df352d	build: Build etnaviv drm Signed-off-by: Guido Günther <guido.gunther@puri.sm> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2019-06-05 08:58:05 +00:00
Guido Günther	3696235f82	etnaviv: gallium: Use internal etnaviv_drmif.h Signed-off-by: Guido Günther <guido.gunther@puri.sm> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2019-06-05 08:58:05 +00:00
Guido Günther	3835e21369	etnaviv: untabify Two driver files had tabs mixed with spaces. Remove the tabs. Signed-off-by: Guido Günther <guido.gunther@puri.sm> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2019-06-05 08:58:05 +00:00
Tomeu Vizoso	c7a6e07454	panfrost: bifrost: Fix format string in disassembler The compiler configuration was hardened to fail on format warnings and things stopped building. Fixes: `c9c1e26106` ("mesa: prevent common string formatting security issues") Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-By: Ryan Houdek <Sonicadvance1@gmail.com>	2019-06-05 10:40:19 +02:00
Kenneth Graunke	8d4f68ee20	iris: Free the buffer when reading from the disk cache.	2019-06-04 23:53:57 -07:00
Alyssa Rosenzweig	bfa9f56a2a	panfrost/midgard: Don't promote non-SSA to pipeline registers Fixes: `33800f4612` ("panfrost/midgard: Implement "pipeline register" prepass") Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-05 00:12:36 +00:00
Eric Anholt	36cb209787	freedreno: Drop invalid scissor optimization. We do support TF now, so it's no longer valid. Besides, if we want this optimization, we should probably have mesa/st doing it right for everyone. Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-06-04 16:44:37 -07:00
Chia-I Wu	65439291a0	virgl: resolve to correct level during texture read When PIPE_TRANSFER_READ requires a resolve, we blit from the host storage to a temporary storage, and do a format conversion from the temporary storage to the guest storage. This change makes sure we convert to the correct level of the guest storage. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Alexandros Frantzis <alexandros.frantzis@collabora.com>	2019-06-04 21:37:03 +00:00
Chia-I Wu	067018d4e7	virgl: fix texture resolving with compressed formats util_format_translate_3d expects the source box to be aligned to the block size. When resolving, make sure the size of the staging buffer is aligned to the block size. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Alexandros Frantzis <alexandros.frantzis@collabora.com>	2019-06-04 21:37:03 +00:00
Bas Nieuwenhuizen	a6a5a6f67f	freedreno: Add printf pattern string. Some new flag setting disallows it due to being a security risk. Fixes: `c9c1e26106` "mesa: prevent common string formatting security issues" Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-06-04 23:20:50 +02:00
Bas Nieuwenhuizen	6256925b11	Revert "vl: Enable DRM by default." Reason: meson.build:586:7: ERROR: Unknown variable "dep_libdrm". if building without x11 platform. This reverts commit `392c60928a`.	2019-06-04 23:14:56 +02:00
Alyssa Rosenzweig	4a03d37827	panfrost/midgard: .pos propagation A previous optimization converts fmax(x, 0.0) instructions to fmov.pos. This pass then propagates the .pos from the move up to the source instruction (when possible). From there, copy propagation will eliminate the move. In the future, we might prefer to do this in common NIR code like we do for saturate, as Bifrost can also benefit. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Ryan Houdek <Sonicadvance1@gmail.com>	2019-06-04 20:14:50 +00:00
Alyssa Rosenzweig	5da0a33fab	panfrost/midgard: Cleanup copy propagation Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Ryan Houdek <Sonicadvance1@gmail.com>	2019-06-04 20:14:50 +00:00
Alyssa Rosenzweig	33800f4612	panfrost/midgard: Implement "pipeline register" prepass This prepass, run after scheduling but before RA, specializes to pipeline registers where possible. It walks the IR, checking whether sources are ever used outside of the immediate bundle in which they are written. If they are not, they are rewritten to a pipeline register (r24 or r25), valid only within the bundle itself. This has theoretical benefits for power consumption and register pressure (and performance by extension). While this is tested to work, it's not clear how much of a win it really is, especially without an out-of-order scheduler (yet!). Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Ryan Houdek <Sonicadvance1@gmail.com>	2019-06-04 20:14:50 +00:00
Alyssa Rosenzweig	2a79afc5f0	panfrost/midgard: Helpers for pipeline Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Ryan Houdek <Sonicadvance1@gmail.com>	2019-06-04 20:14:50 +00:00
Alyssa Rosenzweig	3c7abbfbe8	panfrost/midgard: Refactor schedule/emit pipeline First, this moves the scheduler and emitter out of midgard_compile.c into their own dedicated files. More interestingly, this slims down midgard_bundle to be essentially an array of _pointers_ to midgard_instructions (plus some bundling metadata), rather than the instructions and packing themselves. The difference is critical, as it means that (within reason, i.e. as long as it doesn't affect the schedule) midgard_instrucitons can now be modified _after_ scheduling while having changes updated in the final binary. On a more philosophical level, this removes an IR. Previously, the IR before scheduling (MIR) was separate from the IR after scheduling (post-schedule MIR), requiring a separate set of utilities to traverse, using different idioms. There was no good reason for this, and it restricts our flexibility with the RA. So unify all the things! Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Ryan Houdek <Sonicadvance1@gmail.com>	2019-06-04 20:14:50 +00:00
Alyssa Rosenzweig	0524ab9c37	panfrost/midgard: Cleanup RA (stylistic changes) Trivial. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Ryan Houdek <Sonicadvance1@gmail.com>	2019-06-04 20:14:50 +00:00
Alyssa Rosenzweig	debc29b9ad	panfrost/midgard: Share MIR utilities These are more generally useful than the files they were constrained to. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Ryan Houdek <Sonicadvance1@gmail.com>	2019-06-04 20:14:50 +00:00

1 2 3 4 5 ...

38165 commits