fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-09 11:08:03 +02:00

Author	SHA1	Message	Date
Constantine Charlamov	abc7b110b6	r600g: fix crash when file in R600_TRACE doesn't exist …and print error in such case. Which probably is not a rare event btw because fopen doesn't expand ~ to $HOME. Also get rid of unused "bool ret" variable. Signed-off-by: Constantine Kharlamov <Hi-Angel@yandex.ru> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 17:39:54 +10:00
Constantine Charlamov	3d466f3e9f	r600g: take into account offset to system inputs at tgsi_interp_egcm() Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=100785 v2: I was too much twiddling whether to initialize nsys_inputs at the beginning of shader initialization or for allocation of system values, and by the time I decided to go with the first one, I forgot to change it back. Signed-off-by: Constantine Kharlamov <Hi-Angel@yandex.ru> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 16:32:36 +10:00
Constantine Charlamov	469e2ed473	r600g: get rid of trailing whitespace Signed-off-by: Constantine Kharlamov <Hi-Angel@yandex.ru> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 16:30:10 +10:00
Dave Airlie	27380d6b3e	r600/asm: add support for other GDS operations. This adds support for the GDS operations needed to do atomic counters. Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 16:27:51 +10:00
Dave Airlie	ccab3f7e1b	r600: don't merge GDS into VTX We don't want vtx/tex instructions ending up in GDS sections. Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 16:23:21 +10:00
Dave Airlie	043f16eba1	r600: for memory instructions dump index gpr for read indirects also. This just makes sure we can see the index gpr in the asm dumps. Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 16:23:21 +10:00
Dave Airlie	ac8fb9800a	r600: add support for vertex fetches via texture cache On evergreen we can route vertex fetches via the texture cache, and this is required for some images support. So add support to the asm builder for it. Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 16:23:20 +10:00
Dave Airlie	b050b91e33	r600: route indirect address register correctly for vtx fetches. This was found during writing the images code, we need to make sure we route the correct index register. Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 16:23:20 +10:00
Dave Airlie	4a34f3244a	radv/meta: don't need vertex info for resolve shader. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 01:24:10 +01:00
Marek Olšák	0715b3c2ee	drirc: whitelist glthread for a few games Performance deltas: Alien Isolation: +17% (it varies depending on the location) Borderlands 2: +50% (it varies depending on the location) BioShock Infinite: +76% (benchmark) Civilization 6: +20% (benchmark) Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-06-26 02:17:03 +02:00
Marek Olšák	4f38b48e05	mesa/glthread: decrease the batch size for better perf scaling This is the key to better performance. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-06-26 02:17:03 +02:00
Marek Olšák	09f6915bf8	gallium/hud: add glthread counters Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-06-26 02:17:03 +02:00
Marek Olšák	8f4bc8a324	gallium/hud: add API-thread-busy for monitoring the thread load Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-06-26 02:17:03 +02:00
Marek Olšák	11cf079b67	gallium/hud: add hud_pane::hud pointer for later use Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-06-26 02:17:03 +02:00
Marek Olšák	5fa69be3c8	mesa/glthread: add glthread "perf" counters and pass them to gallium HUD for HUD integration in following commits. This valuable profiling data will allow us to see on the HUD how well glthread is able to utilize parallelism. This is better than benchmarking, because you can see exactly what's happening and you don't have to be CPU-bound. u_threaded_context has the same counters. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-06-26 02:17:03 +02:00
Marek Olšák	833f3c1c31	gallium/hud: move struct hud_context to hud_private.h Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-06-26 02:17:03 +02:00
Marek Olšák	7492201c4e	gallium/hud: rename API-thread-busy to main-thread-busy Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-06-26 02:17:03 +02:00
Marek Olšák	d1513edaa0	mesa/glthread: switch to u_queue and redesign the batch management This mirrors exactly how u_threaded_context works. If you understand this, you also understand u_threaded_context. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-06-26 02:17:03 +02:00
Marek Olšák	1e37a5054b	mesa/glthread: remove HAVE_PTHREAD guards we are switching to util_queue. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-06-26 02:17:03 +02:00
Marek Olšák	6884c95ab4	util: move pipe_thread_is_self from gallium to src/util Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-06-26 02:17:03 +02:00
Bas Nieuwenhuizen	78bef01da2	radv: Remove unused args of radv_image_view_init. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-06-26 01:24:50 +02:00
Bas Nieuwenhuizen	789f480029	radv: Use correct image layout for blit based copies. v2: Don't pass layout to image view usage mask. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Fixes: `0628580eff` "radv: Specify semantics of HTILE layout helpers."	2017-06-26 01:24:29 +02:00
Grigori Goronzy	95fb1c187a	mesa/marshal: add custom marshalling for glNamedBuffer(Sub)Data These entry points are used by Alien Isolation and caused synchronization with glthread. The async marshalling implementation is similar to glBuffer(Sub)Data. However unlike Buffer(Sub)Data we don't need to worry about EXTERNAL_VIRTUAL_MEMORY_BUFFER_AMD, as this isn't applicable to these DSA variants. Results in an approximately 6x drop in glthread synchronizations and a ~30% FPS jump in Alien Isolation (Medium preset, Athlon 860K, RX 480). Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-06-26 09:06:23 +10:00
Dave Airlie	6a68170c83	radv: handle primitive id input into fragment shader with no geom shader Fixes: dEQP-VK.pipeline.framebuffer_attachment.no_attachments dEQP-VK.pipeline.framebuffer_attachment.no_attachments_ms Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 08:45:30 +10:00
Dave Airlie	2a87ddbdcb	radv: compile fragment shader first. This reorders things as we need something from the fs for the vs key. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 08:45:26 +10:00
Dave Airlie	a563f611c3	radv: set prim_id for geometry shaders Noticed in passing. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 08:45:22 +10:00
Dave Airlie	4042892cee	radv: set use_prim_id for tess shaders correctly. Just noticed in passing. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 08:45:14 +10:00
Pierre Moreau	afb8f2d4a3	nv50/ir: Properly fold constants in SPLIT operation Fixes: `b7d9677d` ("nv50/ir: constant fold OP_SPLIT") Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Pierre Moreau <pierre.morrow@free.fr> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2017-06-25 15:23:46 +02:00
Marek Olšák	e25950808f	radeonsi/gfx9: don't overallocate shader binaries It's not needed. The hw doesn't fetch ahead over page boundaries. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-24 23:04:37 +02:00
Lucas Stach	d6b9ba36a4	st/dri2: implement image offset query This trivially adds support for the image offset query, which is needed for the zwp_linux_dmabuf based EGL platform wayland implementation. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Daniel Stone <daniels@collabora.com>	2017-06-24 16:57:55 +01:00
Samuel Pitoiset	cb577e379e	mesa: only flush vertices when the viewport is different This prevents glViewport() and friends to always flush and trigger _NEW_VIEWPORT. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-24 16:47:43 +02:00
Samuel Pitoiset	4178cea06d	mesa: remove useless comments in the viewport code path No need to explain why calling a driver callback is needed. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-24 16:47:38 +02:00
Roland Scheidegger	8bfe451ed3	llvmpipe: initialize default fb correctly in setup If lp_setup_bind_framebuffer() is never called, then setup fb x1/y1 was not correctly initialized. This can happen if there's never a fb set - both cso and llvmpipe would consider setting this with no cbufs and no zsbuf a redundant change and therefore it would never get set. We rely on this setup fb rect being initialized correctly for the tri intersect tests, throwing away tris which don't intersect. Not initializing it meant we'd then say it intersected, and we'd try to bin that despite that we have no actual tiles to bin it to, leading to assertion failures (pretty harmless since tile 0/0 always exists nevertheless as tiles are statically allocated, albeit that should change at some point). (Note probably not an issue with gl state tracker) Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2017-06-24 00:18:43 +02:00
Jason Ekstrand	f7f2fa8eb1	i965/miptree: Rework aux enabling This commit replaces the complex and confusing set of disable flags with two fairly straightforward fields which describe the intended auxiliary surface usage and whether or not the miptree supports fast clears. Right now, supports_fast_clear can be entirely derived from aux_usage but that will not always be the case. This commit makes functional changes. One of these changes is that it re-enables multisampled fast-clears which were accidentally disabled in `cec30a6669` around a year ago. Fixing this improves the SynMark v7 DeferredAA test by around ~3% on some gen9 hardware. This commit also gets us closer to enabling CCS_E for window-system buffers which are Y-tiled. Reviewed-by: Chad Versace <chadversary@chromium.org> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-06-23 12:30:24 -07:00
Jason Ekstrand	f1fa4be871	i965: Clamp clear colors to the representable range Starting with Sky Lake, we can clear to arbitrary floats or integers. Unfortunately, the hardware isn't particularly smart when it comes sampling from that clear color. If the clear color is out of range for the surface format, it will happily return whatever we put in the surface state packet unmodified. In order to avoid returning bogus values for surfaces with a limited range, we need to do some clamping. Cc: "17.1" <mesa-stable@lists.freedesktop.org> Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-06-23 12:30:24 -07:00
Jason Ekstrand	793b312b4a	i965: Don't bother with HiZ in renderbuffer_move_to_temp This function is only used on gen4-5 which don't support HiZ. Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-06-23 12:30:24 -07:00
Jason Ekstrand	764cce442e	i965/miptree: Rename the non_msrt_mcs functions to _ccs While we're here, we also make the two support checks static since there are no users outside intel_mipmap_tree.c. Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-06-23 12:30:24 -07:00
Jason Ekstrand	a7059a764e	i965/miptree: Delete the layered rendering resolve We never fast-clear more than the base slice (LOD 0, layer 0) anyway, so layered rendering without a resolve is always perfectly safe. Should this ever change in the future, we'll have to put some sort of resolve back in but we can cross that bridge when we come to it. Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-06-23 12:30:24 -07:00
Anuj Phogat	7896dee349	anv/cnl: Don't write to Cache Mode Register 1 on gen10+ For PartialResolveDisableInVC field recommendation is to always set this to 0 and that's the default value of the bit. So, we have nothing left to write to CACHE_MODE_1. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-06-23 11:16:00 -07:00
Anuj Phogat	b980553309	i965/cnl: Don't write to Cache Mode Register 1 on gen10+ With below optimizations gone in gen10+ we have nothing left out to write to CACHE_MODE_1: Float Blend Optimization Enable: This bit have been removed in gen10+ Partial Resolve Disable in VC: Recommendation is to always set this field to 0 in gen10+ and that's the default value of the bit. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-06-23 11:16:00 -07:00
Marek Olšák	f6e98e99e3	radeonsi: unreference vertex buffers when destroying the context Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-06-23 19:53:54 +02:00
Edmondo Tommasina	2ea16f08f3	drirc: Add glsl_correct_derivatives_after_discard for The Witcher 2 This fixes the long-standing problem with black transitions in The Wicher 2. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=98238 Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2017-06-23 19:50:20 +02:00
Marek Olšák	ee16796d54	radeonsi: implement the workaround for Rocket League - postponed TGSI kill Do KILL at the end of shaders so as not to break WQM. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100070 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-23 19:50:20 +02:00
Marek Olšák	a98a04ec80	gallium/radeon: pass create_screen flags to r600_common_screen_init Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-23 19:50:20 +02:00
Marek Olšák	118b2008ba	st/dri: add a drirc workaround for Rocket League This needs to be passed to gallium drivers. No game fix is planned at this time. The addition of glsl_correct_derivatives_after_discard is generally a good thing for mesa compatibility with the broader GL driver ecosystem. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100070 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-23 19:50:20 +02:00
Marek Olšák	6b0f6e693b	st/dri: get drirc options before creating pipe_screen dri_init_options_get_screen_flags will return the flags for create_screen(). Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-23 19:50:20 +02:00
Marek Olšák	76f379330a	gallium: allow passing 'unsigned flags' to create_screen() for drirc options Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-23 19:50:20 +02:00
Marek Olšák	516488bb51	mesa: don't flush vertices in glClientActiveTexture Reviewed-by: Brian Paul <brianp@vmware.com>	2017-06-23 19:50:20 +02:00
Marek Olšák	522173aee4	mesa: don't flag _NEW_ARRAY for GL_PRIMITIVE_RESTART_NV Reviewed-by: Brian Paul <brianp@vmware.com>	2017-06-23 19:50:20 +02:00
Roland Scheidegger	c7688d2de5	llvmpipe:fix using 32bit rasterization mistakenly, causing overflows We use the bounding box (triangle extents) to figure out if 32bit rasterization could potentially overflow. However, we used the bounding box which already got rounded up to 0 for negative coords for this, which is incorrect, leading to overflows and hence bogus rendering in some of our private use. It might be possible to simplify this somehow (we're now using 3 different boxes for binning) but I don't quite see how. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2017-06-23 19:39:29 +02:00

1 2 3 4 5 ...

93433 commits