fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-27 16:38:12 +02:00

Author	SHA1	Message	Date
Eric Anholt	971a13d805	Revert "v3d: Disable PIPE_CAP_BLIT_BASED_TEXTURE_TRANSFER." This reverts commit `ccce940947`, leaving a note as to why we had to (corruption in chromium, breaking some GLES3.1 tests).	2019-04-26 12:42:30 -07:00
Eric Anholt	49071b2e3f	v3d: Don't try to update the shadow texture for separate stencil. There are two cases where v3d's sampler view's resource doesn't match the base's: shadow textures for sampling from raster, and pointing at the separate depth texture for z32f_s8x24. We only want to update shadow for the first case. Fixes dEQP-GLES31.functional.stencil_texturing.render.depth32f_stencil8_draw when run after the previous testcase.	2019-04-26 12:42:30 -07:00
Eric Anholt	c74d0e7f62	vc4: Use _mesa_hash_table_remove_key() where appropriate.	2019-04-26 12:42:30 -07:00
Eric Anholt	d8486c2ad7	v3d: Use _mesa_hash_table_remove_key() where appropriate.	2019-04-26 12:42:30 -07:00
Eric Anholt	42210a4351	v3d: Apply the GFXH-930 workaround to the case where the VS loads attrs. We were emitting a dummy load for when the VS doesn't load any attributes, but we also need to emit a dummy load for when the render VS loads attributes but the binner VS doesn't. Fixes simulator assertion failures and GPU hangs on KHR-GLES31.core.texture_gather.\*	2019-04-26 12:42:30 -07:00
Eric Anholt	448fc3ea42	v3d: Fill in the ignored segment size fields to appease new simulator. We are assured that the input segment size field is ignored for !separate_segs mode, and now the simulator wants an in-range value set regardless of whether it's functionally ignored or not.	2019-04-26 12:40:31 -07:00
Alok Hota	8bfb34fd0a	swr/rast: enforce use of tile offsets Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2019-04-26 13:00:45 -05:00
Alok Hota	0e49963212	swr/rast: AVX512 support compiled in by default - Emulation of AVX512 built into SIMDLIB - Remove associated macros - Remove knobs controlling AVX512 and let emulation handle it - Refactor variable names for SIMD16 Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2019-04-26 13:00:38 -05:00
Alok Hota	0bf1df2bb6	swr/rast: Remove deprecated 4x2 backend code - Use 8x2 tiling by default - Remove associated macros - Use SIMDLIB emulation for SIMD16 on SIMD8 hardware - Remove code rot in Load/StoreTile Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2019-04-26 13:00:24 -05:00
Tomasz Figa	e8bf4efceb	llvmpipe: Always return some fence in flush (v2) If there is no last fence, due to no rendering happening yet, just create a new signaled fence and return it, to match the expectations of the EGL sync fence API. Fixes random "Could not create sync fence 0x3003" assertion failures from Skia on Android, coming from the following code: https://android.googlesource.com/platform/frameworks/base/+/master/libs/hwui/pipeline/skia/SkiaOpenGLPipeline.cpp#427 Reproducible especially with thread count >= 4. One could make the driver always keep the reference to the last fence, but: - the driver seems to explicitly destroy the fence whenever a rendering pass completes and changing that would require a significant functional change to the code. (Specifically, in lp_scene_end_rasterization().) - it still wouldn't solve the problem of an EGL sync fence being created and waited on without any rendering happening at all, which is also likely to happen with Android code pointed to in the commit. Therefore, the simple approach of always creating a fence is taken, similarly to other drivers, such as radeonsi. Tested with piglit llvmpipe suite with no regressions and following tests fixed: egl_khr_fence_sync conformance eglclientwaitsynckhr_flag_sync_flush eglclientwaitsynckhr_nonzero_timeout eglclientwaitsynckhr_zero_timeout eglcreatesynckhr_default_attributes eglgetsyncattribkhr_invalid_attrib eglgetsyncattribkhr_sync_status v2: - remove the useless lp_fence_reference() dance (Nicolai), - explain why creating the dummy fence is the right approach. Signed-off-by: Tomasz Figa <tfiga@chromium.org> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2019-04-26 11:26:33 +01:00
Emil Velikov	591955d82d	llvmpipe: correctly handle waiting in llvmpipe_fence_finish Currently if the timeout differs from 0, we'll end up with infinite wait... even if the user is perfectly clear they don't want that. Use the new lp_fence_timedwait() helper guarding both waits in an !lp_fence_signalled block like the rest of llvmpipe. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2019-04-26 11:26:33 +01:00
Emil Velikov	5b284fe6bc	llvmpipe: add lp_fence_timedwait() helper The function is analogous to lp_fence_wait() while taking at timeout (ns) parameter, as needed for EGL fence/sync. v2: - use absolute UTC time, as per spec (Gustaw) - bail out on cnd_timedwait() failure (Gustaw) v3: - check count/rank under mutex (Gustaw) Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> (v1) Reviewed-by: Gustaw Smolarczyk <wielkiegie@gmail.com>	2019-04-26 11:26:33 +01:00
Kenneth Graunke	529ace7887	iris: Silence unused function warning	2019-04-25 17:33:56 -07:00
Rob Clark	7a57cfbed6	freedreno/a6xx: sample-shading support Enables: OES_sample_shading OES_sample_variables OES_shader_multisample_interpolation Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-04-25 14:13:31 -07:00
Rob Clark	85949c52b4	freedreno: wire up core sample-shading support Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-04-25 14:13:31 -07:00
Rob Clark	49f922d96c	freedreno/a6xx: add VALIDREG/CONDREG helper macros There are a few places that we check if a shader stage input reg is used/valid (ie. not r63.x).. and there are about to be a bunch more. So add some helper macros for less open-coding. Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-04-25 14:13:31 -07:00
Rob Clark	4e3ce224a7	freedreno: update generated headers Pull in updates for sample shading. Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-04-25 14:13:31 -07:00
Rob Clark	4d08c1b595	compiler: rename SYSTEM_VALUE_VARYING_COORD And add corresponding enums for different sorts of varying interpolation. Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-04-25 14:13:31 -07:00
Rob Clark	96d2e4ab8a	freedreno: add robustness support Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-04-25 14:13:31 -07:00
Alyssa Rosenzweig	77d091d0c5	panfrost/midgard: Add new bitwise ops These fused NOT-ops could maybe help somehow...? Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-04-25 20:37:46 +00:00
Alyssa Rosenzweig	bcabcfe3ad	panfrost/midgard: Identify inand This was previously thought to be inot, but it's actually a bit more general than that! :) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-04-25 20:37:45 +00:00
Alyssa Rosenzweig	5f942db190	panfrost/midgard: Copy prop for texture registers We'll want to unify this with main copy prop (and extend to varyings), but that'll take more care to handle some special cases, so leave it as a stub pass for now. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-04-25 20:37:45 +00:00
Alyssa Rosenzweig	4d821a1101	panfrost/midgard: Optimize csel involving 0 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-04-25 20:37:45 +00:00
Alyssa Rosenzweig	b53b4573c3	panfrost/midgard: Extend copy propagation pass This extends copy propagation to respect output modifiers for ALU instructions, as well as potentially fixing some bugs related to looping (all dEQP loop tests pass). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-04-25 20:37:45 +00:00
Alyssa Rosenzweig	7bc91b487b	panfrost/midgard: Reduce fmax(a, 0.0) to fmov.pos This will allow us to copyprop away the move and eliminate the instruction entirely. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-04-25 20:37:45 +00:00
Bas Nieuwenhuizen	427024bf2e	ac/nir: Add support for planes. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-04-25 19:56:20 +00:00
Andrii Simiklit	4e9592c5fa	iris: make the TFB result visible to others OpenGL 4.6 Spec: "5.3.3 Rules ....... Note: “Updates” via rendering or transform feedback are treated consistently with updates via GL commands. Once EndTransformFeedback has been issued, any subsequent command in the same context that uses the results of the transform feedback operation will see the results." v2: removed a wrong comment ( Kenneth Graunke <kenneth@whitecape.org> ) v3: - flush+dirty depends on buffers usage history - removed an old hack ( Kenneth Graunke <kenneth@whitecape.org> ) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110404 Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-04-25 11:48:04 -07:00
Kenneth Graunke	aa7306b4cf	iris: Some tidying for preemption support Just enable it during init_render_context on Gen10+, and move the Gen9 state tracking into iris_genx_state so it only exists on Gen9. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2019-04-25 11:26:24 -07:00
Marek Olšák	383f406591	radeonsi: remove dirty slot masks from scissor and viewport states All registers in the array need to be updated if any of them is changed. Only apps writing gl_ViewportIndex were affected by this bug.	2019-04-25 11:49:38 -04:00
Marek Olšák	440135e5a0	radeonsi/gfx9: rework the gfx9 scissor bug workaround (v2) Needed to track context rolls caused by streamout and ACQUIRE_MEM. ACQUIRE_MEM can occur outside of draw calls. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110355 v2: squashed patches and done more rework Cc: 19.0 <mesa-stable@lists.freedesktop.org>	2019-04-25 11:49:38 -04:00
Marek Olšák	bc0d924507	radeonsi/gfx9: set that window_rectangles always roll the context Cc: 19.0 <mesa-stable@lists.freedesktop.org>	2019-04-25 11:49:38 -04:00
Jon Turney	5d310015c5	meson: Force '.so' extension for DRI drivers DRI driver loadable modules are always installed with install_megadriver.py with names ending with '.so', irrespective of platform. Force the name the loadable module is built with to match, so install_megadriver.py doesn't spin trying to remove non-existent symlinks. Fixes: `c77acc3c` "meson: remove meson-created megadrivers symlinks"	2019-04-25 12:40:16 +01:00
Nicolai Hähnle	9445a4ab43	radeonsi: add radeonsi_sync_compile option Force the driver thread to sync immediately with a compiler thread (but compilation still happens in a separate thread). This can be useful to simplify debugging compiler issues. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-04-25 12:35:29 +02:00
Nicolai Hähnle	ca95adf8ff	radeonsi: add radeonsi_aux_debug option for aux context debug dumps Enabling this option will create ddebug-style dumps for the aux context, except that instead of intercepting the pipe_context layer we just dump the IB contents on flush. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-04-25 12:35:27 +02:00
Nicolai Hähnle	fea3dcb844	ddebug: expose some helper functions as non-inline Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-04-25 12:35:24 +02:00
Nicolai Hähnle	ac0b60fa47	ddebug: dump driver state into a separate file Due to asynchronous execution, it's not clear which of the draws the state may refer to. This also works around an issue encountered with radeonsi where dumping the driver state itself caused a hang. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-04-25 12:35:21 +02:00
Nicolai Hähnle	b7fab7b02d	ddebug: log calls to pipe->flush This can be useful when internal draws lead to a hang. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-04-25 12:35:19 +02:00
Nicolai Hähnle	fe0d2b3d37	ddebug: set thread name For better debuggability. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-04-25 12:35:16 +02:00
Nicolai Hähnle	563faa3903	util/u_log: flush auto loggers before starting a new page Without this, command stream dumps of radeonsi may misleadingly end up in a later page. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-04-25 12:35:09 +02:00
Nicolai Hähnle	8bef4df196	radeonsi: add si_debug_options for convenient adding/removing of options Move the definition of radeonsi_clear_db_cache_before_clear there, as well as radeonsi_enable_nir. This removes the AMD_DEBUG=nir option. We currently still have two places for options: the driconf machinery and AMD_DEBUG/R600_DEBUG. If we are to have a single place for options, then the driconf machinery should be preferred since it's more flexible. The only downside of the driconf machinery was that adding new options was quite inconvenient. With this change, a simple boolean option can be added with a single line of code, same as for AMD_DEBUG. One technical limitation of this particular implementation is that while almost all driconf features are available, the translation machinery doesn't pick up the description strings for options added in si_debvug_options. In practice, translations haven't been provided anyway, and this is intended for developer options, so I'm not too worried. It could always be added later if anybody really cares. v2: - use bool instead of uint8_t for options - si_debug_options.inc -> si_debug_options.h Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-04-25 12:31:02 +02:00
Marek Olšák	36cfe5fd62	radeonsi: add BOs after need_cs_space need_cs_space may clear the buffer list. Fixes: `951d60f8cd` "radeonsi: delay adding BOs at the beginning of IBs until the first draw" Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-04-24 20:59:07 -04:00
Eric Anholt	d23b47fda5	v3d: Disable SSBOs and atomic counters on vertex shaders. The CTS fails on dEQP-GLES31.functional.shaders.opaque_type_indexing.atomic_counter.*vertex when they are enabled, due to the VS being run for both bin and render. I think this behavior is expected to be valid, but I can't find text in atomic counters or SSBO specs saying so (the closed I found was in shader_image_load_store). Just disable it for now, since the closed source driver doesn't expose vertex atomic counters/SSBOs either.	2019-04-24 17:24:11 -07:00
Kenneth Graunke	2812ef2a26	iris: Advertise EXT_texture_sRGB_R8 support Using the luminance format, like both brw and anv do.	2019-04-24 16:49:13 -07:00
Kenneth Graunke	59aa7c924d	iris: Enable GL_AMD_depth_clamp_separate We support this, we just forgot to turn it on.	2019-04-24 16:49:13 -07:00
Marek Olšák	131d56edfb	util: fix a compile failure in u_compute.c on windows	2019-04-24 19:04:20 -04:00
Mike Blumenkrantz	c7c59f75e5	iris: enable preemption support for gen10 this automatically enables preemption on gen10 where it is disabled by default but still available Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2019-04-24 14:47:47 -07:00
Mike Blumenkrantz	7315882023	iris: add preemption support on gen9 this is basically just porting the following two commits to gallium: `d8b50e152a` `5c454661c6` resolves kwg/mesa#49 Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2019-04-24 14:47:08 -07:00
Kenneth Graunke	21688a306b	iris: Split iris_flush_and_dirty_for_history into two helpers. We create two new helpers, iris_flush_bits_for_history, and iris_dirty_for_history, then use them in the existing function. The first accumulates flush bits based on res->bind_history, but doesn't actually perform a flush. This allows us to accumulate flush bits by looping over multiple resources, but ultimately emit a single flush for all of them. The latter flags dirty bits without flushing, which again allows us to handle multiple resources, but also is more convenient when writing from the CPU where we don't need a flush (as in commit `4d12236072`).	2019-04-24 13:31:32 -07:00
Dave Airlie	ce17e413de	virgl/drm: insert correct handles into the table. (v3) This inserts a handle for the flink name and a handle the correct gem handle for the bo. v2: fix handles/names confusion (Lepton Wu) v3: set flink name correctly (Lepton Wu) Reviewed-by: Chia-I Wu <olvaffe@gmail.com>	2019-04-25 06:05:43 +10:00
Dave Airlie	8a39f83fb2	virgl/drm: handle flink name better. This realigns this code with code from radeon. Reviewed-by: Chia-I Wu <olvaffe@gmail.com>	2019-04-25 06:05:43 +10:00

1 2 3 4 5 ...

37681 commits