fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 11:38:05 +02:00

Author	SHA1	Message	Date
Alok Hota	8608a747aa	swr/rast: Add initial SWTag proto definitions Update gen_archrast.py to properly generate event IDs Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2019-02-25 13:05:17 -06:00
Alok Hota	93cd9905c8	swr/rast: Cleanup and generalize gen_archrast Update meson.build to accomodate Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2019-02-25 13:05:07 -06:00
Daniel Schürmann	0bd45f96b9	nir: Use SM5 properties to optimize shift(a@32, iand(31, b)) This is a common pattern from HLSL->SPIRV translation and supported in HW by all current NIR backends. vkpipeline-db results anv (SKL): total instructions in shared programs: `6403130` -> 6402380 (-0.01%) instructions in affected programs: 204084 -> 203334 (-0.37%) helped: 208 HURT: 0 total cycles in shared programs: 1915629582 -> 1918198408 (0.13%) cycles in affected programs: 1158892682 -> 1161461508 (0.22%) helped: 107 HURT: 86 shader-db results on i965 (KBL): total instructions in shared programs: 15284592 -> 15284568 (<.01%) instructions in affected programs: 81683 -> 81659 (-0.03%) helped: 24 HURT: 0 total cycles in shared programs: 375013622 -> 375013932 (<.01%) cycles in affected programs: 40169618 -> 40169928 (<.01%) helped: 13 HURT: 9 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-02-25 12:59:44 -06:00
Daniel Schürmann	0525bdc225	nir: Define shifts according to SM5 specification. SPIR-V shifts are undefined for values >= bitsize, but SM5 shifts are defined to only use the least significant bits. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-02-25 12:59:43 -06:00
Jason Ekstrand	c4fb6b0c81	intel/eu: Add an EOT parameter to send_indirect_[split]_message For split indirect sends we have to put the EOT parameter in the extended descriptor as well as the instruction itself so just calling brw_inst_set_eot is insufficient. Moving the EOT handling handling into the send_indirect_[split]_message helper lets us handle it properly.	2019-02-25 11:35:12 -06:00
Sergii Romantsov	dcc4866419	d3d: meson: do not prefix user provided d3d-drivers-path The user can select the location where there d3d drivers are installed by the d3d-drivers-path meson option. By default path will be $prefix/$libdir/d3d. Currently we add $prefix to the user provided path. Resulting in an incorrect or even missing path. Based on logic of Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109698 CC: Kenneth Graunke <kenneth@whitecape.org> CC: Emil Velikov <emil.l.velikov@gmail.com> Signed-off-by: Sergii Romantsov <sergii.romantsov@globallogic.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2019-02-25 16:07:02 +00:00
Sergii Romantsov	f6556ec7d1	dri: meson: do not prefix user provided dri-drivers-path The user can select the location where there dri drivers are installed by the dri-drivers-path meson option. By default path will be $prefix/$libdir/dri. Currently we add $prefix to the user provided path. Resulting in an incorrect or even missing path. v2: fixed dri_search_path by default, rebased to master v3: new commit-message (Emil Velikov), cc mesa-stable Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109698 CC: Rafael Antognolli <rafael.antognolli@intel.com> CC: Dylan Baker <dylan@pnwbakers.com> Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Fixes: `306914db92` (meson: Add dridriverdir variable to dri.pc.) Signed-off-by: Sergii Romantsov <sergii.romantsov@globallogic.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2019-02-25 16:07:02 +00:00
Lionel Landwerlin	30828f4646	intel/aub_viewer: silence more compiler warnings format not a string literal and no format arguments. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-02-25 13:11:16 +00:00
Lionel Landwerlin	91df8b1780	intel/aub_viewer: silence compiler warning buffer_addr may be used uninitialized. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-02-25 13:11:13 +00:00
Lionel Landwerlin	f1da10e0c5	intel/aub_viewer: printout 48bits addresses Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-02-25 13:11:05 +00:00
Gert Wollny	875942c059	mesa/core: Enable EXT_depth_clamp for GLES >= 2.0 The extension NV_depth_clamp is written against OpenGL 1.2.1, and since GLES 2.0 is based on GL 2.0 there is no reason not to enable this extension also for GLES >= 2.0. v2: Use EXT_depth_clamp that has been proposed to Khronos v3: - Fix check for extension availability (Erik Faya-Lund) - Also fix the test in is_enabled v4: - Test both, ARB and EXT extension (Erik) v5: - Fix white space errors (Erik) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com>	2019-02-25 09:44:27 +00:00
Kenneth Graunke	b45186a6cd	iris: Properly allow rendering to RGBX formats. I was converting them at pipe_surface creation time, but not when answering queries about whether formats support rendering. This caused a lot of FBO incomplete errors for formats that ought to be supported. Fixes "Child of Light", which uses PIPE_FORMAT_R8G8B8X8_UNORM_SRGB. Also fixes Witcher 1 using wined3d (GL) according to Timur Kristóf. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109738	2019-02-25 01:11:27 -08:00
Kenneth Graunke	fce089c8a2	iris: Drop RGBX -> RGBA for storage image usages GLSL doesn't expose RGB/RGBX image formats, so this isn't needed.	2019-02-25 00:57:50 -08:00
Kenneth Graunke	6921588d54	mesa: Fix RGBBuffers for renderbuffers with sized internal formats For texture attachments, 'f' is texImg->_BaseFormat, but for renderbuffer attachments, 'f' is att->Renderbuffer->InternalFormat. InternalFormat may be something like GL_RGB8, which causes our (f == GL_RGB) check to fail. Switch to using a proper _BaseFormat, which drops the size. Fixes dEQP-GLES31.functional.draw_buffers_indexed.random. max_required_draw_buffers.15 on iris when combined with a driver fix. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com>	2019-02-25 00:57:42 -08:00
Oscar Blumberg	da9c030763	glsl: Fix function return typechecking apply_implicit_conversion only converts and check base types but we need actual type equality for function returns, otherwise you can return a vec2 from a function declared as returning a float. Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2019-02-25 08:49:06 +02:00
Jordan Justen	bd0ad651e0	iris: Always use in-tree i915_drm.h Ref: `f1374805a8` "drm-uapi: use local files, not system libdrm" Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-02-24 21:06:40 -08:00
Alyssa Rosenzweig	f943047e48	panfrost: Decode render target swizzle/channels On MRT-capable systems, the framebuffer format is encoded as a 64-bit word in the render target descriptor. Previously, the two 32-bit words were exposed as opaque hex values. This commit identifies a 12-bit Mali swizzle and a 2-bit channel counter, removing some of the magic. It also adds decoding support for the AFBC and MSAA enable bits, which were already known but otherwise ignored in pandecode. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-02-25 04:49:50 +00:00
Alyssa Rosenzweig	c6be9969d2	panfrost/midgard: Add fround(_even), ftrunc, ffma These ops were discovered by invoking the correspondingly names GLSL functions. The rounding ops here behave exact as expected and are mapped to their corresponding NIR ops where applicable. The ffma behaves as a LUT instruction and requires some special argument packing (since Midgard normally only allows for 2 arguments); this quirk will be addressed in the future, but for now FMA is still lowered. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-02-25 02:36:26 +00:00
Alyssa Rosenzweig	4a4726af3c	panfrost/nondrm: Split out dump_counters Previously, this function was implied a part of the job submit. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-02-25 02:34:16 +00:00
Alyssa Rosenzweig	cdca103d43	panfrost/nondrm: Make COHERENT_LOCAL explicit This flag corresponds to what was MEM_COHERENT_LOCAL in the vendor driver, which seems to influence the cache policy, necessary for the varying temporary storage but nothing else. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-02-25 02:32:45 +00:00
Alyssa Rosenzweig	f44d4653a9	panfrost/nondrm: Flag CPU-invisible regions Potentially, the kernel could optimize these allocations, or perhaps we can save on mapping costs. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-02-25 02:31:09 +00:00
Alyssa Rosenzweig	10cc251842	panfrost/meson: Remove subdir for nondrm This change fixes cross builds with the (temporary) non-DRM overlay. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-02-25 02:27:26 +00:00
Alyssa Rosenzweig	77fea552f6	panfrost: Use tiler fast path (performance boost) For reasons that are still unclear (speculation included in the comment added in this patch), the tiler? metadata has a fast path that we were not enabling; there looks to be a possible time/memory tradeoff, but the details remain unclear. Regardless, this patch improves performance dramatically. Particular wins are for geometry-heavy scenes. For instance, glmark2-es2's Phong-shaded bunny, rendering at fullscreen (2400x1600) via GBM, jumped from ~20fps to hitting vsync cap at 60fps. Gains are even more obvious when vsync is disabled, as in glmark2-es2-wayland. With this patch, on GLES 2.0 samples not involving FBOs, it appears performance is converging with (and sometimes surpassing) the blob. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-02-25 02:25:50 +00:00
Jason Ekstrand	743700be1f	nir/builder: Don't emit no-op swizzles The nir_swizzle helper is used some on it's own but it's also called by nir_channel and nir_channels which are used everywhere. It's pretty quick to check while we're walking the swizzle anyway whether or not it's an identity swizzle. If it is, we now don't bother emitting the instruction. Sure, copy-prop will clean it up for us but there's no sense making more work for the optimizer than we have to. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2019-02-24 20:01:27 -06:00
Jason Ekstrand	724371c6b9	nir/split_vars: Don't compact vectors unnecessarily Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2019-02-24 20:01:18 -06:00
Erik Faye-Lund	7a6a5d4bfa	st/mesa: remove unused header-file This header has been unused since `f8f2520e88` ("st/mesa: Remove unnecessary headers"). And in the more than 8 years since, this hasn't been useful. So let's just get rid of it. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-02-24 20:53:37 +01:00
Maya Rashish	021c496135	configure: fix test portability From the bash manual: string1 == string2 string1 = string2 True if the strings are equal. = should be used with the test command for POSIX conformance.	2019-02-24 19:26:15 +00:00
David Shao	6fa923a65d	meson: ensure that xmlpool_options.h is generated for gallium targets that need it Fixes: `68076b8747` "meson: build gallium vdpau state tracker" Fixes: `22a817af8a` "meson: build gallium xvmc state tracker" Fixes: `5a785d51a6` "meson: build gallium va state tracker" Fixes: `0ba909f0f1` "meson: build gallium xa state tracker" Fixes: `1d36dc674d` "meson: build gallium omx state tracker" Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-02-24 09:00:39 +00:00
Matthias Lorenz	f91654120b	vulkan/overlay: Add fps counter Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109747	2019-02-24 01:07:26 +00:00
Lionel Landwerlin	239b0d8570	Revert "anv: add support for INTEL_DEBUG=bat" This reverts commit `e4d88396d2`. Apologies, I pushed the wrong commit.	2019-02-24 01:06:39 +00:00
Lionel Landwerlin	e4d88396d2	anv: add support for INTEL_DEBUG=bat As requested by Ken ;) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-02-23 23:29:04 +00:00
Christian Gmeiner	c56e734496	etnaviv: blt: mark used src resource as read from Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com>	2019-02-23 16:00:50 +01:00
Christian Gmeiner	7244e76804	etnaviv: rs: mark used src resource as read from Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com>	2019-02-23 16:00:25 +01:00
Vinson Lee	2bd08b8b9d	gallium/auxiliary/vl: Fix duplicate symbol build errors. CXXLD gallium_dri.la duplicate symbol _compute_shader_video_buffer in: ../../../../src/gallium/auxiliary/.libs/libgalliumvl.a(libgalliumvl_la-vl_compositor.o) ../../../../src/gallium/auxiliary/.libs/libgalliumvl.a(libgalliumvl_la-vl_compositor_cs.o) duplicate symbol _compute_shader_weave in: ../../../../src/gallium/auxiliary/.libs/libgalliumvl.a(libgalliumvl_la-vl_compositor.o) ../../../../src/gallium/auxiliary/.libs/libgalliumvl.a(libgalliumvl_la-vl_compositor_cs.o) duplicate symbol _compute_shader_rgba in: ../../../../src/gallium/auxiliary/.libs/libgalliumvl.a(libgalliumvl_la-vl_compositor.o) ../../../../src/gallium/auxiliary/.libs/libgalliumvl.a(libgalliumvl_la-vl_compositor_cs.o) Fixes: `9364d66cb7` ("gallium/auxiliary/vl: Add video compositor compute shader render") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: James Zhu <James.Zhu@amd.com>	2019-02-22 23:07:26 -08:00
Caio Marcelo de Oliveira Filho	4c160b6bd8	nir: fix MSVC build Zero initialize struct with {0} instead of {}.	2019-02-22 22:38:05 -08:00
Caio Marcelo de Oliveira Filho	eb13211997	nir/copy_prop_vars: add tests for load/store elements of vectors Test using array deref on vectors in loads and stores. These are marked DISABLED_ as this optimization is currently not done. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-02-22 21:00:50 -08:00
Caio Marcelo de Oliveira Filho	4f3809d389	nir: nir_build_deref_follower accept array derefs of vectors Code itself already supports it, just make sure we can use it for those cases. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-02-22 21:00:50 -08:00
Caio Marcelo de Oliveira Filho	c4beadd28e	nir/copy_prop_vars: change test helper to get intrinsics Replace find_next_intrinsic(intrinsic, after) with get_intrinsic(intrinsic, index). This makes slightly more convenient to check the resulting loads/stores/copies, since in most tests we know which one we care about. The cost is to perform more traversals, but for such tests this is not a problem. Added the ASSERT_EQ() on count to some tests missing it, so the indices queried are always expected to find something. Also, drop two nir_print_shader leftover calls in a test. v2: Remove redundant assertions. nir_src_comp_as_uint already assert what we need. (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-02-22 21:00:50 -08:00
Caio Marcelo de Oliveira Filho	fdcb9779d9	nir/copy_prop_vars: keep track of components in copy_entry When a copy_entry is SSA, store not only the nir_ssa_def* for each component, but also the source component they come from. At the moment this is always a match (i.e. 'component[i] == i'), because all the operations for a copy_entry happen using definitions with the same size. This prepares the code for array_derefs of vectors, in which 'component[i] != i'. Also, extract setting all SSA components into a function of its own. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-02-22 21:00:50 -08:00
Caio Marcelo de Oliveira Filho	6624decbb5	nir/copy_prop_vars: add debug helpers Disabled by default, to be used during development. Adding those so I don't rewrite some ad-hoc version of them everytime I'm working with this pass. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-02-22 21:00:50 -08:00
Caio Marcelo de Oliveira Filho	60d9bb9ff5	nir/copy_prop_vars: don't get confused by array_deref of vectors For now these derefs are not handled, so don't let these get into the copies list -- which would cause wrong propagations. For load_derefs, do nothing. For store_derefs, invalidate whatever the store is writing to. For copy_derefs, invalidate whatever the copy is writing to. These cases will happen once derefs to SSBOs/UBOs are kept around long enough to get optimized by copy_prop_vars. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-02-22 21:00:50 -08:00
Timothy Arceri	f48527e51a	nir: allow nir_lower_phis_to_scalar() on more src types Rather than only lowering if all srcs are scalarizable we instead check that at least one src is scalarizable. We change undef type to return false otherwise it will cause regressions when it is the only scalarizable src. total instructions in shared programs: 13219105 -> 13024547 (-1.47%) instructions in affected programs: 1153797 -> 959239 (-16.86%) helped: 581 HURT: 74 total cycles in shared programs: 333968972 -> 324807922 (-2.74%) cycles in affected programs: 129809402 -> 120648352 (-7.06%) helped: 571 HURT: 131 total spills in shared programs: 57947 -> 29130 (-49.73%) spills in affected programs: 53364 -> 24547 (-54.00%) helped: 351 HURT: 0 total fills in shared programs: 51310 -> 25468 (-50.36%) fills in affected programs: 44882 -> 19040 (-57.58%) helped: 351 HURT: 0 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-02-23 11:11:51 +11:00
Alok Hota	6053499f2e	swr/rast: bypass size limit for non-sampled textures This fixes a bug where SWR will fail to render in cases with large buffer allocations, e.g. very large meshes whose vertex buffers exceed 2GB CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2019-02-22 23:35:11 +00:00
Marek Olšák	b326a15eda	tgsi: don't set tgsi_info::uses_bindless_images for constbufs and hw atomics This might have decreased performance for radeonsi/tgsi, because most most shaders claimed they used bindless. Cc: 18.3 19.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2019-02-22 18:00:54 -05:00
Jordan Justen	cf652205cf	iris: Add gitlab-ci build testing Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-02-22 14:08:21 -08:00
Rob Clark	fd360c82f0	freedreno/a6xx: cube image fix Note that emit_intrinsic_load_image() already swaps a .3d flag with an .a flag. I tried doing things the other way around (going back to .3d) but that didn't work. And treating cube images as 2d array is also what blob does, so let's just go with that. Fixes dEQP-GLES31.functional.image_load_store.cube.load_store.* Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-02-22 14:05:32 -05:00
Rob Clark	f90c3b4485	freedreno/a6xx: fix border-color offset Fixes nearly all of dEQP-GLES31.functional.texture.border_clamp.* when run after a test that binds textures used in vertex shader. Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-02-22 14:05:32 -05:00
Rob Clark	bdedb8277a	freedreno/ir3: don't hardcode wrmask Fixes dEQP-GLES31.functional.shaders.opaque_type_indexing.sampler.const_literal.vertex.samplercubeshadow and few other similar tests that do multiple texture fetches into individual components of a packet output. Mostly works around the issue mentioned in ra_block_find_definers(). Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-02-22 14:05:32 -05:00
Rob Clark	5d4fa194b8	freedreno: fix race condition rsc->write_batch can be cleared behind our back, so we need to acquire the lock before deref'ing. Signed-off-by: Rob Clark <robdclark@gmail.com>	2019-02-22 14:05:32 -05:00
Kenneth Graunke	3090c6b9e9	vulkan: Fix 32-bit build for the new overlay layer vulkan_core.h defines non-dispatchable handles as (struct object ) on 64-bit systems, but uint64_t on 32-bit systems. The former can be implicitly cast to void , but the latter requires an explicit cast. While here, %lu is the wrong format specifier for uint64_t on 32-bit systems, so use PRIu64, fixing a warning. Reported-by: Mike Lothian <mike@fireburn.co.uk> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-02-22 08:56:54 -08:00

1 2 3 4 5 ...

108558 commits