fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-03 20:10:17 +01:00

Author	SHA1	Message	Date
Karol Herbst	2a36699ed3	radv: use nir constant helpers Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-04-14 22:25:56 +02:00
Karol Herbst	adb2263014	amd/nir: some cleanups Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-04-14 22:25:56 +02:00
Alyssa Rosenzweig	1e2cb3e964	panfrost/midgard: Use shared nir_lower_viewport_transform v2: Run before lowering I/O. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Qiang Yu <yuq825@gmail.com>	2019-04-14 19:16:29 +00:00
Alyssa Rosenzweig	2ce4adefa5	nir: Add nir_lower_viewport_transform On Mali hardware (supported by Panfrost and Lima), the fixed-function transformation from world-space to screen-space coordinates is done in the vertex shader prior to writing out the gl_Position varying, rather than in dedicated hardware. This commit adds a shared NIR pass for implementing coordinate transformation and lowering gl_Position writes into screen-space gl_Position writes. v2: Run directly on derefs before io/vars are lowered to cleanup the code substantially. Thank you to Qiang for this suggestion! v3: Bikeshed continues. v4: Add to Makefile.sources (per Jason's comment). Bikeshed comment. Ian and Qiang's reviews are from v3, but no real functional changes from v4. Rob's review is from v4. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Suggested-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-04-14 19:15:13 +00:00
Alyssa Rosenzweig	89b02bffcb	panfrost: Cleanup indexed draw handling As part of this cleanup, we use the newly-exposed u_vbuf_get_minmax_index, deduplicating quite a bit of bookkeeping. We also centralize the draw_flags tracking to make this code cleaner / futureproofed; we have already had bugs regarding this field so we might as well get it right now. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-04-14 15:25:46 +00:00
Alyssa Rosenzweig	74b17b9a9f	panfrost/midgard: Drop dependence on mesa/st This was used as a workaround for uniform sizing which was fixed in `771adffe` ("st: Lower uniforms in st in the...") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-04-14 15:25:46 +00:00
Mauro Rossi	1af7701666	draw: fix building error in draw_gs_init() Fixes the following building error happening with Android build system: external/mesa/src/gallium/auxiliary/draw/draw_gs.c:740:79: error: address of array 'draw->gs.tgsi.machine->PrimitiveOffsets' will always evaluate to 'true' [-Werror,-Wpointer-bool-conversion] if (!draw->gs.tgsi.machine->Primitives[i] \|\| !draw->gs.tgsi.machine->PrimitiveOffsets) ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~ 1 error generated. Fixes: `7720ce3` ("draw: add support to tgsi paths for geometry streams. (v2)") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2019-04-14 18:07:02 +10:00
Qiang Yu	b46b661f53	lima/gpir: fix alu check miss last store slot Fixes: `92d7ca4b1c` "gallium: add lima driver" Signed-off-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-04-14 12:10:23 +08:00
Qiang Yu	8d91cd64aa	lima/gpir: fix compile fail when two slot node Come from glmark2-es2 jellyfish test. Fixes: `92d7ca4b1c` "gallium: add lima driver" Signed-off-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-04-14 12:10:23 +08:00
Vasily Khoruzhick	fef2f10cc2	lima: add support for depth/stencil fbo attachments and textures Hardware supports writing back Z/S buffers and sampling from them, so add support for that. Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Tested-by: Icenowy Zheng <icenowy@aosc.io>	2019-04-14 01:16:00 +00:00
Vasily Khoruzhick	a817f0fec6	lima: use individual tile heap for each GP job. Looks like it's somehow used by subsequent PP job, so we have to preserve its contents until PP job is done. Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Tested-by: Icenowy Zheng <icenowy@aosc.io>	2019-04-14 01:16:00 +00:00
Christian Gmeiner	b6bed115a5	nir: add lower_ftrunc Port TGSI TRUNC lowering to nir Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-04-13 17:54:48 +00:00
Mauro Rossi	e538dd67de	android: fix LLVM version string related building errors Adding \ prior to " in llvm version string fixes the following building errors: external/mesa/src/gallium/drivers/r600/r600_pipe_common.c:1290:14: error: expected ')' ", LLVM " MESA_LLVM_VERSION_STRING ^ <command line>:8:34: note: expanded from here ^ external/mesa/src/gallium/drivers/r600/r600_pipe_common.c:1287:10: note: to match this '(' snprintf(rscreen->renderer_string, sizeof(rscreen->renderer_string), ^ 1 error generated. Fixes: 05b114e ("simplify LLVM version string printing") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch>	2019-04-13 18:56:14 +02:00
Lionel Landwerlin	9e7b0988d6	anv: leave the top 4Gb of the high heap VMA unused In `628c9ca908` I forgot to apply the same -4Gb of the high address of the high heap VMA. This was previously computed in the HIGH_HEAP_MAX_ADDRESS. Many thanks to James for pointing this out. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reported-by: Xiong, James <james.xiong@intel.com> Fixes: `628c9ca908` ("anv: store heap address bounds when initializing physical device") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-04-13 12:08:23 +00:00
Eric Anholt	dc402be73e	v3d: Use the new lower_to_scratch implementation for indirects on temps. We can use the same register spilling infrastructure for our loads/stores of indirect access of temp variables, instead of doing an if ladder. Cuts 50% of instructions and max-temps from 2 KSP shaders in shader-db. Also causes several other KSP shaders with large bodies and large loop counts to not be force-unrolled. The change was originally motivated by NOLTIS slightly modifying register pressure in piglit temp mat4 array read/write tests, triggering register allocation failures.	2019-04-12 16:16:58 -07:00
Jason Ekstrand	18ed82b084	nir: Add a pass for selectively lowering variables to scratch space This commit adds new nir_load/store_scratch opcodes which read and write a virtual scratch space. It's up to the back-end to figure out what to do with it and where to put the actual scratch data. v2: Drop const_index comments (by anholt) Reviewed-by: Eric Anholt <eric@anholt.net>	2019-04-12 15:59:31 -07:00
Eric Anholt	8a2d91e124	v3d: Detect the correct number of QPUs and use it to fix the spill size. We were missing a * 4 even if the particular hardware matched our assumption.	2019-04-12 15:59:31 -07:00
Eric Anholt	11ba8a46e4	v3d: Add missing dumping for the spill offset/size uniforms.	2019-04-12 15:59:31 -07:00
Eric Anholt	42cf57f186	v3d: Add missing base offset to CS shared memory accesses. This code is so touchy, trying to emit the minimum amount of address math. Some day we'll move it all to NIR, I hope.	2019-04-12 15:59:31 -07:00
Eric Anholt	6b1c659825	v3d: Add Compute Shader compilation support. While waiting for the CSD UABI to get reviewed, I keep having to rebase the CS patch. Just land the compiler side for now to keep it from diverging. For now this covers just GLES 3.1 compute shaders, not CL kernels.	2019-04-12 15:59:31 -07:00
Eric Anholt	1e0a72ce09	v3d: Replace the old shader-db env var output with the ARB_debug_output. We're using ARB_debug_output for the main shader-db, but I had this env var left around from the shader-db-2 support (vc4 apitrace-based). Keep the env var around since it's nice sometimes to get the stats on a shader you're optimizing without having to do a shader-db run, but drop the old formatting that's not useful and keeps tricking me when I go to add another measurement to the shader-db output.	2019-04-12 15:59:31 -07:00
Eric Anholt	b02dbaa8ce	v3d: Include the number of max temps used in the shader-db output. This gives us finer-grained feedback on how we're doing on register pressure than "did we trigger a new shader to spill or drop thread count?"	2019-04-12 15:59:24 -07:00
Eric Anholt	276ec879fd	v3d: Drop a note for the future about PIPE_CAP_PACKED_UNIFORMS.	2019-04-12 15:58:28 -07:00
Eric Anholt	89b7df552b	v3d: Add and use a define for the number of channels in a QPU invocation. A shader invocation always executes 16 channels together, so we often end up multiplying things by this magic 16 number. Give it a name.	2019-04-12 15:58:28 -07:00
Eric Anholt	b88ef3bd76	nir: Add a comment about how intrinsic definitions work. I was thinking about a refactor, and needed to read this first. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-04-12 15:56:12 -07:00
Eric Anholt	35355b4860	nir: Drop remaining references to const_index in favor of the call to use. Please don't make me read a const_index[] expression ever again. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-04-12 15:56:04 -07:00
Eric Anholt	6e4d3d0a2f	nir: Drop comments about the constant_index slots for load/stores. The constant_index slots are named right there in the intrinsic definition, and the comment is just a chance to get out of sync. Noticed while reviewing the lower_to_scratch changes that copy-and-pasted wrong comments, and load_ubo and load_per_vertex_output had incorrect comments currently. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-04-12 15:55:55 -07:00
Sagar Ghuge	066d2aebc0	intel/fs: Remove unused condition from opt_algebraic case We will never hit a condition where we have src1 and src2 as immediate operands. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-04-12 13:47:57 -07:00
Kenneth Graunke	9e0c744f07	glsl: Set location on structure-split sampler uniform variables gl_nir_lower_samplers_as_deref splits structure uniform variables, creating new variables for individual fields. As part of that, it calculates a new location. It then never set this on the new variables. Thanks to Michael Fiano for finding this bug. Fixes crashes on i965 with Piglit's new tests/spec/glsl-1.10/execution/samplers/uniform-struct test, which was reduced from the failing case in Michael's app. Fixes: `f003859f97` nir: Make gl_nir_lower_samplers use gl_nir_lower_samplers_as_deref Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-04-12 10:35:08 -07:00
Mateusz Krzak	f4fc2ece57	panfrost: use os_mmap and os_munmap 32-bit needs mmap64 for 64-bit offsets. We get 64-bit offsets from kernel. Signed-off-by: Mateusz Krzak <kszaquitto@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-04-12 16:33:00 +00:00
Mateusz Krzak	411da8b80d	panfrost: cast bo_handles pointer to uintptr_t first Required for 64-bit kernel to interpret the pointer from 32-bit userspace. Signed-off-by: Mateusz Krzak <kszaquitto@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-04-12 16:33:00 +00:00
Jason Ekstrand	7eaaff18cb	anv/pipeline: Fix MEDIA_VFE_STATE::PerThreadScratchSpace on gen7 We were always programming it with the Broadwell convention which is too large by a factor of two on Haswell and just plain wrong on IVB and BYT. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable@lists.freedesktop.org	2019-04-12 16:08:35 +00:00
Eric Engestrom	da1a5a19bd	gitlab-ci: add lima to the build Suggested-by: Karol Herbst <karolherbst@gmail.com> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>	2019-04-12 15:43:19 +00:00
Marek Olšák	f4ae188d50	ac: use the common helper ac_apply_fmask_to_sample Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-04-12 11:35:31 -04:00
Marek Olšák	971bc10177	radeonsi: set AC_FUNC_ATTR_READNONE for image opcodes where it was missing Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-04-12 11:34:39 -04:00
Marek Olšák	467ff6ebfe	mesa: don't overwrite existing shader files with MESA_SHADER_CAPTURE_PATH Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2019-04-12 11:34:39 -04:00
Marek Olšák	bd2995c8b7	glsl: allow the #extension directive within code blocks for the dri option for Viewperf 13 Acked-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-04-12 11:34:39 -04:00
Samuel Pitoiset	6718bb57ac	ac/nir: remove some useless integer casts for ALU operations Sources are always casted to integers. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-04-12 17:30:55 +02:00
Samuel Pitoiset	8a6442075f	ac/nir: remove useless integer cast in visit_image_load() ac_build_image_opcode() casts if necessary and buffer images are casted too. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-04-12 17:30:55 +02:00
Samuel Pitoiset	ffbb62f808	ac/nir: remove useless integer cast in adjust_sample_index_using_fmask() It's already casted if necessary in ac_build_image_opcode(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-04-12 17:30:55 +02:00
Samuel Pitoiset	7b5b27a685	ac/nir: remove useles LLVMGetUndef for nir_op_pack_64_2x32_split Trivial. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-04-12 17:30:55 +02:00
Samuel Pitoiset	fd4041987b	ac: add ac_build_load_helper_invocation() helper Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-04-12 17:30:55 +02:00
Samuel Pitoiset	590a4c8981	ac: add ac_build_ddxy_interp() helper Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-04-12 17:30:55 +02:00
Samuel Pitoiset	4cb13e9462	ac: add ac_build_umax() and use it where possible This changes the predicate from LessThan to Equal. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-04-12 17:30:55 +02:00
Samuel Pitoiset	cf88bfa75a	ac/nir: make use of ac_build_umin() where possible This changes the predicate from LessThan to Equal. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-04-12 17:30:54 +02:00
Samuel Pitoiset	15dd81913f	ac/nir: make use of ac_build_imin() where possible This changes the predicate from LessThan to Equal. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-04-12 17:30:54 +02:00
Samuel Pitoiset	d7a0c0d53b	ac/nir: make use of ac_build_imax() where possible Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-04-12 17:30:54 +02:00
Karol Herbst	a55c7352d6	lima: add bool parameter to type_size function Fixes: `035759b61b` ("nir/i965/freedreno/vc4: add a bindless bool to type size functions") Signed-off-by: Karol Herbst <kherbst@redhat.com> Tested-by: Icenowy Zheng <icenowy@aosc.io> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-04-12 17:08:53 +02:00
Karol Herbst	98934e6aa1	nvc0/nir: enable bindless texture Signed-off-by: Karol Herbst <kherbst@redhat.com>	2019-04-12 09:02:59 +02:00
Karol Herbst	89a81fbd98	nv50/ir/nir: add support for bindless images Signed-off-by: Karol Herbst <kherbst@redhat.com>	2019-04-12 09:02:59 +02:00

1 2 3 4 5 ...

109940 commits