fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 11:38:05 +02:00

Author	SHA1	Message	Date
Gert Wollny	e7dd1a84a0	mesa/st: draw_vbo: initialize restart_index too restart_index is later always used in a comparison, so it should be initialized properly. Fixes valgrind warning: Conditional jump or move depends on uninitialised value(s) at 0xB8D682F: r600_draw_vbo (r600_state_common.c:2153) by 0xB71F743: u_vbuf_draw_vbo (u_vbuf.c:1156) by 0xB3B92DB: st_draw_vbo (st_draw.c:235) by 0xB36B1AE: vbo_draw_arrays (vbo_exec_array.c:391) by 0xB36BB0D: vbo_exec_DrawArrays (vbo_exec_array.c:550) by 0x10A989: piglit_display (textureSize.c:157) by 0x4F8F174: run_test (piglit_fbo_framework.c:52) by 0x4F7BA12: piglit_gl_test_run (piglit-framework-gl.c:229) by 0x10A60A: main (textureSize.c:71) Uninitialised value was created by a stack allocation at 0xB3B90B0: st_draw_vbo (st_draw.c:143) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Mathias Fröhlich <mathias.froehlich@web.de> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-07-05 07:57:16 +02:00
Timothy Arceri	0cb6537dee	mesa: enable ARB_direct_state_access in OpenGL 4.5 compat profile Its unlikely anyone will add proper ARB_direct_state_access compat support before we branch 18.2. Enabling the extension in 4.5 at least allows users to make use of MESA_GL_VERSION_OVERRIDE=4.5COMPAT for games like No Mans Sky. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-07-05 13:15:34 +10:00
Timothy Arceri	39063334d3	util/drirc: turn on force_glsl_extensions_warn for No Mans Sky The game forgets to enable multiple extensions in its shaders, one of those extesions is EXT_texture_array. But enabling this config entry fixes at least one other rendering issue that enabling EXT_texture_array on its own doesn't fix. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-07-05 13:05:47 +10:00
Marek Olšák	9b4c4fe334	util/queue: remove leftover debug code	2018-07-04 22:19:47 -04:00
Marek Olšák	7fab8a4b37	Shorten u_queue names There is a 15-character limit for thread names shared by the queue name and process name. Shorten the thread name to make space for the process name. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-04 22:03:35 -04:00
Marek Olšák	b238e33bc9	kutil/queue: add a process name into a thread name v2: simplifications Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (v1) Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (v1)	2018-07-04 21:54:39 -04:00
Marek Olšák	7149bffe66	gallium/os: use util_get_process_name when possible Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-07-04 21:16:57 -04:00
Marek Olšák	2e1e6511f7	util: extract get_process_name from xmlconfig.c Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-07-04 21:16:03 -04:00
Marek Olšák	4695984dbc	ac: fold LLVMContext creation into ac_llvm_context_init Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-07-04 15:48:18 -04:00
Marek Olšák	f5cb4194c9	radeonsi: reorder code in si_llvm_context_init Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-07-04 15:48:18 -04:00
Marek Olšák	ff330055e9	radeonsi: use ac_compile_module_to_binary to reduce compile times Compile times of simple shaders are reduced by ~20%. Compile times of prologs and epilogs are reduced by up to 40%. Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-07-04 15:48:18 -04:00
Marek Olšák	0075e5fed8	ac: add reusable helpers for direct LLVM compilation This is basically LLVMTargetMachineEmitToMemoryBuffer inlined and reworked. struct ac_compiler_passes (opaque type) contains the main pass manager. ac_create_llvm_passes -- the result can go to thread local storage ac_destroy_llvm_passes -- can be called by a destructor in TLS ac_compile_module_to_binary -- from LLVMModuleRef to ac_shader_binary The motivation is to do the expensive call addPassesToEmitFile once per context or thread. Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-07-04 15:48:18 -04:00
Rhys Perry	c2ae9b4052	nvc0: implement multisampled images on Maxwell+ Changes in v2: - make loadSuInfo32() protected without making the rest protected - move NVC0_SU_INFO_* into nv50_ir_lowering_nvc0.h instead of duplicating NVC0_SU_INFO_MS Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Karol Herbst <kherbst@redhat.com>	2018-07-04 16:04:23 +02:00
Neil Roberts	2d5ddbe960	i965: Fix output register sizes when variable ranges are interleaved In `6f5abf3146` this code was fixed to calculate the maximum size of an attribute in a seperate pass and then allocate the registers to that size. However this wasn’t taking into account ranges that overlap but don’t have the same starting location. For example: layout(location = 0, component = 0) out float a[4]; layout(location = 2, component = 1) out float b[4]; Previously, if ‘a’ was processed first then it would allocate a register of size 4 for location 0 and it wouldn’t allocate another register for location 2 because it would already be covered by the range of 0. Then if something tries to write to b[2] it would try to write past the end of the register allocated for ‘a’ and it would hit an assert. This patch changes it to scan for any overlapping ranges that start within each range to calculate the maximum extent and allocate that instead. Fixed Piglit’s arb_enhanced_layouts/execution/component-layout/ vs-fs-array-interleave-range.shader_test Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Fixes: `6f5abf3146` "i965: Fix output register sizes when multiple variables share a slot."	2018-07-04 10:57:51 +02:00
Dave Airlie	8c51caab24	r600/sb: cleanup if_conversion iterator to be legal C++ The current code causes: /usr/include/c++/8/debug/safe_iterator.h:207: Error: attempt to copy from a singular iterator. This is due to the iterators getting invalidated, fix the reverse iterator to use the return value from erase, and cast it properly. (used Mathias suggestion) Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Mathias Fröhlich <mathias.froehlich@web.de>	2018-07-04 07:42:22 +01:00
Marek Olšák	45f9d58668	radeonsi: fix compiler breakage Broken by `d853d3a59b`.	2018-07-04 00:13:38 -04:00
Dave Airlie	5b32b246cf	ac: make some fns static Some of the compiler functions are no longer called outside the util file. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-07-04 10:29:26 +10:00
Dave Airlie	7398913a62	ac/radv: move llvm compiler info to struct and init in one place This ports radv to the shared code, however due to a bug in LLVM version prior to 7, radv cannot add target info at this stage, as it would leak one for every shader compile, however I'd prefer to keep this llvm damage in the shared code, since it isn't the driver at fault here. We just add a flag to denote if the driver can support leaking the target info or not, and the common code does the right thing depending on the llvm version. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-07-04 10:29:16 +10:00
Dave Airlie	d853d3a59b	ac/radeonsi: port compiler init/destroy out of radeonsi. We want to share this code with radv in the future, so port it out of radeonsi. Add a return value as radv will want that to know if this succeeds Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-07-04 10:29:03 +10:00
Dave Airlie	35c82af539	radv/radeonsi: add a check ir tm options This doesn't do much yet, but it makes it easier to move the code to a common shared code base. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-07-04 05:32:35 +10:00
Dave Airlie	0eb65b4944	radeonsi: rename si_compiler -> ac_llvm_compiler As precursor to moving init to common code, just rename the struct and move it. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-07-04 05:31:32 +10:00
Dave Airlie	887ba45c93	ac: add target library info helpers Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-07-04 05:31:29 +10:00
Dave Airlie	e1387eaf12	radv: create/destroy passmgr at the higher level. This is prep work for moving this to a per-thread struct Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-07-04 05:31:05 +10:00
Dave Airlie	97d9b88447	radv: port to use common passmgr code. This adds a inline always pass, but otherwise should work the same. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-07-04 05:30:34 +10:00
Dave Airlie	584ad1eda9	ac/radeonsi: refactor out pass manager init to common code. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-07-04 05:18:01 +10:00
Dave Airlie	f2b3e96e75	radv: drop copy of ac_create_target_machine. Once we split the init once stuff out, this can be shared again. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-07-04 05:15:35 +10:00
Dave Airlie	473be16c74	ac/radv: split the non-common init_once code from the common target code. (v2) This just splits out the non-shared code and reuses ac_get_llvm_target in radv. v2: rebase on Marek's patch - fixup brace position/whitespace Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-07-04 05:15:23 +10:00
Neil Roberts	590cc7c8f6	i965: Use the new nir atomic counter linker for SPIR-V shaders Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-03 12:41:46 +02:00
Alejandro Piñeiro	c13f8ea8ac	i965: enable AtomicStorage capability for gen7+ That is the same gen requirement for ARB_shader_atomic_counters. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-03 12:41:46 +02:00
Antia Puentes	7600678216	mesa/glspirv: lower workgroup access to offsets This will perform the CS shared lowering. See `8761a04d0d` Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-03 12:41:46 +02:00
Antia Puentes	fbcebfc5bf	nir: Fix OpAtomicCounterIDecrement for uniform atomic counters From the SPIR-V 1.0 specification, section 3.32.18, "Atomic Instructions": "OpAtomicIDecrement: <skip> The instruction's result is the Original Value." However, we were implementing it, for uniform atomic counters, as a pre-decrement operation, as was the one available from GLSL. Renamed the former nir intrinsic 'atomic_counter_dec' to 'atomic_counter_pre_dec' for clarification purposes, as it implements a pre-decrement operation as specified for GLSL. From GLSL 4.50 spec, section 8.10, "Atomic Counter Functions": "uint atomicCounterDecrement (atomic_uint c) Atomically 1. decrements the counter for c, and 2. returns the value resulting from the decrement operation. These two steps are done atomically with respect to the atomic counter functions in this table." Added a new nir intrinsic 'atomic_counter_post_dec' which implements a post-decrement operation as required by SPIR-V. v2: (Timothy Arceri) Add extra spec quotes on commit message * Use "post" instead "pos" to avoid confusion with "position" Signed-off-by: Antia Puentes <apuentes@igalia.com> Signed-off-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-03 12:41:46 +02:00
Neil Roberts	6677e131b8	nir/linker: Add a pure NIR implementation of the atomic counter linker This is mostly just a straight-forward conversion of link_assign_atomic_counter_resources to C directly using nir variables instead of GLSL IR variables. It is based on the version of link_assign_atomic_counter_resources in `6b8909f2d1`. I’m noting this here to make it easier to track changes and keep the NIR version up-to-date. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-03 12:41:46 +02:00
Neil Roberts	1fb9984d7e	nir/types: Add wrappers for a couple of atomic counter methods Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-03 12:41:46 +02:00
Alejandro Piñeiro	54d7fca077	spirv/nir: add capability check for SpvCapabilityAtomicStorage Capability that informs if atomic counters are supported. From SPIR-V 1.0 spec, section 3.7, "Storage Class", item 10 from table: (Column "Storage Class"): "AtomicCounter For holding atomic counters. Visible across all functions of the current invocation. Atomic counter-specific memory." (Column "Required Capability"): "AtomicStorage" Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-03 12:41:46 +02:00
Alejandro Piñeiro	12301766de	spirv/nir: add atomic counter support on vtn_handle_ssbo_or_shared_atomic So renamed to a more general vtn_handle_atomics Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-03 12:41:46 +02:00
Alejandro Piñeiro	c3eb0ba0ff	spirv/nir: initialize offset on the nir var at vtn_create_variable This is convenient when dealing with atomic counter uniforms. The alternative would be doing that at vtn_handle_atomics. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-03 12:41:46 +02:00
Antia Puentes	4110bc4c17	nir/spirv: Fix atomic counter (multidimensional-)arrays When constructing NIR if we have a SPIR-V uint variable and the storage class is SpvStorageClassAtomicCounter, we store as NIR's glsl_type an atomic_uint to reflect the fact that the variable is an atomic counter. However, we were tweaking the type only for atomic_uint scalars, we have to do it as well for atomic_uint arrays and atomic_uint arrays of arrays of any depth. Signed-off-by: Antia Puentes <apuentes@igalia.com> Signed-off-by: Alejandro Piñeiro <apinheiro@igalia.com> v2: update after deref patches got pushed (Alejandro Piñeiro) v3: simplify repair_atomic_type (suggested by Timothy Arceri, included on the patch by Alejandro) Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-03 12:41:46 +02:00
Alejandro Piñeiro	480d2c56b3	spirv/nir: tweak nir type when storage class is SpvStorageClassAtomicCounter GLSL types differentiates uint from atomic uint. On SPIR-V the type is uint, and the variable has a specific storage class. So we need to tweak the type based on the storage class. Ideally we would like to get the proper type at vtn_handle_type, but we don't have the storage class at that moment. We tweak only the nir type, as is the one that really requires it. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-03 12:41:46 +02:00
Alejandro Piñeiro	88d3325a44	nir_types: add glsl_atomic_uint_type() helper Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-03 12:41:46 +02:00
Alejandro Piñeiro	c6230b9358	spirv/nir: add offset at vtn_variable Also initialize it on var_decoration_cb This is equivalent to nir_variable.offset, used to store the location an atomic counter is stored at. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-03 12:37:32 +02:00
Alejandro Piñeiro	768c275deb	spirv/nir: SpvStorageClassAtomicCounter support on vtn_storage_class_to_mode Atomic Counters are uniforms per spec. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-03 12:37:32 +02:00
Alejandro Piñeiro	a9e6298727	nir/linker: handle uniforms without explicit location ARB_gl_spirv points that uniforms in general need explicit location. But there are still some cases of uniforms without location, like for example uniform atomic counters. Those doesn't have a location from the OpenGL point of view (they are identified with a binding and offset), but Mesa internally assigns it a location. Signed-off-by: Eduardo Lima <elima@igalia.com> Signed-off-by: Alejandro Piñeiro <apinheiro@igalia.com> Signed-off-by: Neil Roberts <nroberts@igalia.com> v2: squash with another patch, minor variable name tweak (Timothy Arceri) Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-03 12:37:32 +02:00
Alejandro Piñeiro	b0712df6cf	compiler/glsl: refactor empty_uniform_block utilities to linker_util This includes: * Move the defition of empty_uniform_block to linker_util.h * Move find_empty_block (with a rename) to linker_util.h * Refactor some code at linker.cpp to a new method at linker_util.h (link_util_update_empty_uniform_locations) So all that code could be used by the GLSL linker and the NIR linker used for ARB_gl_spirv. v2: include just "ir_uniform.h" (Timothy Arceri) Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-03 12:37:32 +02:00
Ian Romanick	995d993710	i965/vec4: Don't cmod propagate from CMP to ADD if the writemask isn't compatible Otherwise we can incorrectly cmod propagate in situations like add(8) g10<1>.xD g2<0>.xD -16D ... cmp.ge.f0(8) null<1>D g2<0>.xD 16D ... (+f0) sel(8) g21<1>.xyUD g14<4>.xyyyUD g18<4>.xyyyUD Sadly, this change hurts quite a few shaders. v2: Refactor writemask compatibility check into a separate function. Suggested by Caio. Ivy Bridge and Haswell had similar results. (Haswell shown) total instructions in shared programs: 12968489 -> 12968738 (<.01%) instructions in affected programs: 60679 -> 60928 (0.41%) helped: 0 HURT: 249 HURT stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.22% max: 0.81% x̄: 0.46% x̃: 0.44% 95% mean confidence interval for instructions value: 1.00 1.00 95% mean confidence interval for instructions %-change: 0.44% 0.48% Instructions are HURT. total cycles in shared programs: 409171965 -> 409172317 (<.01%) cycles in affected programs: 260056 -> 260408 (0.14%) helped: 0 HURT: 176 HURT stats (abs) min: 2 max: 2 x̄: 2.00 x̃: 2 HURT stats (rel) min: 0.04% max: 0.34% x̄: 0.17% x̃: 0.17% 95% mean confidence interval for cycles value: 2.00 2.00 95% mean confidence interval for cycles %-change: 0.16% 0.18% Cycles are HURT. Sandy Bridge total instructions in shared programs: 10423577 -> 10423753 (<.01%) instructions in affected programs: 40667 -> 40843 (0.43%) helped: 0 HURT: 176 HURT stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.29% max: 0.79% x̄: 0.48% x̃: 0.42% 95% mean confidence interval for instructions value: 1.00 1.00 95% mean confidence interval for instructions %-change: 0.46% 0.51% Instructions are HURT. total cycles in shared programs: 146097503 -> 146097855 (<.01%) cycles in affected programs: 503990 -> 504342 (0.07%) helped: 0 HURT: 176 HURT stats (abs) min: 2 max: 2 x̄: 2.00 x̃: 2 HURT stats (rel) min: 0.02% max: 0.36% x̄: 0.12% x̃: 0.11% 95% mean confidence interval for cycles value: 2.00 2.00 95% mean confidence interval for cycles %-change: 0.11% 0.13% Cycles are HURT. No changes on any other platforms. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Fixes: `cd635d149b` i965/vec4: Propagate conditional modifiers from compares to adds Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-07-02 19:19:16 -07:00
Ian Romanick	fb6dc8e894	intel/compiler: Silence unused parameter warnings brw_nir.c src/intel/compiler/brw_nir.c: In function ‘brw_nir_lower_vue_outputs’: src/intel/compiler/brw_nir.c:464:32: warning: unused parameter ‘is_scalar’ [-Wunused-parameter] bool is_scalar) ^~~~~~~~~ src/intel/compiler/brw_nir.c: In function ‘lower_bit_size_callback’: src/intel/compiler/brw_nir.c:610:57: warning: unused parameter ‘data’ [-Wunused-parameter] lower_bit_size_callback(const nir_alu_instr alu, void data) ^~~~ Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2018-07-02 16:17:19 -07:00
Kenneth Graunke	8e38947f6c	i965: Fix BRW_NEW_NUM_SAMPLES to be in .brw, not .mesa This is the wrong kind of dirty bit. Caught by GCC warnings, due to 64-bit values being truncated to 32 bits. Fixes: `b95b0e2918` (intel/anv,blorp,i965: Implement the SKL 16x MSAA SIMD32 workaround) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-07-02 15:30:21 -07:00
Jason Ekstrand	afa8f58921	anv: Add support for the on-disk shader cache The Vulkan API provides a mechanism for applications to cache their own shaders and manage on-disk pipeline caching themselves. Generally, this is what I would recommend to application developers and I've resisted implementing driver-side transparent caching in the Vulkan driver for a long time. However, not all applications do this and, for some use-cases, it's just not practical. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-02 14:52:05 -07:00
Jason Ekstrand	e0f7a3aa5b	anv/pipeline_cache: Add a _locked suffix to a function Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-02 13:07:06 -07:00
Jason Ekstrand	f5c38f4a30	anv: Add device-level helpers for searching for and uploading kernels Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-02 13:07:06 -07:00
Jason Ekstrand	eae192bf5f	anv/pipeline: Stop optimizing for not having a cache Before, we were only hashing the shader if we had a shader cache to cache things in. This means that if we ever get it wrong, we could end up trying to cache a shader with an undefined hash. Since not having a shader cache is an extremely uncommon case, let's optimize for code clarity and obvious correctness over avoiding a hash operation. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-02 13:07:06 -07:00

1 2 3 4 5 ...

103291 commits