fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-02-02 08:50:24 +01:00

Author	SHA1	Message	Date
Andreas Boll	5d4b20267d	glapi: Build glapi_gentable.c only on Darwin Removes the public symbol _glapi_create_table_from_handle from libGL.so.1.2.0 on all platforms except Darwin. Since the symbol is not used on other platforms it makes sense to build glapi_gentable.c only on Darwin. As a side effect it accelerates the build a bit and reduces the size of libGL.so.1.2.0 as follows: size lib/libGL.so.1.2.0 on my system shows text data bss dec hex filename 469211 21848 2720 493779 788d3 lib/libGL.so.1.2.0 before 420988 11240 2720 434948 6a304 lib/libGL.so.1.2.0 after A little bit of history: _glapi_create_table_from_handle was introduced in commit `85937f4c0d` Author: Jeremy Huddleston <jeremyhu@apple.com> Date: Thu Jun 9 16:59:49 2011 -0700 glapi: Add API that can create a _glapi_table from a dlfcn handle Example usage: void handle = dlopen(opengl_library_path, RTLD_LOCAL); struct _glapi_table disp = _glapi_create_table_from_handle(handle, "gl"); Signed-off-by: Jeremy Huddleston <jeremyhu@apple.com> and the only user in mesa was added in commit `f35913b96e` Author: Jeremy Huddleston <jeremyhu@apple.com> Date: Thu Jun 9 17:29:51 2011 -0700 apple: Use _glapi_create_table_from_handle to initialize our dispatch table Signed-off-by: Jeremy Huddleston <jeremyhu@apple.com> gl_gentable.py was also used for XQuartz in xserver 1.11 - 1.14. v2: Fix typos in commit message Add missing XORG_GLAPI_OUTPUTS += \ into src/mapi/glapi/gen/Makefile.am Add glapi_gentable.c to EXTRA_DIST for inclusion in the release tarball v3: Fix commit message: s/gl_gentable.c/glapi_gentable.c/ Reported-by: Arlie Davis <arlied@google.com> Cc: Jeremy Huddleston <jeremyhu@apple.com> Signed-off-by: Andreas Boll <andreas.boll.dev@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-01-21 15:04:02 +01:00
Arlie Davis	daa775b58e	mesa: Reduce libGL.so binary size by about 15% This patch significantly reduces the size of the libGL.so binary. It does not change the (externally visible) behavior of libGL.so at all. gl_gentable.py generates a function, _glapi_create_table_from_handle. This function allocates a large dispatch table, consisting of 1300 or so function pointers, and fills this dispatch table by doing symbol lookups on a given shared library. Previously, gl_gentable.py would generate a single, very large _glapi_create_table_from_handle function, with a short cluster of lines for each entry point (function). The idiom it generates was a NULL check, a call to snprintf, a call to dlsym / GetProcAddress, and then a store into the dispatch table. Since this function processes a large number of entry points, this code is duplicated many times over. We can encode the same information much more compactly, by using a lookup table. The previous total size of _glapi_create_table_from_handle on x64 was 125848 bytes. By using a lookup table, the size of _glapi_create_table_from_handle (and the related lookup tables) is reduced to 10840 bytes. In other words, this enormous function is reduced by 91%. The size of the entire libGL.so binary (measured when stripped) itself drops by 15%. So the purpose of this change is to reduce the binary size, which frees up disk space, memory, etc. size lib/libGL.so.1.2.0 on my system shows (Andreas) text data bss dec hex filename 565947 11256 2720 579923 8d953 lib/libGL.so.1.2.0 before 469211 21848 2720 493779 788d3 lib/libGL.so.1.2.0 after v2: Incorporate Matt's feedback. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> Tested-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> Signed-off-by: Andreas Boll <andreas.boll.dev@gmail.com>	2016-01-21 15:03:53 +01:00
Jordan Justen	b1a7a27d60	nir/spirv: Handle compute shared atomics Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2016-01-21 00:31:29 -08:00
Jordan Justen	a7e5b683ca	nir/spirv: Support workgroup (shared) variable translation Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2016-01-21 00:31:29 -08:00
Jordan Justen	bc035db3c8	anv/gen8: Set SLM size in interface descriptor Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2016-01-21 00:31:29 -08:00
Jordan Justen	819cb69434	anv/gen8+9: Invalidate color calc state when switching to the GPGPU pipeline Port `044acb9256` to anv. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2016-01-21 00:31:29 -08:00
Jordan Justen	19830031cb	anv/gen8: Enable SLM in L3 cache control register Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2016-01-21 00:31:29 -08:00
Jordan Justen	97b09a9268	anv/pipeline: Set size of shared variables in prog_data Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2016-01-21 00:31:29 -08:00
Jordan Justen	86daceb7f2	i965/nir: Lower nir compute shader shared variables Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2016-01-21 00:31:29 -08:00
Jordan Justen	ca55817fa1	nir: Lower shared var atomics during nir_lower_io Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2016-01-21 00:31:29 -08:00
Jordan Justen	36157cd5ea	nir: Add support for lowering load/stores of shared variables Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2016-01-21 00:31:29 -08:00
Jordan Justen	7a9a54b5c8	nir: Add atomic operations on variables This allows us to first generate atomic operations for shared variables using these opcodes, and then later we can lower those to the shared atomics intrinsics with nir_lower_io. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2016-01-21 00:31:29 -08:00
Jordan Justen	10db985fa0	nir: Add compute shader shared variable storage class Previously we were receiving shared variable accesses via a lowered intrinsic function from glsl. This change allows us to send in variables instead. For example, when converting from SPIR-V. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2016-01-21 00:31:29 -08:00
Jordan Justen	65a5407931	nir/print: Add space after shader_storage var mode Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2016-01-21 00:31:29 -08:00
Jordan Justen	9f4a72c9e3	i965/fs/nir: Move shared variable load/store to nir_emit_cs_intrinsic Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2016-01-21 00:31:29 -08:00
Ilia Mirkin	daa0fd7843	nv50/ir: 64-bit splitting fixes Take reading shader outputs into account, and use setFlagsDef for the carry since we rely on having i->flagsDef being set. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-20 19:37:34 -05:00
Ilia Mirkin	c0b66d96d7	gk110/ir: allow carry to be set/read by imad Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-20 19:37:34 -05:00
Ilia Mirkin	73c9ca7544	gm107/ir: add carry emission to LOP and IADD Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-20 19:37:34 -05:00
Ilia Mirkin	71a489633b	gm107/ir: add ATOM and CCTL support Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-20 19:37:34 -05:00
Ilia Mirkin	57b0025814	gm107/ir: set LD/ST address width bit Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-20 19:37:34 -05:00
Ilia Mirkin	2e533ab74b	gk110/ir: fix double-wide vm address	2016-01-20 19:37:34 -05:00
Ilia Mirkin	8c2dfe05c5	gk110/ir: add OP_CCTL handling	2016-01-20 19:37:33 -05:00
Ilia Mirkin	7d9a97d6be	gk110/ir: add atomic op emission, fix gmem loads Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-20 19:37:33 -05:00
Chad Versace	5ce5a7d021	anv/image: Stop including gen8_pack.h in common file	2016-01-20 15:42:17 -08:00
Chad Versace	8ab527de03	isl: Add a README Most of the file-level comment in isl.h is moved to the README.	2016-01-20 15:24:40 -08:00
Roland Scheidegger	dc8b9bd0aa	llvmpipe: warn about illegal use of objects in different contexts Doing that is clearly a bug. We can't quite assert as st/mesa may hit this, but increase at least visibility of it a bit. (For the non-refcounted objects it would be illegal too, but we can't detect that unless we'd store the context ourselves. Plus, those don't tend to cause random crashes at context or object destruction time... So just sampler views, surfaces and so targets for now.) Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-01-21 00:09:55 +01:00
Roland Scheidegger	e925ec8811	llvmpipe,i915: add back NEW_RASTERIZER dependency when computing vertex info I removed this mistakenly in `2dbc20e456`. I actually thought it should not be necessary and a piglit run didn't show any differences, but this shouldn't have been in there. draw_prepare_shader_outputs() is in fact dependent on NEW_RASTERIZER. The new polygon-mode-facing test indeed shows why this is necessary, there's lots of invalid reads and writes with valgrind (also crashes without valgrind), because the pre-pipeline vertex size doesn't match the post-pipeline vertex size (note this won't help much with stages which don't have the prepare hook which can grow the vertex size, in particular the wide point stage, but this isn't used by llvmpipe). The test still won't pass, of course, but it is only usage of uninitialized values now, which is much less dangerous... (Albeit I'm pretty sure for i915 it really is not needed anymore as it doesn't care about the extra outputs and doesn't call draw_prepare_shader_outputs().) Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-01-21 00:09:55 +01:00
Ilia Mirkin	dc3ac418bf	nv50/ir: don't flip SHL(ADD) into ADD(SHL) if ADD sources have modifiers Fixes: `31fde8fa` (nv50/ir: flip shl(add, imm) into add(shl, imm)) Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-20 18:03:36 -05:00
Kristian Høgsberg Kristensen	7b7a7c2bfc	vk: Make maxSamplerAllocationCount more reasonable We can't allocate 4 billion samplers. Let's go with 64k.	2016-01-20 14:36:52 -08:00
Ilia Mirkin	3a63576168	gk110/ir: fix load from shared memory It was accidentally using the store opcode. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-20 17:16:09 -05:00
Ilia Mirkin	9f23007a7a	gk110/ir: add partial BAR support This is enough for the plain TGSI BARRIER implementation. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-01-20 17:16:09 -05:00
Kristian Høgsberg Kristensen	8ef002dd7a	vk/tests: Add stub for anv_gem_get_bit6_swizzle()	2016-01-20 13:47:40 -08:00
Kristian Høgsberg Kristensen	420e8664cb	vk/tests: Add isl include path	2016-01-20 13:47:40 -08:00
Kenneth Graunke	b76e4458f9	nir/spirv/glsl450: Use fabs not iabs in ldexp. This was just wrong.	2016-01-20 12:18:02 -08:00
Tapani Pälli	f1152c3455	Revert "glsl: move uniform calculation to link_uniforms" This reverts commit `4475d8f916`.	2016-01-20 22:04:46 +02:00
Kristian Høgsberg Kristensen	947ebd9c71	isl: Add ish.h to libsil_la_SOURCES	2016-01-20 12:03:46 -08:00
Jason Ekstrand	21b2d87408	nir/spirv/glsl450: Implement FrexpStruct	2016-01-20 11:36:41 -08:00
Jason Ekstrand	c7896d1868	spirv/nir/glsl450: Use vtn_create_ssa_value to create SSA values	2016-01-20 11:36:26 -08:00
Jason Ekstrand	e45748bade	anv/device: Default to scalar GS on BDW+	2016-01-20 11:16:44 -08:00
Jason Ekstrand	34f9a5f301	nir/spirv: Pull texture dimensionality out of the image when available	2016-01-20 11:11:30 -08:00
Jason Ekstrand	59ef7c6507	anv/meta: fix UpdateBuffer in the case where we do multiple updates	2016-01-20 07:56:48 -08:00
Jason Ekstrand	a0516cfbac	anv/meta: Fix a finishme	2016-01-20 07:33:41 -08:00
Tapani Pälli	4475d8f916	glsl: move uniform calculation to link_uniforms Patch moves uniform calculation to happen during link_uniforms, this is possible with help of UniformRemapTable that has all the reserved locations. Location assignment for implicit locations is changed so that we utilize also the 'holes' that explicit uniform location assignment might have left in UniformRemapTable, this makes it possible to fit more uniforms as previously we were lazy here and wasting space. Fixes following CTS tests: ES31-CTS.explicit_uniform_location.uniform-loc-mix-with-implicit-max ES31-CTS.explicit_uniform_location.uniform-loc-mix-with-implicit-max-array v2: code cleanups, increment NumUniformRemapTable correctly, fix find_empty_block to work properly and add some more comments. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Marta Lofstedt <marta.lofstedt@intel.com>	2016-01-20 07:24:39 +02:00
Timothy Arceri	0a6a05c8ea	glsl: add missing explicit_image_format flag to has_layout() Fixes piglit regression after fixes to duplicate layout rules. Previously catching multiple layouts was relying on the code meant to catch duplicates within a single layout(...), this change triggers the rules for multiple layouts. Cc: Mark Janes <mark.a.janes@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2016-01-20 15:45:56 +11:00
Jason Ekstrand	c7203aa621	nir/spirv: Move OpPhi handling to vtn_cfg.c Phi handling is somewhat intrinsically tied to the CFG. Moving it here makes it a bit easier to handle that. In particular, we can now do SSA repair after we've done the phi node second-pass. This fixes 6 CTS tests.	2016-01-19 19:00:00 -08:00
Jason Ekstrand	891564adb9	nir/spirv: Handle OpLine and OpNoLine in foreach_instruction This way we don't have to explicitly handle them everywhere.	2016-01-19 19:00:00 -08:00
Kenneth Graunke	e79f8a4926	nir: Lower ldexp to arithmetic. This is a port of Matt's GLSL IR lowering pass to NIR. It's required because we translate SPIR-V directly to NIR, bypassing GLSL IR. I haven't introduced a lower_ldexp flag, as I believe all current NIR consumers would set the flag. i965 wants this, vc4 doesn't implement this feature, and st_glsl_to_tgsi currently lowers ldexp unconditionally anyway. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2016-01-19 18:10:30 -08:00
Kenneth Graunke	b3cc10f3b2	nir: Let nir_opt_algebraic rules contain unsigned constants > INT_MAX. struct.pack('i', val) interprets `val` as a signed integer, and dies if `val` > INT_MAX. For larger constants, we need to use 'I' which interprets it as an unsigned value. This patch makes us use 'I' for all values >= 0, and 'i' for negative values. This should work in all cases. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2016-01-19 18:10:30 -08:00
Jason Ekstrand	eb2a119da2	anv/meta: Implement UpdateBuffer	2016-01-19 16:53:35 -08:00
Jason Ekstrand	0ae1bd321e	anv/meta: Implement CmdFillBuffer	2016-01-19 16:53:35 -08:00

... 161 162 163 164 165 ...

85652 commits