fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 13:48:06 +02:00

Author	SHA1	Message	Date
Nanley Chery	97f4efd573	mesa/macros: add power-of-two assertions for alignment macros ALIGN and ROUND_DOWN_TO both require that the alignment value passed into the macro be a power of two in the comments. Using software assertions verifies this to be the case. v2: use static inline functions instead of gcc-specific statement expressions (Brian). v3: fix indendation (Brian). v4: add greater than zero requirement (Anuj). Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2015-08-26 14:36:43 -07:00
Nanley Chery	8b1f008e9a	i965/surface_formats: add support for 2D ASTC surface formats Define two-thirds of the 2D Intel ASTC surface formats (LDR-only). This allows a 1-to-1 mapping from the mesa format to the Intel format. ASTC textures will default to being processed in LDR mode. If there is hardware support for HDR/Full mode and the texture is not sRGB, add the format bit necessary to process it in HDR/Full mode. v2: remove extra newlines. v3: follow existing coding style in translate_tex_format(). v4: expound on the GEN9_SURFACE_ASTC_HDR_FORMAT_BIT comment. update SF table - ASTC is actually supported in Gen8. v5: conform the ASTC MESA_FORMAT enums to the existing naming convention. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2015-08-26 14:36:43 -07:00
Nanley Chery	cd49b97a8a	mesa/teximage: return the base internal format of the ASTC formats This is necesary to initialize the gl_texture_image struct. From the KHR_texture_compression_astc_ldr spec: "Added to Section 3.8.6, Compressed Texture Images Add the tokens specified above to Table 3.16, Compressed Internal Formats. In all cases, the base internal format will be RGBA. The encoding allows images to be encoded with fewer channels, but this is always presented as RGBA to the sampler." v2. use _mesa_is_astc_format(). Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2015-08-26 14:36:43 -07:00
Nanley Chery	12b519b457	mesa/teximage: accept ASTC formats for 3D texture specification The ASTC spec was revised as follows: Revision 2, April 28, 2015 - added CompressedTex{Sub,}Image3D to commands accepting ASTC format tokens in the New Tokens section [...]. Support only exists in the HDR submode: Add a second new column "3D Tex." which is empty for all non-ASTC formats. If only the LDR profile is supported by the implementation, this column is also empty for all ASTC formats. If both the LDR and HDR profiles are supported only, this column is checked for all ASTC formats. LDR-only systems should generate an INVALID_OPERATION error when attempting to call CompressedTexImage3D with the TEXTURE_3D target. v2. return the proper error for LDR-only systems. v3. update is_astc_format(). v4. use _mesa_is_astc_format(). v5. place logic in _mesa_target_can_be_compressed. v6. fix issues handling ASTC formats. Reviewed-by: Chad Versace <chad.versace@intel.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2015-08-26 14:36:43 -07:00
Nanley Chery	23c9cd5a96	mesa/texcompress: enable translation between MESA and GL ASTC formats v3. conform the ASTC MESA_FORMAT enums to the existing naming convention. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2015-08-26 14:36:43 -07:00
Nanley Chery	692578ed13	mesa/glformats: recognize ASTC formats as compressed Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2015-08-26 14:36:42 -07:00
Nanley Chery	4143511b15	mesa: add ASTC extensions to the extensions table v2: alphabetize the extensions. remove OES ASTC extension. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2015-08-26 14:36:42 -07:00
Nanley Chery	582ce1ea97	mesa: don't enable online compression for ASTC formats In agreement with the ASTC spec, this makes calls to TexImageD unsuccessful. Implied by the spec, Generate[Texture]Mipmap and [Copy]Tex[Sub]ImageD calls must be unsuccessful as well. v2. actually force attempts to compress online to fail. v3. indentation (Matt). v4. update copytexture_error_check to account for CopyTexImage*D (Chad). Reviewed-by: Chad Versace <chad.versace@intel.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2015-08-26 14:36:42 -07:00
Nanley Chery	e9fd8e154f	glapi: add support for KHR_texture_compression_astc_ldr v2: correct the spelling of the sRGB variants. remove spaces around "=" when setting the enum value. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2015-08-26 14:36:42 -07:00
Nanley Chery	8ae37365f3	mesa/formats: define the 2D ASTC formats Define the mesa formats and make changes necessary for compilation without errors. Also add support for _mesa_get_srgb_format_linear(). v2. conform the ASTC MESA_FORMAT enums to the existing naming convention. v3. remove ASTC cases for _mesa_get_uncompressed_format(). This function is only used for generating mipmaps - something ASTC formats do not support due to lack of online compression. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2015-08-26 14:36:42 -07:00
Ilia Mirkin	c4cbaca327	nouveau: avoid build failures since `0fc21ecf` Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-08-26 14:04:41 -04:00
Marek Olšák	6924ecac77	gallium/radeon: read_registers should return bool meaning success or failure Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:20 +02:00
Marek Olšák	16e5d8ad38	radeonsi: add IB parser support for CP DMA packets If the packet encoding is defined in the same format as register definitions, the python script can process them automatically and the parser support becomes trivial. Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:19 +02:00
Marek Olšák	2c14a6d3b1	radeonsi: add IB tracing support for debug contexts This adds trace points to all IBs and the parser prints them and also prints which trace points were reached (executed) by the CP. This can help pinpoint a problematic packet, draw call, etc. Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:19 +02:00
Marek Olšák	189953ee13	radeonsi: remove old CS tracing code Some of it is left there and it will be re-used in the next commit. Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:19 +02:00
Marek Olšák	df6a5666b6	radeonsi: parse and dump status registers on GPU hang GPU hang detection must be enabled by setting: GALLIUM_DDEBUG=[timeout in ms] This may print too much information that we might not understand yet, but some of the bits are very useful. Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:19 +02:00
Marek Olšák	61df4f0cd3	radeonsi: add an IB parser Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:19 +02:00
Marek Olšák	be6dc87776	radeonsi: save the contents of indirect buffers for debug contexts This will be used by the IB parser. Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:19 +02:00
Marek Olšák	a6a6c68955	radeonsi: generate register and packet tables for an IB parser from sid.h This makes writing a good IB parser a lot easier. It generates 2 tables: - packet3 table - register table with all registers, fields, and named values Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:19 +02:00
Marek Olšák	d15b71b4bd	radeonsi: remove duplicated register definitions and instruction definitions Instruction encoding isn't needed in Mesa. The border color address registers were duplicated. Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:19 +02:00
Marek Olšák	c59ad265df	r600g,radeonsi: remove unused ill-formed register field definitions Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:18 +02:00
Marek Olšák	110873ed11	radeonsi: add an initial dump_debug_state implementation dumping shaders This is usually called after a draw call. Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:18 +02:00
Marek Olšák	93d97db349	radeonsi: allow si_dump_key to write to a file Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:18 +02:00
Marek Olšák	525921ed51	gallium/ddebug: new pipe for hang detection and driver state dumping (v2) v2: lots of improvements This is like identity or trace, but simpler. It doesn't wrap most states. Run with: GALLIUM_DDEBUG=1000 [executable] where "executable" is the app and "1000" is in miliseconds, meaning that the context will be considered hung if a fence fails to signal in 1000 ms. If that happens, all shaders, context states, bound resources, draw parameters, and driver debug information (if any) will be dumped into: /home/$username/dd_dumps/$processname_$pid_$index. Note that the context is flushed after every draw/clear/copy/blit operation and then waited for to find the exact call that hangs. You can also do: GALLIUM_DDEBUG=always to do the dumping after every draw/clear/copy/blit operation without flushing and waiting. Examples of driver states that can be dumped are: - Hardware status registers saying which hw block is busy (hung). - Disassembled shaders in a human-readable form. - The last submitted command buffer in a human-readable form. v2: drop pipe-loader changes, drop SConscript rename dd.h -> dd_pipe.h Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:18 +02:00
Marek Olšák	0fc21ecfc0	gallium: add flags parameter to pipe_screen::context_create This allows creating compute-only and debug contexts. Reviewed-by: Brian Paul <brianp@vmware.com> Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:18 +02:00
Marek Olšák	7b5c92391f	gallium: add an interface for dumping debug driver state Reviewed-by: Brian Paul <brianp@vmware.com> Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:18 +02:00
Ilia Mirkin	a3b617a258	mesa: remove pointless es31 checks, fix indirect to only be in es31 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2015-08-26 12:37:38 -04:00
Ilia Mirkin	332fb341dd	mesa: uncomment checks in es31 computation, add texture_ms Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Martin Peres <martin.peres@linux.intel.com>	2015-08-26 12:37:17 -04:00
Marek Olšák	f432ae899f	mesa: create multisample fallback textures like normal textures This works if drivers upsample on upload (like all radeon ones do). The alternative is an unexpected GL error from anything calling _mesa_update_state and possibly other issues. Cc: 10.6 11.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Dave Airlie <airlied@redhat.com>	2015-08-26 15:42:26 +02:00
Grazvydas Ignotas	f8b01ae47c	radeonsi: mark unreachable paths to avoid warnings Otherwise we get: warning: 'num_user_sgprs' may be used uninitialized in this function ... Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-08-26 15:42:26 +02:00
Tapani Pälli	e0c2ea0337	mesa: GetTexLevelParameter{if}v changes for OpenGL ES 3.1 Patch refactors existing parameters check to first check common enums between desktop GL and GLES 3.1 and modifies get_tex_level_parameter_image to be compatible with enums specified in 3.1. v2: remove extra is_gles31() checks (suggested by Ilia) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> (v1) Reviewed-by: Marta Lofstedt <marta.lofstedt@intel.com> (v1) Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-08-26 08:38:25 +03:00
Marta Lofstedt	ae8d0e7abe	mesa/es3.1: Allow GL_COMPUTE_WORK_GROUP_SIZE for OpenGL ES 3.1 According to OpenGL ES specification section 7.12, GL_COMPUTE_WORK_GROUP_SIZE, is supported by the glGetProgramiv function. Signed-off-by: Marta Lofstedt <marta.lofstedt@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2015-08-26 08:25:07 +03:00
Marta Lofstedt	c2a766880d	mesa/es3.1: Enable getting MAX_COMPUTE_WORK_GROUP_ values for OpenGL ES 3.1 According to the OpenGL ES 3.1 specification chapter 17, the MAX_COMPUTE_WORK_GROUP_COUNT and MAX_COMPUTE_WORK_GROUP_SIZE is available for glGetIntegeri_v. Signed-off-by: Marta Lofstedt <marta.lofstedt@linux.intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2015-08-26 08:25:07 +03:00
Dave Airlie	73e5adc4b2	mesa/formats: pass correct parameter to _mesa_is_format_compressed commit `26c549e69d` Author: Nanley Chery <nanley.g.chery@intel.com> Date: Fri Jul 31 10:26:36 2015 -0700 mesa/formats: remove compressed formats from matching function caused a regression in my CTS testing, this looks like a clear thinko. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> sSigned-off-by: Dave Airlie <airlied@redhat.com>	2015-08-26 14:13:27 +10:00
Roland Scheidegger	48e6404c04	gallium/auxiliary: optimize rgb9e5 helper some more I used this as some testing ground for investigating some compiler bits initially (e.g. lrint calls etc.), figured I could do much better in the end just for fun... This is mathematically equivalent, but uses some tricks to avoid doubles and also replaces some float math with ints. Good for another performance doubling or so. As a side note, some quick tests show that llvm's loop vectorizer would be able to properly vectorize this version (which it failed to do earlier due to doubles, producing a mess), giving another 3 times performance increase with sse2 (more with sse4.1), but this may not apply to mesa. No piglit change. Acked-by: Marek Olšák <marek.olsak@amd.com>	2015-08-26 02:57:38 +02:00
Roland Scheidegger	941346a803	gallium/auxiliary: optimize rgb9e5 helper a bit This code (lifted straight from the extension) was doing things the most inefficient way you could think of. This drops some of the more expensive float operations, in particular - int-cast floors (pointless, values always positive) - 2 raised to (signed) integers (replace with simple exponent manipulation), getting rid of a misguided comment in the process (implement with table...) - float division (replace with mul of reverse of those exponents) This is like 3 times faster (measured for float3_to_rgb9e5), though it depends (e.g. llvm is clever enough to replace exp2 with ldexp whereas gcc is not, division is not too bad on cpus with early-exit divs). Note that keeping the double math for now (float x + 0.5), as the results may otherwise differ. Acked-by: Marek Olšák <marek.olsak@amd.com>	2015-08-26 02:57:37 +02:00
Dave Airlie	c1452983b4	mesa/texgetimage: fix missing stencil check GetTexImage can read to stencil8 but only from a stencil or depthstencil textures. This fixes a bunch of failures in CTS GL33-CTS.gtf32.GL3Tests.packed_pixels Reviewed-by: Marek Olšák <marek.olsak@amd.com> Cc: "11.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-08-26 10:22:09 +10:00
Nanley Chery	1d2a844e7d	mesa/teximage: Add GL error parameter to _mesa_target_can_be_compressed Enables _mesa_target_can_be_compressed to return the appropriate GL error depending on it's inputs. Use the parameter to return the appropriate GL error for ETC2 formats on GLES3. Suggested-by: Chad Versace <chad.versace@intel.com> Reviewed-by: Chad Versace <chad.versace@intel.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2015-08-25 15:53:46 -07:00
Nanley Chery	26c549e69d	mesa/formats: remove compressed formats from matching function All compressed formats return GL_FALSE and there isn't any evidence to support that this behaviour would change. Remove all switch cases for compressed formats. v2. Since the exhaustive switch is removed, add a gtest to ensure all formats are handled. v3. Ensure that GL_NO_ERROR is set before returning. v4. Fix an arg to _mesa_uncompressed_format_to_type_and_comps(); fix formatting and misc improvements (Chad). Reviewed-by: Chad Versace <chad.versace@intel.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2015-08-25 15:45:17 -07:00
Nanley Chery	8e581747d2	mesa/formats: make format testing a gtest We currently check that our format info table is sane during context initialization in debug builds. Perform this check during `make check` instead. This enables format testing in release builds and removes the requirement of an exhuastive switch for _mesa_uncompressed_format_to_type_and_comps(). v2. indentation and conditional inclusion fixes (Chad). allow tests to continue running if any format fails and display the failing format name. Reviewed-by: Chad Versace <chad.versace@intel.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2015-08-25 15:45:13 -07:00
Kenneth Graunke	1bec29d04d	gallium/ttn: Use nir_builder_insert() rather than poking at cf_list. I intend to remove nir_builder::cf_node_list, so I can't have this code poking at it directly. The proper way is to set the insertion point and then simply insert things there. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2015-08-25 11:12:35 -07:00
Kenneth Graunke	78856194c1	prog_to_nir: Use nir_builder_insert() rather than poking at cf_list. I intend to remove nir_builder::cf_node_list, so I can't have this code poking at it directly. The proper way is to set the insertion point and then simply insert things there. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2015-08-25 11:12:35 -07:00
Kenneth Graunke	5f14c417c8	nir: Use nir_shader::stage rather than passing it around. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2015-08-25 11:12:35 -07:00
Kenneth Graunke	d4d5b430a5	nir: Store gl_shader_stage in nir_shader. This makes it easy for NIR passes to inspect what kind of shader they're operating on. Thanks to Michel Dänzer for helping me figure out where TGSI stores the shader stage information. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2015-08-25 11:12:35 -07:00
Jason Ekstrand	dfacae3a56	i965/fs: Combine assign_constant_locations and move_uniform_array_access_to_pull_constants The comment above move_uniform_array_access_to_pull_constants was completely bogus because it has nothing to do with lowering instructions. Instead, it's assiging locations of pull constants. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-25 10:18:27 -07:00
Jason Ekstrand	c999a58f50	nir/lower_io: Remove assign_var_locations_direct_first This is no longer used so we might as well get rid of it. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-25 10:18:27 -07:00
Jason Ekstrand	259f7291de	i965/fs: Rework uniform handling Previously, we treated the entire UNIFORM file as if it had two elements: One for direct things and one for indirect. This is substantially different from how the old visitor code handled it where each element was effectively its own uniform. This commit makes the NIR path more like the old ir_visitor path where each uniform is separate. This should allow us to more easily make decisions about what to push. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-25 10:18:27 -07:00
Jason Ekstrand	cfa056c6a5	i965/vec4_nir: Get rid of the uniform_driver_location tracking Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-25 10:18:27 -07:00
Jason Ekstrand	ce5e9139aa	nir/lower_io: Separate driver_location and base offset for uniforms Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-25 10:18:27 -07:00
Jason Ekstrand	0db8e87b4a	nir/intrinsics: Add a second const index to load_uniform In the i965 backend, we want to be able to "pull apart" the uniforms and push some of them into the shader through a different path. In order to do this effectively, we need to know which variable is actually being referred to by a given uniform load. Previously, it was completely flattened by nir_lower_io which made things difficult. This adds more information to the intrinsic to make this easier for us. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-25 10:18:27 -07:00

1 2 3 4 5 ...

72353 commits