fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-16 09:48:16 +02:00

Author	SHA1	Message	Date
Kenneth Graunke	1aa28f3509	i965: Make opt_vector_float() only handle non-type-conversion MOVs. We don't handle this properly - we'd have to perform the type conversion before trying to convert the value to a VF. While we could do that, it doesn't seem particularly useful - most vector loads should be consistently typed (all float or all integer). As a special case, we do allow type-converting MOVs of integer 0, as it's represented the same regardless of the type. I believe this case does actually come up. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-04-20 15:05:13 -07:00
Kenneth Graunke	2a25a5142b	i965: Fold vectorize_mov() back into the one caller. After the previous patch, this helper is only called in one place. So, just fold it back in - there are a lot of parameters here and not much code. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-04-20 15:05:13 -07:00
Kenneth Graunke	9967561158	i965: Rework opt_vector_float() control flow. This reworks opt_vector_float() so that there's only one place that flushes out any accumulated state and emits a VF. v2: Don't break the sequence for non-representable numbers - just skip recording their values. Only break it for non-MOVs or register changes. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-04-20 15:05:13 -07:00
Jason Ekstrand	50018522d2	anv: s/anv_batch_emit_blk/anv_batch_emit/ Acked-by: Kristian Høgsberg <krh@bitplanet.net>	2016-04-20 14:54:09 -07:00
Jason Ekstrand	0a45395902	anv: Remove the old emit macro Acked-by: Kristian Høgsberg <krh@bitplanet.net>	2016-04-20 14:54:09 -07:00
Jason Ekstrand	86c52bc757	anv/gen7_pipeline: Use the new emit macro Acked-by: Kristian Høgsberg <krh@bitplanet.net>	2016-04-20 14:54:09 -07:00
Jason Ekstrand	744e133431	anv/gen7_cmd_buffer: Use the new emit macro Acked-by: Kristian Høgsberg <krh@bitplanet.net>	2016-04-20 14:54:09 -07:00
Jason Ekstrand	cae2f14947	anv/device: Use the new emit macro Acked-by: Kristian Høgsberg <krh@bitplanet.net>	2016-04-20 14:54:09 -07:00
Jason Ekstrand	932c353592	anv/state: Use the new emit macro Acked-by: Kristian Høgsberg <krh@bitplanet.net>	2016-04-20 14:54:09 -07:00
Jason Ekstrand	9e9f3f4e71	anv/gen8_pipeline: Use the new emit macro Acked-by: Kristian Høgsberg <krh@bitplanet.net>	2016-04-20 14:54:09 -07:00
Jason Ekstrand	dba3727bea	anv/genX_pipeline: Use the new emit macro Acked-by: Kristian Høgsberg <krh@bitplanet.net>	2016-04-20 14:54:09 -07:00
Jason Ekstrand	a48f8340d9	anv/gen8_cmd_buffer: Use the new emit macro Acked-by: Kristian Høgsberg <krh@bitplanet.net>	2016-04-20 14:54:09 -07:00
Jason Ekstrand	8a6ced83e9	anv/cmd_buffer: Use the new emit macro for quaries Acked-by: Kristian Høgsberg <krh@bitplanet.net>	2016-04-20 14:54:09 -07:00
Jason Ekstrand	db25e1eec5	anv/cmd_buffer: Use the new emit macro for DRAWING_RECTANGLE Acked-by: Kristian Høgsberg <krh@bitplanet.net>	2016-04-20 14:54:09 -07:00
Jason Ekstrand	deb13870d8	anv/cmd_buffer: Use the new emit macro for compute shader dispatch Acked-by: Kristian Høgsberg <krh@bitplanet.net>	2016-04-20 14:54:09 -07:00
Jason Ekstrand	06fc7fa684	anv/cmd_buffer: Use the new emit macro for 3DSTATE_CONSTANT Acked-by: Kristian Høgsberg <krh@bitplanet.net>	2016-04-20 14:54:09 -07:00
Jason Ekstrand	a71ded0e18	anv/cmd_buffer: Use the new emit macro for DEPTH/STENCIL_BUFFER Acked-by: Kristian Høgsberg <krh@bitplanet.net>	2016-04-20 14:54:09 -07:00
Jason Ekstrand	56453eeaff	anv/cmd_buffer: Use the new emit macro for PIPE_CONTROL and STATE_BASE_ADDRESS Acked-by: Kristian Høgsberg <krh@bitplanet.net>	2016-04-20 14:54:09 -07:00
Jason Ekstrand	1d4d6852b4	anv/cmd_buffer: Use the new emit macro for 3DPRIMITIVE commands Acked-by: Kristian Høgsberg <krh@bitplanet.net>	2016-04-20 14:54:09 -07:00
Jason Ekstrand	64ad2d3bcd	anv: Add a new block-based batch emit macro This new macro uses a for loop to create an actual code block in which to place the macro setup code. One advantage of this is that you syntatically use braces instead of parentheses. Another is that the code in the block doesn't even get executed if anv_batch_emit_dwords fails. Acked-by: Kristian Høgsberg <krh@bitplanet.net>	2016-04-20 14:54:09 -07:00
Samuel Pitoiset	d30768025a	gk110/ir: make use of IMUL32I for all immediates Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "11.1 11.2" <mesa-stable@lists.freedesktop.org>	2016-04-20 22:55:36 +02:00
Samuel Pitoiset	17a37c78fc	gk110/ir: do not overwrite def value with zero for EXCH ops This is only valid for other atomic operations (including CAS). This fixes an invalid opcode error from dmesg. While we are it, make sure to initialize global addr to 0 for other atomic operations. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "11.1 11.2" <mesa-stable@lists.freedesktop.org>	2016-04-20 22:55:33 +02:00
Marcin Ślusarz	3caf2e89aa	anv: fix build without Wayland platform Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-20 11:12:10 -07:00
Laurent Carlier	6c952d8ac7	anv: fix building on i686 with -mcpu=generic mcpu=generic doesn't enable sse2, and anvil definitly needs it Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-20 10:48:11 -07:00
Jason Ekstrand	2ef7aef322	spirv: Trivially handle the NonWriteable decoration Signed-off-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-20 10:33:23 -07:00
Connor Abbott	b6dc940ec2	nir: rename nir_foreach_block() to nir_foreach_block_call() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-20 09:47:05 -07:00
Samuel Pitoiset	7143068296	nvc0: avoid tex read fault from compute shaders on GK110 After some investigation, it seems like that disabling the UNK02C4 command avoid a read fault with texelFetch() from a compute shader. I have no clue on what this method actually does, but this avoid the GPU to hang with basic-texelFetch.shader_test without introducing any compute-related regressions. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-04-20 18:28:47 +02:00
Jason Ekstrand	87a4fb516e	i965/vec4: Always split uniforms in array_access_to_pull_constants Normally, we split uniforms at the end but in Vulkan, we bail because we don't want pull constants. However, we still need them split because pack_uniforms relies on it. I really don't like this patch not because it doesn't work (it does) but because now that we're using MOV_INDIRECT, uniform numbers and sizes don't really matter anymore. In the FS backend, uniform splitting and packing is handled all at once (actual re-assignment of locations happens later) and we really should do it that way in vec4 eventually as well. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=94998 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=95001	2016-04-20 09:15:01 -07:00
Jason Ekstrand	b3f43822c7	i965/vec4: Use the correct offset for the swizzle shift in push constants This was actually caught by Ken in review the first time around but somehow didn't get fixed before the patches were pushed. :-( Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=94998 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=95001	2016-04-20 09:15:01 -07:00
Jason Ekstrand	9f16e170fe	i965/vec4: Use nir_intrinsic_base in the load_uniform implementation We shouldn't be reading the const_index directly Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=94998 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=95001	2016-04-20 09:15:01 -07:00
Jason Ekstrand	f63a95080f	anv/apply_dynamic_offsets: Provide a range on the load_uniform Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=94998 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=95001	2016-04-20 09:14:58 -07:00
Jason Ekstrand	35b758c378	anv/lower_push_constants: Stop treating scalar specially All of the code that did something special based on vec4 vs. scalar is bogus. In the backend, everything is now in units of bytes and the vec4 backend can handle full std140 packing so we don't need to do anything special anymore. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=94998	2016-04-20 09:14:47 -07:00
Tim Rowley	3bbe8a09ea	swr: fix resource backed constant buffers Code was using an incorrect address for the base pointer. v2: use swr_resource_data() utility function. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=94979 Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com> Tested-by: Markus Wick <markus@selfnet.de>	2016-04-20 09:57:55 -05:00
Hans de Goede	2ac2ecdd6c	nouveau: codegen: Add support for OpenCL global memory buffers Add support for OpenCL global memory buffers, note this has only been tested with regular load and stores and likely needs more work for e.g. atomic ops. Tested with piglet on a gf119 and a gk107: ./piglit run -o shader -t '.arb_shader_storage_buffer_object.' results/shader [9/9] pass: 9 / ./piglit run -o shader -t '.arb_compute_shader.' results/shader [20/20] skip: 4, pass: 16 \| Signed-off-by: Hans de Goede <hdegoede@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2016-04-20 13:46:03 +02:00
Hans de Goede	61d52a5fb9	nouveau: codegen: Use FILE_MEMORY_BUFFER for buffers Some of the lowering steps we currently do for FILE_MEMORY_GLOBAL only apply to buffers, making it impossible to use FILE_MEMORY_GLOBAL for OpenCL global buffers. This commits changes the buffer code to use FILE_MEMORY_BUFFER at the ir_from_tgsi and lowering steps, freeing use of FILE_MEMORY_GLOBAL for use with OpenCL global buffers. Note that after lowering buffer accesses use the FILE_MEMORY_GLOBAL register file. Tested with piglet on a gf119 and a gk107: ./piglit run -o shader -t '.arb_shader_storage_buffer_object.' results/shader [9/9] pass: 9 / ./piglit run -o shader -t '.arb_compute_shader.' results/shader [20/20] skip: 4, pass: 16 \| Signed-off-by: Hans de Goede <hdegoede@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2016-04-20 13:46:03 +02:00
Jose Fonseca	f02f4d09ce	scons: Build dri_common_interop.c.	2016-04-20 12:41:24 +01:00
Marek Olšák	4fa3d35cc5	st/dri: implement the GL interop DRI extension (v2.2) v2: - set interop_version - simplify the offset_after macro v2.1: - use version numbers, remove offset_after - set "out_driver_data_written" v2.2: - set buf_offset & buf_size for GL_ARRAY_BUFFER too - add whandle.offset to buf_offset - disable the minmax cache for GL_TEXTURE_BUFFER	2016-04-20 12:18:47 +02:00
Marek Olšák	37d3a26bd6	glx: implement GLX part of interop interface (v2) v2: - use const	2016-04-20 12:18:47 +02:00
Marek Olšák	b6eda70843	egl: implement EGL part of interop interface (v2) v2: - use const	2016-04-20 12:18:47 +02:00
Marek Olšák	5e9ed261ed	dri_interface: add interface for GL interop with other APIs (v2) v2: - use const	2016-04-20 12:18:47 +02:00
Marek Olšák	6eeb729490	include/GL: add mesa_glinterop.h for OpenGL-OpenCL interop (v4.2) v2: - use "enum" to define stuff v3: - more comments, define MESA_GLINTEROP_UNSUPPORTED v4: - add mesa_glinterop_device_info::interop_version - more comments - remove #define MESA_GLINTEROP_VERSION - use const for "in" v4.1: - use version numbers for structures - add "out_driver_data_written" v4.2: - buf_offset & buf_size affect GL_ARRAY_BUFFER too, this is required for sharing suballocations within a larger buffer	2016-04-20 12:15:41 +02:00
Nicolas Dufresne	8093990ef4	st/dri: Fix RGB565 EGLImage creation When creating egl images we do a bytes to pixel conversion by deviding by 4 regardless of the pixel format. This does not work for RGB565. In this patch, we avoid useless conversion and use proper API when the conversion cannot be avoided. Signed-off-by: Nicolas Dufresne <nicolas.dufresne@collabora.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-04-20 17:55:30 +09:00
Nicolas Dufresne	4463f38766	st/dri: Factor out DRI2 to PIPE_FORMAT conversion This code is already duplicated twice and will be useful again. This will also help when adding formats. Signed-off-by: Nicolas Dufresne <nicolas.dufresne@collabora.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-04-20 17:34:03 +09:00
Rob Clark	899bd63ace	freedreno/a4xx: lower srgb in shader for astc textures This seems like a hw bug, and maybe only applies to certain a4xx variants/revisions. But setting the SRGB bit in sampler view state (texconst0) causes invalid alpha for ASTC textures. Work around this by doing the srgb->linear conversion in the shader instead. This fixes 392 dEQP tests: dEQP-GLES3.functional.texture.astcsrgb* (The remaining fails seem to be a bug w/ ASTC + linear filtering, also possibly a420.0 specific.) Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-04-19 17:14:04 -04:00
Rob Clark	eddfc97709	nir/lower-tex: add srgb->linear lowering Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-19 17:13:50 -04:00
Rob Clark	eb00a0fc58	nir/builder: const'ify swiz param No need for it not to be const, and lets caller declare it const if desired. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2016-04-19 17:13:36 -04:00
Rob Clark	52ccc6349f	nir/lower-tex: make options a local var Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-04-19 16:12:49 -04:00
Rob Clark	d4ff42bd0a	freedreno: cleanup fd_set_sampler_views The separate FS/VS entrypoints are no longer used since `a3ed98f`. So just inline them. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-04-19 16:11:47 -04:00
Russell King	fadfaa82c6	tgsi/lowering: improved lowering for LRP Provide an improved lowering for LRP, which can be implemented in two MAD instructions with a bit of rearranging of the equation, rather than the literal implementation of two multiplies, an add and a subtract. Signed-off-by: Russell King <rmk@arm.linux.org.uk> Reviewed-by: Rob Clark <robdclark@gmail.com> Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-04-19 16:04:44 -04:00
Russell King	67da7dd98a	tgsi/lowering: improved lowering for XPD Improve XPD lowering to consume less instructions by using the MAD instruction to perform the multiply and subtraction together. Signed-off-by: Russell King <rmk@arm.linux.org.uk> Reviewed-by: Rob Clark <robdclark@gmail.com> Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-04-19 16:04:44 -04:00

... 39 40 41 42 43 ...

82384 commits