fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 17:58:26 +02:00

Author	SHA1	Message	Date
Antia Puentes	79f1a7ae28	i965/vec4: Fix saturation errors when coalescing registers If the register types do not match and the instruction that contains the final destination is saturated, register coalescing generated non-equivalent code. This did not happen when using IR because types usually matched, but it is visible in nir-vec4. For example, mov vgrf7:D vgrf2:D mov.sat m4:F vgrf7:F is coalesced to: mov.sat m4:D vgrf2:D The patch prevents coalescing in such scenario, unless the instruction we want to coalesce into is a MOV (without type conversion implied). In that case, the patch sets the register types to the type of the final destination. Shader-db results in HSW (only vec4 instructions shown): total instructions in shared programs: 1754415 -> 1754416 (0.00%) instructions in affected programs: 74 -> 75 (1.35%) helped: 0 HURT: 1 GAINED: 0 LOST: 0 Only one extra instruction in one of the shaders, that comes from eliminating a saturation error by preventing register coalesce. Cc: "10.6 11.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-09-14 12:11:46 +02:00
Tapani Pälli	d1bce52e13	docs: cleanups + mark some work as done Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-09-14 09:29:30 +03:00
Ilia Mirkin	f0b9d53262	docs: only astc ldr required for ES3.2, not hdr Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2015-09-14 02:08:42 -04:00
Ilia Mirkin	67d2d3ba43	st/mesa: emit TXQS, support ARB_shader_texture_image_samples The image component of the ext is a no-op since there is no image support in gallium (yet). Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2015-09-13 18:24:45 -04:00
Ilia Mirkin	ec3fe42b3a	r600g: add support for TXQS tgsi opcode Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Glenn Kennard <glenn.kennard@gmail.com>	2015-09-13 18:24:44 -04:00
Ilia Mirkin	4294db90b1	nv50/ir: add support for TXQS tgsi opcode Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-09-13 18:24:44 -04:00
Ilia Mirkin	f46a53ffa5	gallium: add PIPE_CAP_TGSI_TXQS to let st know if TXQS is supported Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Glenn Kennard <glenn.kennard@gmail.com>	2015-09-13 18:24:37 -04:00
Ilia Mirkin	d173c5e77d	tgsi: add a TXQS opcode to retrieve the number of texture samples Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>	2015-09-13 18:24:01 -04:00
Jordan Justen	c4cf824658	glsl/cs: Initialize gl_LocalInvocationIndex in main() We initialize gl_LocalInvocationIndex based on the extension spec formula: gl_LocalInvocationIndex = gl_LocalInvocationID.z * gl_WorkGroupSize.x * gl_WorkGroupSize.y + gl_LocalInvocationID.y * gl_WorkGroupSize.x + gl_LocalInvocationID.x; https://www.opengl.org/registry/specs/ARB/compute_shader.txt Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2015-09-13 09:53:17 -07:00
Jordan Justen	6823e12d5a	glsl/cs: Exclude gl_LocalInvocationIndex from builtin variable stripping We lower gl_LocalInvocationIndex based on the extension spec formula: gl_LocalInvocationIndex = gl_LocalInvocationID.z * gl_WorkGroupSize.x * gl_WorkGroupSize.y + gl_LocalInvocationID.y * gl_WorkGroupSize.x + gl_LocalInvocationID.x; https://www.opengl.org/registry/specs/ARB/compute_shader.txt We need to set this variable in main(), even if gl_LocalInvocationIndex is not referenced by the shader. (It may be used by a linked shader.) Therefore, we can't eliminate it as a dead variable. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2015-09-13 09:53:16 -07:00
Jordan Justen	2b6cc0395b	glsl/cs: Initialize gl_GlobalInvocationID in main() We initialize gl_GlobalInvocationID based on the extension spec formula: gl_GlobalInvocationID = gl_WorkGroupID * gl_WorkGroupSize + gl_LocalInvocationID https://www.opengl.org/registry/specs/ARB/compute_shader.txt Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Cc: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2015-09-13 09:53:16 -07:00
Jordan Justen	c4d049f646	glsl: Move link_get_main_function_signature to a common location Also rename to _mesa_get_main_function_signature. We will call it near the end of compilation to insert some code into main for initializing some compute shader global variables. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2015-09-13 09:53:16 -07:00
Jordan Justen	34e187ec38	glsl/cs: Don't strip gl_GlobalInvocationID and dependencies We lower gl_GlobalInvocationID based on the extension spec formula: gl_GlobalInvocationID = gl_WorkGroupID * gl_WorkGroupSize + gl_LocalInvocationID https://www.opengl.org/registry/specs/ARB/compute_shader.txt We need to set this variable in main(), even if gl_GlobalInvocationID is not referenced by the shader. (It may be used by a linked shader.) Therefore, we can't eliminate these as dead variables. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2015-09-13 09:53:16 -07:00
Jordan Justen	c5743a5d7f	i965/nir: Support gl_WorkGroupID variable Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-13 09:53:16 -07:00
Jordan Justen	4e454cb7c6	i965/cs: Initialize gl_WorkGroupID variable from payload Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-13 09:53:16 -07:00
Jordan Justen	4f178f0d8b	nir: Add gl_WorkGroupID system variable Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-13 09:53:16 -07:00
Jordan Justen	f5bb5a1bf1	glsl/cs: Add gl_WorkGroupID variable Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-13 09:53:16 -07:00
Jordan Justen	49f999b9cb	i965/nir: Support gl_LocalInvocationID variable Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-13 09:53:16 -07:00
Jordan Justen	43624361df	i965/cs: Initialize gl_LocalInvocationID from payload Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-13 09:53:16 -07:00
Jordan Justen	b94b57f7c5	i965/cs: Initialize gl_LocalInvocationID in push constant data Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-13 09:53:16 -07:00
Jordan Justen	c7161a3c35	i965/cs: Reserve local invocation id in payload regs Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2015-09-13 09:53:16 -07:00
Jordan Justen	62e011d593	nir: Add gl_LocalInvocationID variable Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-13 09:53:16 -07:00
Jordan Justen	bf8d6e501c	glsl/cs: Add gl_LocalInvocationID variable Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-13 09:53:16 -07:00
Krzesimir Nowak	08ceb5e076	softpipe: Change faces type to uint This is to avoid needless float<->int conversions, since all face-related computations are made on integers. Spotted by Emil Velikov. Reviewed-by: Brian Paul <brianp@vmware.com>	2015-09-13 09:50:21 -06:00
Rob Clark	59519c2283	freedreno/ir3: fix compile warn after `1807a08e` New enum to add to switch so compiler doesn't complain. commit `1807a08e4f` Author: Ilia Mirkin <imirkin@alum.mit.edu> AuthorDate: Thu Aug 27 23:05:03 2015 -0400 Commit: Ilia Mirkin <imirkin@alum.mit.edu> CommitDate: Thu Sep 10 17:38:33 2015 -0400 nir: add nir_texop_texture_samples and convert from glsl Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-13 11:31:45 -04:00
Rob Clark	bf45a7d28e	freedreno/ir3: fix compile break after `a4aa25be` Following commit dropped the unused memctx arg: commit `a4aa25be1e` Author: Jason Ekstrand <jason.ekstrand@intel.com> AuthorDate: Wed Sep 9 13:24:35 2015 -0700 Commit: Jason Ekstrand <jason.ekstrand@intel.com> CommitDate: Fri Sep 11 09:21:20 2015 -0700 nir: Remove the mem_ctx parameter from ssa_def_rewrite_uses Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-13 11:31:30 -04:00
Rob Clark	b88aeff4f5	nir: add nir_channel() to get at single components of vec's Rather than make yet another copy of channel(), let's move it into nir. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-09-13 11:08:27 -04:00
Rob Clark	86358e949e	tgsi/scan: add support to figure out max nesting depth Sometimes a useful thing for compilers (or, for example, tgsi_to_nir) to know. And pretty trivial for scan to figure this out for us. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2015-09-13 11:08:27 -04:00
Kai Wasserbäch	d6fbcf6ee2	r600: Fix llvm build since const buffer changes In commit `f9caabe8f1`: One place in r600_llvm.c was forgotten when replacing R600_UCP_CONST_BUFFER with R600_BUFFER_INFO_CONST_BUFFER. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=91985 Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Signed-off-by: Dave Airlie <airlied@gmail.com>	2015-09-13 07:09:08 +10:00
Jason Ekstrand	1037e0a84f	i965/vec4: Don't reswizzle hardware registers Cc: "11.0 10.6" <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=91719 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-09-12 10:46:26 -07:00
Jason Ekstrand	dd7290cf59	i965/emit: Add assertions for accumulator restrictions Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-09-12 10:46:26 -07:00
Emil Velikov	7852a44e3c	docs: add news item and link release notes for 11.0.0 Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>	2015-09-12 13:50:33 +01:00
Emil Velikov	c34ed46217	docs: add sha256 checksums for 11.0.0 Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> (cherry picked from commit `c4bae5792b`)	2015-09-12 13:48:15 +01:00
Emil Velikov	09223bfa9b	docs: Update 11.0.0 release notes Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> (cherry picked from commit `4f1e500150`)	2015-09-12 13:48:14 +01:00
Glenn Kennard	ce34048b57	r600: Enable fp64 on chips with native support Cypress/Cayman/Aruba, earlier r6xx/r7xx chips only support a subset of the needed fp64 ops, and don't do GL4 anyway. Signed-off-by: Glenn Kennard <glenn.kennard@gmail.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-09-12 07:32:08 +01:00
Glenn Kennard	d2ca9afd5d	r600g: Support I2D/U2D/D2I/D2U Only for Cypress/Cayman/Aruba, older chips have only partial fp64 support. Uses float intermediate values so only accurate for int24 range, which matches what the blob does. Signed-off-by: Glenn Kennard <glenn.kennard@gmail.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-09-12 07:30:10 +01:00
Dave Airlie	f9caabe8f1	r600g: lower number of driver const buffers I'm going to want a driver constant buffer for tess to coordinate LDS storage, so before I go tackling that I decided to merge the clip/samplepos and texture info buffers into one. So I can steal the spare one. This creates a single constant buffer between the two, with clip/samplepos taking up a reserved 128 bytes at the start. Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Reviewed-by: Glenn Kennard <glenn.kennard@gmail.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-09-12 06:56:58 +01:00
Dave Airlie	0337a9b2af	r600: define some values for the fetch constant offsets. This just puts these in one place and #defines them. Reviewed-by: Glenn Kennard <glenn.kennard@gmail.com> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-09-12 06:56:51 +01:00
Thomas Helland	2e7e3fe55f	docs: Update with GLES3.2 entries and status V2: -Change to "not started" for most entries -Add status for multisample_2d_array -Change shader_multisample_interpolation to "not_stared" V3 (idr): Move the GLES 3.2 section after the "Additional functions" section from GLES 3.1. Note that GL_KHR_texture_compression_astc_hdr is done for i965 on gen9+ hardware. Note that GL_OES_shader_io_blocks is based on some features from GLSL 1.50. Signed-off-by: Thomas Helland <thomashelland90@gmail.com> Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> [v2] Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2015-09-11 18:46:43 -07:00
Krzesimir Nowak	2135aba8d9	softpipe: Constify variables This commit makes a lot of variables constant - this is basically done by moving the computation to variable definition. Some of them are moved into lower scopes (like in img_filter_2d_ewa). Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2015-09-11 15:37:00 -06:00
Krzesimir Nowak	231687c19b	softpipe: Constify sp_tgsi_sampler Add a small inline function doing the casting - this is to make sure we don't do a cast from some completely unrelated type. This commit does not make tgsi_sampler parameters const in vfuncs themselves for now - probably llvmpipe would need looking at before making such a change. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2015-09-11 15:36:54 -06:00
Krzesimir Nowak	ac23116de5	softpipe: Constify sampler and view parameters in mip filters Those functions actually could always take them as constants. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2015-09-11 15:36:47 -06:00
Krzesimir Nowak	ea764baa61	softpipe: Constify sampler and view parameters in img filters Those functions actually could always take them as constants. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2015-09-11 15:36:43 -06:00
Krzesimir Nowak	ba72e6cfb8	tgsi, softpipe: Constify tgsi_sampler in query_lod vfunc A followup from previous commit - since all functions called by query_lod take pointers to const sp_sampler_view and const sp_sampler, which are taken from tgsi_sampler subclass, we can the tgsi_sampler as const itself now. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2015-09-11 15:36:38 -06:00
Krzesimir Nowak	ea0fecd1a3	softpipe: Constify some sampler and view parameters This is to prepare for making tgsi_sampler parameter in query_lod a const too. These functions do not modify anything in either sampler or view anymore. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2015-09-11 15:36:32 -06:00
Krzesimir Nowak	4ca2896e8e	softpipe: Move the faces array from view to filter_args With that, sp_sampler_view instances are not abused anymore as a local storage, so we can later make them constant. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2015-09-11 15:36:23 -06:00
Jason Ekstrand	ca11c3c0a4	nir/from_ssa: Use instr_rewrite_dest Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2015-09-11 09:21:20 -07:00
Jason Ekstrand	cee29220e3	nir: Add a function for rewriting instruction destinations Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2015-09-11 09:21:20 -07:00
Jason Ekstrand	106a3b2cc3	nir: Only unlink sources that are actually valid Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2015-09-11 09:21:20 -07:00
Jason Ekstrand	a4aa25be1e	nir: Remove the mem_ctx parameter from ssa_def_rewrite_uses Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2015-09-11 09:21:20 -07:00

... 9 10 11 12 13 ...

73282 commits