fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 07:08:04 +02:00

Author	SHA1	Message	Date
Boyuan Zhang	231605d14d	vl: add RefPicList defines for VAAPI HEVC decode Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2015-10-27 19:09:55 -04:00
Marta Lofstedt	16c49da63a	mesa: Draw indirect is not allowed if the default VAO is bound. From OpenGL ES 3.1 specification, section 10.5: "DrawArraysIndirect requires that all data sourced for the command, including the DrawArraysIndirectCommand structure, be in buffer objects, and may not be called when the default vertex array object is bound." Signed-off-by: Marta Lofstedt <marta.lofstedt@linux.intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2015-10-27 12:16:23 +01:00
Marek Olšák	93eb4f9287	winsys/amdgpu: remove the dcc_enable surface flag dcc_size is sufficient and doesn't need a further comment in my opinion. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-10-27 10:49:24 +01:00
Marek Olšák	3aebc596b3	radeonsi: add debug flags that disable DCC and DCC fast clear For debugging, bug reports, etc. This is not in the radeonsi directory, but it is about radeonsi. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-10-27 10:49:24 +01:00
Marek Olšák	235d38584c	radeonsi: properly check if DCC is enabled and allocated Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-10-27 10:49:24 +01:00
Marek Olšák	5bc5dca0cb	radeonsi: simplify DCC handling in si_initialize_color_surface Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-10-27 10:49:24 +01:00
Marta Lofstedt	3daa7e5147	mesa: Draw indirect is not allowed when xfb is active and unpaused OpenGL ES 3.1 specification, section 10.5: "An INVALID_OPERATION error is generated if transform feedback is active and not paused." Signed-off-by: Marta Lofstedt <marta.lofstedt@linux.intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2015-10-27 08:49:21 +01:00
Marta Lofstedt	2c91e08656	mesa: Draw Indirect return wrong error code on unalinged From OpenGL 4.4 specification, section 10.4 and Open GL Es 3.1 section 10.5: "An INVALID_VALUE error is generated if indirect is not a multiple of the size, in basic machine units, of uint." However, the current code follow the ARB_draw_indirect: https://www.opengl.org/registry/specs/ARB/draw_indirect.txt "INVALID_OPERATION is generated by DrawArraysIndirect and DrawElementsIndirect if commands source data beyond the end of a buffer object or if <indirect> is not word aligned." V2: After discussions on the list, it was suggested to only keep the INVALID_VALUE error. Signed-off-by: Marta Lofstedt <marta.lofstedt@linux.intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-10-27 08:49:21 +01:00
Samuel Iglesias Gonsalvez	4565b6f4fb	main: Remove interface block array index for doing the name comparison From ARB_program_query_interface spec: "uint GetProgramResourceIndex(uint program, enum programInterface, const char name); [...] If <name> exactly matches the name string of one of the active resources for <programInterface>, the index of the matched resource is returned. Additionally, if <name> would exactly match the name string of an active resource if "[0]" were appended to <name>, the index of the matched resource is returned. [...]" "A string provided to GetProgramResourceLocation or GetProgramResourceLocationIndex is considered to match an active variable if: [...] if the string identifies the base name of an active array, where the string would exactly match the name of the variable if the suffix "[0]" were appended to the string; [...] " Fixes the following two dEQP-GLES31 tests: dEQP-GLES31.functional.program_interface_query.shader_storage_block.resource_list.block_array dEQP-GLES31.functional.program_interface_query.shader_storage_block.resource_list.block_array_single_element v2: - Add AoA support (Timothy) - Apply it too for GetUniformLocation(), GetUniformName() and others because ARB_program_interface_query says that they are equivalent to GetProgramResourceLocation() and GetProgramResourceName() (Tapani) Signed-off-by: Samuel Iglesias Gonsalvez <siglesias@igalia.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2015-10-27 08:10:04 +01:00
Eric Anholt	3359ad6cda	vc4: Add support for copy propagation with unpack flags present. total instructions in shared programs: 89251 -> 87862 (-1.56%) instructions in affected programs: 52971 -> 51582 (-2.62%)	2015-10-26 16:48:34 -07:00
Eric Anholt	01ca4f207e	vc4: Rewrite the pack instructions as a MOV with a dst pack flag Another step in reducing the special-casing of instructions.	2015-10-26 16:48:34 -07:00
Eric Anholt	72fa2ae20b	vc4: Move dst pack setup out to a helper function with more asserts.	2015-10-26 16:48:34 -07:00
Eric Anholt	99a9a5a345	vc4: Switch the unpack ops to being unpack flags on a mov. This paves the way for copy propagating our unpacks. We end up with a small change on shader-db: total instructions in shared programs: 89390 -> 89251 (-0.16%) instructions in affected programs: 19041 -> 18902 (-0.73%) which appears to be because we no longer convert MOVs for an FMAX dst, r4.unpack, r4.unpack (instead of the previous MOV dst, r4.unpack), and this ends up with a slightly better schedule.	2015-10-26 16:48:34 -07:00
Eric Anholt	548b05d53f	vc4: Drop some confused code about pack/unpack handling. At one point I thought packs and unpacks were in the same field of the instruction. They aren't. These instructions therefore never cause a pack. total instructions in shared programs: 89472 -> 89390 (-0.09%) instructions in affected programs: 15261 -> 15179 (-0.54%)	2015-10-26 16:48:34 -07:00
Eric Anholt	a7b424e835	vc4: Reduce MOV special-casing in QIR-to-QPU. I'm going to introduce some more types of MOV, which also want the elision of raw MOVs.	2015-10-26 16:48:34 -07:00
Eric Anholt	652a864b25	vc4: Fix up the test for whether the unpack can be from r4. We can do 16a/16b from float as well. No difference on shader-db.	2015-10-26 16:48:34 -07:00
Eric Anholt	3d7a088608	vc4: Don't try to follow MOVs across a pack.	2015-10-26 16:48:34 -07:00
Eric Anholt	6eb0760f48	vc4: Only copy propagate raw MOVs. No problems being fixed, but needed for the new unpack changes.	2015-10-26 16:48:34 -07:00
Eric Anholt	0ccacfa017	vc4: If a QIR source has an unpack set, print it. Not used yet, but will be.	2015-10-26 16:48:34 -07:00
Kenneth Graunke	8034e7d6f1	glsl: Convert TES gl_PatchVerticesIn into a constant when using a TCS. When a TCS is present, the TES input gl_PatchVerticesIn is actually a constant - it's simply the # of output vertices specified by the TCS layout qualifiers. So, we can replace the system value with a constant, which may allow further optimization, and will likely be more efficient. If the TCS is absent, we can't do this optimization. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2015-10-26 16:37:07 -07:00
Ian Romanick	8f84a8e257	i965: Add missing close-parenthesis in error messages Trivial. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2015-10-26 16:15:55 -07:00
Ian Romanick	7070c8879a	i965: Fix is-renderable check in intel_image_target_renderbuffer_storage Previously we could create a renderbuffer with format MESA_FORMAT_R8G8B8A8_UNORM, convert that renderbuffer to an EGLImage, then FAIL to convert the EGLImage back to a renderbuffer because reasons. Just use the same check in intel_image_target_renderbuffer_storage that brw_render_target_supported uses. There are more checks in brw_render_target_supported, but I don't think they are necessary here. A different approach would be to refactor brw_render_target_supported to take rb->Format and rb->NumSamples as parameters (instead of a gl_renderbuffer) and use the new function here. Fixes: ES2-CTS.gtf.GL2ExtensionTests.egl_image.egl_image Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Tested-by: Tapani Pälli <tapani.palli@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=92476 Cc: "10.3 10.4 10.5 10.6 11.0" <mesa-stable@lists.freedesktop.org>	2015-10-26 16:15:55 -07:00
Timothy Arceri	a3d0359aff	glsl: keep track of intra-stage indices for atomics This is more optimal as it means we no longer have to upload the same set of ABO surfaces to all stages in the program. This also fixes a bug where since commit c0cd5b var->data.binding was being used as a replacement for atomic buffer index, but they don't have to be the same value they just happened to end up the same when binding is 0. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Cc: Ilia Mirkin <imirkin@alum.mit.edu> Cc: Alejandro Piñeiro <apinheiro@igalia.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=90175	2015-10-27 07:03:05 +11:00
Roland Scheidegger	711489648b	gallivm: disable f16c when not using AVX f16c intrinsic can only be emitted when AVX is used. So when we disable AVX due to forcing 128bit vectors we must not use this intrinsic (depending on llvm version, this worked previously because llvm used AVX even when we didn't tell it to, however I've seen this fail with llvm 3.3 since `718249843b` which seems to have the side effect of disabling avx in llvm albeit it only touches sse flags really, but with `ea421e919a` it's now really disabled). Albeit being able to use AVX with 128bit vectors also would have its uses, the code as is really was meant to emulate jit code creation for less capable cpus. v2: add some (ifdefed out) missing de-featuring options for simulating less capable cpus. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-10-26 16:45:49 +01:00
Julien Isorce	a61be1a798	st/va: pass picture desc to begin and decode At least vl_mpeg12_decoder uses the picture desc in begin_frame and decode_bitstream. https://bugs.freedesktop.org/show_bug.cgi?id=92634 Signed-off-by: Julien Isorce <j.isorce@samsung.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2015-10-26 13:53:10 +01:00
Tapani Pälli	8ae4317c36	mesa: add additional checks for uniform location query Patch adds additional check to make sure we don't return locations for structures or arrays of structures. From page 79 of the OpenGL 4.2 spec: "A valid name cannot be a structure, an array of structures, or any portion of a single vector or a matrix." v2: use without-array() to simplify code (Timothy) No Piglit or CTS regressions observed. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2015-10-26 12:52:17 +02:00
Emil Velikov	a305d59baa	docs: add news item and link release notes for 11.0.4 Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2015-10-25 10:17:14 +00:00
Emil Velikov	47dd80a35d	docs: add sha256 checksums for 11.0.4 Signed-off-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `ec14e6f8fd`)	2015-10-25 10:14:04 +00:00
Emil Velikov	bddb7a51c3	docs: add release notes for 11.0.4 Signed-off-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `31bf247031`)	2015-10-25 10:14:03 +00:00
Kenneth Graunke	fcb39f5b6a	i965: Make brw_varying_to_offset take a const pointer to the VUE map. It doesn't modify it. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-10-24 20:30:14 -07:00
Eric Anholt	a2eba3362f	vc4: Fix names of the 16-bit unpacks They're only f16-to-f32 on a float operation, otherwise they're i16-to-i32.	2015-10-24 17:55:55 -07:00
Eric Anholt	a238ad372d	vc4: Don't try to register coalesce into the VPM across non-raw MOVs. No known bugs, just something I noticed while updating optimization code for other changes.	2015-10-24 17:55:38 -07:00
Eric Anholt	ae1d3322cc	vc4: Take advantage of the 8888 pack function in pack_unorm_4x8. One instruction instead of four, and it turns out you do this a lot for the Over operator. total uniforms in shared programs: 32168 -> 32087 (-0.25%) uniforms in affected programs: 318 -> 237 (-25.47%) total instructions in shared programs: 89830 -> 89472 (-0.40%) instructions in affected programs: 6434 -> 6076 (-5.56%)	2015-10-24 17:55:22 -07:00
Eric Anholt	f09ed63f43	vc4: Fix the test for skipping raw MOVs. I don't know what previous test was trying to do, but it dates back to the first add of vc4_qpu_emit.c. No change to shader-db.	2015-10-24 17:55:22 -07:00
Ben Widawsky	9ecfc6baf1	i965: Remove unused devinfo revision I left the function to obtain the revision because it is, and will continue to be useful in the future. I'd rather not have to dig it up every time we need it. Comments left at the implementation to say as much. This was accidentally left here when I moved the early platform support: commit `28ed1e08e8` Author: Ben Widawsky <benjamin.widawsky@intel.com> Date: Fri Aug 7 13:58:37 2015 -0700 i965/skl: Remove early platform support Signed-off-by: Ben Widawsky <benjamin.widawsky@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-10-24 12:48:01 -07:00
Fabio Pedretti	b0342f48d0	docs/index.html: fix typo Reviewed-by: Boyan Ding <boyan.j.ding@gmail.com> Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>	2015-10-24 19:27:24 +01:00
Rob Clark	1e8d0cc628	freedreno: remove unnecessary null checks According to piglit/xonotic/neverball/stc, blend/rasterize/zsa state will always be bound (never null). And the null checks were in- consistent anyways, so remove them. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-10-24 12:38:33 -04:00
Bas Nieuwenhuizen	6529daca39	radeonsi: Implement DCC fast clear. Uses the DCC buffer instead of the CMASK buffer. The ELIMINATE_FAST_CLEAR still works. Furthermore, with DCC compression we can directly clear to a limited set of colors such that we do not need a postprocessing step. v2 Marek: check dcc_buffer && dirty_level_mask in set_sampler_view Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-10-24 17:46:08 +02:00
Roland Scheidegger	205a3ce5c1	gallivm: fix tex offsets with mirror repeat linear Can't see why anyone would ever want to use this, but it was clearly broken. This fixes the piglit texwrap offset test using this combination. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-10-24 03:00:33 +02:00
Roland Scheidegger	71ff5af5dd	gallivm: fix sampling with texture offsets in SoA path When using nearest filtering and clamp / clamp to edge wrapping results could be wrong for negative offsets. Fix this by adding the offset before doing the conversion to int coords (could also use floor instead of trunc int conversion but probably more complex on "typical" cpu). This fixes the piglit texwrap offset failures with this filter/wrap combo (which only leaves the linear/mirror repeat combination broken). Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-10-24 03:00:33 +02:00
Roland Scheidegger	fb586e1edb	softpipe: fix using non-zero layer in non-array view from array resource For vertex/geometry shader sampling, this is the same as for llvmpipe - just use the original resource target. For fragment shader sampling though (which does not use first-layer based mip offsets) adjust the sampling code to use first_layer in the non-array cases. While here also fix up some code which looked wrong wrt buffer texel fetch (no piglit change). Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-10-24 03:00:33 +02:00
Roland Scheidegger	fe707c0373	llvmpipe: fix using non-zero layer in non-array view from array resource Just need to use resource target not view target when calculating first-layer based mip offsets. (This is a gl specific problem since d3d10 does not distinguish between non-array and array resources neither at the resource nor view level, only at the shader level.) Fixes new piglit arb_texture_view sampling-2d-array-as-2d-layer test. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-10-24 03:00:33 +02:00
Alex Deucher	830e57b82d	radeonsi: add Stoney to si_init_gs_info() This patch was originally written before stoney support was merged. Add stoney. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com>	2015-10-23 18:56:45 -04:00
Bas Nieuwenhuizen	48b5f104ac	radeonsi: Enable DCC. Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-10-24 00:42:30 +02:00
Bas Nieuwenhuizen	81ebd6a882	radeonsi: Add FLUSH_AND_INV_CB_DATA_TS for DCC. Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-10-24 00:42:28 +02:00
Bas Nieuwenhuizen	bb77467df9	radeonsi: Disable operations that do not work with DCC. Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-10-24 00:42:24 +02:00
Bas Nieuwenhuizen	afa357c3b0	radeonsi: Allocate buffers for DCC. As the alignment requirements can be 32 KiB or more, also adding an aligned buffer creation function. DCC is disabled for textures that can be shared as sharing the DCC buffers has not been implemented yet. Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-10-24 00:42:01 +02:00
Marek Olšák	edf6a4537c	radeonsi: only apply the SNORM blit workaround to *8_SNORM Like the comment says. This fixes DCC, which doesn't like blitting RG16 as RGBA8. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-24 00:01:20 +02:00
Marek Olšák	e1c098f238	util/format: add helper util_format_is_snorm8 Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-24 00:01:20 +02:00
Marek Olšák	06083046a4	radeonsi: add another requirement for PARTIAL_ES_WAVE Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-24 00:01:20 +02:00

1 2 3 4 5 ...

73869 commits