fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-06-09 03:38:18 +02:00

Author	SHA1	Message	Date
Julien Isorce	3bbb8715ac	nvc0: fix crash when nv50_miptree_from_handle fails Signed-off-by: Julien Isorce <j.isorce@samsung.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2015-10-28 18:26:20 +01:00
Marek Olšák	ce9db16e1c	gallium: add PIPE_CAP_COPY_BETWEEN_COMPRESSED_AND_PLAIN_FORMATS For ARB_copy_image. Reviewed-by: Brian Paul <brianp@vmware.com>	2015-10-28 11:52:17 +01:00
Marek Olšák	e82c527f1f	radeonsi: allow copying between compatible compressed and uncompressed formats which is where a block in src maps to a pixel in dst and vice versa. e.g. DXT1 <-> R32G32_UINT DXT5 <-> R32G32B32A32_UINT Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-28 11:52:17 +01:00
Boyuan Zhang	03c92ffbf6	st/vdpau: disable RefPicList for Vdpau HEVC Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2015-10-27 19:09:55 -04:00
Boyuan Zhang	ad2752e94b	st/va: add VAAPI HEVC decode support Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2015-10-27 19:09:55 -04:00
Boyuan Zhang	38c3d7cfc4	radeon/uvd: implement and add flag for VAAPI HEVC decode Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2015-10-27 19:09:55 -04:00
Boyuan Zhang	231605d14d	vl: add RefPicList defines for VAAPI HEVC decode Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2015-10-27 19:09:55 -04:00
Marek Olšák	93eb4f9287	winsys/amdgpu: remove the dcc_enable surface flag dcc_size is sufficient and doesn't need a further comment in my opinion. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-10-27 10:49:24 +01:00
Marek Olšák	3aebc596b3	radeonsi: add debug flags that disable DCC and DCC fast clear For debugging, bug reports, etc. This is not in the radeonsi directory, but it is about radeonsi. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-10-27 10:49:24 +01:00
Marek Olšák	235d38584c	radeonsi: properly check if DCC is enabled and allocated Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-10-27 10:49:24 +01:00
Marek Olšák	5bc5dca0cb	radeonsi: simplify DCC handling in si_initialize_color_surface Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-10-27 10:49:24 +01:00
Eric Anholt	3359ad6cda	vc4: Add support for copy propagation with unpack flags present. total instructions in shared programs: 89251 -> 87862 (-1.56%) instructions in affected programs: 52971 -> 51582 (-2.62%)	2015-10-26 16:48:34 -07:00
Eric Anholt	01ca4f207e	vc4: Rewrite the pack instructions as a MOV with a dst pack flag Another step in reducing the special-casing of instructions.	2015-10-26 16:48:34 -07:00
Eric Anholt	72fa2ae20b	vc4: Move dst pack setup out to a helper function with more asserts.	2015-10-26 16:48:34 -07:00
Eric Anholt	99a9a5a345	vc4: Switch the unpack ops to being unpack flags on a mov. This paves the way for copy propagating our unpacks. We end up with a small change on shader-db: total instructions in shared programs: 89390 -> 89251 (-0.16%) instructions in affected programs: 19041 -> 18902 (-0.73%) which appears to be because we no longer convert MOVs for an FMAX dst, r4.unpack, r4.unpack (instead of the previous MOV dst, r4.unpack), and this ends up with a slightly better schedule.	2015-10-26 16:48:34 -07:00
Eric Anholt	548b05d53f	vc4: Drop some confused code about pack/unpack handling. At one point I thought packs and unpacks were in the same field of the instruction. They aren't. These instructions therefore never cause a pack. total instructions in shared programs: 89472 -> 89390 (-0.09%) instructions in affected programs: 15261 -> 15179 (-0.54%)	2015-10-26 16:48:34 -07:00
Eric Anholt	a7b424e835	vc4: Reduce MOV special-casing in QIR-to-QPU. I'm going to introduce some more types of MOV, which also want the elision of raw MOVs.	2015-10-26 16:48:34 -07:00
Eric Anholt	652a864b25	vc4: Fix up the test for whether the unpack can be from r4. We can do 16a/16b from float as well. No difference on shader-db.	2015-10-26 16:48:34 -07:00
Eric Anholt	3d7a088608	vc4: Don't try to follow MOVs across a pack.	2015-10-26 16:48:34 -07:00
Eric Anholt	6eb0760f48	vc4: Only copy propagate raw MOVs. No problems being fixed, but needed for the new unpack changes.	2015-10-26 16:48:34 -07:00
Eric Anholt	0ccacfa017	vc4: If a QIR source has an unpack set, print it. Not used yet, but will be.	2015-10-26 16:48:34 -07:00
Roland Scheidegger	711489648b	gallivm: disable f16c when not using AVX f16c intrinsic can only be emitted when AVX is used. So when we disable AVX due to forcing 128bit vectors we must not use this intrinsic (depending on llvm version, this worked previously because llvm used AVX even when we didn't tell it to, however I've seen this fail with llvm 3.3 since `718249843b` which seems to have the side effect of disabling avx in llvm albeit it only touches sse flags really, but with `ea421e919a` it's now really disabled). Albeit being able to use AVX with 128bit vectors also would have its uses, the code as is really was meant to emulate jit code creation for less capable cpus. v2: add some (ifdefed out) missing de-featuring options for simulating less capable cpus. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-10-26 16:45:49 +01:00
Julien Isorce	a61be1a798	st/va: pass picture desc to begin and decode At least vl_mpeg12_decoder uses the picture desc in begin_frame and decode_bitstream. https://bugs.freedesktop.org/show_bug.cgi?id=92634 Signed-off-by: Julien Isorce <j.isorce@samsung.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2015-10-26 13:53:10 +01:00
Eric Anholt	a2eba3362f	vc4: Fix names of the 16-bit unpacks They're only f16-to-f32 on a float operation, otherwise they're i16-to-i32.	2015-10-24 17:55:55 -07:00
Eric Anholt	a238ad372d	vc4: Don't try to register coalesce into the VPM across non-raw MOVs. No known bugs, just something I noticed while updating optimization code for other changes.	2015-10-24 17:55:38 -07:00
Eric Anholt	ae1d3322cc	vc4: Take advantage of the 8888 pack function in pack_unorm_4x8. One instruction instead of four, and it turns out you do this a lot for the Over operator. total uniforms in shared programs: 32168 -> 32087 (-0.25%) uniforms in affected programs: 318 -> 237 (-25.47%) total instructions in shared programs: 89830 -> 89472 (-0.40%) instructions in affected programs: 6434 -> 6076 (-5.56%)	2015-10-24 17:55:22 -07:00
Eric Anholt	f09ed63f43	vc4: Fix the test for skipping raw MOVs. I don't know what previous test was trying to do, but it dates back to the first add of vc4_qpu_emit.c. No change to shader-db.	2015-10-24 17:55:22 -07:00
Rob Clark	1e8d0cc628	freedreno: remove unnecessary null checks According to piglit/xonotic/neverball/stc, blend/rasterize/zsa state will always be bound (never null). And the null checks were in- consistent anyways, so remove them. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-10-24 12:38:33 -04:00
Bas Nieuwenhuizen	6529daca39	radeonsi: Implement DCC fast clear. Uses the DCC buffer instead of the CMASK buffer. The ELIMINATE_FAST_CLEAR still works. Furthermore, with DCC compression we can directly clear to a limited set of colors such that we do not need a postprocessing step. v2 Marek: check dcc_buffer && dirty_level_mask in set_sampler_view Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-10-24 17:46:08 +02:00
Roland Scheidegger	205a3ce5c1	gallivm: fix tex offsets with mirror repeat linear Can't see why anyone would ever want to use this, but it was clearly broken. This fixes the piglit texwrap offset test using this combination. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-10-24 03:00:33 +02:00
Roland Scheidegger	71ff5af5dd	gallivm: fix sampling with texture offsets in SoA path When using nearest filtering and clamp / clamp to edge wrapping results could be wrong for negative offsets. Fix this by adding the offset before doing the conversion to int coords (could also use floor instead of trunc int conversion but probably more complex on "typical" cpu). This fixes the piglit texwrap offset failures with this filter/wrap combo (which only leaves the linear/mirror repeat combination broken). Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-10-24 03:00:33 +02:00
Roland Scheidegger	fb586e1edb	softpipe: fix using non-zero layer in non-array view from array resource For vertex/geometry shader sampling, this is the same as for llvmpipe - just use the original resource target. For fragment shader sampling though (which does not use first-layer based mip offsets) adjust the sampling code to use first_layer in the non-array cases. While here also fix up some code which looked wrong wrt buffer texel fetch (no piglit change). Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-10-24 03:00:33 +02:00
Roland Scheidegger	fe707c0373	llvmpipe: fix using non-zero layer in non-array view from array resource Just need to use resource target not view target when calculating first-layer based mip offsets. (This is a gl specific problem since d3d10 does not distinguish between non-array and array resources neither at the resource nor view level, only at the shader level.) Fixes new piglit arb_texture_view sampling-2d-array-as-2d-layer test. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-10-24 03:00:33 +02:00
Alex Deucher	830e57b82d	radeonsi: add Stoney to si_init_gs_info() This patch was originally written before stoney support was merged. Add stoney. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com>	2015-10-23 18:56:45 -04:00
Bas Nieuwenhuizen	48b5f104ac	radeonsi: Enable DCC. Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-10-24 00:42:30 +02:00
Bas Nieuwenhuizen	81ebd6a882	radeonsi: Add FLUSH_AND_INV_CB_DATA_TS for DCC. Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-10-24 00:42:28 +02:00
Bas Nieuwenhuizen	bb77467df9	radeonsi: Disable operations that do not work with DCC. Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-10-24 00:42:24 +02:00
Bas Nieuwenhuizen	afa357c3b0	radeonsi: Allocate buffers for DCC. As the alignment requirements can be 32 KiB or more, also adding an aligned buffer creation function. DCC is disabled for textures that can be shared as sharing the DCC buffers has not been implemented yet. Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-10-24 00:42:01 +02:00
Marek Olšák	edf6a4537c	radeonsi: only apply the SNORM blit workaround to *8_SNORM Like the comment says. This fixes DCC, which doesn't like blitting RG16 as RGBA8. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-24 00:01:20 +02:00
Marek Olšák	e1c098f238	util/format: add helper util_format_is_snorm8 Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-24 00:01:20 +02:00
Marek Olšák	06083046a4	radeonsi: add another requirement for PARTIAL_ES_WAVE Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-24 00:01:20 +02:00
Marek Olšák	0d2cb35f68	radeonsi: merge two ifs setting WD_SWITCH_ON_EOP Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-24 00:01:20 +02:00
Marek Olšák	ca18f12dbb	radeonsi: make PARTIAL_ES_WAVE globally dependent on SWITCH_ON_EOI This catches the other cases that enable SWITCH_ON_EOI. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-24 00:01:20 +02:00
Marek Olšák	2070af2fb1	radeonsi: add one more SWITCH_ON_EOI requirement for Hawaii and VI The VI condition depends on geometry shaders and MAX_PRIMGRP_IN_WAVE. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-24 00:01:20 +02:00
Marek Olšák	a6b5684e99	radeonsi: only apply the instancing bug workaround to Bonaire Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-24 00:01:20 +02:00
Marek Olšák	96d5879d38	radeonsi: add SWITCH_ON_EOI requirement for 4 SE parts Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-24 00:01:20 +02:00
Marek Olšák	7e056f872f	radeonsi: remove unnecessary PARTIAL_VS_WAVE setting for streamout hardware does this automatically Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-24 00:01:20 +02:00
Marek Olšák	3a157e6e68	radeonsi: allow unbinding vertex shaders Draw calls without a vertex shader are skipped. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-24 00:01:20 +02:00
Marek Olšák	07b3cc6ecf	radeonsi: allow unbinding pixel shaders and remove the dummy shader Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-24 00:01:20 +02:00
Marek Olšák	50bb2decf7	radeonsi: add draw_vbo check for a NULL pixel shader Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-24 00:01:20 +02:00

... 53 54 55 56 57 ...

27608 commits