fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-02-03 17:20:26 +01:00

Author	SHA1	Message	Date
Jason Ekstrand	862da6a891	anv/device: Add a newline to the end of a comment	2015-11-09 16:04:06 -08:00
Nanley Chery	9c2b37a9c3	anv/formats: Define ETC2 formats Reviewed-by: Chad Versace <chad.versace@intel.com>	2015-11-09 15:41:41 -08:00
Nanley Chery	41cf35d1d8	anv/image: Determine the alignment units for compressed formats Alignment units, i and j, match the compressed format block width and height respectively. v2: Don't assert against HALIGN* and VALIGN* enums (Chad) Reviewed-by: Chad Versace <chad.versace@intel.com>	2015-11-09 15:41:41 -08:00
Nanley Chery	381f602c6b	anv/image: Handle compressed format qpitch and padding Reviewed-by: Chad Versace <chad.versace@intel.com>	2015-11-09 15:41:41 -08:00
Nanley Chery	300f7c2be3	anv/image: Handle compressed format stride and size These formulas did not take compressed formats into account. Reviewed-by: Chad Versace <chad.versace@intel.com>	2015-11-09 15:41:41 -08:00
Nanley Chery	7b4244dea0	anv/formats: Add fields for block dimensions A non-compressed texture is a 1x1x1 block. Compressed textures could have values which vary in different dimensions WxHxD. Reviewed-by: Chad Versace <chad.versace@intel.com>	2015-11-09 15:41:41 -08:00
Nanley Chery	a6c7d1e016	anv/formats: Add surface_format initializer v2: Rename __brw_fmt to __hw_fmt (Chad) Suggested-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Chad Versace chad.versace@intel.com	2015-11-09 15:41:41 -08:00
Nanley Chery	3ee923f1c2	anv: Rename cpp variable to "bs" cpp (chars-per-pixel) is an integer that fails to give useful data about most compressed formats. Instead, rename it to "bs" which stands for block size (in bytes). v2: Rename vk_format_for_bs to vk_format_for_size (Chad) Use "block size" instead of "bs" in error message (Chad) Reviewed-by: Chad Versace <chad.versace@intel.com>	2015-11-09 15:41:41 -08:00
Ilia Mirkin	3ea3727998	docs: note that ARB_copy_image was added to nv50, nvc0 in this release Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-09 07:14:07 -05:00
Brian Paul	28f6faca51	st/wgl: add null pointer check for HUD texture Fixes crash when using HUD with Nobel Clinician Viewer. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-11-09 11:25:59 +00:00
Brian Paul	75d1e363ff	st/wgl: fix double-present on swapbuffers bug The stw_st_framebuffer_present_locked() function was getting called twice per SwapBuffers. First, when st_context_iface::flush() was called from DrvSwapBuffers() because the ST_FLUSH_FRONT flag was given. Second, by stw_st_swap_framebuffer_locked() which does the actual SwapBuffers. Two code changes: 1. Pass ST_FLUSH_END_OF_FRAME, instead of ST_FLUSH_FRONT. 2. Move the implementation of stw_flush_current_locked() into DrvSwapBuffers() since it's not called anywhere else. Not much change in perf for benchmarks like Lightsmark, but some simple Mesa demos are measurably faster. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2015-11-09 11:25:59 +00:00
Brian Paul	8083943e2e	st/wgl: reorder pixel formats to put MSAA formats last And put 8-bit/channel formats before 5/6/5 formats. The ChoosePixelFormat() function seems to be finicky about format selection. Putting the MSAA formats after the non-MSAA formats means most apps get a low-numbered format. Now we generally get the same pixel format regardless of whether using vgpu9 or 10. VMware bug 1455030 Reviewed-by: José Fonseca <jfonseca@vmware.com>	2015-11-09 11:25:59 +00:00
José Fonseca	e524df5ef3	st/wgl: Don't rely on GDI to bookkeep pixelformat for us. This allows to use apitrace's retracediff script on Windows to retrace and compare two builds of a Mesa based opengl32.dll/ICD side-by-side. See also `e4a4f15f5b`	2015-11-09 11:08:27 +00:00
Michel Dänzer	24abbaff9a	winsys/radeon: Use CPU page size instead of hardcoding 4096 bytes v3 Fixes GPUVM conflicts with non-4K page size. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=92738 v2: Replace sanitization of VM base address alignment with comment why that's not necessary. v3: Use unsigned instead of long as the type for the size_align member. (Marek) Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Christian König <christian.koenig@amd.com> (v1) Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2015-11-09 17:24:32 +09:00
Christian König	df4f9b0236	radeon/uvd: add H.265/HEVC to legal notes Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2015-11-08 18:16:01 -05:00
Leo Liu	519502d08f	st/omx: add headless support This will allow dec/enc/transcode without X v2: use env override even with X, use loader_open_device instead of open v3: clean up Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2015-11-08 18:15:57 -05:00
Leo Liu	25526d77b1	st/va: use vl screen drm support from vl_wys_drm v2: move the dup to vl_wys_drm for pipe loader Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2015-11-08 18:15:57 -05:00
Leo Liu	7da86e0ec0	vl: add drm support for vl_screen This will allow the state trackers to use render nodes with screen creation v2: dup fd for pipe loader Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2015-11-08 18:15:57 -05:00
Leo Liu	d115e47099	st/va: fix build fails with pipe loader There is no dev in drv, and dev should be from vl_screen here Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2015-11-08 18:15:57 -05:00
Samuel Pitoiset	ffb60e7788	nvc0: enable compute support on Fermi Altough the compute support is still not complete because textures and surfaces need to be implemented, it allows to launch very simple compute kernel like one which reads reading MP performance counters. This turns on PIPE_CAP_COMPUTE and PIPE_SHADER_COMPUTE. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-08 16:47:59 +01:00
Ilia Mirkin	e06238cb9e	nv50/ir: fix emission of s[] args in certain situations There might only be a single arg (e.g. cvt), so use mode rather than looking at the source directly. Also we don't want to rely on the type of the value, which can be unreliable, but instead use the instruction's. This works out well since mkSplit doesn't adjust the type. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-07 18:58:58 -05:00
Ilia Mirkin	af218217d7	nv50/ir: only take abs value when computing high result Not reachable from TGSI since it only has UMUL, no IMUL. However it's surprising that setting argument types to s32 will cause sign to get lost. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-07 18:58:58 -05:00
Ilia Mirkin	53cbb11707	nouveau: avoid queueing too much work onto a single fence Force the fence to get kicked off, which won't actually wait for its completion, but any additional work will be put onto a fresh list. This fixes crashes in teximage-colors --benchmark with too many active maps. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-07 18:58:58 -05:00
Dave Airlie	0f5b1409fd	llvmpipe: disable front updates for now As pointed out by Emil, this sometimes hangs, appears to be due to threading need to rethink how this stuff works for llvmpipe. Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-11-08 07:55:17 +10:00
Dave Airlie	87711183ac	virgl: wrap ret assignment with braces to do correct thing Coverity reported that ret could only be 0 or 1, since it was setting ret = fn() > 0, instead of doing (ret = fn()) > 0. Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-11-08 06:27:02 +10:00
Jason Ekstrand	6c731d8566	nir: Add a nir_deref_tail helper Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-11-07 12:09:44 -08:00
Jason Ekstrand	7d90e570f3	nir/types: Add an is_vector_or_scalar helper Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-11-07 12:09:38 -08:00
Jason Ekstrand	d43e16b163	i965/fs: Use regs_read/written for post-RA scheduling in calculate_deps Previously, we were assuming that everything read/wrote exactly 1 logical GRF (1 in SIMD8 and 2 in SIMD16). This isn't actually true. In particular, the PLN instruction reads 2 logical registers in one of the components. This commit changes post-RA scheduling to use regs_read and regs_written instead so that we add enough dependencies. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=92770 Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-11-07 08:41:48 -08:00
Jason Ekstrand	c839174d55	nir/validate: Add better validation of load/store types Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-11-07 08:41:35 -08:00
Jason Ekstrand	17fa3d3572	nir/spirv: Give both block and buffer_block types an interface type	2015-11-07 08:03:25 -08:00
Marek Olšák	d57ede92b7	radeonsi: add register definitions for Stoney There are a few non-stoney changes too. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2015-11-07 10:22:13 +01:00
Marek Olšák	2658777f46	radeonsi: add workarounds for CP DMA to stay on the fast path v2: set emit_scratch_reloc, add a NULL check Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-11-07 10:22:13 +01:00
Marek Olšák	fc0416ef5d	radeonsi: unify CP DMA preparation logic Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-11-07 10:22:13 +01:00
Marek Olšák	89da3b4458	radeonsi: unify CP DMA code determining various flags v2: don't call get_flush_flags twice per function Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-11-07 10:22:12 +01:00
Marek Olšák	c3e527f93d	radeonsi: only enable write confirmation on the last CP DMA packet This should improve performance for big copies that need to be split. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-11-07 10:22:12 +01:00
Ilia Mirkin	8e9ade7eb3	nv50/ir: allow emission of immediates in imul/imad ops Nothing actually uses this yet (due to complications), but the emission logic is right. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-07 00:42:15 -05:00
Jason Ekstrand	a10d59c09a	nir/spirv: Increment num_ubos/ssbos when creating variables	2015-11-06 16:53:27 -08:00
Jason Ekstrand	046563167c	anv/apply_dynamic_offsets: Use the right sized immediate zero	2015-11-06 16:49:24 -08:00
Ilia Mirkin	393d0c336b	nv50/ir: properly set the type of the constant folding result This removes the hack used for merge, which only covers a fraction of the cases. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 19:39:32 -05:00
Ilia Mirkin	2f9aaed749	nv50/ir: add support for const-folding OP_CVT with F64 source/dest Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 19:39:32 -05:00
Jason Ekstrand	104525c33b	anv/pipeline: Set the right SSBO binding table start index for FS	2015-11-06 15:57:51 -08:00
Ilia Mirkin	76957389fc	nv50/ir: add fp64 opcode emission support for G200 (NVA0) Need to emulate rcp/rsq before providing full fp64 support Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 18:36:25 -05:00
Jason Ekstrand	399d5314f6	anv/cmd_buffer: Rework the way we emit UBO surface state The new mechanism should be able to handle SSBOs as well as properly handle emitting surface state on gen7 where we need different strides depending on shader stage.	2015-11-06 15:14:12 -08:00
Hans de Goede	f979d3cfec	nv50/ir: Add support for 64bit immediates to checkSwapSrc01 Now that we support 64 bit immediates in insnCanLoad, we need to swap 64 bit immediate sources too for optimal effect. Signed-off-by: Hans de Goede <hdegoede@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 18:13:31 -05:00
Hans de Goede	9f2f8bda6e	nvc0/ir: Teach insnCanLoad about double immediates Teach insnCanLoad about double immediates, together with the "Add support for merge-s to the ConstantFolding pass" This turns the following (nvc0) code: 1: mov u32 $r2 0x00000000 (8) 2: mov u32 $r3 0x3fe00000 (8) 3: add f64 $r0d $r0d $r2d (8) Into: 1: add f64 $r0d $r0d 0.500000 (8) Signed-off-by: Hans de Goede <hdegoede@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 18:13:31 -05:00
Hans de Goede	428506ece2	nv50/ir: Add support for merge-s to the ConstantFolding pass This allows later passes like LoadPropagation to properly deal with 64 bit immediates. If the new 64 bit load this introduces does not get optimized away then split64BitOpPostRA() will split this into 2 instructions again. Signed-off-by: Hans de Goede <hdegoede@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 18:13:31 -05:00
Ilia Mirkin	2437f00853	nv50/ir: disallow 64-bit immediates on nv50 targets No instructions are able to load short immediates like nvc0 can. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 18:13:31 -05:00
Ilia Mirkin	11e3dac36e	nv50/ir: allow movs with TYPE_F64 destinations to be split Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-06 18:13:31 -05:00
Jason Ekstrand	1b5c7e7ecd	anv/pipeline: Expose is_scalar_shader_stage	2015-11-06 15:12:33 -08:00
Jason Ekstrand	5ba281e794	nir/spirv: Add a helper for determining if a block is externally visable	2015-11-06 15:09:57 -08:00

... 141 142 143 144 145 ...

82384 commits