fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-03-13 13:10:34 +01:00

Author	SHA1	Message	Date
Marek Olšák	ecbd3a545a	r600g,radeonsi: add debug flags which disable tiling Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2014-07-28 23:57:08 +02:00
Marek Olšák	04f2c88f45	gallium: rename shader cap MAX_CONSTS to MAX_CONST_BUFFER_SIZE This new name isn't so confusing. I also changed the gallivm limit, because it looked wrong. Reviewed-by: Brian Paul <brianp@vmware.com> v2: use sizeof(float[4])	2014-07-28 23:57:08 +02:00
Marek Olšák	d5bcb5e8de	r600g: switch SNORM conversion to DX and GLES behavior it also matches GL 4.2 further discussion: http://lists.freedesktop.org/archives/mesa-dev/2013-August/042680.html Cc: mesa-stable@lists.freedesktop.org	2014-07-28 23:57:08 +02:00
Tom Stellard	5fe20592d4	util: Fix typo Spotted by okias on IRC.	2014-07-28 16:40:05 -04:00
Chia-I Wu	cc1e1da24a	ilo: correctly propagate resource renames to hardware Not only should we mark states dirty when the underlying resource is renamed, we should also update the CSO bo when available.	2014-07-28 23:55:55 +08:00
Chia-I Wu	fb1820355b	ilo: add ilo_resource_get_bo() helper We will need it in the following commit.	2014-07-28 23:55:55 +08:00
Tom Stellard	6f0c1f2b5f	radeonsi: Use util_memcpy_cpu_to_le32() Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-07-28 10:14:28 -04:00
Tom Stellard	f0e0737922	util: Add util_memcpy_cpu_to_le32() v3 v2: - Preserve word boundaries. v3: - Use const and restrict. - Fix indentation. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-07-28 10:10:43 -04:00
Tom Stellard	3d636b4785	clover: Add checks for image support to the image functions v2 Most image functions are required to return a CL_INVALID_OPERATION error when used on devices without image support. v2: - Simplified the code Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2014-07-28 10:10:30 -04:00
Bruno Jiménez	7f96bea5bc	r600g/compute: Add debug information to promote and demote functions v2: Add information about the item's starting point and size v3: Rebased on top of master Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2014-07-28 10:10:20 -04:00
Bruno Jiménez	e7715126f7	r600g/compute: Add documentation to compute_memory_pool v2: Rebased on top of master Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2014-07-28 10:09:46 -04:00
Chia-I Wu	717e3b1ca1	ilo: unblock an inline write with a staging bo This should allow a deeper pipeline.	2014-07-28 22:57:22 +08:00
Chia-I Wu	7395432f2e	ilo: try unblocking a transfer with a staging bo When mapping a busy resource with PIPE_TRANSFER_DISCARD_RANGE or PIPE_TRANSFER_FLUSH_EXPLICIT, we can avoid blocking by allocating and mapping a staging bo, and emit pipelined copies at proper places. Since the staging bo is never bound to GPU, we give it packed layout to save space.	2014-07-28 22:57:22 +08:00
Chia-I Wu	0a0e57b070	ilo: enable persistent and coherent transfers Enable PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT and reorder caps a bit.	2014-07-28 22:57:22 +08:00
Chia-I Wu	b02e993d8c	ilo: drop ptr from ilo_transfer With the recent clean-ups, we can pass the mapped pointer around between functions cleanly. Drop it to make ilo_transfer smaller.	2014-07-28 22:57:22 +08:00
Chia-I Wu	b1dd54d9fe	ilo: s/TRANSFER_MAP_UNSYNC/TRANSFER_MAP_GTT_UNSYNC/ It maps to drm_intel_gem_bo_map_unsynchronized(), which results in unsynchronized GTT mapping.	2014-07-28 22:57:22 +08:00
Chia-I Wu	2a82bb30e8	ilo: drop unused context param from transfer functions Many of the transfer functions do not need an ilo_context. Drop it.	2014-07-28 22:57:22 +08:00
Chia-I Wu	8abf6c06e8	ilo: tidy up transfer mapping/unmapping Add xfer_map() to replace map_bo_for_transfer(). Add xfer_unmap() and xfer_alloc_staging_sys() to simplify texture and buffer mapping/unmapping, and enable more code sharing between them.	2014-07-28 22:57:22 +08:00
Chia-I Wu	2f4bed0405	ilo: tidy up choose_transfer_method() Add a bunch of helper functions and a big comment for choose_transfer_method(). This also fixes handling of PIPE_TRANSFER_MAP_DIRECTLY to not ignore tiling.	2014-07-28 22:57:22 +08:00
Chia-I Wu	91656eb375	ilo: free transfers with util_slab_free() We used FREE() in one of the error path.	2014-07-28 22:57:22 +08:00
EdB	1d3e06c216	clover: Add clUnloadPlatformCompiler. Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2014-07-28 14:46:44 +02:00
EdB	39869423cb	clover: Add clCreateProgramWithBuiltInKernels. [ Francisco Jerez: Check for devices not associated with the specified context. Style fix. ] Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2014-07-28 14:45:29 +02:00
Jordan Justen	be8bc588b9	glsl/cs: Add several GLSL compute shader variables With MESA_EXTENSION_OVERRIDE=GL_ARB_compute_shader, this fixes piglit: built-in-constants tests/spec/arb_compute_shader/minimum-maximums.txt Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2014-07-27 17:59:28 -07:00
Jordan Justen	12029046a2	main/cs: Add additional compute shader constant values With MESA_EXTENSION_OVERRIDE=GL_ARB_compute_shader, this fixes piglit: * arb_compute_shader-minmax Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2014-07-27 17:58:58 -07:00
Chris Forbes	74e100affc	glsl: No longer require ubo block index to be constant in ir_validate Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-07-26 16:46:03 +12:00
Chris Forbes	be237a6129	glsl: Accept nonconstant array references in lower_ubo_reference Instead of falling back to just the block name (which we won't find), look for the first element of the block array. We'll deal with the rest in the backend by arranging for the blocks to be laid out contiguously. V2: Squashed together patches 3, 5 of V1, plus a naming tweak. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-07-26 16:46:03 +12:00
Chris Forbes	c59802d3a1	glsl: Convert uniform_block in lower_ubo_reference to ir_rvalue. Previously this was a block index with special semantics for -1. With ARB_gpu_shader5, this need not be a compile-time constant, so allow any rvalue here and convert the -1 to a NULL pointer. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-07-26 16:46:03 +12:00
Chris Forbes	9c90a63378	glsl: Mark entire UBO array active if indexed with non-constant. Without doing a lot more work, we have no idea which indices may be used at runtime, so just mark them all. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-07-26 16:46:03 +12:00
Chris Forbes	8eae5ceb99	glsl: Allow non-constant UBO array indexing with GLSL4/ARB_gpu_shader5. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-07-26 16:46:03 +12:00
Chia-I Wu	4714c4ec48	ilo: simplify ilo_flush() Move fence creation to the new ilo_fence_create().	2014-07-26 12:30:39 +08:00
Bruno Jiménez	654fd3e33f	r600g/compute: Defrag the pool at the same time as we grow it This allows us two things: we now need less item copies when we have to defrag+grow the pool (to just one copy per item) and, even in the case where we don't need to defrag the pool, we reduce the data copied to just the useful data that the items use. Note: The fallback path is a bit ugly now, but hopefully we won't need it much. Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2014-07-25 17:51:57 -04:00
Bruno Jiménez	4ca04f3112	r600g/compute: Try to use a temporary resource when growing the pool Now, before moving everything to host memory, we try to create a new resource to use as a pool. I we succeed we just use this resource and delete the previous one. If we fail we fallback to using the shadow. This should make growing the pool faster, and we can also save 64KB of memory that were allocated for the 'shadow', even if they weren't used. Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2014-07-25 17:51:57 -04:00
Rob Clark	5eb11eb192	freedreno: fix typo in gpu version check Opps, I should use larger fonts, I guess. Reported-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-07-25 14:29:02 -04:00
Rob Clark	db193e5ad0	freedreno/ir3: split out shader compiler from a3xx Move the bits we want to share between generations from fd3_program to ir3_shader. So overall structure is: fdN_shader_stateobj -> ir3_shader -> ir3_shader_variant -> ir3 \|- ... \- ir3_shader_variant -> ir3 So the ir3_shader becomes the topmost generation neutral object, which manages the set of variants each of which generates, compiles, and assembles it's own ir. There is a bit of additional renaming to s/fd3_compiler/ir3_compiler/, etc. Keep the split between the gallium level stateobj and the shader helper object because it might be a good idea to pre-compute some generation specific register values (ie. anything that is independent of linking). Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-07-25 13:29:28 -04:00
Rob Clark	7d7e6ae9c3	freedreno/a3xx/compiler: rename ir3_shader to ir3 First step of reoganization split out compiler (so it can be shared between a3xx and a4xx). Rename ir3_shader -> ir3 (since we'll want the name ir3_shader for a higher level object). Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-07-25 13:29:28 -04:00
Rob Clark	faaeddb55e	freedreno/a3xx/compiler: scheduler vs pred reg The scheduler also needs to be aware of predicate register (p0) in addition to address register (a0). Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-07-25 13:29:28 -04:00
Rob Clark	9f391322a0	freedreno/a3xx/compiler: little cleanups Remove some obsolete comments, rename deref->addr. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-07-25 13:29:28 -04:00
Rob Clark	d48faad3c2	freedreno/a3xx: enable/disable wa's based on patch-level It seems like for the most part, different behaviors, workarounds, etc, should be conditional on GPU patch revision (ie. a320.0 vs a320.2) rather than GPU id (a320 vs a330). Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-07-25 13:29:28 -04:00
Rob Clark	9613ca569f	freedreno/a3xx/compiler: make IR heap dyanmic The fixed size heap is a remnant of the fdre-a3xx assembler. Yet it is convenient for being able to free the entire data structure in one shot without worrying about leaking nodes. Change it to dynamically grow the heap size (adding chunks) as needed so we don't have an artificial upper limit on shader size (other than hw limits) and don't always have to allocate worst-case size. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-07-25 13:29:28 -04:00
Jan Vesely	0bc1fa22d8	r600g/compute: Fix singed/unsigned comparison compiler warnings. The iteration variables go from 0 anyway. Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2014-07-25 12:55:05 -04:00
Tom Stellard	0ec8587642	clover: Query the device to see if images are supported Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2014-07-25 12:49:45 -04:00
Tom Stellard	1607a8efc1	gallium: Add PIPE_CAP_COMPUTE_IMAGES_SUPPORTED Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2014-07-25 12:49:20 -04:00
Bruno Jiménez	d6b89aef26	r600g/compute: Allow compute_memory_defrag to defragment between resources This will be used in the following patch to avoid duplicated code Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2014-07-25 12:38:42 -04:00
Bruno Jiménez	5cf108078c	r600g/compute: Allow compute_memory_move_item to move items between resources v2: Remove unnecesary variables Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2014-07-25 12:38:28 -04:00
Dylan Baker	bf1247936a	gbm: Search LIBGL_DRIVERS_PATH if GBM_DRIVERS_PATH is not set The GBM_DRIVERS_PATH environment variable is not documented, and only used to set the location of gbm drivers, while LIBGL_DRIVERS_PATH is used for everything else, and is documented. Generally this split leads to confusion as to why gbm doesn't work. This patch will read LIBGL_DRIVERS_PATH as a fallback if GBM_DRIVERS_PATH is not set. The comments clearly indicate that using LIBGL_DRIVERS_PATH is preferred over GBM_DRIVERS_PATH. v2: - Use GBM_DRIVERS_PATH as a fallback v3: [jordan.l.justen@intel.com] - Make LIBGL_DRIVERS_PATH the fallback Signed-off-by: Dylan Baker <baker.dylan.c@gmail.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2014-07-24 23:15:06 -07:00
Jerome Glisse	cce58147eb	winsys/radeon: fix indentation Can we please keep it clean and avoid ending up in messy situation like ddx. Signed-off-by: Jérôme Glisse <jglisse@redhat.com>	2014-07-24 17:30:31 -04:00
Jason Ekstrand	989d2e3709	Add an accelerated version of F_TO_I for x86_64 According to a quick micro-benchmark, this new version is 20% faster on my Haswell laptop. v2: Removed the XXX note about x86_64 from the comment v3: Use an intrinsic instead of an __asm__ block. This should give us MSVC support for free. v4: Enable it for all x86_64 builds, not just with USE_X86_64_ASM Signed-off-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-07-24 12:44:56 -07:00
Matt Turner	2a33510f16	i965/fs: Decide predicate/predicate_inverse outside of the for loop. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-07-24 11:27:44 -07:00
Matt Turner	96128d134b	i965/fs: Swap if/else conditions in SEL peephole. Will clarify make the next commit easier to read. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-07-24 11:27:44 -07:00
Matt Turner	ac2acf04f7	i965: Improve dead control flow elimination. ... to eliminate an ELSE instruction followed immediately by an ENDIF. instructions in affected programs: 704 -> 700 (-0.57%) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-07-24 11:27:43 -07:00

1 2 3 4 5 ...

64109 commits