fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 07:08:05 +02:00

Author	SHA1	Message	Date
Jason Ekstrand	51376cd749	Revert "intel/compiler: Properly consider UBO loads that cross 32B boundaries." This reverts commit `b8fa847c2e`. This broke about 30k Vulkan CTS tests.	2018-06-13 09:23:55 -07:00
Kenneth Graunke	b8fa847c2e	intel/compiler: Properly consider UBO loads that cross 32B boundaries. The UBO push analysis pass incorrectly assumed that all values would fit within a 32B chunk, and only recorded a bit for the 32B chunk containing the starting offset. For example, if a UBO contained the following, tightly packed: vec4 a; // [0, 16) float b; // [16, 20) vec4 c; // [20, 36) then, c would start at offset 20 / 32 = 0 and end at 36 / 32 = 1, which means that we ought to record two 32B chunks in the bitfield. Similarly, dvec4s would suffer from the same problem. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2018-06-13 02:07:58 -07:00
Mauro Rossi	fb9ab2fbd3	anv/android: Use an address for each anv_image plane Fixes to avoid building error after change in image->planes[] structure, {bo,bo_offset} has to be replaced by address.{bo,offset} and update is needed also in the assert() for debug builds. external/mesa/src/intel/vulkan/anv_android.c:188:21: error: no member named 'bo' in 'struct anv_image::(anonymous at external/mesa/src/intel/vulkan/anv_private.h:2647:4)' image->planes[0].bo = bo; ~~~~~~~~~~~~~~~~ ^ 1 error generated. Fixes: `bf34ef16ac` ("anv: Use an address for each anv_image plane") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-12 11:17:43 +03:00
Mauro Rossi	a1220e7311	anv/android: Set the BO flags in bo_cache_import (v2) Changes to avoid building error: external/mesa/src/intel/vulkan/anv_android.c:131:72: error: too few arguments to function call, expected 5, have 4 result = anv_bo_cache_import(device, &device->bo_cache, dma_buf, &bo); ~~~~~~~~~~~~~~~~~~~ ^ 1 error generated. (v2) Set the correct bo_flags based on support of 48bit addresses and soft-pin Fixes: `b0d50247a7` ("anv/allocator: Set the BO flags in bo_cache_alloc/import") Fixes: `e7d0378bd9` ("anv: Soft-pin client-allocated memory") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-12 11:16:39 +03:00
Kenneth Graunke	0d5329d626	anv: Disable __gen_validate_value if NDEBUG is set. We were enabling undefined memory checking for genxml values based on Valgrind being installed at build time, even for release builds. This generates piles and piles of assembly whenever you touch genxml. With gcc 7.3.1 and -O3 and -march=native on a Kabylake with Valgrind installed at build time: text data bss dec hex filename 5978385 262884 13488 6254757 5f70a5 libvulkan_intel.so 3799377 262884 13488 4075749 3e30e5 libvulkan_intel.so That's a 36% reduction in text size. Fixes: `047ed02723` (vk/emit: Use valgrind to validate every packed field) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-11 14:55:32 -07:00
Gustavo Lima Chaves	7dfaf025c5	anv: enable VK_EXT_shader_stencil_export Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-08 11:16:01 -07:00
Jason Ekstrand	3b54dd87f7	intel/isl: Add bounds-checking assertions for the format_info table We follow the same convention as isl_format_get_layout in having two assertions to ensure that only valid formats are passed in. We also check against the array size of the table because some valid formats such as CCS formats will may be past the end of the table. This fixes some potential out-of-bounds array access even in valid cases. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-06-07 11:23:34 -07:00
Jason Ekstrand	778e2881a0	intel/isl: Add bounds-checking assertions in isl_format_get_layout We add two assertions instead of one because the first assertion that format != ISL_FORMAT_UNSUPPORTED is more descriptive and checks for a real but unsupported enumerant while the second ensures that they don't pass in garbage values. We also update some other helpers to use isl_format_get_layout instead of using the table directly so that they get bounds checking too. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-06-07 11:23:34 -07:00
Jason Ekstrand	237c5ac4f9	anv: Set fence/semaphore types to NONE in impl_cleanup There were some places that were calling anv_semaphore_impl_cleanup and neither deleting the semaphore nor setting the type back to NONE. Just set it to NONE in impl_cleanup to avoid these issues. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106643 Fixes: `031f57eba` "anv: Add a basic implementation of VK_KHX_external..." Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-06-07 09:46:45 -07:00
Kenneth Graunke	74259b98aa	intel/blorp: Emit VF cache invalidates for 48-bit bugs with softpin. commit `92f01fc5f9` made i965 start emitting VF cache invalidates when the high bits of vertex buffers change. But we were not tracking vertex buffers emitted by BLORP. This was papered over by a mistake where I emitted VF cache invalidates all the time, which Chris fixed in commit `3ac5fbadfd`. This patch adds a new hook which allows the driver to track addresses and request a VF cache invalidate as appropriate. v2: Make the driver do the PIPE_CONTROL so it can apply workarounds (caught by Jason Ekstrand). Rebase on anv bug fix. v3: Don't screw up the boolean (caught by Jason Ekstrand). Fixes: `92f01fc5f9` ("i965: Emit VF cache invalidates for 48-bit addressing bugs with softpin.") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-06 19:45:09 -07:00
Jason Ekstrand	44c614843c	intel/blorp: Don't vertex fetch directly from clear values On gen8+, we have to VF cache flush whenever a vertex binding aliases a previous binding at the same index modulo 4GiB. We deal with this in Vulkan by ensuring that vertex buffers and the dynamic state (from which BLORP pulls its vertex buffers) are in the same 4GiB region of the address space. That doesn't work if we're reading clear colors with the VF unit. In order to work around this we switch to using MI commands to copy the clear value into the vertex buffer we allocate for the normal constant data. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-06 16:32:38 -07:00
Jason Ekstrand	01ad2067bb	intel/eu: Use a struct copy instead of a memcpy The memcpy had the wrong size and this was causing crashes on 32-bit builds of the driver. Fixes: `6a9525bf67` "intel/eu: Switch to a logical state stack" Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106830 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-05 15:51:01 -07:00
Scott D Phillips	6fb22114a0	intel/tools: add intel_sanitize_gpu to EXTRA_DIST Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106778 Fixes: `cc41603d6d` ("intel/tools: new intel_sanitize_gpu tool") Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-06-05 10:32:35 -07:00
Lionel Landwerlin	9aedee64ac	anv: intel: add softpin flag on imported BOs Looks like we forgot to update this bit of the driver for softpin. Fixes: `4affeba1e9` ("anv: Soft-pin everything else") Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-05 14:18:35 +01:00
Jason Ekstrand	6a9525bf67	intel/eu: Switch to a logical state stack Instead of the state stack that's based on copying a dummy instruction around, we start using a logical stack of brw_insn_states. This uses a bit less memory and is way less conceptually bogus. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-04 14:03:03 -07:00
Jason Ekstrand	db9675f5a4	intel/eu: Set flag [sub]register number differently for 3src Prior to gen8, the flag [sub]register number is in a different spot on 3src instructions than on other instructions. Starting with Broadwell, they made it consistent. This commit fixes bugs that occur when a conditional modifier gets propagated into a 3src instruction such as a MAD. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-04 14:03:03 -07:00
Jason Ekstrand	2d20303e18	intel/eu: Copy fields manually in brw_next_insn Instead of doing a memcpy, this moves us to start with a blank instruction (memset to zero) and copy the fields over one at a time. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-04 14:03:03 -07:00
Jason Ekstrand	381fac2740	intel/eu: Add some brw_get_default_ helpers This is much cleaner than everything that wants a default value poking at the bits of p->current directly. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-04 14:03:03 -07:00
Jason Ekstrand	64e619674e	anv: Don't even bother processing relocs if we have softpin Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-06-01 16:34:26 -07:00
Jason Ekstrand	c7be17c8d3	anv: Refactor reloc handling in execbuf_add_bo This just separates the reloc list vs. BO set cases and lets us avoid an allocation if relocs->deps->entries == 0. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-06-01 16:34:25 -07:00
Jason Ekstrand	7105b7890a	anv: Assert that the kernel leaves pinned BO addresses alone Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-06-01 16:33:07 -07:00
Scott D Phillips	4affeba1e9	anv: Soft-pin everything else v2 (Jason Ekstrand): - Break up Scott's mega-patch Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-06-01 14:27:13 -07:00
Scott D Phillips	f3dbe0419d	anv: Soft-pin batch buffers Co-authored-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-06-01 14:27:12 -07:00
Jason Ekstrand	a0b133286a	anv/batch_chain: Simplify secondary batch return chaining Previously, we did this weird thing where we left space and an empty relocation for use in a hypothetical MI_BATCH_BUFFER_START that would be added to the secondary later. Then, when it came time to chain it into the primary, we would back that out and emit an MI_BATCH_BUFFER_START. This worked well but it was always a bit hacky, fragile and ugly. This commit instead adds a helper for rewriting the MI_BATCH_BUFFER_START at the end of an anv_batch_bo and we use that helper for both batch bo list cloning and handling returns from secondaries. The new helper doesn't actually modify the batch in any way but instead just adjusts the relocation as needed. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-06-01 14:27:12 -07:00
Jason Ekstrand	4f20c665b4	anv/batch_chain: Call batch_bo_finish at the end of end_batch_buffer The only reason we were calling it in the middle was that one of the cases for figuring out the secondary command buffer execution type wanted batch_bo->length which gets set by batch_bo_finish. It's easy enough to recalculate and now batch_bo_finish is called in a sensible location. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-06-01 14:27:11 -07:00
Jason Ekstrand	e7d0378bd9	anv: Soft-pin client-allocated memory Now that we've done all that refactoring, addresses are now being directly written into surface states by ISL and BLORP whenever a BO is pinned so there's really nothing to do besides enable it. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-06-01 14:27:11 -07:00
Jason Ekstrand	caf41c78ca	anv/allocator: Support softpin in the BO cache Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-06-01 14:27:11 -07:00
Jason Ekstrand	b0d50247a7	anv/allocator: Set the BO flags in bo_cache_alloc/import It's safer to set them there because we have the opportunity to properly handle combining flags if a BO is imported more than once. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-06-01 14:27:10 -07:00
Scott D Phillips	27cc68d9e9	anv: For pinned BOs, skip relocations, but track bo usage References to pinned BOs won't need to be relocated at a later point, so just write the final value of the reference into the bo directly. Add a `set` to the relocation lists for tracking dependencies that were previously tracked by relocations. When a batch is executed, we add the referenced pinned BOs to the exec list. v2: - visit bos from the dependency set in a deterministic order (Jason) v3: - compar => compare, drat (Jason) - Reworded commit message, provided by (Jordan) Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-01 14:27:10 -07:00
Scott D Phillips	c7db0ed4e9	anv: Use a separate pool for binding tables when soft pinning Soft pinning lets us satisfy the binding table address requirements without using both sides of a growing state_pool. If you do use both sides of a state pool, then you need to read the state pool's center_bo_offset (with the device mutex held) to know the final offset of relocations that target the state pool bo. By having a separate pool for binding tables that only grows in the forward direction, the center_bo_offset is always 0 and relocations don't need an update pass to adjust relocations with the mutex held. v2: - don't introduce a separate state flag for separate binding tables (Jason) - replace bo and map accessors with a single binding_table_pool accessor (Jason) v3: - assert bt_block->offset >= 0 for the separate binding table (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2018-06-01 14:27:10 -07:00
Scott D Phillips	e662bdb820	anv: Soft-pin state pools The state_pools reserve virtual address space of the full BLOCK_POOL_MEMFD_SIZE, but maintain the current behavior of growing from the middle. v2: - rename block_pool::offset to block_pool::start_address (Jason) - assign state pool start_address statically (Jason) v3: - remove unnecessary bo_flags tampering for the dynamic pool (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2018-06-01 13:49:22 -07:00
Plamena Manolova	939312702e	i965: Add ARB_fragment_shader_interlock support. Adds suppport for ARB_fragment_shader_interlock. We achieve the interlock and fragment ordering by issuing a memory fence via sendc. Signed-off-by: Plamena Manolova <plamena.manolova@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2018-06-01 16:36:39 +01:00
Scott D Phillips	29a139b308	anv/blorp: Write relocated values into surface states v2 (Jason Ekstrand): - Split the blorp bit into it's own patch and re-order a bit - Use anv_address helpers Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-05-31 16:51:47 -07:00
Jason Ekstrand	bf34ef16ac	anv: Use an address for each anv_image plane This is better than having BO and offset fields. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-05-31 16:51:46 -07:00
Jason Ekstrand	1f2328c3b7	anv/cmd_buffer: Rework surface relocation helpers This commit renames add_surface_state_reloc to add_surface_reloc and makes it takes an address. We also rename add_image_view_relocs to add_surface_state_relocs because it takes an anv_surface_state and doesn't really care about the image view anymore. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-05-31 16:51:46 -07:00
Jason Ekstrand	f270a09737	anv: Use an anv_address in anv_buffer Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-05-31 16:51:46 -07:00
Jason Ekstrand	8a8bd39d5e	anv/cmd_buffer: Use anv_address for handling indirect parameters Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-05-31 16:51:46 -07:00
Jason Ekstrand	1029458ee3	anv: Use an anv_address in anv_buffer_view Instead of storing a BO and offset separately, use an anv_address. This changes anv_fill_buffer_surface_state to use anv_address and we now call anv_address_physical and pass that into ISL. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-05-31 16:51:46 -07:00
Jason Ekstrand	de1c5c1b50	anv: Use full anv_addresses in anv_surface_state This refactors surface state filling to work entirely in terms of anv_addresses instead of offsets. This should make things simpler for when we go to soft-pin image buffers. Among other things, add_image_view_relocs now only cares about the addresses in the surface state and doesn't really need the image view anymore. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-05-31 16:51:46 -07:00
Jason Ekstrand	94081ffc80	anv: Add some anv_address helpers Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-05-31 16:51:46 -07:00
Scott D Phillips	aaea46242d	anv: Add vma_heap allocators in anv_device These will be used to assign virtual addresses to soft pinned buffers in a later patch. Two allocators are added for separate 'low' and 'high' virtual memory areas. Another alternative would have been to add a double-sided allocator, which wasn't done here just because it didn't appear to give any code complexity advantages. v2 (Scott Phillips): - rename has_exec_softpin to use_softpin (Jason) - Only remove bottom one page and top 4 GiB from virt (Jason) - refer to comment in anv_allocator about state address + size overflowing 48 bits (Jason) - Mention hi/lo allocators vs double-sided allocator in commit message (Chris) - assign state pool memory ranges statically (Jason) v3 (Jason Ekstrand): - Use (LOW\|HIGH)_HEAP_(MIN\|MAX)_ADDRESS rather than (1 << 31) for determining which heap to use in anv_vma_free - Only return de-canonicalized addresses to the heap Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-05-31 16:51:46 -07:00
Jason Ekstrand	6e4672f881	intel/common: Add an address de-canonicalization helper Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2018-05-31 16:51:45 -07:00
Francisco Jerez	4bd2047dee	intel/fs: Add explicit last_rt flag to fb writes orthogonal to eot. When using multiple RT write messages to the same RT such as for dual-source blending or all RT writes in SIMD32, we have to set the "Last Render Target Select" bit on all write messages that target the last RT but only set EOT on the last RT write in the shader. Special-casing for dual-source blend works today because that is the only case which requires multiple RT write messages per RT. When we start doing SIMD32, this will become much more common so we add a dedicated bit for it. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-05-29 15:44:50 -07:00
Francisco Jerez	d3cd6b7215	intel/fs: Replace the CINTERP opcode with a simple MOV The only reason it was it's own opcode was so that we could detect it and adjust the source register based on the payload setup. Now that we're using the ATTR file for FS inputs, there's no point in having a magic opcode for this. v2 (Jason Ekstrand): - Break the bit which removes the CINTERP opcode into its own patch Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-05-29 15:44:50 -07:00
Francisco Jerez	39de901a96	intel/fs: Use the ATTR file for FS inputs This replaces the special magic opcodes which implicitly read inputs with explicit use of the ATTR file. v2 (Jason Ekstrand): - Break into multiple patches - Change the units of the FS ATTR to be in logical scalars Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-05-29 15:44:50 -07:00
Francisco Jerez	4bfa2ac2ea	intel/fs: Rename a local variable so it doesn't shadow component() v2 (Jason Ekstrand): - Break the refactor into its own patch Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-05-29 15:44:50 -07:00
Francisco Jerez	11c71f0e75	intel/eu: Remove brw_codegen::compressed_stack. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-05-29 15:44:50 -07:00
Jason Ekstrand	71a86d1fc6	intel/fs: Use groups for SIMD16 LINTERP on gen11+ This is better than compression control because it naturally extends to SIMD32. v2: - Push/pop instruction state around adjusted codegen (Ken) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-05-29 15:44:50 -07:00
Jason Ekstrand	a1a850cd34	intel/fs: Assert that the gen4-6 plane restrictions are followed The fall-back does not work correctly in SIMD16 mode and the register allocator should ensure that we never hit this case anyway. Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-05-29 15:44:50 -07:00
Scott D Phillips	4714784dae	anv: move canonical_address calculation into a separate function A later patch will make use of this in other places. Also, remove dependency on undefined behavior of left-shifting a signed value. v2: - move function into a separate header (Chris) v3: (by Ken) Add new header to the various build systems. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-05-27 19:24:33 -07:00

1 2 3 4 5 ...

3061 commits