fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-23 15:08:09 +02:00

Author	SHA1	Message	Date
Matt Turner	d37d9f84ac	i965: Mark functions static Cuts 300 bytes of .text Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2017-08-21 14:45:44 -07:00
Matt Turner	f30902629c	i965/vec4: Use 'class' src_reg, rather than 'struct' src_reg Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2017-08-21 14:45:44 -07:00
Matt Turner	a77d5b28ac	i965/vec4: Return float from spill_cost_for_type() Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2017-08-21 14:45:44 -07:00
Matt Turner	76f36607b0	anv: Move clamp_int64() inside the IVB check It's only used in the gen7_cmd_buffer_emit_scissor() function. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2017-08-21 14:45:44 -07:00
Matt Turner	a98b1a8922	i965: Optimize reading the destination type brw_hw_type_to_reg_type() needs to know only whether the file is BRW_IMMEDIATE_VALUE or not, which is not a valid file for the destination. gcc and clang will evaluate __builtin_strcmp() at compile time, so we can use it to pass a constant file for the destination. text data bss dec hex filename 7816214 346248 420496 8582958 82f72e i965_dri.so before 7816070 346248 420496 8582814 82f69e i965_dri.so after Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	91ef949054	i965: Mark brw_hw_type_to_reg_type() as a pure function text data bss dec hex filename 7816886 346248 420496 8583630 82f9ce i965_dri.so before 7816214 346248 420496 8582958 82f72e i965_dri.so after Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	e07fe89035	i965: Hide the register type hardware encodings So we stop mixing them with the logical enum. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	4fab67a441	i965: Stop using hardware register types directly Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	c746f1c888	i965: Add brw_hw_reg_type_to_letters() and use it in brw_disasm.c Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	6a2471b501	i965: Move brw_reg_type_letters() as well And add "to_" to the name for consistency with the other functions in this file. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	1cb0a7941b	i965: Switch to using the logical register types Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	cb2cd462b1	i965: Add functions to abstract access to register types Previously the brw_inst{,_set}_{dst,src0,src1}_reg_type() functions provided access to the hardware encodings for the register types. We often mixed these with the logical BRW_REGISTER_TYPE_* enums (which themselves used to be the hardware format!) with bad results. With that functionality now available with the hw_ versions (see previous commit), we now add functions that take the logical BRW_REGISTER_TYPE_* enums and convert into the hardware format and vice versa. To do the conversion we also have to provide the file. Note the asymmetry between the two functions: the new getter reads the file from the instruction word, and to ensure that is always set the setter writes both the file and the type. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	9fb8323328	i965: Rename brw_inst's functions that access the register type Put hw_ in the name so that it's clear these are the hardware encodings. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	3e379af492	i965: Index brw_hw_reg_type_to_size()'s table by logical type I'll be transitioning everything to use the logical types. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	c1ac1a3d25	i965: Add a brw_hw_type_to_reg_type() function Will be used in later commits. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	dbe7dd13dd	i965: Use a common table to translate logical to hardware types Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	bfcc9aa829	i965: Extract functions dealing with register types to separate file I'm going to encapsulate all of the logic dealing with register types in this file. Rename the parameters for the hardware encodings from type -> hw_type at the same time. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	890f863da0	i965: Reverse file/type arguments to register type functions I think of the initial arguments as "state" and the last as the actual subject. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	92f787ff86	i965: Add support for disassembling 64-bit integer immediates After the last patch converted things into enums, I helpfully got a compiler warning about these missing from the switch statement. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	deae25ce37	i965: Use separate enums for register vs immediate types The hardware encodings often mean different things depending on whether the source is an immediate. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	8815b9677f	i965: Reorder brw_reg_type enum values These vaguely corresponded to the hardware encodings, but that is purely historical at this point. Reorder them so we stop making things "almost work" when mixing enums. The ordering has been closen so that no enum value is the same as a compatible hardware encoding. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	ce6b8627d8	i965: Validate destination restrictions with vector immediates Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	1d79c828d8	i965: Don't let raw-move check be tricked by immediate vector types UB and B type encodings are the same as UV and VF. Noticed when writing the following patch. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	48aa6ecb87	i965: Only change type of 0.0f to VF if destination stride == 1 The destination stride must be equivalent to a dword if VF is used. Also, since the only compaction table entires with "i:vf" have the destination as "r:f" specifically check that the destination is of type float. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	56a676eed2	i965: Remove CONT/BREAK from instruction compaction test These cannot be compacted. A similar mistake was fixed in commit `90eaf01616` Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	3d661e6062	i965: Test instruction compaction on all supported Gens Note that there's no point in testing on G45, since its compaction is the same as Gen5. Same logic applies to Gen7 variants and low-power parts. Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	9ff7d9b853	i965: Silence signed/unsigned comparison warning Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	eac89911e5	i965: Move compaction "prepass" into brw_eu_compact.c Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Matt Turner	17641f6388	i965: Mark src inst pointer const in compaction code Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com>	2017-08-21 14:05:23 -07:00
Topi Pohjolainen	393ec1a507	intel/blorp: Adjust intra-tile x when faking rgb with red-only v2 (Jason): Adjust directly in surf_fake_rgb_with_red() Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=101910 CC: mesa-stable@lists.freedesktop.org Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-08-21 09:55:08 +03:00
Kenneth Graunke	6f8a577ed2	anv: Use ISL for emitting null surface states. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-08-19 00:46:48 -07:00
Kenneth Graunke	5db9757bd7	isl: Add a null surface fill function. ISL already offers functions to fill out most kinds of SURFACE_STATE, so why not handle null surfaces too? Null surfaces are simple, so we can just take the dimensions, rather than an entirte fill structure. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-08-19 00:46:36 -07:00
Eric Anholt	9caba0f16f	anv: Move a comment that got left behind in the u_vector refactor.	2017-08-18 11:56:58 -07:00
Jason Ekstrand	1af8342b0c	intel/isl: Replace switch statements of doom with a macro Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-08-17 18:09:05 -07:00
Jason Ekstrand	2d68d27071	intel/isl: Reduce header file duplication Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-08-17 18:09:05 -07:00
Jason Ekstrand	bf1d2e84f3	anv/gem: Add a stub for sync_file_merge This fixes make check Fixes: `5c4e4932e0`	2017-08-16 18:44:26 -07:00
Jason Ekstrand	98983503cb	anv: Advertise VK_KHR_external_semaphore Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-08-15 19:08:26 -07:00
Jason Ekstrand	55bce22d8d	anv: Use DRM sync objects for external semaphores when available Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-08-15 19:08:26 -07:00
Jason Ekstrand	f41a0e4b0d	anv/gem: Add a drm syncobj support Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-08-15 19:08:26 -07:00
Jason Ekstrand	5c4e4932e0	anv: Implement support for exporting semaphores as FENCE_FD Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-08-15 19:08:26 -07:00
Jason Ekstrand	e4054ab77b	anv/gem: Use EXECBUFFER2_WR when the FENCE_OUT flag is set Reviewed-by: Chad Versace <chadversary@chromium.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-08-15 19:08:26 -07:00
Jason Ekstrand	017cdb10cf	anv: Submit a dummy batch when only semaphores are provided. Vulkan allows you to do a submit whose only job is to wait on and trigger semaphores. The easiest way for us to support that right now is to insert a dummy execbuf. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-08-15 19:08:26 -07:00
Jason Ekstrand	031f57eba3	anv: Add a basic implementation of VK_KHX_external_semaphore This patch adds an implementation based on DRM BOs. We don't actually advertise the extension yet because we want to add a couple more paths first. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-08-15 19:08:26 -07:00
Scott D Phillips	d6539608a4	intel/genxml: Fix gen10 BLEND_STATE variable length packing BLEND_STATE packing was modified to be variable-length in: `9670124e31` genxml: Make BLEND_STATE command support variable length array. The initial gen10.xml still had the old, fixed-length style definition for BLEND_STATE. So gen10_upload_blend_state would overwrite the packed BLEND_STATE_ENTRYs with its own fixed array of all-zero entries when packing BLEND_STATE. This caused BLEND_STATE upload to not work at all. Fixes: `aa416f515a` ("i965/genxml: Add gen10.xml") Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2017-08-15 09:06:29 -07:00
Ben Widawsky	8f6e54c929	i965: Pretend that CCS modified images are two planes v2: move is_aux into if block. (Jason) Use else block instead of goto (Jason) v3: Fix up logic for is_aux (Ben) Fix up size calculations and add FIXME (Ben) v4 (Jason Ekstrand): Use the aux_pitch in the image instead of calculating it Signed-off-by: Ben Widawsky <ben@bwidawsk.net> Acked-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-08-14 10:43:30 -07:00
Jason Ekstrand	cf2e92262b	intel/isl: Add support for I915_FORMAT_MOD_Y_TILED_CCS Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-08-14 10:43:30 -07:00
Iago Toral Quiroga	81615ad444	intel/compiler: properly size attribute wa_flags array for Vulkan Mesa will map user defined vertex input attributes to slots starting at VERT_ATTRIB_GENERIC0 which gives us room for only 16 slots (up to GL_VERT_ATTRIB_MAX). This sufficient for GL, where we expose exactly 16 vertex attributes for user defined inputs, but in Vulkan we can expose up to 28 (which are also mapped from VERT_ATTRIB_GENERIC0 onwards) so we need to account for this when we scope the size of the array of attribute workaround flags that is used during the brw_vertex_workarounds NIR pass. This prevents out-of-bounds accesses in that array for NIR shaders that use more than 16 vertex input attributes. Fixes: dEQP-VK.pipeline.vertex_input.max_attributes.* Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-08-11 10:41:44 +02:00
Kenneth Graunke	5563872dbf	isl: Validate row pitch of stencil surfaces. Also, silence an obnoxious finishme that started occurring for all GL applications which use stencil after the i965 ISL conversion. v2: Check against 3DSTATE_STENCIL_BUFFER's pitch bits when using separate stencil, and 3DSTATE_DEPTH_BUFFER's bits when using combined depth-stencil. Cc: "17.2" <mesa-stable@lists.freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-08-10 15:18:58 -07:00
Jason Ekstrand	4d27c6095e	intel/isl: Don't align the height of the last array slice We were calculating the total height of 2D surfaces by multiplying the row pitch by the number of slices. This means that we actually request slightly more space than actually needed since the padding on the last slice is unnecessary. For tiled surfaces this is not likely to make a difference. For linear surfaces, on the other hand, this means we may require additional memory. In particular, this makes the i965 driver reject EGL imports of buffers which do not have this extra padding. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Cc: "17.2" <mesa-stable@lists.freedesktop.org>	2017-08-07 09:31:11 -07:00
Jason Ekstrand	c15b92ce11	intel/isl: Stop padding surfaces The docs contain a bunch of commentary about the need to pad various surfaces out to multiples of something or other. However, all of those requirements are about avoiding GTT errors due to missing pages when the data port or sampler accesses slightly out-of-bounds. However, because the kernel already fills all the empty space in our GTT with the scratch page, we never have to worry about faulting due to OOB reads. There are two caveats to this: 1) There is some potential for issues with caches here if extra data ends up in a cache we don't expect due to OOB reads. However, because we always trash the entire cache whenever we need to move anything between cache domains, this shouldn't be an issue. 2) There is a potential issue if a surface gets placed at the very top of the GTT by the kernel. In this case, the hardware could potentially end up trying to read past the top of the GTT. If it nicely wraps around at the 48-bit (or 32-bit) boundary, then this shouldn't be an issue thanks to the scratch page. If it doesn't, then we need to come up with something to handle it. Up until some of the GL move to ISL, having the padding code in there just caused us to harmlessly use a bit more memory in Vulkan. However, now that we're using ISL sizes to validate external dma-buf images, these padding requirements are causing us to reject otherwise valid images due to the size of the BO being too small. Acked-by: Kenneth Graunke <kenneth@whitecape.org> Tested-by: Tapani Pälli <tapani.palli@intel.com> Tested-by: Tomasz Figa <tfiga@chromium.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Cc: "17.2" <mesa-stable@lists.freedesktop.org>	2017-08-07 09:31:11 -07:00

1 2 3 4 5 ...

2077 commits