fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 17:58:09 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	1dd513727d	agx: Handle centroid and sample interpolation Works great now that all the infrastructure is wired up. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23480>	2023-06-07 03:21:49 +00:00
Alyssa Rosenzweig	b7f130fbbc	agx: Model interpolation for iter instructions Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23480>	2023-06-07 03:21:49 +00:00
Alyssa Rosenzweig	2548293e8b	agx: Split iter and iterproj instructions These are different (though related) instructions. I've split them in applegpu, let's mirror that here. This simplifies the IR a bit. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23480>	2023-06-07 03:21:49 +00:00
Alyssa Rosenzweig	b9b71bcae6	asahi,agx: Call lower_discard_zs_emit in the driver The driver needs to lower MSAA (because only it knows the sample count). MSAA lowering depends on discards getting lowered (in order to get sample masks on the discards for sample shading to work properly). Discard lowering depends on all discards emitted. But the driver needs to lower clip planes which generates discards. To break the circular dependency, we have the driver call the discard lowering pass itself (in between lowering clip planes and lowering MSAA). Technically, this is probably a layering violation but it's the least gross solution I see. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23480>	2023-06-07 03:21:49 +00:00
Alyssa Rosenzweig	398851ca53	agx: Lower discard in NIR We already lower discard in NIR when depth/stencil writes are used in the shader. In this patch, we extend that lowering for when depth/stencil writes are not used, in which case the discard is lowered to a sample_mask instruction. This is a step towards multisampling, since the old lowering assumed single-sample and there's no way to express a sample mask with a standard NIR discard instructions so we need to lower in NIR anyway for sample shading (i.e. if a discard_if diverges between samples in a pixel). This changes the lowering for discard_if to be free of control flow (instead executing a sample mask instruction unconditionally). This seems to be slightly faster in SuperTuxKart and slightly slower in Dolphin, but I'm not too worried right now. To make this work, we do need some extra lowering to ensure we always execute a sample_mask instruction, in case a discard_if is buried in other control flow (as occurs with Dolphin's ubershaders). So that's added too. We need that for MSAA anyway, so pardon the line count. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23480>	2023-06-07 03:21:49 +00:00
Alyssa Rosenzweig	989d6fd378	agx: Enable tag writes when sample mask written Including indirectly via discard/demote. Fixes graphical artefacts in Chromium when API sample masks are hooked up, which will result in fragment programs that do not write colour/depth but do a lone sample mask write. These need tag writes enabled (according to a trace from Metal for a case constructed to test this scenario). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23480>	2023-06-07 03:21:49 +00:00
Alyssa Rosenzweig	f514d49ae2	agx: Handle sample_mask_agx 1:1 translation. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23480>	2023-06-07 03:21:49 +00:00
Alyssa Rosenzweig	73bbf43bc0	agx: Plumb in nir_intrinsic_load_sample_mask_in We have a special register for this, although this will need some lowering for glSampleMask. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23480>	2023-06-07 03:21:49 +00:00
Alyssa Rosenzweig	6fd16dd7c9	agx: Model both sources of sample_mask We need to control both sources to implement multisampling properly. The semantic is something like: foreach sample in the first mask { if correspond bit in second bit set { make sample live } else { make sample dead } } But I'm reticent to document more formally until the details are really understood and properly tested. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23480>	2023-06-07 03:21:49 +00:00
Alyssa Rosenzweig	bffbe099df	asahi: Set uses_sample_shading for background program If we read gl_SampleID we need the lowering, even though we don't call into gather_info to set the bit for us. So set the bit manually. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23480>	2023-06-07 03:21:49 +00:00
Alyssa Rosenzweig	0b95d81150	agx: Assert that sample shading is lowered Lest someone mess this up later and then try to "implement" these intrinsics in the backend. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23480>	2023-06-07 03:21:48 +00:00
Alyssa Rosenzweig	46a5a99d24	asahi: Add alpha-to-coverage (and alpha-to-one) lowering This should probably be shared code but meh. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23480>	2023-06-07 03:21:48 +00:00
Alyssa Rosenzweig	51e868f3a2	asahi: Add passes to lower sample intrinsics Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23480>	2023-06-07 03:21:48 +00:00
Alyssa Rosenzweig	f28962e29a	asahi: Add passes to lower MSAA Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23480>	2023-06-07 03:21:48 +00:00
Alyssa Rosenzweig	70b8babe3c	agx: Use textures_used, not num_textures The latter doesn't account for holes. Fixes regression in Neverball on Asahi. Fixes: `e607a89f` ("mesa/main: ff-fragshader to nir") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23480>	2023-06-07 03:21:48 +00:00
Alyssa Rosenzweig	f1c2ea99e2	agx: Constant fold when optimizing int64 Otherwise we can get bcsel(false, ...) in the final optimized code, which isn't great. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23480>	2023-06-07 03:21:48 +00:00
Alyssa Rosenzweig	9641fba9ba	agx: Set support_16bit_alu Allows some more optimizations. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23480>	2023-06-07 03:21:48 +00:00
Alyssa Rosenzweig	99a00e2247	treewide: Use nir_trim_vector more Via Coccinelle patches @@ expression a, b, c; @@ -nir_channels(b, a, (1 << c) - 1) +nir_trim_vector(b, a, c) @@ expression a, b, c; @@ -nir_channels(b, a, BITFIELD_MASK(c)) +nir_trim_vector(b, a, c) @@ expression a, b; @@ -nir_channels(b, a, 3) +nir_trim_vector(b, a, 2) @@ expression a, b; @@ -nir_channels(b, a, 7) +nir_trim_vector(b, a, 3) Plus a fixup for pointless trimming an immediate in RADV and radeonsi. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23352>	2023-06-06 18:52:25 +00:00
Alyssa Rosenzweig	68eda9456f	treewide: Use nir_tex_src_for_ssa Via Coccinelle patch: @@ expression a, b, c; @@ -a.src = nir_src_for_ssa(b); -a.src_type = c; +a = nir_tex_src_for_ssa(c, b); @@ expression a, b, c; @@ -a.src_type = c; -a.src = nir_src_for_ssa(b); +a = nir_tex_src_for_ssa(c, b); Plus manual fixups, including... * a few identity swizzles changed to nir_trim_vector in TTN and prog-to-nir to fix the Coccinelle-botched formatting, and similarly a pointless nir_channels * collapsing a now-pointless temp in vtn Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23352>	2023-06-06 18:52:25 +00:00
Konstantin Seurer	13c9b490a7	asahi: Reformat using the new style Now, that the foreach macro list is complete (I hope), let's reformat drivers that enforce correct formatting in CI. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23275>	2023-05-29 21:06:12 +00:00
Konstantin Seurer	e3773c4395	asahi: Use the Mesa base style Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23275>	2023-05-29 21:06:12 +00:00
Alyssa Rosenzweig	d6b8acbee9	agx: Use common combine_all_barriers callback This contains a bugfix: execution scopes are now respected when combining barriers. Otherwise control barriers can disappear during combining, which is wrong. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23181>	2023-05-24 17:30:03 +00:00
Alyssa Rosenzweig	ecd295bb8b	treewide: Avoid nir_lower_regs_to_ssa calls nir_registers are only supposed to be used temporarily. They may be created by a producer, but then must be immediately lowered prior to optimizing the produced shader. They may be created internally by an optimization pass that doesn't want to deal with phis, but that pass needs to lower them back to phis immediately. Finally they may be created when going out-of-SSA if a backend chooses, but that has to happen late. Regardless, there should be no case where a backend sees a shader that comes in with nir_registers needing to be lowered. The two frontend producers of registers (tgsi_to_nir and mesa/st) both call nir_lower_regs_to_ssa to clean up as they should. Some backend (like intel) already depend on this behaviour. There's no need for other backends to call nir_lower_regs_to_ssa too. Drop the pointless calls as a baby step towards replacing nir_register. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23181>	2023-05-24 17:30:03 +00:00
Alyssa Rosenzweig	e5867b0dca	asahi: Use common hexdump utility We just moved it into common. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23088>	2023-05-19 16:30:44 +00:00
Alyssa Rosenzweig	01e9ee79f7	nir: Drop unused name from nir_ssa_dest_init Since `624e799cc3` ("nir: Drop nir_ssa_def::name and nir_register::name"), SSA defs don't have names, making the name argument unused. Drop it from the signature and fix the call sites. This was done with the help of the following Coccinelle semantic patch: @@ expression A, B, C, D, E; @@ -nir_ssa_dest_init(A, B, C, D, E); +nir_ssa_dest_init(A, B, C, D); Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23078>	2023-05-17 23:46:16 +00:00
Alyssa Rosenzweig	c323762f9f	treewide: Stop lowering legacy atomics There are no more producers of legacy atomics so these calls are inert. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23036>	2023-05-16 22:36:21 +00:00
Alyssa Rosenzweig	65469d6b23	agx: Lower legacy atomics sooner Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23036>	2023-05-16 22:36:21 +00:00
Alyssa Rosenzweig	f5d73a9989	agx: Use unified atomics Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22914>	2023-05-12 20:39:46 +00:00
Asahi Lina	0a398b0ef9	ail: Add MSAA tests This tests the following matrix: - Format: RGBA8Unorm, RGBA16Unorm, RGBA32Float - Samples: 2 or 4 - Layers: 1 or 2 - Width: Interesting values 1..4097 - Height: Interesting values 1..4097 Compression is based on the dimensions (that is, everything that can be compressed is). This test compares both the total texture size and the compression metadata offset. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:48 +00:00
Alyssa Rosenzweig	e918509284	ail: Handle larger block sizes We need to support up to 16 bytes/sample * 4 samples/pixel = 64 bytes/pixel for multisampling to work with formats like RGBA32F. Fixes dEQP-GLES3.functional.fbo.msaa.4_samples.rgba32f Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:48 +00:00
Asahi Lina	59a6c5b357	ail: Implement multisampling for compression meta calculation For multisampled textures, the decision about whether to compress or not is based on the effective width and height in samples, not pixels. Introduce ail_can_compress() to encode this logic in ail, so the driver can use it to decide whether to compress or not before the full layout is determined. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:48 +00:00
Asahi Lina	94c9115aa0	asahi: Make bo->writer_syncobj atomic BOs can be written from several contexts, so writing to this member is racy. We only care about this for the purposes of exporting BOs after a submission (and if the app is racing writers/submissions at that point all bets are off), so just keeping track of the last written value is sufficient. Switch to atomic operations to eliminate the race, and drop the assert in the batch cleanup path that no longer holds when the BO might have been written to from another context. Fixes: asahi/mesa#20 Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:48 +00:00
Asahi Lina	f8b055eb96	asahi: Partially identify some missing index list stuff Still unclear what the extra 2 blocks do, but at least we know the size/order now. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:48 +00:00
Asahi Lina	64a595291e	asahi: Add some more system registers Core and opfifo stuff from the compute helper blob, vm_slot because it was the only one changing when I poked around yesterday and it hit me what it was ^^ Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:48 +00:00
Asahi Lina	e92ff4f809	asahi: Add missing stdbool include to lib/hexdump.h Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22971>	2023-05-11 23:24:47 +00:00
Alyssa Rosenzweig	5a80bf2eb0	agx: Optimize multiplies We have an imad instruction and our iadd has a small immediate shift on the second source. Together, these allow expressing lots of integer multiplies more efficiently. Add some rules to optimize these now that the backend compiler can ingest the optimized forms. Half-register changes are from load_const scheduling changing in some vertex shaders. total instructions in shared programs: 1539092 -> 1537949 (-0.07%) instructions in affected programs: 167896 -> 166753 (-0.68%) total bytes in shared programs: 10543012 -> 10533866 (-0.09%) bytes in affected programs: 1218068 -> 1208922 (-0.75%) total halfregs in shared programs: 483180 -> 483448 (0.06%) halfregs in affected programs: 1942 -> 2210 (13.80%) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22695>	2023-05-11 09:23:23 -04:00
Alyssa Rosenzweig	c2793a304d	agx: Fix packing of imsub instructions The negate for imad is on the third source (a * b - c), not the second source. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22695>	2023-05-11 09:23:23 -04:00
Alyssa Rosenzweig	8289fa253b	agx: Handle imadshl_agx, imsubshl_agx Same hardware instructions as iadd/isub/imad/imsub, just with the extra input represented in NIR as required. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22695>	2023-05-11 09:23:23 -04:00
Alyssa Rosenzweig	3df4ae3334	agx: Use nir_alu_src_as_uint Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22695>	2023-05-11 09:23:04 -04:00
Asahi Lina	3f55eff0e5	asahi: Assert that freed BOs have no pending writers This is just a sanity check, I haven't actually hit this case but if we ever do something is very broken (e.g. BO refcounting bug). Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:37 -04:00
Alyssa Rosenzweig	e79e743674	agx: Lower I/O to scalar later This lets us preserve vectorized stores for transform feedback shaders. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:36 -04:00
Alyssa Rosenzweig	a561a6c468	agx: Validate that collect sources are the same size RA asserts this, but by then if you've messed it up, the failure is inscrutable. Let's check it in the validator for more pleasant debugging. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:36 -04:00
Alyssa Rosenzweig	9337f6a865	agx: Rework z/s emit We were being sloppy with the sizes before. It mostly worked out, but there were some corner cases where we would end up with mixed sized collects and that won't end well for us. Let's rework the logic to make all the sizes explicit in NIR -- 32-bit for depth and 16-bit stencil -- and then do the needed promotions to make it happen in the AGX IR side. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:36 -04:00
Alyssa Rosenzweig	f4f9269b66	agx: Ensure load_frag_coord has the right sizes In case .x isn't read, it'll be null which has the wrong size and will fail the validation added later in this series. We fix this by padding with sized undefs (something that exists of defined size but undefined value) rather than nothingness (of undefined size). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:36 -04:00
Alyssa Rosenzweig	7f71e1bc2d	agx/lower_address: Match multiplies, not only shifts Sometimes a shader might index with a non-power-of-two stride. For example, if it's indexing into an array of structures where the structure size is not a power of two, we'll get a multiply with a constant as opposed to a shift. We want to handle these cases, too. To do so, we generalize our pattern matching to look for any kind of multiply (with our new helper), rather than hardcoding logic for ishl. This eliminates right-shifts in a pile of compute shaders, which makes me happy from a "I read lots of shader assembly when debugging" perspective. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:36 -04:00
Alyssa Rosenzweig	032d7bd302	agx/lower_address: Add helper to match multiplies Currently, we hardcode logic in the addressing chasing code to look for ishl instructions that shift by constants. We can generalize this to looking for integer multiplies by constants to optimize more addressing patterns. Add a helper to do so. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:36 -04:00
Alyssa Rosenzweig	31c805d0aa	agx: Don't wait at the end of the shader This is totally pointless. This saves some waits at the ends of compute kernels (waiting for stores to complete before terminating the thread). I don't know how much this would matter for performance, since the hardware may have to do these waits internally, but it makes the generated code less silly which is always nice. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:05:39 -04:00
Alyssa Rosenzweig	87e57eae09	asahi: Rename no colour output to tag write disable Comparison with PowerVR's XML shows that this is the actual name... And it needs to be set a bit more carefully than "no colour output" in order to get correct behaviour for depth-only passes that use sample mask / discard. Fix the name first, the extra conditions will come when they're needed for multisampling. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:05:39 -04:00
Alyssa Rosenzweig	e13f9caa25	agx: Fix packing for iadd with shift Wrong bit pattern was packed, oops. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:05:39 -04:00
Alyssa Rosenzweig	cd7e016961	asahi: Use device_load shift for VBO loads When possible. Only occassionally possible because the loads are pretty limited in the addressing arithmetic. This probably doesn't matter for performance but it saves some noise in dEQP tests which makes for nicer debugging, plenty of optimizations end up worth it for that alone. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:05:39 -04:00

1 2 3 4 5 ...

842 commits