third_party_mesa3d

Author	SHA1	Message	Date
Alyssa Rosenzweig	74e92274af	asahi,agx: Use new tilebuffer infrastructure Flag day change to replace the previous hardcoded background/end-of-tile shaders and the API-style load/store_output in fragment shaders with the generated shaders and lowered *_agx intrinsics. This gets us working non-UNORM8 render targets and working MRT. It's also a step in the direction of working MSAA but that needs a lot more work, since the multisampling programming model on AGX is quite different from any of the APIs (including Metal). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	c5c0ea39f6	asahi: Add new clear/reload/store infrastructure With multiple render targets, it's not practical to generate all variants of the background and end-of-tile programs at start up. Rather than trying, add a hash table of meta program keys to background programs, and compile variants as they're needed. With the new infrastructure, it's sensible to handle clears with the same code path as reloads. In addition to getting us closer to multiple render target support, this gets us support for non-RGBA8 render targets, as the u8norm tilebuffer format was baked into the hardcoded clear shader and store shaders used before. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	4f96651f1e	asahi: Use correct tib settings for USC Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	555447769d	asahi: Extend texture descriptor packing for MSAA Multisampling uses different values of the dimension enum in tandem with a new samples field. Handle this in agx_pack_texture (split off here) so we can use the new functionality for texture descriptors in reloads too. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	cc555e0c04	asahi: Remove some bogus asserts Hitting in dEQP-GLES31 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	bbe7d8e4f5	asahi: Implement texture_barrier trivially For the advanced blending tests. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	03dc4bc3e8	asahi: Calculate tilebuffer layout per batch It won't be fixed soon. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	b1f5004ee7	asahi: Add agx_usc_shared_none helper Convenience for vertex USC programs. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	c713197c25	asahi: Add R16 SNORM formats For completeness, since we do have hardware for this. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	d637189d36	asahi: Add more XML via PowerVR These bits are the same as RGX. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	a3907e92da	asahi: Add note to XML about 16-bit varyings Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	94a8fe51d5	asahi: Identify more depth-related fields in XML Needed for gl_FragDepth writes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	6ce615d852	asahi: Add XML for layered rendering We don't need to support this for a while but it's good to know the mechanism. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	74de571402	asahi: Add NIR pass to lower tilebuffer access The compiler can't handle load/store_output directly for nontrivial tilebuffer layouts. Add a NIR pass to lower these intrinsics, applying a given layout. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	66a680a043	asahi: Add tilebuffer layout helpers Laying out the tilebuffer is nontrivial and a task shared between GL and VK, so add unit-tested helpers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	5d3243ea2d	asahi: Add some notes about unknowns to the XML Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	363ffa779d	asahi: Identify multisampling fields of shared layout Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	5a20c90508	asahi: Add _with_bo pool uploads Will be useful for managing our meta shaders. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	4a166acc93	agx: Add block_image_store instruction This hw instruction writes out an entire block from the tilebuffer to an attached render target (PBE descriptor). It is used (only?) in end-of-tile shaders to implement write out. We need to handle it in the compiler as a prerequisite to compiling end-of-tile shaders ourselves, instead of hardcoding. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	0e106681e0	agx: Add helper to map pipe formats to agx_formats Or a restricted subset thereof anyway. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	db0461a8d0	agx: Implement nir_texop_txf_ms Mutlisampled texture fetch (txf_ms) is encoded like regular txf. However, we now need to pack the multisample index in the right place, which we do by extending our existing NIR texture source lowering pass. 2D MS arrays use a new value of dim which requires tweaking the encoding slightly. Otherwise, everything is bog standard. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	53d013a605	ail: Handle multisampling It appears that multisampled textures on AGX have all samples of the same pixel contiguous in memory, effectively using the layout of a single-sampled texture with a larger block size. Handle in ail. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	8781aef6b4	asahi: Make libasahi_lib depend on libasahi_decode The track_alloc and track_free symbols are used, we need to link them in. Depending on build flags / environment / etc, fixes the potential build error hit by a CI job: mold: error: undefined symbol: agxdecode_track_alloc >>> referenced by agx_device.c >>> src/asahi/lib/libasahi_lib.a(src/asahi/lib/libasahi_lib.a.p/agx_device.c.o):(agx_shmem_alloc)>>> referenced by agx_device.c >>> src/asahi/lib/libasahi_lib.a(src/asahi/lib/libasahi_lib.a.p/agx_device.c.o):(agx_bo_create) mold: error: undefined symbol: agxdecode_track_free >>> referenced by agx_device.c >>> src/asahi/lib/libasahi_lib.a(src/asahi/lib/libasahi_lib.a.p/agx_device.c.o):(agx_bo_unreference) ...when trying to link with libasahi_lib without libasahi_decode for unit tests. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	6ee6cfec41	asahi: Use PIPE_FORMATs for driver-compiler ABI This avoids exposing the ISA-internal agx_format to the driver, instead hiding it behind a real PIPE_FORMAT. This lets us use real pipe formats in formatted intrinsics in NIR, which is convenient; it will allow us to simplify the compiler/driver ABI; and it lets us use common format helpers (e.g. util_format_get_blocksize) for the internal formats in driver lowering. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	940b871dba	nir: Define AGX intrinsics for local pixel access Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Emma Anholt	7befecf500	turnip: Apply the RB_DBG_ECO_CNTL_blit workaround. On blob v512.490 on a615, using WRAP_GPU_ID to fake GPU versions, I see 0x41 used everywhere, except for BLIT_OP_SCALE on a630. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19794>	2022-11-19 18:28:27 +00:00
Emma Anholt	9076b38610	freedreno: Don't WFI and set RB_DBG_ECO_CNTL if it's not changing. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19794>	2022-11-19 18:28:27 +00:00
Emma Anholt	4ab489a0b7	freedreno: Update RB_DBG_ECO_CNTL/RB_DBG_ECO_CNTL_blit. On blob v512.490, using WRAP_GPU_ID to fake GPU versions, I see 0x41 used everywhere, except for BLIT_OP_SCALE on a630. Define the magic number in dev info so it can be reused in the two places that set the non-BLIT_OP_SCALE value. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19794>	2022-11-19 18:28:27 +00:00
Gert Wollny	be570cd322	r600/sfn: sort FS color outputs before all other outputs The color outputs must be checked against the number of available color buffers, therefore it is best to sort the color outputs to be on the driver locations before the other FS outputs. Fixes: `79ca456b48` r600/sfn: rewrite NIR backend Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7530 Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19804>	2022-11-19 16:59:26 +00:00
Gert Wollny	85e140aa5c	r600: Print RAT instruction names in disassembly Also print the swizzle of the address to indicate what values may be used. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19804>	2022-11-19 16:59:26 +00:00
Gert Wollny	684e90b15c	r600: Update scratch buffer late For some reason the setup that comes after the scratch buffer setup calls clobber the PS output configuration. Emitting the scratch buffer setup as last action before the actual draw commands seems to fix this. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19804>	2022-11-19 16:59:26 +00:00
Rob Clark	394d8e4122	freedreno/drm/virtio: Defer flush on BO free Freeing BOs tends to be bursty (ie. when a submit is retired, or expiring entries from BO cache). Sending lots of small SET_IOVA messages to the host can quickly eat up the available virtqueue slots, resulting in (eventually) starving the guest waiting for free virtqueue space. By batching, we can avoid this and handle things more efficiently on the host (ie. in a single wakeup rather than many). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19832>	2022-11-19 16:32:25 +00:00
Rob Clark	b4a54824e5	freedreno/drm: Support for batched frees Batch up handles before closing them to give the drm backend a chance to batch up any extra handling needed (ie. virtio batching up messages to host to release IOVA). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19832>	2022-11-19 16:32:25 +00:00
Rob Clark	e5a60e1df2	freedreno/drm: Add optimized path for freeing many BOs Submits tend to hold references to a lot of BOs, which get unref'd when the submit is destroyed/retired. For now, all this does is reduce lock aquire/release, but the next commit will build on it. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19832>	2022-11-19 16:32:25 +00:00
Alyssa Rosenzweig	d7511ad784	asahi: Add batch tracking logic We already have the notion of an agx_batch, which encapsulates a render pass. Extend the logic to allow multiple in-flight batches per context, avoiding a flush in set_framebuffer_state and improving performance for certain applications designed for IMRs that ping-pong unnecessarily between FBOs. I don't have such an application immediately in mind, but I wanted to get this flag-day out of the way while the driver is still small and flexible. The driver was written from day 1 with batch tracking in mind, so this is a relatively small change to actually wire it up, but there are lots of little details to get right. The code itself is mostly a copy/paste of panfrost, which in turn draws inspiration from freedreno and v3d. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19865>	2022-11-19 15:33:16 +00:00
Alyssa Rosenzweig	de1eb9400f	asahi: Use the batch for submission So we can submit background batches. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19865>	2022-11-19 15:33:16 +00:00
Alyssa Rosenzweig	0d3b4ff2aa	asahi: Use batch_reads for sysvals Required for proper resource tracking. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19865>	2022-11-19 15:33:16 +00:00
Alyssa Rosenzweig	84f623ae7b	asahi: Use a pipe_framebuffer_state batch key More convenient for batch tracking. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19865>	2022-11-19 15:33:16 +00:00
Alyssa Rosenzweig	d36c911b7b	asahi: Use batch instead of ctx for pipelines So we can support multiple batches later. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19865>	2022-11-19 15:33:16 +00:00
Alyssa Rosenzweig	fb7257af4e	asahi: Hide ctx->batch This will make it easier to support multiple batches. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19865>	2022-11-19 15:33:16 +00:00
Alyssa Rosenzweig	3104b1aaaf	asahi: Factor out prepare_for_map This will be expanded, let's expand in the direction of less spaghetti. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19865>	2022-11-19 15:33:16 +00:00
Lionel Landwerlin	9c1c1888d9	intel/fs: put scratch surface in the surface state heap In `4ceaed7839` we made scratch surface state allocations part of the internal heap (mapped to STATE_BASE_ADDRESS::SurfaceStateBaseAddress) so that it doesn't uses slots in the application's expected 1M descriptors (especially with vkd3d-proton). But all our compiler code relies on BSS (STATE_BASE_ADDRESS::BindlessSurfaceStateBaseAddress). The additional issue is that there is only 26bits of surface offset available in CS instruction (CFE_STATE, 3DSTATE_VS, etc...) for scratch surfaces. So we need the drivers to put the scratch surfaces in the first chunk of STATE_BASE_ADDRESS::SurfaceStateBaseAddress (hence all the driver changes). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4ceaed7839` ("anv: split internal surface states from descriptors") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7687 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19727>	2022-11-19 14:58:58 +00:00
Lionel Landwerlin	daab161535	iris: move bindless surface state heap inside the surface state heap We're about to make scratch surface states part of the surface state heap. Because those are required to be in the low 26bits parts surface state heap (we're limited in bits handed in the CFE_STATE, 3DSTATE_VS, etc... instructions), this change splits the 32bit surface state heap as follow: - 8Mb of surface states for scratch - 1Gb - 8Mb of binding tables - 3Gb of surface states That way all of the surfaces are located within a 4Gb region visible from STATE_BASE_ADDRESS::SurfaceStateBaseAddress Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19727>	2022-11-19 14:58:57 +00:00
Lionel Landwerlin	64f1ae4bc5	iris: prevent crash in decoder Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19727>	2022-11-19 14:58:57 +00:00
Bas Nieuwenhuizen	1b5dc33caa	radv: Convert instance bvh address to node in bvh build. So we don't have to do it in the traversal loop. Should 2 and instructions and a 64-bit shift, so 4/8 cycles per instance node visit. Totals from 7 (0.01% of 134913) affected shaders: CodeSize: 208460 -> 208292 (-0.08%) Instrs: 38276 -> 38248 (-0.07%) Latency: 803181 -> 803142 (-0.00%) InvThroughput: 165384 -> 165376 (-0.00%) Copies: 4912 -> 4905 (-0.14%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19706>	2022-11-19 14:24:36 +00:00
Bas Nieuwenhuizen	d09ed23b9a	radv: Fiddle with opaque flag positions to reduce instructions. Totals from 7 (0.01% of 134913) affected shaders: CodeSize: 209076 -> 208460 (-0.29%) Instrs: 38374 -> 38276 (-0.26%) Latency: 803899 -> 803181 (-0.09%) InvThroughput: 165530 -> 165384 (-0.09%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19706>	2022-11-19 14:24:36 +00:00
Bas Nieuwenhuizen	3884210902	radv: Skip and for node_to_addr with bvh_base. Cause the bvh base is always 64 byte aligned. Totals from 7 (0.01% of 134913) affected shaders: CodeSize: 209216 -> 209076 (-0.07%) Instrs: 38402 -> 38374 (-0.07%) Latency: 804537 -> 803899 (-0.08%) InvThroughput: 165663 -> 165530 (-0.08%) Copies: 4919 -> 4912 (-0.14%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19706>	2022-11-19 14:24:36 +00:00
Bas Nieuwenhuizen	0a26975840	radv: Move ray flag compares out of the loop. To save on and+cmp combos with VALU instructions. Totals from 7 (0.01% of 134913) affected shaders: CodeSize: 208476 -> 209216 (+0.35%) Instrs: 38384 -> 38402 (+0.05%) Latency: 805725 -> 804537 (-0.15%) InvThroughput: 165906 -> 165663 (-0.15%) Copies: 4936 -> 4919 (-0.34%) PreSGPRs: 393 -> 430 (+9.41%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19706>	2022-11-19 14:24:36 +00:00
Lionel Landwerlin	e2dadda35f	Revert "nir/lower_shader_calls: put inserted instructions into a dummy block" This reverts commit `35d82ecf1e`. Cc: mesa-stable Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19820>	2022-11-19 10:53:18 +00:00
Lionel Landwerlin	3686d5a312	nir/lower_shader_calls: wrap only jumps rather than entire code blocks Moving entire chunks of code into a dummy if block is causing issues in some situations. To work around the issue that we tried to fix in `35d82ecf1e` ("nir/lower_shader_calls: put inserted instructions into a dummy block") which is that we cannot cut and past a block of instruction that ends with a jump if there are more instruction behind where we're going to past. We can instead just wraps the jumps into dummy if blocks. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19820>	2022-11-19 10:53:18 +00:00

1 2 3 4 5 ...

163207 Commits