third_party_mesa3d

Author	SHA1	Message	Date
Jesse Natalie	f05b7225a3	microsoft/clc: Split struct copies before vars_to_ssa in pre-inline optimizations Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29896>	2024-07-22 21:16:58 +00:00
Lionel Landwerlin	1908d2c171	anv: split image view from anv_image.c Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30285>	2024-07-22 18:46:05 +00:00
Lionel Landwerlin	eff01c46d8	anv: split buffer view from anv_image.c Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30285>	2024-07-22 18:46:05 +00:00
Lionel Landwerlin	f5af56528b	anv: split sampler from anv_device.c Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30285>	2024-07-22 18:46:05 +00:00
Lionel Landwerlin	543c726781	anv: split buffer from anv_device.c Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30285>	2024-07-22 18:46:05 +00:00
Lionel Landwerlin	c59e8e814a	anv: split events from anv_device.c Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30285>	2024-07-22 18:46:05 +00:00
Lionel Landwerlin	ca51a02e7b	anv: split physical_device from anv_device.c Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30285>	2024-07-22 18:46:05 +00:00
Lionel Landwerlin	c7ecf10c20	anv: split instance from anv_device.c Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30285>	2024-07-22 18:46:05 +00:00
José Roberto de Souza	69ee1c4b46	anv: Drop useless 'if (total_scratch > 0) {' block in cmd_buffer_ensure_cfe_state() cmd_buffer_ensure_cfe_state() returns ealier if total_scratch == 0 here: if (total_scratch <= comp_state->scratch_size) return; Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30271>	2024-07-22 18:17:38 +00:00
José Roberto de Souza	de5d767f9a	intel/brw: Add a maximum scratch size restriction Gfx 12.5 moved scratch to a surface and SURFTYPE_SCRATCH has this pitch restriction: RENDER_SURFACE_STATE::Surface Pitch For surfaces of type SURFTYPE_SCRATCH, valid range of pitch is: [63,262143] -> [64B, 256KB] The pitch of the surface is the scratch size per thread and the surface should be large enough to accommodate every physical thread. So here adding a new field to intel_device_info, setting it in intel_device_info_init_common() so even offline tools can have it set. And finally adding a check to fail shader compilation if needed scratch is larger than supported. This issue can be reproduced in debug builds when running dEQP-VK.protected_memory.stack.stacksize_1024 on Gfx 12.5 or newer platforms. Ref: BSpec 43862 (r52666) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30271>	2024-07-22 18:17:38 +00:00
Paulo Zanoni	c65a76db85	anv/trtt: don't just crash when we can't find device->trtt.queue Please refer to the big comment this patch introduces. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30252>	2024-07-22 10:04:34 -07:00
Paulo Zanoni	3ab8ff99fa	anv/trtt: fix the process of picking device->trtt.queue We want to use actual sparse-capable queues as the default trtt->queue, not copy queues that may have a companion_rcs_batch. Before this patch, if we expose more than one queue and the application creates a copy queue first, we'll end up setting trtt->queue as the copy queue, which will GPU hang when we submit the TR-TT batches as they don't support the pipe_control commands we issue. The trtt->queue queue is used for binding/unbinding buffers in code paths where there's no specific queue coming from user space, such as when we're creating or destroying a sparse resource. This is not a problem yet on i915.ko since we are exposing only a single queue, and it is not a problem for xe.ko since TR-TT is not the default there. This is also not a problem in applications that create the render or compute queue first. We plan to expose more queues when using TR-TT, so this would become a problem without this patch. None of VK-GL-CTS seems to exercise that, and none of the Steam games I tested exercise that as well. I was able to reproduce this issue using our internal tracing tool. v2: New implementation that doesn't break when we only have a compute queue (Lionel). Fixes: `04bfe828db` ("anv/sparse: allow sparse resouces to use TR-TT as its backend") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30252>	2024-07-22 10:04:34 -07:00
Paulo Zanoni	5ca224aa0c	anv/trtt: make all contexts have the same TR-TT programming On Gen12 (the oldest we support on Mesa right now for TR-TT) we started having per-engine TR-TT registers and we are supposed to make all contexts share the same TR-TT programming. On LNL+, this is documented in the BSpec page for the TRTT_CNTRL register (68417), with more details in HSDs 14020454786 and 16022013154. On Gen12 platforms this information is a little harder to find and there's a whole trail of HSDs leading up to 1209977595, which links to the documents that describe the programming. BSpec for TR-TT on Gen12 is very confusing as it still contains registers and other information from Gen11 that were not removed. Regarding the additional BLT and COMP registers, please notice that on the BSpec pages for the TR-TT registers, the "Register Instance" section only lists the GFX registers as non-privileged. However, the "User Mode Privileged Commands" lists the other instances of the TR-TT Regsiters as non-privileged, which matches what we see: there's no need to put these addresses in the FORCE_TO_NONPRIV registers. Notice that for now, when TR-TT is being used we only expose a single queue, so this change effectively does nothing until we start exposing extra queues. I left that part for later to help bisectability. v2: - s/trtt_init_context_state/trtt_init_queues_state/ (José) - pass device as the argument to init_queues_state (José) v3: - use async_submit_end (José) Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30252>	2024-07-22 10:04:34 -07:00
Paulo Zanoni	6415027d85	anv/trtt: submit a separate batch in anv_trtt_init_context_state() Having this as a separate batch was the normal behavior until `7da5b1caef` ("anv: move trtt submissions over to the anv_async_submit"). While it certainly sounds better to do everything related to TR-TT initialization in one batch, we need to revert it back to be a separate batch (but now using the new anv_async_submit infrastructure) because we'll want to run this batch on every engine. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30252>	2024-07-22 10:04:34 -07:00
Paulo Zanoni	abbb4b20f3	anv/trtt: check the return value of anv_trtt_init_context_state() I haven't seen this happening anywhere, but let's have it for correctness. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30252>	2024-07-22 10:04:34 -07:00
Paulo Zanoni	fb9d94f4ed	anv/trtt: make genX(init_trtt_context_state) a little more compact In this series we're going to further change this function, adding a lot more lines, so this patch should make the next diffs a little easier to comprehend and review. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30252>	2024-07-22 10:04:34 -07:00
Paulo Zanoni	6bc9a57173	intel/genxml: add the BLT and COMP_CTX0 versions of the TR-TT registers Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30252>	2024-07-22 10:04:33 -07:00
Valentine Burley	0a6cbb3a97	tu/kgsl: Remove unused variable The offset variable declaration at the beginning of the function was left over after the variable was moved inside the if statement. Fixes: `17c12a9924` ("turnip/kgsl: Support external memory via ION/DMABUF buffers") Signed-off-by: Valentine Burley <valentine.burley@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30300>	2024-07-22 14:53:14 +00:00
Valentine Burley	47de27cde1	tu: Implement VK_EXT_device_address_binding_report This extension should be helpful for debugging. Passes dEQP-VK.memory.address_binding_report.* Signed-off-by: Valentine Burley <valentine.burley@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30021>	2024-07-22 14:23:53 +00:00
Valentine Burley	dd10d2ef2f	tu/rmv: Fix missing image bind logging for WSI images Signed-off-by: Valentine Burley <valentine.burley@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30021>	2024-07-22 14:23:53 +00:00
Rohan Garg	fe387e14b5	anv: use the WA infrastructure when emitting WA 16013994831 Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30295>	2024-07-22 13:43:39 +00:00
Rhys Perry	3aa29c47b9	nir/instr_set: hash tex sources commutatively I'm not sure if two otherwise equal texture instructions ever have sources in different orders, but they should be considered equal. ministat of nir_opt_cse: N Min Max Median Avg Stddev x 9 6.586801 6.718673 6.682875 6.6621411 0.047817119 + 9 6.519098 6.609235 6.552997 6.5605604 0.028879587 Difference at 95.0% confidence -0.101581 +/- 0.0394755 -1.52475% +/- 0.585928% (Student's t, pooled s = 0.0395) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30145>	2024-07-22 11:04:01 +00:00
Rhys Perry	b7ceb9d327	nir/instr_set: stop sorting phi sources This is faster. ministat of nir_opt_cse: N Min Max Median Avg Stddev x 9 6.724212 6.84511 6.788336 6.7873378 0.034363882 + 9 6.586801 6.718673 6.682875 6.6621411 0.047817119 Difference at 95.0% confidence -0.125197 +/- 0.0416115 -1.84456% +/- 0.609248% (Student's t, pooled s = 0.0416374) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30145>	2024-07-22 11:04:01 +00:00
Rhys Perry	8b328443e3	nir/instr_set: combine XXH32 calls ministat of nir_opt_cse: N Min Max Median Avg Stddev x 9 7.393408 7.490593 7.434056 7.4338972 0.028150325 + 9 6.724212 6.84511 6.788336 6.7873378 0.034363882 Difference at 95.0% confidence -0.646559 +/- 0.0313916 -8.69745% +/- 0.407925% (Student's t, pooled s = 0.0314111) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30145>	2024-07-22 11:04:01 +00:00
Pierre-Eric Pelloux-Prayer	159a3edd80	egl,gbm,glx: fix log message spam Based on the other similar logs we only want to log when extensions is NULL. Use this opportunity to indicate the source of the log and remove the extra ')' at the end of each line. Fixes: `50fc7cc290` ("glx: directly link to gallium") Reviewed-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30257>	2024-07-22 10:09:34 +00:00
Pierre-Eric Pelloux-Prayer	a55b9c0c60	radeonsi: consider DBG(NO_TILING) when filtering modifiers Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30257>	2024-07-22 10:09:34 +00:00
Pierre-Eric Pelloux-Prayer	94f2b3f7bc	radeonsi: consider PIPE_BIND_LINEAR when filtering modifiers If PIPE_BIND_LINEAR is set the only valid modifier we can use is DRM_FORMAT_MOD_LINEAR. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30257>	2024-07-22 10:09:34 +00:00
Pierre-Eric Pelloux-Prayer	f12ccea6c7	radeonsi: reject modifiers with DCC when NO_EXPORTED_DCC is used Otherwise AMD_DEBUG=noexporteddcc will be ignored when modifier are used. Similarly to AMD_DEBUG=nodcc handling, this makes the application unable to import buffers with DCC as well - the alternative would be to implement the filtering only in the texture creation path, so in the si_modifier_supports_resource function. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30257>	2024-07-22 10:09:34 +00:00
Pierre-Eric Pelloux-Prayer	0c868aa94a	amd: use a valid size for ac_pm4_state allocation If max_dw is smaller than the pm4 array the allocation size would be smaller than sizeof(ac_pm4_state). Fixes: `428601095c` ("ac,radeonsi import PM4 state from RadeonSI") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30257>	2024-07-22 10:09:34 +00:00
Eric Engestrom	547de1e928	v3d/ci: mark spec@amd_performance_monitor@vc4 tests as flaky Turns out it was not fixed, it just happened to pass a bunch of times in a row, but it actually fails randomly, so mark it as such. Fixes: `4696e9c49b` ("v3d/ci: mark spec@amd_performance_monitor@vc4 tests as fixed") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30290>	2024-07-22 10:56:09 +02:00
Juan A. Suarez Romero	4215d50384	v3d: add new clear blitter op A specific clear_surface blitter operation is required, because in this case we need to save framebuffer information, but not in standard clear, as we are currently doing. This fixes a leak in depthstencil surface, which happens because we were storing saving it as part for the framebuffer information, but the blitter clear wasn't restoring it because it wasn't required (only it is required in clearing a surface). Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30240>	2024-07-22 08:11:57 +00:00
Juan A. Suarez Romero	7158950a6f	v3d: use operations to specify what to save in blitter So far, in order to know what we need to save before using the blitter utility, we use a boolean to know if we are going to do a blit or not, and if we need to take in account conditional rendering or not. In other to allow to specify more operations than blit or not, use an enum to define what operation we would like to do, and based on that what information we need to store. This also merge the conditional rendering as part of these operations. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30240>	2024-07-22 08:11:57 +00:00
Christian Gmeiner	c83330cde0	etnaviv: isa: left shift is 3 bit long Blob generates such a shift for piglit's generated_tests/cl/builtin/int/builtin-int-abs-1.0.generated.cl Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30232>	2024-07-22 07:12:28 +00:00
Icenowy Zheng	5f22e152ad	gallivm: orcjit: use atexit to release LPJit singleton at exit Valgrind will report some memory possibly lost because of this singleton (it's dynamically allocated when it is first accessed). Use atexit() to register a handler that releases this singleton. Signed-off-by: Icenowy Zheng <uwu@icenowy.me> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30216>	2024-07-21 23:17:17 +00:00
Icenowy Zheng	3423e73cec	gallivm: orcjit: keep the ownership of tm for LPJit The ownership of the TargetMachine object is released when LPJit singleton is constructed, leads to a slight memory loss detectable. Keep the ownership by saving the unique pointer as another class member named tm_unique. Signed-off-by: Icenowy Zheng <uwu@icenowy.me> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30216>	2024-07-21 23:17:17 +00:00
David Heidelberg	dc6e6d7a2b	freedreno: Enable Adreno 306A Enable the Adreno 306A that is found on the QM215 SoC (Qualcomm 215). The GPU marketing name is Adreno 308. Kernel patch: https://lore.kernel.org/linux-arm-msm/20240528-a306a-v1-1-03a66dacd8c7@gmail.com/ Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29401>	2024-07-21 21:31:06 +00:00
Eric Engestrom	5c5df9376f	venus: initialize bitset in CreateDescriptorPool() Fixes: `de5879447b` ("Track bitset when create descriptor pool") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30286>	2024-07-21 17:44:05 +00:00
Eric Engestrom	324ccd7430	nak: fix meson typo Fixes: `95bff5ca5b` ("nak: Add minimum bindgen requirement") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30288>	2024-07-21 17:58:53 +02:00
Faith Ekstrand	0cc23b6524	nak: Move creation of nak_shader_info to ShaderBin::new() This makes ShaderBin a bit more self-contained since it can now be created from just a compiled shader and info. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30283>	2024-07-20 14:40:26 -05:00
Faith Ekstrand	618dfc73e6	nak: Move a few more things to VtgIoInfo This moves clip/cull and XFB and uses the recorded attributes to figure out writes_layer and writes_point_size. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30283>	2024-07-20 14:40:26 -05:00
Faith Ekstrand	f39b645c66	nak: Rework fragment shader stage info This adds a FragmentShaderStageInfo, moves uses_kill and does_interlock there, and adds fields for API depth test bits. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30283>	2024-07-20 14:40:26 -05:00
Faith Ekstrand	d96fe18547	nak: Plumb tessellation parameters through ShaderStageInfo Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30283>	2024-07-20 14:40:26 -05:00
Faith Ekstrand	c4c9bfdebd	nak: Drop the nvfuzz binary A much better version now exists as part of https://gitlab.freedesktop.org/gfxstrand/nv-shader-tools Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30283>	2024-07-20 14:40:23 -05:00
Yiwei Zhang	bb7632128b	venus: only request ring thread prio for TLS ring Mainly to leave main ring thread prio to default. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30279>	2024-07-20 17:27:24 +00:00
Faith Ekstrand	cc82f80dcb	nak/nir: Split 64-bit conversions pre-Volta Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30281>	2024-07-20 16:53:38 +00:00
Faith Ekstrand	574239c948	nak/sm50: Implement OpCCtl Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30281>	2024-07-20 16:53:38 +00:00
Faith Ekstrand	6fd427bf6e	nak: Lower fsqrt in NIR on Maxwell A and earlier Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30281>	2024-07-20 16:53:38 +00:00
Faith Ekstrand	9b4a005bf8	nak/sm50: Implement OpPixLd Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30281>	2024-07-20 16:53:38 +00:00
Faith Ekstrand	69be07b191	nak/sm50: Various encoding fixes Most of this is code clean-ups and unifications. A few things were missing proper legalization. Some of it was just plain wrong. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30281>	2024-07-20 16:53:38 +00:00
Faith Ekstrand	549fada593	nak/sm50: Improve encoding of OpFFma OpFFma has another form that allows a cbuf in src2 which we weren't taking into account. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30281>	2024-07-20 16:53:38 +00:00

1 2 3 4 5 ...

192354 Commits