c3732ac0d0ffe487fbf72a2e61ee201f8d5c356f

We had an optimization in place to skip a unifa write if the address happens to be right after the last ldunifa read address, but we can take this further and update the unifa address by emitting ldunifa instructions if needed to skip a unifa write that is close enough. This is because a unifa write involves 4 cycles: 1 for the write and 3 delay slots before we can emit the first ldunifa. So if we have code like this: unifa addr + 0 ldunifa.r0 unifa addr + 12 ldunifa.r1 In practice we end up with QPU like this: unifa addr + 0 nop nop nop ldunifa.r0 unifa addr + 12 nop nop nop ldunifa.r1 And with this patch we get: unifa addr + 0 nop nop nop ldunifa.r0 <--- reads offset 0 ldunifa.- <--- reads offset 4 ldunifa.- <--- reads offset 8 ldunifa.r1 <--- reads offset 12 Of course, QPU scheduling might find ways to fill the NOPs to some extent and remove some of the gains, but generally speaking, this is still usually a win. Going by shader-db results, allowing the next unifa address to be up to 12 bytes after the address resulting from the last ldunifa read shows the best results: total instructions in shared programs: 13817048 -> 13812202 (-0.04%) instructions in affected programs: 602701 -> 597855 (-0.80%) helped: 1750 HURT: 760 Instructions are helped. total uniforms in shared programs: 3795485 -> 3793200 (-0.06%) uniforms in affected programs: 43930 -> 41645 (-5.20%) helped: 898 HURT: 0 Uniforms are helped. total max-temps in shared programs: 2326612 -> 2326621 (<.01%) max-temps in affected programs: 651 -> 660 (1.38%) helped: 10 HURT: 21 Inconclusive result (value mean confidence interval includes 0). total sfu-stalls in shared programs: 30942 -> 30906 (-0.12%) sfu-stalls in affected programs: 627 -> 591 (-5.74%) helped: 186 HURT: 158 Inconclusive result (value mean confidence interval includes 0). total inst-and-stalls in shared programs: 13847990 -> 13843108 (-0.04%) inst-and-stalls in affected programs: 601404 -> 596522 (-0.81%) helped: 1747 HURT: 757 Inst-and-stalls are helped. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9384>
`Mesa <https://mesa3d.org>`_ - The 3D Graphics Library ====================================================== Source ------ This repository lives at https://gitlab.freedesktop.org/mesa/mesa. Other repositories are likely forks, and code found there is not supported. Build & install --------------- You can find more information in our documentation (`docs/install.rst <https://mesa3d.org/install.html>`_), but the recommended way is to use Meson (`docs/meson.rst <https://mesa3d.org/meson.html>`_): .. code-block:: sh $ mkdir build $ cd build $ meson .. $ sudo ninja install Support ------- Many Mesa devs hang on IRC; if you're not sure which channel is appropriate, you should ask your question on `Freenode's #dri-devel <irc://chat.freenode.net#dri-devel>`_, someone will redirect you if necessary. Remember that not everyone is in the same timezone as you, so it might take a while before someone qualified sees your question. To figure out who you're talking to, or which nick to ping for your question, check out `Who's Who on IRC <https://dri.freedesktop.org/wiki/WhosWho/>`_. The next best option is to ask your question in an email to the mailing lists: `mesa-dev\@lists.freedesktop.org <https://lists.freedesktop.org/mailman/listinfo/mesa-dev>`_ Bug reports ----------- If you think something isn't working properly, please file a bug report (`docs/bugs.rst <https://mesa3d.org/bugs.html>`_). Contributing ------------ Contributions are welcome, and step-by-step instructions can be found in our documentation (`docs/submittingpatches.rst <https://mesa3d.org/submittingpatches.html>`_). Note that Mesa uses gitlab for patches submission, review and discussions.
Description
Languages
C
75.3%
C++
18.2%
Python
2.7%
Assembly
1.5%
Rust
1.2%
Other
0.9%