Yonggang Luo
2deee5ddb3
ci/vs2019: Clear CI_COMMIT_MESSAGE and CI_COMMIT_DESCRIPTION for please meson
...
Fixes https://gitlab.freedesktop.org/lygstate/mesa/-/jobs/23444155/raw
https://gist.github.com/lygstate/201dd9d2db3d411668f7bbdc17daf74a
Signed-off-by: Yonggang Luo <luoyonggang@gmail.com >
Acked-by: Jesse Natalie <jenatali@microsoft.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16825 >
2022-06-02 18:54:50 +00:00
Renato Pereyra
c642ebf8ab
venus: Add driconf option to enable implicit fencing
...
Signed-off-by: Renato Pereyra <renatopereyra@google.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16735 >
2022-06-02 18:05:50 +00:00
Mike Blumenkrantz
87a53127e9
mesa: throw a log warning any time a fallback texture is used
...
this generally means an app is broken in some way, so throw a log
message to be helpful
Reviewed-by: Marek Olšák <marek.olsak@amd.com >
Reviewed-by: Adam Jackson <ajax@redhat.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16821 >
2022-06-02 17:31:16 +00:00
Alyssa Rosenzweig
ad5c84999b
pan/bi: Rework Valhall register alignment
...
Because we lower SPLIT and COLLECT before RA, we need to consider offsets when
determining the dimensions of vectors, in order to align properly. Lowering
COLLECT post-RA would avoid this special case.
Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16780 >
2022-06-02 17:13:16 +00:00
Alyssa Rosenzweig
0770e7a90c
pan/bi: Align 64-bit register sources
...
Similar idea to aligning staging register sources.
Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16780 >
2022-06-02 17:13:16 +00:00
Alyssa Rosenzweig
8553dd97ad
pan/bi: Allow vec6 for collects
...
Hit for some Valhall texturing instructions.
Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16780 >
2022-06-02 17:13:16 +00:00
Icecream95
1bfff407b9
pan/bi: Use nodearrays for linear constraints
...
Speeds up compiling shaders/skia/781.shader_test in shader-db by 8x
(Icecream95).
...At least it did before I extended to support register allocation of vec8. On
Valhall, texture instructions require up to 8 consecutive registers. To handle
this, provide for vec8 register allocation. Liveness was already (accidentally?)
vec8. The increased memory requirement is acceptable given that the interference
matrix is now stored sparsely (Alyssa).
Icecream95 reports the vec8 changes hurt RA performance by about 1% on average.
I consider this acceptable for now.
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16780 >
2022-06-02 17:13:16 +00:00
Icecream95
c70daa74f0
pan/bi: Add nodearray datastructure
...
This is an array which can either be sparse or dense, and was designed
to be used to track liveness and interference information.
Either a sparse array with sorted indices or dense array is used.
Other data structures were tried, such as red-black trees or hash
tables, but they were slower. When used for storing constraints, the
indices do not have to be sorted as duplicating elements is okay, but
the speedup from that was not enough to justify the extra complexity.
v2: Add a comment about how to potentially speed it up. But it seems
fast enough even without this change.
v3: Use a custom struct rather than relying on util_dynarray.
v4: Split out functions only used for liveness analysis, rather than the simpler
data structure needed for the register interference matrix. If we need to
optimize liveness, that can follow on after. Also make it for vec8 (Alyssa).
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16780 >
2022-06-02 17:13:16 +00:00
Icecream95
c24b78cceb
pan/bi: Reverse linear constraint bits
...
This will make it simpler to implement parallel RA where multiple
possible registers for a node are tested at once.
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16780 >
2022-06-02 17:13:16 +00:00
Sil Vilerino
65b3decd5b
d3d12: Align output buffer offset access to D3D12_FEATURE_DATA_VIDEO_ENCODER_RESOURCE_REQUIREMENTS.CompressedBitstreamBufferAccessAlignment
...
Reviewed-by: Jesse Natalie <jenatali@microsoft.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16644 >
2022-06-02 17:00:44 +00:00
Connor Abbott
bd68559d36
tu: Enable shaderTessellationAndGeometryPointSize
...
This already works and AFAICT there was no reason to disable it other
than that the blob does.
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16830 >
2022-06-02 16:32:57 +00:00
Danylo Piliaiev
eba8671e40
tu: Fix D32S8 resolves into D32 or S8 in sysmem mode
...
Fixes CTS tests in sysmem mode:
dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d24_unorm_s8_uint.compatibility_depth_zero_stencil_zero_testing_depth
dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d24_unorm_s8_uint_separate_layouts.compatibility_depth_zero_stencil_zero_testing_depth
dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d24_unorm_s8_uint_separate_layouts.compatibility_depth_zero_stencil_zero_testing_stencil
dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d32_sfloat_s8_uint.compatibility_depth_zero_stencil_zero_testing_depth
dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d32_sfloat_s8_uint_separate_layouts.compatibility_depth_zero_stencil_zero_testing_depth
dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d32_sfloat_s8_uint_separate_layouts.compatibility_depth_zero_stencil_zero_testing_stencil
Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16834 >
2022-06-02 16:01:46 +00:00
Samuel Pitoiset
e680fe224a
radv/ci: update list of failures for Pitcairn
...
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com >
Acked-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16836 >
2022-06-02 17:03:59 +02:00
Rhys Perry
be4c1a5f52
radv,aco: disable shader cache when ACO debug options are used
...
Signed-off-by: Rhys Perry <pendingchaos02@gmail.com >
Reviewed-by: Timur Kristóf <timur.kristof@gmail.com >
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14594 >
2022-06-02 14:45:55 +00:00
Jason Ekstrand
d8df87056c
nir: xfb_buffer_info::stride is in bytes
...
For the NIR XFB gathering as well as all the Vulkan drivers, buffer
strides in nir_xfb_info are in bytes. When Marek started using
nir_xfb_info for GLSL on radeonsi, he copied directly from the GLSL
struct which has strides in dwords. This inconsistency didn't show up
until I went through and started us using the NIR passes for GL drivers
directly without going through the GLSL structs. We could change the
nir_xfb_buffer_info field to be in dwords to be consistent with
shader_info but that would mean changing all the Vulkan drivers but, for
now, it's easier to always use bytes in nir_xfb_info.
Fixes: 2a22885a45
("st,nir: Use nir_shader::xfb_info in nir_lower_io_passes")
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com >
Reviewed-by: Marek Olšák <marek.olsak@amd.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16819 >
2022-06-02 14:06:31 +00:00
Jason Ekstrand
7c876a6b2f
nir/glsl: Use rzalloc for nir_xfb_info
...
A lot of the fields get fully overwritten but outputs/buffers_written
are both bitfields that we set one bit at a time.
Fixes: 7c5dc0b11a
("glsl/nir: Populate nir_shader::xfb_info after linking varyings")
Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com >
Reviewed-by: Marek Olšák <marek.olsak@amd.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16819 >
2022-06-02 14:06:31 +00:00
Mike Blumenkrantz
fc5a3e1e37
lavapipe: fix dynamic patchControlPoints
...
the dynamic state value is independent of whether tesselation is used
cc: mesa-stable
Reviewed-by: Dave Airlie <airlied@redhat.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16824 >
2022-06-02 13:51:17 +00:00
Erik Faye-Lund
df2dd474c7
include: drop c11_compat.h
...
We now require C11, and C++ supports static_assert just fine, which is
the only thing this header ever added support for. So let's get rid of
this needless header.
Reviewed-by: Jesse Natalie <jenatali@microsoft.com >
Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com >
Reviewed-by: Eric Engestrom <eric@engestrom.ch >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16812 >
2022-06-02 13:09:16 +00:00
Erik Faye-Lund
18246ed06a
include: drop c99_math.h
...
Since we now depend on C11, we know that we have support for the C99
math functionality. So let's drop the c99_math.h compatibility wrapper,
and just include <math.h> directly.
Reviewed-by: Jesse Natalie <jenatali@microsoft.com >
Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com >
Reviewed-by: Eric Engestrom <eric@engestrom.ch >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16812 >
2022-06-02 13:09:16 +00:00
Erik Faye-Lund
6ed10a99bd
vulkan/device_select: remove needless c99_compat.h includes
...
Reviewed-by: Jesse Natalie <jenatali@microsoft.com >
Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com >
Reviewed-by: Eric Engestrom <eric@engestrom.ch >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16812 >
2022-06-02 13:09:16 +00:00
Erik Faye-Lund
df4fe7c4a2
intel/isl: remove needless c99_compat.h includes
...
Reviewed-by: Jesse Natalie <jenatali@microsoft.com >
Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com >
Reviewed-by: Eric Engestrom <eric@engestrom.ch >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16812 >
2022-06-02 13:09:16 +00:00
Erik Faye-Lund
9633b42672
include: remove needless c99_compat.h includes
...
Reviewed-by: Jesse Natalie <jenatali@microsoft.com >
Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com >
Reviewed-by: Eric Engestrom <eric@engestrom.ch >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16812 >
2022-06-02 13:09:16 +00:00
Erik Faye-Lund
beaa26bd75
gallium: remove needless c99_compat.h includes
...
Reviewed-by: Jesse Natalie <jenatali@microsoft.com >
Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com >
Reviewed-by: Eric Engestrom <eric@engestrom.ch >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16812 >
2022-06-02 13:09:16 +00:00
Erik Faye-Lund
75f3373b34
mapi: remove needless c99_compat.h includes
...
Reviewed-by: Jesse Natalie <jenatali@microsoft.com >
Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com >
Reviewed-by: Eric Engestrom <eric@engestrom.ch >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16812 >
2022-06-02 13:09:16 +00:00
Erik Faye-Lund
afce7ddacb
gallium/util: remove needless c99_compat.h includes
...
Reviewed-by: Jesse Natalie <jenatali@microsoft.com >
Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com >
Reviewed-by: Eric Engestrom <eric@engestrom.ch >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16812 >
2022-06-02 13:09:16 +00:00
Erik Faye-Lund
1613aa7341
egl: remove needless c99_compat.h includes
...
Reviewed-by: Jesse Natalie <jenatali@microsoft.com >
Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com >
Reviewed-by: Eric Engestrom <eric@engestrom.ch >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16812 >
2022-06-02 13:09:16 +00:00
Erik Faye-Lund
9f717b5f23
util: remove needless c99_compat.h includes
...
Reviewed-by: Jesse Natalie <jenatali@microsoft.com >
Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com >
Reviewed-by: Eric Engestrom <eric@engestrom.ch >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16812 >
2022-06-02 13:09:16 +00:00
Erik Faye-Lund
9ec514ded7
util: add missing c99_compat.h includes
...
These headers use the "restrict" keyword, so they need to include
c99_compat.h in case they get included from C++.
Right now, we include c99_compat.h in many needless places, which saves
us. But we're about to stop doing that.
Reviewed-by: Jesse Natalie <jenatali@microsoft.com >
Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com >
Reviewed-by: Eric Engestrom <eric@engestrom.ch >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16812 >
2022-06-02 13:09:15 +00:00
Erik Faye-Lund
806272ec97
c99_compat.h: remove inline and __func__ shims
...
We're requiring C11 now, and both "inline" and __func__ is supported
in C++ code as well. So no need for these compatibility-shims anymore.
Reviewed-by: Jesse Natalie <jenatali@microsoft.com >
Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com >
Reviewed-by: Eric Engestrom <eric@engestrom.ch >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16812 >
2022-06-02 13:09:15 +00:00
Erik Faye-Lund
8575d0e84b
meson: modernize win_flex stdint.h logic
...
Since this logic was written, we've moved to require C11, so this can
now be simplified. First of all, we no longer need to set
__STDC_VERSION__ for C code at all, because the issue that MSVC doesn't
set __STDC_VERSION__ for C99 is longer a concern. Second, we can make
the C++ check unconditional.
Reviewed-by: Jesse Natalie <jenatali@microsoft.com >
Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com >
Reviewed-by: Eric Engestrom <eric@engestrom.ch >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16812 >
2022-06-02 13:09:15 +00:00
Erik Faye-Lund
a8605db504
intel: remove stale makefile
...
When this landed, the Autotools build system was already removed. Why
was this file added in the first place? Probably a rebase-mistake...
Fixes: 134e750e16
("i965: extract performance query metrics")
Reviewed-by: Adam Jackson <ajax@redhat.com >
Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16790 >
2022-06-02 09:13:23 +00:00
Erik Faye-Lund
9bcd538643
freedreno: remove stale makefile
...
We don't use this makefile, so there's no point in keeping it in the
repo. It's also a generated one, so it contains a lot of... mess.
Fixes: 536f43cb96
("freedreno: slurp in afuc")
Reviewed-by: Adam Jackson <ajax@redhat.com >
Reviewed-by: Rob Clark <robdclark@chromium.org >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16790 >
2022-06-02 09:13:23 +00:00
Erik Faye-Lund
69d55f42b6
radeonsi: port amdgcn_glslc build to meson
...
Seems nice to reduce the number of old-fashioned build systems we have
in-tree.
Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16789 >
2022-06-02 08:54:08 +00:00
Martin Roukala (né Peres)
72cf18f54a
radv/ci: try to guess failure expectations for Fiji/Raven/Navi14/Navi22
...
The guesses are based on other gpus from the same gfx version. We will
need actual runs on the GPUs to confirm this.
Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org >
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16810 >
2022-06-02 08:29:31 +00:00
Martin Roukala (né Peres)
205744b42d
radv/ci: remove an expected fail from all families
...
dEQP-VK.glsl.builtin.precision_double.determinant.compute.mat3 was
failing because of a CTS bug, which got fixed in the latest update for
all our CI machines.
This commit assumes this got fixed for all families, even the ones we
did not try to run on.
Fixes: 836ce97f5e
(ci: bump VK-GL-CTS to 1.3.2.0)
Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org >
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16810 >
2022-06-02 08:29:31 +00:00
Martin Roukala (né Peres)
91f93c97dd
radv/ci: update the vkcts expectations for the CI machines
...
Fixes: 836ce97f5e
(ci: bump VK-GL-CTS to 1.3.2.0)
Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org >
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16810 >
2022-06-02 08:29:31 +00:00
Martin Roukala (né Peres)
e58c455ca9
radv/ci: bump the timeout of vangogh/renoir
...
The execution time of vkcts has drastically changed with the latest
update, especially for APUs. This commit bumps the timeout to give
them a chance to finish.
Fixes: 836ce97f5e
(ci: bump VK-GL-CTS to 1.3.2.0)
Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org >
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16810 >
2022-06-02 08:29:31 +00:00
Samuel Pitoiset
5af1409c1b
radv: fix lowering GS intrinsics if NGG is disabled per pipeline
...
If NGG is disabled per pipeline for extreme (or suboptimal) geometry
or with transform feedback, make sure to not lower GS intrinsics that
are only needed for NGG GS. This means we have to lower GS intrinsics
later in the compilation process to effectively know if the pipeline
uses NGG.
fossils-db (Navi21):
Totals from 8 (0.01% of 134913) affected shaders:
VGPRs: 512 -> 520 (+1.56%)
CodeSize: 58180 -> 65080 (+11.86%); split: -0.04%, +11.90%
MaxWaves: 128 -> 126 (-1.56%)
Instrs: 10525 -> 11779 (+11.91%); split: -0.05%, +11.96%
Latency: 62941 -> 49428 (-21.47%); split: -21.59%, +0.12%
InvThroughput: 16121 -> 12950 (-19.67%); split: -19.96%, +0.29%
VClause: 122 -> 123 (+0.82%); split: -0.82%, +1.64%
SClause: 130 -> 143 (+10.00%)
Copies: 566 -> 596 (+5.30%); split: -0.35%, +5.65%
Branches: 200 -> 208 (+4.00%)
PreSGPRs: 424 -> 422 (-0.47%)
PreVGPRs: 430 -> 424 (-1.40%); split: -1.63%, +0.23%
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com >
Reviewed-by: Timur Kristóf <timur.kristof@gmail.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15885 >
2022-06-02 09:42:54 +02:00
Dylan Baker
a5549ebe5b
docs: update calendar and link releases notes for 22.0.5
...
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16828 >
2022-06-01 16:55:58 -07:00
Dylan Baker
7bd8efb74f
docs: update sha256 for 22.0.5
...
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16828 >
2022-06-01 16:55:22 -07:00
Dylan Baker
a6e546ab51
docs: add release notes for 22.0.5
...
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16828 >
2022-06-01 16:55:21 -07:00
Dylan Baker
67280a4019
docs: update calendar and link releases notes for 22.1.1
...
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16828 >
2022-06-01 16:55:17 -07:00
Dylan Baker
27a1c477c9
docs: Add sh256sums for 22.1.1
...
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16828 >
2022-06-01 16:55:11 -07:00
Dylan Baker
3e4de308a5
docs: add release notes for 22.1.1
...
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16828 >
2022-06-01 16:55:09 -07:00
Jesse Natalie
1ef329b825
wgl, d3d12: Handle front buffer writes for double-buffered MSAA surfaces backed by swapchains
...
We don't need to go as far as the fake front thing when MSAA is being used, because the
swapchain (single-sampled) is already decoupled from the app render buffers. But we do
need to direct the frontbuffer flush to the single-sampled back buffer, and then present
the back buffer. We also need to swap the buffers when we do this, so the next blit
targets the former front buffer.
Reviewed-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com >
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16227 >
2022-06-01 16:18:12 -07:00
Emma Anholt
49dc60efa1
freedreno/ir3: Fold 16-bit conversions into image load/store src/dsts.
...
Shaves 5 instructions off of one manhattan31 shader.
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616 >
2022-06-01 22:19:44 +00:00
Emma Anholt
f005a2d786
freedreno/ir3: Fix validation of half-precision image store values.
...
ce1a381e57
("turnip: enable VK_KHR_16bit_storage on A650") determined
that the type of the instr decided the type of the value being stored in
the ".b" case. But it would be surprising if image stores had the type
determine the coordinates' precision instead of the value's, and once we
turned on image instruction precision lowering we ran into asserts.
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616 >
2022-06-01 22:19:44 +00:00
Matt Turner
003327dd95
freedreno/ir3: Pass 16-bit sampler coordinates when possible.
...
shader-db highlights from Rob's android shaders:
total instructions in shared programs: 769641 -> 767536 (-0.27%)
instructions in affected programs: 151139 -> 149034 (-1.39%)
total last-baryf in shared programs: 55908 -> 55607 (-0.54%)
last-baryf in affected programs: 35219 -> 34918 (-0.85%)
total sstall in shared programs: 67074 -> 65767 (-1.95%)
total full in shared programs: 36115 -> 36080 (-0.10%)
full in affected programs: 203 -> 168 (-17.24%)
sstall in affected programs: 9510 -> 8203 (-13.74%)
total (ss) in shared programs: 14380 -> 14239 (-0.98%)
(ss) in affected programs: 2965 -> 2824 (-4.76%)
total systall in shared programs: 92425 -> 91522 (-0.98%)
systall in affected programs: 13146 -> 12243 (-6.87%)
total (sy) in shared programs: 4330 -> 4314 (-0.37%)
(sy) in affected programs: 167 -> 151 (-9.58%)
total waves in shared programs: 71580 -> 71584 (<.01%)
waves in affected programs: 12 -> 16 (33.33%)
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616 >
2022-06-01 22:19:44 +00:00
Emma Anholt
ac24c49c37
freedreno/ir3: Make the types of tex coord padding match.
...
Required for lowering of coords to be 16-bit.
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616 >
2022-06-01 22:19:44 +00:00
Matt Turner
edb0904775
freedreno/ir3: Move the texture array coord fixup to nir
...
We're going to optimize sampler coordinates to FP16, so we'll need to
add the appropriately typed 0.5. Move this to NIR where that information
is readily available.
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616 >
2022-06-01 22:19:44 +00:00