third_party_mesa3d

Author	SHA1	Message	Date
Ian Romanick	461a5c899c	glsl: Don't copy propagate from SSBO or shared variables either Since SSBOs can be written by other GPU threads, copy propagating a read can cause the value to magically change. SSBO reads are also very expensive, so doing it twice will be slower. Haswell, Broadwell, and Skylake had similar results. (Skylake shown) total instructions in shared programs: 14399120 -> 14399119 (<.01%) instructions in affected programs: 684 -> 683 (-0.15%) helped: 1 HURT: 0 total cycles in shared programs: 532978931 -> 532973113 (<.01%) cycles in affected programs: 530484 -> 524666 (-1.10%) helped: 1 HURT: 0 Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Cc: mesa-stable@lists.freedesktop.org Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106774	2018-06-14 11:26:33 -07:00
Lukas Rusak	1d92d6486a	meson: only build vl_winsys_dri.c when x11 platform is used This seems to have been missed in the move from autotools This fixes the following build issue: ../src/gallium/auxiliary/vl/vl_winsys_dri.c:34:10: fatal error: X11/Xlib-xcb.h: No such file or directory #include <X11/Xlib-xcb.h> ^~~~~~~~~~~~~~~~ Fixes: `b1b65397d0` ("meson: Build gallium auxiliary") Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-06-14 10:34:51 -07:00
Brian Paul	b9e6438adf	st/mesa: add missing switch cases in glsl_to_tgsi_visitor::visit() To silence compiler warning about unhandled switch cases. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-06-14 11:29:51 -06:00
Bas Nieuwenhuizen	41dabdc475	radv: Fix output for sparse MRTs. We need to init the cb_shader_format correctly with the changed col_format, so this moves the col_format adjustment to before the adjustment to before the cb_shader_mask gets generated. Fixes: `06d3c65098` "radv: fix a GPU hang when MRTs are sparse" Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106903 CC: 18.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-06-14 11:48:24 +02:00
Samuel Pitoiset	68dead112e	radv: update the ZRANGE_PRECISION value for the TC-compat bug On GFX8+, there is a bug that affects TC-compatible depth surfaces when the ZRange is not reset after LateZ kills pixels. The workaround is to always set DB_Z_INFO.ZRANGE_PRECISION to match the last fast clear value. Because the value is set to 1 by default, we only need to update it when clearing Z to 0.0. We also need to set the depth clear regs and to update ZRANGE_PRECISION when initializing a TC-compat depth image to 0. Original patch from James Legg. This fixes random CTS fails with dEQP-VK.renderpass.suballocation.formats.d32_sfloat_s8_uint.input.* Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=105396 CC: <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-06-14 11:38:29 +02:00
Samuel Iglesias Gonsálvez	183adc51f8	anv: reduce maxFragmentInputComponents If the application asks for the maximum number of fragment input components (128), use all of them plus some builtins that are passed in the VUE, then we exceed the maximum number of used VUE slots (32) and we break one assert that checks this limit. Also, with separate shader objects, we add CLIP_DIST0, CLIP_DIST1 builtins in brw_compute_vue_map() because we don't know if gl_ClipDistance is going to be read/write by an adjacent stage. Fixes VK-GL-CTS CL#2569. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-14 09:54:28 +02:00
Marek Olšák	6d671078a8	radeonsi/gfx9: fix si_get_buffer_from_descriptors for 48-bit pointers This fixes: GL45-CTS.pipeline_statistics_query_tests_ARB.functional_compute_shader_invocations Cc: 18.0 18.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-06-13 22:00:12 -04:00
Marek Olšák	a4312742a5	radeonsi/gfx9: update & clean up a DPBB heuristic Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-13 22:00:43 -04:00
Marek Olšák	47b780be21	radeonsi/gfx9: set POPS_DRAIN_PS_ON_OVERLAP due to a hw bug This may not be needed yet, but let's set it now. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-13 22:00:42 -04:00
Marek Olšák	a152ca70f2	radeonsi/gfx9: remove UINT_MAX array terminators in bin size tables Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-13 22:00:40 -04:00
Marek Olšák	cd0be6cdc8	radeonsi/gfx9: update bin sizes This is based on our docs (recently updated), not amdvlk. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-13 22:00:39 -04:00
Marek Olšák	2f51081a93	radeonsi/gfx9: update primitive binning code for EQAA Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-13 22:00:37 -04:00
Marek Olšák	22e994bb75	radeonsi: assume that rasterizer state is non-NULL in draw_vbo Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-13 22:00:36 -04:00
Marek Olšák	f3b3ee6974	radeonsi: micro-optimize prim checking and fix guardband with lines+adjacency Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-13 22:00:34 -04:00
Marek Olšák	d6974feb90	radeonsi: move the guardband registers into a separate state atom They have a different frequency of updates and don't change when scissors change. I think this even fixes something in si_update_vs_viewport_state. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-13 22:00:31 -04:00
Marek Olšák	68b1c669e7	radeonsi/gfx9: implement the scissor bug workaround without performance drop This might improve performance on Vega10 and Raven. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-13 22:00:27 -04:00
Marek Olšák	73b0d10152	radeonsi: don't set VGT_LS_HS_CONFIG if it doesn't change Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-13 22:00:25 -04:00
Marek Olšák	28ee825e19	radeonsi: move VGT_GS_OUT_PRIM_TYPE into si_shader_gs same as amdvlk. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-13 22:00:23 -04:00
Marek Olšák	99e0ba6868	radeonsi: record CLIPVERTEX output usage properly for compatibility profiles This was missed when adding CLIPVERTEX support into GS & tess. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-13 22:00:20 -04:00
Marek Olšák	47a57a709d	radeonsi: fix FBFETCH with 2D MSAA arrays Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-13 22:00:17 -04:00
Marek Olšák	e5e57c3a5e	ac: handle undefined EQAA samples in ac_apply_fmask_to_sample RADV might wanna use this helper too. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-13 22:00:12 -04:00
Marek Olšák	a2d4c8ff6d	radeonsi: return real memory usage instead of per-process usage Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-06-13 21:47:36 -04:00
Marek Olšák	95ecde42eb	ac/gpu_info: report real total memory sizes The change from MIN2 to MAX2 is intentional. Cc: 18.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-06-13 21:47:36 -04:00
Dave Airlie	f11b664f48	docs: mark virgl GL 4.0 features as complete. virgl should now expose GL4.1 where it can.	2018-06-14 10:38:11 +10:00
Dave Airlie	7b6f2704eb	virgl: add ARB_tessellation_shader support. (v2) This should add all the pieces to enable tess shaders on virgl. v2: fixup transform to handle tess and strip out precise. set default for max patch varyings to work around issue when tess gets enabled from v1 caps but v2 caps aren't in place. (Elie) Reviewed-by: Elie Tournier <elie.tournier@collabora.com>	2018-06-14 10:36:31 +10:00
Dave Airlie	babd1d526b	glsl: allow standalone semicolons outside main() GLSL 4.60 offically added this but games and older CTS suites actually had shaders that did this, we may as well enable it everywhere. Adding stable because it appears apps in the wild do this. Acked-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Cc: <mesa-stable@lists.freedesktop.org>	2018-06-14 10:21:51 +10:00
Samuel Pitoiset	51e23d3419	radv: don't fast clear HTILE for 16-bit depth surfaces on GFX8 This causes rendering issues in Shadow Warrior 2 with DXVK. Cc: mesa-stable@lists.freedesktop.org Fixes: `ccc64f3133` ("radv: enable TC-compat HTILE for 16-bit depth surfaces on GFX8") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106912 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-06-13 20:30:04 +02:00
Andrew Galante	baf16b2ea3	configure.ac: Test for __atomic_add_fetch in atomic checks Some platforms have 64-bit __atomic_load_n but not 64-bit __atomic_add_fetch, so test for both of them. Bug: https://bugs.gentoo.org/655616 Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-06-13 10:09:46 -07:00
Andrew Galante	9d547a7617	meson: Test for __atomic_add_fetch in atomic checks Some platforms have 64-bit __atomic_load_n but not 64-bit __atomic_add_fetch, so test for both of them. Bug: https://bugs.gentoo.org/655616 Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-06-13 10:09:46 -07:00
Matt Turner	b29b5a82a1	meson: Fix -latomic check Commit `54ba73ef10` (configure.ac/meson.build: Fix -latomic test) fixed some checks for -latomic, and then commit `54bbe600ec` (configure.ac: rework -latomic check) further extended the fixes in configure.ac but not in Meson. This commit extends those fixes to the Meson tests. Fixes: `54bbe600ec` (configure.ac: rework -latomic check) Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-06-13 10:09:46 -07:00
Dylan Baker	9cc577761f	meson: Remove various completed todos v3: - Remove "won't do" todos, so only completed todo's are now removed. Signed-off-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> (v2)	2018-06-13 10:07:03 -07:00
Dylan Baker	0ce3f3538b	meson: Make use of optional modules meson 0.43 gained support for optional modules, which clover wold like to use. Since we require 0.44.1 now we can rely on them being available for clover. compile tested only. Signed-off-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-06-13 10:06:58 -07:00
Dylan Baker	34bbb24ce7	meson: Add support for ppc assembly/optimizations v2: - Use -mpower8-vector in compiler test for altivec - rename altivec option to power8 - reword power8 option description to be more clear, originally I had made it a boolean, but replaced it with an auto option. Signed-off-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-06-13 10:06:54 -07:00
Dylan Baker	e26af22143	meson: Add support for SPARC assembly This was blindly copied from autotools and tested by a helpful gentoo user. Signed-off-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-06-13 10:06:25 -07:00
Dylan Baker	6eaa013685	meson: Set include dirs for asm v2: - split this from the next patch - Only include x86-64 and not x86 when buiding x86_64 Signed-off-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-06-13 10:06:23 -07:00
Dylan Baker	65e447c5df	meson: move cc and cpp definitions to top of main meson.build This just makes using cc and cpp easier. v2: - Add this patch to fix altivec Signed-off-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2018-06-13 10:06:16 -07:00
Jason Ekstrand	51376cd749	Revert "intel/compiler: Properly consider UBO loads that cross 32B boundaries." This reverts commit `b8fa847c2e`. This broke about 30k Vulkan CTS tests.	2018-06-13 09:23:55 -07:00
Kenneth Graunke	b8fa847c2e	intel/compiler: Properly consider UBO loads that cross 32B boundaries. The UBO push analysis pass incorrectly assumed that all values would fit within a 32B chunk, and only recorded a bit for the 32B chunk containing the starting offset. For example, if a UBO contained the following, tightly packed: vec4 a; // [0, 16) float b; // [16, 20) vec4 c; // [20, 36) then, c would start at offset 20 / 32 = 0 and end at 36 / 32 = 1, which means that we ought to record two 32B chunks in the bitfield. Similarly, dvec4s would suffer from the same problem. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2018-06-13 02:07:58 -07:00
Ross Burton	3c288da5ee	drivers/dri/i965: add missing #include brw_bufmgr.h uses time_t without include time.h, so the build fails under musl. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-06-12 12:08:30 +01:00
Mauro Rossi	fb9ab2fbd3	anv/android: Use an address for each anv_image plane Fixes to avoid building error after change in image->planes[] structure, {bo,bo_offset} has to be replaced by address.{bo,offset} and update is needed also in the assert() for debug builds. external/mesa/src/intel/vulkan/anv_android.c:188:21: error: no member named 'bo' in 'struct anv_image::(anonymous at external/mesa/src/intel/vulkan/anv_private.h:2647:4)' image->planes[0].bo = bo; ~~~~~~~~~~~~~~~~ ^ 1 error generated. Fixes: `bf34ef16ac` ("anv: Use an address for each anv_image plane") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-12 11:17:43 +03:00
Mauro Rossi	a1220e7311	anv/android: Set the BO flags in bo_cache_import (v2) Changes to avoid building error: external/mesa/src/intel/vulkan/anv_android.c:131:72: error: too few arguments to function call, expected 5, have 4 result = anv_bo_cache_import(device, &device->bo_cache, dma_buf, &bo); ~~~~~~~~~~~~~~~~~~~ ^ 1 error generated. (v2) Set the correct bo_flags based on support of 48bit addresses and soft-pin Fixes: `b0d50247a7` ("anv/allocator: Set the BO flags in bo_cache_alloc/import") Fixes: `e7d0378bd9` ("anv: Soft-pin client-allocated memory") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-12 11:16:39 +03:00
Kenneth Graunke	0d5329d626	anv: Disable __gen_validate_value if NDEBUG is set. We were enabling undefined memory checking for genxml values based on Valgrind being installed at build time, even for release builds. This generates piles and piles of assembly whenever you touch genxml. With gcc 7.3.1 and -O3 and -march=native on a Kabylake with Valgrind installed at build time: text data bss dec hex filename 5978385 262884 13488 6254757 5f70a5 libvulkan_intel.so 3799377 262884 13488 4075749 3e30e5 libvulkan_intel.so That's a 36% reduction in text size. Fixes: `047ed02723` (vk/emit: Use valgrind to validate every packed field) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-11 14:55:32 -07:00
Eric Engestrom	06e8771dec	README: wording fix for previous commit Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>	2018-06-11 18:34:58 +01:00
Eric Engestrom	d9f54dceca	README: add link to WhosWho for IRC nicks Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>	2018-06-11 18:33:12 +01:00
Eric Engestrom	eadc068406	add project README Now that we're using GitLab, let's take advantage of the "landing page" README feature with some minimal information, mostly to point people to the right resources. Acked-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>	2018-06-11 18:02:35 +01:00
Eric Engestrom	e43c012433	i965: fix resource leak v2: intel_miptree_release() already takes care of the planes, no need to hand-code the loop (Lionel) Coverity ID: 1436909 Fixes: `3352f2d746` "i965: Create multiple miptrees for planar YUV images" Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Eric Engestrom <eric@engestrom.ch>	2018-06-11 14:54:23 +01:00
Rob Clark	55d1a77c29	freedreno/ir3: use pipe_image_view's cpp At least for PIPE_BUFFER, we could get the resource used as (for example) R32F imageBuffer. So using cpp=1 from the rsc is wrong. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-06-11 09:06:03 -04:00
Rob Clark	9bb90a3255	freedreno/ir3: fix image dimensions offset copy-pasta fail from how SSBO sizes are handled. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-06-11 09:06:03 -04:00
Rob Clark	e9fc9c16c9	freedreno/a5xx: correct image/ssbo offset Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-06-11 09:06:03 -04:00
Rob Clark	132e5b0b34	freedreno/ir3: use saml always if we have lod In some cases we get plain tex opcodes (but w/ a lod argument).. in this case always use the saml instruction. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-06-11 09:06:03 -04:00

1 2 3 4 5 ...

102790 Commits