third_party_mesa3d

Author	SHA1	Message	Date
Eric Engestrom	fa0dcaaae0	docs/install: drop autotools references 19.3 will be the 3rd release without autotools, people know it's gone by now. Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2019-09-30 19:45:15 +01:00
Maya Rashish	c0330461c9	meson: Test for -Wl,--build-id=sha1 instead of hard-coding OS list. Helps Solaris ld builds. Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Signed-off-by: Maya Rashish <coypu@sdf.org>	2019-09-30 18:38:14 +00:00
Dylan Baker	4913ad9a37	docs: remove stray newline Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-09-30 18:27:52 +00:00
Dylan Baker	bc2d73c36b	docs: use https for mesonbuild.com Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-09-30 18:27:52 +00:00
Dylan Baker	5d11a828e1	docs: update install docs for meson Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-09-30 18:27:52 +00:00
Marek Olšák	a1545af079	ac/nir: fix GLSL imageSamples() Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-09-30 14:21:42 -04:00
Marek Olšák	0cc233e3dc	ac: add ac_build_image_get_sample_count from radeonsi Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-09-30 14:21:42 -04:00
Marek Olšák	39e638c14e	ac/surface: don't allocate FMASK if there is no graphics Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-09-30 14:21:42 -04:00
Marek Olšák	f704fb7f0b	tgsi_to_nir: handle PIPE_FORMAT_NONE in image opcodes radeonsi doesn't use the format and internal shaders don't set it. Reviewed-By: Timur Kristóf <timur.kristof@gmail.com>	2019-09-30 14:20:48 -04:00
Dylan Baker	3b265f61f5	meson: gallium media state trackers require libdrm with x11 v2: - update copyright year in all changed files - rebase on master Cc: 19.1 19.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-09-30 18:06:56 +00:00
Kenneth Graunke	a0a93763fb	iris: Disable CCS_E for 32-bit floating point textures. A while back, Michael Larabel noticed that Paraview's Wavelet Volume case runs significantly slower on iris than i965. It turns out this is because we enable CCS_E for 32-bit floating point formats, while i965 disables it, with an oblique comment saying that we benchmarked it (on what exactly?) and determined that it was a loss. Paraview uses both R32_FLOAT and R32G32B32A32_FLOAT, and I observed large framerate drops when enabling CCS_E for either format. However, several other benchmarks (Aztec Ruins, many Synmark cases) use 16-bit floating point formats, with no apparent ill effects. So, disable compression for 32-bit float formats for now, but leave it enabled for 16-bit float formats as they seem to be working fine. Improves performance in Paraview's Wavelet Volume test by 62% on a Skylake GT4e. Fixes: `3cfc6a207b` ("iris: Fill out res->aux.possible_usages")	2019-09-30 10:44:52 -07:00
Marek Olšák	4a0d2e2880	ac: reorder and print all radeon_info fields Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-30 13:36:21 -04:00
Marek Olšák	e8b1538587	ac: set the number of SDPs same as the number of TCCs Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-30 13:36:21 -04:00
Marek Olšák	b7c2f7c5a6	ac: fix num_good_cu_per_sh for harvested chips Cc: 19.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-30 13:36:20 -04:00
Marek Olšák	235ebe9163	radeonsi/gfx10: fix corruption for chips with harvested TCCs Cc: 19.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-30 13:36:20 -04:00
Marek Olšák	8cbe83445b	ac: add radeon_info::tcc_harvested Cc: 19.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-30 13:36:20 -04:00
Marek Olšák	7d97013294	ac: fix incorrect vram_size reported by the kernel Cc: 19.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-30 13:36:20 -04:00
Marek Olšák	3c0938bece	radeonsi/gfx10: fix L2 cache rinse programming Cc: 19.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-30 13:36:20 -04:00
Eric Engestrom	0efc253f02	etnaviv: fix bitmask typo Fixes: `d92689c46f` ("etnaviv: nir: add native integers (HALTI2+)") Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-09-30 17:54:33 +01:00
Adam Jackson	855dc17fcf	glx: Log the filename of the drm device if we fail to open it Helps point the user to the specific device that's having issues, since you're increasingly likely to have more than one. Gitlab: https://gitlab.freedesktop.org/mesa/mesa/issues/107 Reviewed-by: Eric Anholt <eric@anholt.net>	2019-09-30 15:30:16 +00:00
pal1000	eebe091d29	scons/windows: Enable compute shaders when possible. Tests done with llvm-config indicate that there are only 2 libraries in irreader and not in engine, LLVMAsmParser and LLVMIRReader and both of them are part of coroutines so I replaced irreader with coroutines and added libraries unique to coroutines. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2019-09-30 15:49:46 +01:00
Alyssa Rosenzweig	7be00b2a06	pan/midgard: Allow scheduling conditions with constants Now that we have constant adjustment logic abstracted, we can do this safely. Along with the csel inversion patch, this allows many more common csel ops to inline their condition in the bundle. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	c20063aa4a	pan/midgard: Add csel invert optimization Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	f0f4b39548	pan/midgard: Add mir_flip helper Useful for various operations on both commutative and anticommutative ops. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	10037ce523	pan/midgard: Tightly pack 32-bit constants If we can reuse constant slots from other instructions, we would like to do so to include more instructions per bundle. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	a3ca283bc1	pan/midgard: Allow writeout to see into the future If an instruction could be scheduled to vmul to satisfy the writeout conditions, let's do that and save an instruction+cycle per fragment shader. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	12a70ccd9e	pan/midgard: Allow 6 instructions per bundle We never had a scheduler good enough to hit this case before! :) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	34ff50cadd	pan/midgard: Only one conditional per bundle allowed There's no r32 to save ya after you use up r31 :) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	2715bd02ee	pan/midgard: Schedule to smul/sadd Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	57bac68fff	pan/midgard: Extend choose_instruction for scalar units Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	e9edae3ecb	pan/midgard: Don't double check SCALAR units Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	d3b3daa9d3	pan/midgard: Use new scheduler We still emit in-order but we switch to using the bundles created from the new scheduler, which will allow greater flexibility and room for out-of-order optimization. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	1409af9fc7	pan/midgard: Add distance metric to choose_instruction We require chosen instructions to be "close", to avoid ballooning register pressure. This is a kludge that will go away once we have proper liveness tracking in the scheduler, but for now it prevents a lot of needless spilling. v2: Lower threshold to 6 (from 8). Schedule is hurt, but a few shaders that spilled excessively are fixed. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Derp	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	e9571b53e1	pan/midgard: Add mir_choose_alu helper Based on a given unit. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	8462e82467	pan/midgard: Implement load/store pairing We can bundle two load/store together. This eliminates the need for explicit load/store pairing in a prepass, as well. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	7cf4932410	pan/midgard: Extend csel_swizzle to branches Conditions for branches don't have a swizzle explicitly in the emitted binary, but they do implicitly get swizzled in whatever instruction wrote r31, so we need to handle that. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	c9ce5a92a0	pan/midgard: Add helpers for scheduling conditionals Conditional instructions (csel and conditional branches) require their condition to be written to a special condition pipeline register (r31.w for scalar, r31.xyzw for vector). However, pipeline registers are live only for the duration of a single bundle. As such, the logic to schedule conditionals correct is surprisingly complex. Essentially, we see if we could stuff the conditional within the same bundle as the csel/branch without breaking anything; if we can, we do that. If we can't, we add a dummy move to make room. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	6f92288e85	pan/midgard: Implement predicate->unit This allows ALUs to select for each unit of the bundle separately. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	5a9a48b81a	pan/midgard: Add predicate->exclude A bit of a kludge but allows setting an implicit dependency of synthetic conditional moves on the actual condition, fixing code generated like: vmul.feq r0, .. sadd.imov r31, .., r0 vadd.fcsel [...] The imov runs simultaneous with feq so it gets garbage results, but it's too late to add an actual dependency practically speaking, since the new synthetic imov doesn't have a node associated. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	6284f3ec25	pan/midgard: Add constant intersection filters In the future, we will want to keep track of which components of constants of various sizes correspond to which parts of the bundle constants, like in the old scheduler. For now, let's just stub it out for a simple rule of one instruction with embedded constants per bundle. We can eventually do better, of course. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	941bdd2088	pan/midgard: Remove csel constant unit force Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	da18525b6f	pan/midgard: Add mir_schedule_texture/ldst/alu helpers We don't actually do any scheduling here yet, but add per-tag helpers to consume an instruction, print it, pop it off the worklist. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	72a03bcafa	pan/midgard: Add mir_choose_bundle helper It's not always obvious what the optimal bundle type should be. Let's break out the logic to decide. Currently set for purely in-order operation. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	b5396369d2	pan/midgard: Add mir_update_worklist helper After we've chosen an instruction, popped it off, and processed it, it's time to update the worklist, removing that instruction from the dependency graph to allow its dependents to be put onto the worklist. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	826fd7308b	pan/midgard: Add mir_choose_instruction stub In the future, this routine will implement the core scheduling logic to decide which instruction out of the worklist will be scheduled next, in a way that minimizes cycle count and register pressure. In the present, we are more interested in replicating in-order scheduling with the much-more-powerful out-of-order model. So rather than discriminating by a register pressure estimate, we simply choose the latest possible instruction in the worklist. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	f48038b588	pan/midgard: Initialize worklist This flows naturally from the dependency graph Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	a3b46c0db6	pan/midgard: Calculate dependency graph Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	adda411263	pan/midgard: Add flatten_mir helper We would like to flatten a linked list of midgard_instructions into an array of midgard_instruction pointers on the heap. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	0ecfcbf462	pan/midgard: Squeeze indices before scheduling This allows node_count to be correct while scheduling. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	ad05e8a52c	pan/midgard: Fix component count handling for ldst It's not based on the writemask and it can't be inferred; it's just intrinsic to the op itself. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00

1 2 3 4 5 ...

115813 Commits