third_party_mesa3d

Author	SHA1	Message	Date
Brian Paul	c20b48c48e	gallivm: add a few const qualifiers Trivial.	2014-02-02 06:52:36 -07:00
Brian Paul	c6d94648cf	translate: reindent translate_sse.c Trivial.	2014-02-02 06:52:36 -07:00
Brian Paul	8689076925	mesa: make _mesa_get_proxy_target() static Wasn't used in any other file. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-02-02 06:47:32 -07:00
Brian Paul	9eaed3eb6e	mesa: remove unused _mesa_select_tex_object() function The _mesa_get_current_tex_object() function is now used everywhere that _mesa_select_tex_object() was formerly used. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-02-02 06:47:32 -07:00
Brian Paul	d5df28381e	swrast: use _mesa_get_current_tex_object() in swrastSetTexBuffer2() Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-02-02 06:47:32 -07:00
Brian Paul	ed72115891	st/mesa: use _mesa_get_current_tex_object() in st_context_teximage() Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-02-02 06:47:32 -07:00
Brian Paul	f09a1261ad	mesa: use _mesa_get_current_tex_object() in GetTexLevelParameteriv() And update a related comment. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-02-02 06:47:32 -07:00
Brian Paul	8b4f6fada2	radeon: use _mesa_get_current_tex_object() in radeonSetTexBuffer2() Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-02-02 06:47:32 -07:00
Brian Paul	76c33e383c	r200: use _mesa_get_current_tex_object() in r200SetTexBuffer2() Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-02-02 06:47:32 -07:00
Paul Seidler	1cdeeef6c4	build: move ARCH_LIBS definition outside of ASM definition _mesa_streaming_load_memcpy is also needed even if assembling is disabled Cc: "10.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-01 15:01:06 -08:00
Eric Anholt	c849ecc19a	dri: Add a useful error message if someone's packages missed libudev deps. Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-01 10:09:11 -08:00
Eric Anholt	63546b8e3d	dri: Also support the loader with libudev.so.0. As far as I know, this should be safe. If not, we have to decide whether to have variable lookup of the functions, or just drop support for .so.0 (which is a year and a half old it looks like) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=74127 Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-01 10:08:36 -08:00
Rob Clark	dc00ec154b	freedreno: better manage our WFI's Updates to non-banked registers, CP_LOAD_STATE, etc, need a WFI if there is potentially pending rendering. Track this better, and add fd_wfi() calls everywhere that might potentially need CP_WAIT_FOR_IDLE. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-01 12:10:17 -05:00
Rob Clark	1fe9df8f29	freedreno/a3xx: add logicop Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-01 11:59:25 -05:00
Rob Clark	8d27be2633	freedreno/a3xx: handle frag z write Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-01 11:58:47 -05:00
Rob Clark	083b27a1b1	freedreno: resync generated headers Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-01 11:57:39 -05:00
Rob Clark	98c1111462	freedreno/a3xx: fix const confusion Gallium can leave const buffers bound above what is used by the current shader. Which can have a couple bad effects: 1) write beyond const space assigned, which can trigger HLSQ lockup 2) double emit of immed consts, first with bound const buffer vals followed by with actual immed vals. This seems to be a sort of undefined condition. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-01 11:57:09 -05:00
Rob Clark	5c6961efae	freedreno/a3xx/compiler: compiler cleanups Drop color/pos/psize_regid, plus a few compiler and IR cleanups. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-01 11:53:21 -05:00
Rob Clark	69eca28dd0	freedreno/compiler/a3xx: remove lowered instructions Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-01 11:52:27 -05:00
Rob Clark	0f2df4ff90	freedreno: add tgsi lowering pass Currently lowers the following instructions: DST, XPD, SCS, LRP, FRC, POW, LIT, EXP, LOG, DP4, DP3, DPH, DP2 translating these into equivalent simpler TGSI instructions. This probably should be moved to util so other drivers can use it, but just adding under freedreno for now so that I can clear out a lot of the lowering code in a3xx compiler before beginning to add new compiler. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-01 11:50:10 -05:00
Rob Clark	7524756199	freedreno/a3xx/compiler: add CLAMP Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-01 11:49:31 -05:00
Rob Clark	fafe16a8a0	freedreno/a3xx/compiler: various fixes Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-01 11:49:06 -05:00
Rob Clark	4971628bae	freedreno: ctx should hold ref to dev The ctx should hold ref to dev to avoid problems if screen is destroyed before ctx. Doesn't really fix the egl/glx issues, but at least it prevents things from getting much worse. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-01 11:47:08 -05:00
Rob Clark	303df12db8	freedreno: add prims-emitted driver query Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-01 11:45:19 -05:00
Kenneth Graunke	80bf1fbaf6	i965: Silence unused variable 'ctx' warning. Somehow I missed this before pushing the Broadwell PS state upload code. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2014-01-31 21:40:27 -08:00
Kenneth Graunke	e1cdafe6f7	i965: Fix math instruction hstride assertions on Broadwell. In the final revision of my gen8_generator patch, I updated the MATH instruction's assertion from (dst.hstride == 1) to check that source and destination hstride matched. Unfortunately, I didn't test this enough, and many Piglit tests fail this test. The documentation indicates that "scalar source is also supported", which we believe means <0,1,0> access mode (hstride == 0). If hstride is non-zero, then it must match the destination register. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-01-31 17:50:09 -08:00
Kenneth Graunke	d8878055f5	i965: Add (disabled) Broadwell PCI IDs. This puts the PCI IDs in place so it's easy to enable support. However, it doesn't actually enable support since it's very preliminary still, and a few crucial pieces (such as BLORP) are still missing. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net>	2014-01-31 17:50:08 -08:00
Kenneth Graunke	3ade766684	i965: Disable 3DSTATE_WM_HZ_OP fields. Eric believes this to be wrong and unnecessary, as the command is supposed to emit an implicit rectangle primitive. However, empirically the pixel pipeline is completely unreliable without it. So for now, it stays until someone comes up with a better solution. We'll need to do better than this when we implement multisampling, HiZ, or fast clears...but for now, this will do. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net>	2014-01-31 17:50:08 -08:00
Kenneth Graunke	4c4e0ed64b	i965: Update GS state for Broadwell. This is quite similar to the Gen7 code. The main changes: - 48-bit relocations - Thread count is specified as U/2-1 instead of U-1. - An extra DWord (DW9) with clip planes, URB entry output length/offsets - We need to program the "Expected Vertex Count" (VerticesIn) v2: Set the number of binding table entries so they can be prefetched (requested by Eric Anholt). v3: Add a WARN_ONCE for a missing workaround. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-01-31 17:50:08 -08:00
Kenneth Graunke	a0d4311072	i965: Update multisampling state for Broadwell. On previous platforms, 3DSTATE_MULTISAMPLE contained the number of samples, pixel location, and the positions of each sample within a pixel for each multisampling mode (4x and 8x). It was also a non-pipelined command, presumably since changing the sample positions is fairly drastic. Broadwell improves upon this by splitting the sample positions out into a separate non-pipelined state packet, 3DSTATE_SAMPLE_PATTERN. With that removed, 3DSTATE_MULTISAMPLE becomes a pipelined state packet. Broadwell also supports 2x and 16x multisampling, in addition to the 4x and 8x supported by Gen7. This patch, however, does not implement 2x and 16x. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Chad Versace <chad.versace@linux.intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-01-31 17:50:08 -08:00
Kenneth Graunke	9cd65e3289	i965: Update 3DSTATE_{DEPTH,STENCIL,...}_BUFFER and such for Broadwell. The amount of cut and paste from Gen7 is rather ugly, and should probably be cleaned up in the future. Even the Gen7 code is in need of some tidying though; many of the function parameters aren't used on platforms that use level/layer rather than tile offsets. Tidying both can be left to a future patch series. This at least gets things going. v2: Rebase on Paul's rename of NumLayers -> MaxNumLayers. v3: Shift QPitch by 2 when storing it in the packet. Bits 14:0 store bits 16:2 of the actual value. Fixes tests. v4: Add missing stencil buffer QPitch. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net>	2014-01-31 17:50:08 -08:00
Kenneth Graunke	2fce1e3c69	i965: Update BLEND_STATE for Broadwell. v2: Allow logic ops on all surface types. The UNORM restriction was lifted with Haswell and I simply hadn't noticed. Also, add missing BRW_NEW_STATE_BASE_ADDRESS dirty bit. Both caught by Eric Anholt. v3: Fix swapped per-RT DWord pairs. Eliminates bizarre hacks. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-01-31 17:50:08 -08:00
Kenneth Graunke	460e0df330	i965: Update SF_CLIP_VIEWPORT for Broadwell. It has additional fields to support clipping to the viewport even if guardband clipping is enabled. v2: Update for viewport array changes. v3: No, seriously, update for viewport array changes. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> [v1]	2014-01-31 17:50:08 -08:00
Kenneth Graunke	dcbf25969e	i965: Rework SURFACE_STATE entries for Broadwell. v2: Add missing SCS setting in gen8_emit_buffer_surface_state (caught by Eric Anholt). v3: Use stored QPitch rather than recomputing it. v4: Shift QPitch by 2 when setting it in the packet; bits 14:0 store bits 16:2 of the actual value (fixes myriads of cube and array texturing tests). Also, only enable cube face bits for cubemaps (matches Chris Forbes' commit on master). Port to use offset64. v5: s/gl_format/mesa_format/g v6: Fix DW5 of renderbuffer state, which neglected to subtract irb->mt->first_level. Use vertical_alignment() rather than hardcoding 4. Use ffs for multisample counts rather than a large switch statement (all caught/suggested by Eric). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-01-31 17:50:07 -08:00
Kenneth Graunke	990aaf87c4	i965: Update SOL state for Broadwell. Unlike on Gen7, we can directly set the offset via the state packet. We also -have- to: the kernel SOL reset code won't work anymore. v2: Fix copy and paste mistake in buffer stride setup; drop stale comment (caught by Eric Anholt). Add a perf_debug for missing MOCS setup. v3: Rebase on Paul Berry's changes to CurrentVertexProgram. v4: Fix SO Write Offset handling. We need to set bits 20 and 21 so the hardware both loads and saves the offset. There's also a restriction that 3DSTATE_SO_BUFFER can only be programmed once per buffer between primitives, so the "reset to zero" code needed reworking. Fixes most of the transform feedback Piglit tests. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> [v2]	2014-01-31 17:50:07 -08:00
Kenneth Graunke	fd91ab662d	i965: Update the code that disables unused shader stages for Broadwell. v2: Also disable 3DSTATE_WM_CHROMAKEY for safety. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> [v1]	2014-01-31 17:50:07 -08:00
Kenneth Graunke	3d3c351cfb	i965: Update 3DSTATE_CLIP for Broadwell. Broadwell's winding order, polygon fill, and viewport Z test fields have moved to DWord 1 of 3DSTATE_RASTER. v2: Add a perf_debug for a future optimization and improve commit message (both suggested by Eric Anholt). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-01-31 17:50:07 -08:00
Kenneth Graunke	5c0d7dbcb9	i965: Rework vertex uploads for Broadwell. v2: Emit a dummy 3DSTATE_VF_SGVS packet when not needed. v3: Add WARN_ONCE and perf_debugs requested by Eric Anholt. v4: Program 3DSTATE_SGVS even in the no-elements case so gl_VertexID continues working. Fix 3DSTATE_VF_INSTANCING to not use an element index to access the buffers array. Some ARB_draw_indirect prep work. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-01-31 17:50:07 -08:00
Kenneth Graunke	08a4714959	i965: Update STATE_BASE_ADDRESS for Broadwell. v2: Fix missing "change" bit on instruction state base address (caught by Haihao Xiang). v3: Add a perf_debug for missing MOCS setup, requested by Eric. v4: Fix buffer sizes. The value, specified at bit 12 and up, is actually measured in 4k pages. We need to round up to the next multiple of 4k. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> [v3] Reviewed-by: Matt Turner <mattst88@gmail.com> [v4]	2014-01-31 17:50:07 -08:00
Kenneth Graunke	f3c6d6f1e1	i965: Update 3DSTATE_PS, 3DSTATE_WM, and add 3DSTATE_PS_EXTRA. v2: Fix setting of GEN8_PSX_ATTRIBUTE_ENABLE after rebases. v3: Add missing binding table entry counts. Don't worry about alpha testing or alpha to coverage when setting the "Kill Pixel" bit; those are specified in 3DSTATE_PS_BLEND (caught by Eric Anholt). Drop unused _NEW_BUFFERS. Tidy comments. v4: Rebase on Paul Berry's changes to CurrentFragmentProgram. v5: Re-enable line stippling. It doesn't crash or anything. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> [v3]	2014-01-31 17:50:07 -08:00
Kenneth Graunke	20d9286f71	i965: Rework 3DSTATE_VS for Broadwell. v2: Remove incorrect MOCS shifts; rename urb_entry_write_offset to urb_entry_output_offset to closer match the documentation. v3: Only emit a non-zero constant buffer read length when active. v4: Add missing binding table counts (caught by Eric). v5: Rebase on Paul Berry's changes to CurrentVertexProgram. v6: Drop bogus SBE read length/offset field code. We were programming the wrong values, and our 3DSTATE_SBE code overrides any value we put here anyway with the correct one. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> [v4]	2014-01-31 17:50:06 -08:00
Kenneth Graunke	c96686a6cc	i965: Add the new 3DSTATE_PS_BLEND state packet. v2: Only set GEN8_PS_BLEND_HAS_WRITEABLE_RT if color buffer writes are enabled (caught by Eric Anholt). v3: Set non-blending flags (writeable RT, alpha test, alpha to coverage) for integer formats too. +14 Piglits. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> [v2]	2014-01-31 17:50:06 -08:00
Kenneth Graunke	17768bb7b4	i965: Replace DEPTH_STENCIL_STATE with Gen8's 3DSTATE_WM_DEPTH_STENCIL. v2: Use stencil->_WriteEnabled instead of setting GEN8_WM_DS_STENCIL_BUFFER_WRITE_ENABLE twice (suggested by Eric). v3: Mask stencil->WriteMask and stencil->ValueMask with 0xff. The field is only 8-bits, so we'd trip the new SET_FIELD assertion when core Mesa gave us a value like 0xFFFFFFFF. The Gen7 code uses structure field widths to implicitly do this truncation. Fixes Piglit tests. v4: Use uint32_t for dw1/dw2, not uint8_t. Worst. Typo. Ever. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> [v2]	2014-01-31 17:50:06 -08:00
Kenneth Graunke	90fff1354b	i965: Update SF, SBE, and RASTER state for Broadwell. The attribute override portion of 3DSTATE_SBE was split out into 3DSTATE_SBE_SWIZ; various bits of 3DSTATE_SF were split out into 3DSTATE_RASTER. v2: Set Force URB Read Offset bit. Eventually the URB read offset should be set in 3DSTATE_VS, but that will require some refactoring. v3: Rebase on viewport array changes. v4: Improve comments about URB read length/offset overrides. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-01-31 17:50:06 -08:00
Kenneth Graunke	4552a22f04	i965: Bump generation assertions on workaround flushes. I haven't investigated whether these are necessary on Broadwell or not, but for paranoia's sake, we may as well continue doing them for now. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2014-01-31 17:50:06 -08:00
Kenneth Graunke	2184b519cd	i965: Duplicate gen7_atoms to gen8_atoms. It's going to diverge significantly. Starting out with a copy allows future patches to change atoms one by one. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-01-31 17:50:06 -08:00
Brian Paul	f51ca46f0c	radeon: move driContextSetFlags(ctx) call after ctx var is initialized CC: "10.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2014-01-31 17:09:44 -07:00
Brian Paul	2d6d69bab6	r200: move driContextSetFlags(ctx) call after ctx var is initialized Otherwise, ctx was a garbage value. CC: "10.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2014-01-31 17:09:44 -07:00
Roland Scheidegger	1d53603f1f	llvmpipe: fix denorm handling for r11g11b10_float format when blending The code re-enabling denorms for small float formats did not recognize this format due to format handling hacks (mainly, the lp_type doesn't have the floating bit set). Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2014-01-31 19:51:06 +01:00
Matt Turner	606544214e	glsl: Expand non-expr & non-swizzle scalar rvalues in vectorizing.	2014-01-31 10:21:50 -08:00

1 2 3 4 5 ...

60963 Commits