#zink on 2021-06-03 — irc logs at oftc.irclog.whitequark.org

00:31 adjtm has quit [Ping timeout: 480 seconds]

00:41 adjtm has joined #zink

07:24 adjtm has quit [Ping timeout: 480 seconds]

09:21 <kusma> Anyone willing to give this one a quick review? https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11081

12:05 <kusma> airlied: lp_build_insert_soa_chan doesn't seem to handle the cases where chan_desc.type == UTIL_FORMAT_TYPE_UNSIGNED / UTIL_FORMAT_TYPE_SIGNED and *both* chan_desc.pure_integer and type.floating is true...

12:06 <kusma> That is, writing float-values into formats like r8i / r16i etc...

12:07 <kusma> In that case, we only consider pure_integer, and bitcast the float to int, it seems.

12:08 <kusma> I'm a bit confused, because I don't see failures on LLVMpipe for this... but I do see it fail with Zink + Lavapipe...

12:08 <kusma> I mean: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10467/diffs#diff-content-3a964bec8d1767e05a777116116a81da9bc047cd

12:09 <kusma> I'm a bit confused why there's an incoming float here rather than an int...

12:21 <kusma> OK, that seems like it might be because st_pbo copies through a float... which seems like a bad idea to me.

12:21 <kusma> (nan-flushing, anyone?)

13:24 <zmike> llvmpipe wouldn't take that codepath because it does cpu texture population

13:24 <zmike> using the cpu map codepath

13:25 <kusma> zmike: Yeah, true.

13:25 <kusma> But I also don't see any other failures for r8i images etc

13:25 <zmike> in llvmpipe?

13:25 <kusma> Yeah

13:26 <zmike> probably just not hitting the exact case in any test that doesn't use the alternate pbo path

13:26 <kusma> Yeah, could be that the PBO-case does something that isn't really legal in GL or something

13:27 <zmike> mesa-internals aren't restricted to doing what GL can do, only what drivers can do

13:27 <kusma> exactly

13:34 <kusma> zmike: Yeah, you're right. This is not legal in GLSL, so I guess that's why it's not causing any issues there.

13:35 <kusma> imageStore() requires that the image type and value have the same type.

13:37 <kusma> But SPIR-V also have the same limitation, so I guess we have the same problem on the Zink-end...

13:37 <kusma> (only legal in SPIR-V for CL, not Vulkan)

13:38 <kusma> Or maybe I'm confusing things a bit now...

13:40 <kusma> Yeah, we're just doing a bitcast here. Which isn't going to do the right thing :/

13:43 <kusma> Hmpf, but we're doing the same on ANV, where things work...

14:14 hch12907_ has joined #zink

14:15 hch12907 is now known as Guest729

14:15 hch12907_ is now known as hch12907

14:15 Guest729 has quit [Ping timeout: 480 seconds]

14:17 hch12907 has quit [Remote host closed the connection]

14:18 hch12907 has joined #zink

14:19 hch12907_ has joined #zink

14:19 hch12907 is now known as Guest731

14:19 hch12907_ is now known as hch12907

14:20 Guest731 has quit []

14:29 <kusma> zmike: Hmmpf, seems there's a *lot* of completely valid validator errors in this test (KHR-GL32.packed_pixels.pbo_rectangle.r16i on ANV)... I've disabled EXT_extended_dynamic_state to avoid the obvious validator-derp stuff...

14:29 <kusma> Lots of these things "VkDescriptorSet 0x4c400000004c4[] encountered the following validation error at vkCmdDraw() time: Descriptor in binding #544 index 0 requires FLOAT component type, but bound descriptor format is VK_FORMAT_R16_SINT."

14:44 <zmike> that's the same thing you just complained about with copying int formats using float

14:44 <zmike> not my problem

14:45 <kusma> Yeah, seems you're right. I think I have a fix for that.

14:54 <kusma> And with that fixed... We also fail the test on ANV :P

14:54 <zmike> "fixed"

14:54 <kusma> What could possibly go wrong?!

14:55 <kusma> Look, the validator tells me I'm doing everything correct. Apart from barriers, it doesn't like the access bits there.

14:56 <kusma> Oh, and this: vkCreateRenderPass: value of pCreateInfo->pDependencies[0].srcStageMask must not be 0. The Vulkan spec states: srcStageMask must not be 0 (https://www.khronos.org/registry/vulkan/specs/1.2-extensions/html/vkspec.html#VUID-VkSubpassDependency-srcStageMask-requiredbitmask)

14:56 <zmike> already on my list

14:57 <kusma> Cool. Not trying to shovel things over on you here, just debugging out loud ;)

14:58 _whitelogger has joined #zink

15:00 <kusma> Nevermind, the test passes. I was just being stupid.

16:40 <hch12907> I noticed there's quite a lot of C99 VLA in nir_to_spirv... are there any particular reasons of using it? (e.g. avoiding heap allocation in hotpath)

16:40 <hch12907> I was looking at issue #4854, for context

16:41 <zmike> no particular reason that I'm aware of

16:49 jekstrand has joined #zink

17:08 <kusma> That's probably laziness on my end. I'm all for fixing it!

17:12 <hch12907> some of them are fairly trivial, I can simply replace num_components with NIR_MAX_VEC_COMPONENTS

17:13 <hch12907> but others like `SpvId src[nir_op_infos[alu->op].num_inputs]`... not so much

17:13 <hch12907> we don't have a NIR_ALU_MAX_INPUTS lying somewhere in the headers :P

17:14 <jekstrand> It's the same as MAX_VEC_COMPONENTS

17:15 <jekstrand> vecN is the biggest ALU op

17:15 <jekstrand> We should really have a #define for that too. I'd not be opposed to someone adding one.

17:15 <jekstrand> But MAX_VEC_COMPONENTS is sufficient, probably

17:15 <jekstrand> There is a MAX_INPUTS thing for intrinsics, I think.

17:15 <jekstrand> Maybe we should add one for ALU

17:16 <hch12907> I saw there is a MAX_INPUTS for intrinsics, but nothing for other instructions

17:17 <jekstrand> Only intrinsics and ALU really have a max and, for ALU, it's MAX_VEC_COMPONENTS

17:17 <jekstrand> THough, like I said, I wouldn't be opposed to a `#define NIR_ALU_MAX_INPUTS NIR_MAX_VEC_COMPONENTS` with a comment saying that vecN is the widest ALU op.