#asahi-gpu on 2023-01-15 — irc logs at oftc.irclog.whitequark.org

2022-12-21 00:46 ChanServ changed the topic of #asahi-gpu to: Asahi Linux GPU development (no user support, NO binary reversing) | Keep things on topic | GitHub: https://alx.sh/g | Wiki: https://alx.sh/w | Logs: https://alx.sh/l/asahi-gpu

00:07 <lina> alyssa: What did you see when you tried compute stuff without the magic shader?

00:07 <lina> I'm not using it right now and I *think* it's for context switching only, but I also have this cache issue...

01:34 faruk has quit [Quit: Textual IRC Client: www.textualapp.com]

01:47 systwi has joined #asahi-gpu

02:11 ah- has joined #asahi-gpu

02:58 possiblemeatball has joined #asahi-gpu

03:00 cylm has quit [Ping timeout: 480 seconds]

03:04 mini0n has quit [Quit: Leaving]

03:12 pthariensflame has joined #asahi-gpu

03:23 Stary has quit [Ping timeout: 480 seconds]

03:37 Stary has joined #asahi-gpu

04:00 pthariensflame has quit [Quit: Textual IRC Client: www.textualapp.com]

04:11 Stary has quit [Ping timeout: 480 seconds]

04:11 <lina> I'm looking at the magic shader code and it seems to be some kind of dispatcher based on a special register value...

04:12 <lina> It definitely interacts with the compute buffer struct I found

04:12 <lina> Some codepaths do save/restore stuff (and I think we might be missing some looping instruction decoding? I see unknown instructions right after the load/store groups...)

04:12 <lina> But at least one op, 0xf, just runs 5 unknown instructions. I wonder if that is cache maintenance...

04:21 <lina> And then there's a magic store to address 0x2d822acc in here... is this some kind of MMIO or doorbell?

04:54 Stary has joined #asahi-gpu

06:02 <lina> Aaaa, I just tried doing a faulting write in that shader and it locks up the whole GPU... the compute part runs fine, but a subsequent 3D job hangs. I can see the fault status but the firmware is just sitting there...

06:02 <lina> I think what happens is that since we're not flushing caches, the fault happens after the compute job is complete, in some state where the firmware doesn't implement timeouts properly...

06:26 possiblemeatball has quit [Quit: Leaving]

06:52 SSJ_GZ has joined #asahi-gpu

07:43 hertz has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

08:18 tim has joined #asahi-gpu

08:18 tim has quit [Remote host closed the connection]

09:07 le0n has quit [Quit: see you later, alligator]

09:08 le0n has joined #asahi-gpu

09:36 LinuxM1 has joined #asahi-gpu

09:52 SSJ_GZ has quit [Ping timeout: 480 seconds]

10:02 kit_ty_kate has quit [Quit: WeeChat 3.6]

10:56 LinuxM1 has quit [Ping timeout: 480 seconds]

11:06 bcrumb has joined #asahi-gpu

11:06 bcrumb has quit []

12:02 hertz has joined #asahi-gpu

12:16 mkurz has quit [Quit: Leaving]

13:28 bcrumb has joined #asahi-gpu

13:49 possiblemeatball has joined #asahi-gpu

14:05 bcrumb has quit [Quit: WeeChat 3.7.1]

14:11 SSJ_GZ has joined #asahi-gpu

14:19 cylm has joined #asahi-gpu

14:25 cr1901 has quit [Ping timeout: 480 seconds]

14:38 Tramtrist has quit [Remote host closed the connection]

14:42 Tramtrist has joined #asahi-gpu

15:03 amarioguy has joined #asahi-gpu

15:52 alyssa has joined #asahi-gpu

15:52 <alyssa> lina: https://gitlab.freedesktop.org/asahi/mesa/-/commit/fe3eed5f527cf51f8a45b2ad24ba5a79eb7e71c3#89b6e919adec3916dacf34377fe9e0036bd40140_42_47

15:52 <alyssa> this should be _writes, not _reads

15:54 <alyssa> https://gitlab.freedesktop.org/asahi/mesa/-/commit/e4a856e52ac8d88b3af6b5c4cad0534d01edac96#89b6e919adec3916dacf34377fe9e0036bd40140_89_89

15:54 <alyssa> this should be predicated on stage == COMPUTE, you're writing into a union

15:55 balor has quit [Quit: balor]

15:56 balor has joined #asahi-gpu

15:56 <ella-0> alyssa: With respect to the system values MR I am currently porting AGXV to it. Will let you know if anything doesn't work well with vulkan but so far so good :3

16:05 bcrumb has joined #asahi-gpu

16:07 <alyssa> ella-0: awesome!

16:07 <alyssa> Is it, like... better than what we had before?

16:07 <alyssa> I've never written a VK driver lol

16:12 <ella-0> yes

16:14 <ella-0> It simplifies the command buffer code somewhat and makes it possible to allocate a chunk of the register file for push constants.

16:15 <alyssa> Hmmm ok

16:15 <alyssa> I still don't really get how

16:15 <alyssa> but if you say it's right and Jason says it's right that is good for me

16:15 <alyssa> and obviously it solves a problem I had in GL lol

16:18 bcrumb has quit [Quit: WeeChat 3.7.1]

16:22 <alyssa> by the way.. at what point will we be bottlenecked on agxv specific stuff

16:22 <alyssa> i.e. when should I pull your tree and start hacking on deqp-vk, versus continuing to support more and better gl in the interest of agxv

16:28 mini0n has joined #asahi-gpu

16:28 <ella-0> uhh not sure

16:28 <alyssa> kie

16:29 <ella-0> I was hoping to get some basic compute running on agxv before that I think

16:29 <alyssa> makes sense

16:30 <alyssa> I was hoping the combination of you, Lina, Karol, and Dougall would be able to finish off compute support without me ^.^

16:33 <alyssa> make sure my baby can grow without me, yknow

16:41 faruk has joined #asahi-gpu

16:43 <alyssa>

16:50 <alyssa> o

16:57 cr1901 has joined #asahi-gpu

16:58 <ella-0> That makes sense. I'm happy to work on compute stuff :3 currently the main thing holding agxv back is not having vkCmdCopy* and other transfer operations implemented. I tried and failed to implement them using the tilebuffer load/store shaders

17:00 <alyssa> Right..

17:00 <alyssa> vkCmdCopy* sucks.

17:01 <alyssa> vkCmdCopyImageToBuffer should probably be a compute kernel

17:01 <alyssa> vkCmdCopyBufferToImage should be a fragment shader

17:01 <alyssa> probably

17:01 <alyssa> except maybe some little details about ASTC/BCn formats? but I think you can just munge the dimensions?

17:01 <alyssa> also I would expect vk_meta can do those because they're pretty general

17:02 <alyssa> vkCmdCopyBufferToBuffer is also a totally generic compute kernel

17:02 faruk has quit [Quit: Textual IRC Client: www.textualapp.com]

17:02 <alyssa> the only spicy thing is vkCmdCopyImageToImage

17:02 <alyssa> implementing that efficiently requires being able to "cast" framebuffer compressed images to different formats

17:03 <ella-0> Yup it's pain :<

17:04 <alyssa> right, so, there are basically 2 classes of hardware

17:06 <alyssa> 1. hw that can do compressed texture views. in this case, img2img copies are just blits! you might need to munge the dest format into something renderable with the same size, maybe munge the dimensions in some evil cases, but by and large this is generic blitting on the fragment pipe, where a texture view (with view format != original format) on the sampling image for casting

17:08 <alyssa> 2. hw that can't. obviously this class of hw can still do uncompressed -- at worst, linear -- texture views. so the sane option here is to eat the bandwidth hit -- allocate an uncompressed staging resource and lower compressed->compressed copies to a pair of copies compressed->uncompressed + uncompressed->compressed because you can do views for each of those. and those simpler copies are easy and

17:08 <alyssa> generic on the fragment pipe.

17:08 <alyssa> I am unsure which class of hardware AGX is.

17:08 <alyssa> Mali is #1 on Valhall, but #2 on anything older

17:09 <alyssa> panvk 1.0 has some extremely delicate logic to try to do compressed->compressed image copies with format conversion in one shot with format packing/unpacking in the copy shaders

17:10 <alyssa> while that's almost certainly faster, it isn't worth the complexity ... vkCmdCopyImageToImage with compressed<--->compressed and incompatible formats is an abomination that should never have been added to the spec and bad perf is to be expected

17:10 <alyssa> I don't know if any real apps would even hit that. I'm doubful.

17:10 <alyssa> GL drivers won't even try and will just decompressed your image in this case.

17:13 hightower2 has joined #asahi-gpu

17:15 <alyssa> oh, uh

17:15 <alyssa> 3 classes i guess

17:16 <alyssa> 3. hardware blitters that trivialize the problem

17:16 <alyssa> NVIDIA is #3 so NVK doesn't need meta shaders for this

17:16 <ella-0> interesting

17:17 <alyssa> but architecturally vk_meta should be able to support both #1 and #2 with some work

17:17 <alyssa> if you don't get to it I probably will in, like, June

17:17 <alyssa> July maybe

17:17 <alyssa> for $DAY_JOB

17:21 <ella-0> okay :3

17:24 bluetail8 has joined #asahi-gpu

17:29 faruk has joined #asahi-gpu

17:30 landscape15 has joined #asahi-gpu

17:31 bluetail has quit [Ping timeout: 480 seconds]

17:31 bluetail8 is now known as bluetail

17:41 faruk has quit [Quit: Textual IRC Client: www.textualapp.com]

18:10 landscape15 has quit [Quit: WeeChat 3.8]

18:21 DarkShadow4444 has quit [Quit: ZNC - https://znc.in]

18:21 hotbbqsauce has joined #asahi-gpu

18:22 hotbbqsauce has quit []

18:25 DarkShadow4444 has joined #asahi-gpu

18:26 DarkShadow4444 has quit [Remote host closed the connection]

18:26 DarkShadow4444 has joined #asahi-gpu

18:33 dg_ has joined #asahi-gpu

18:34 <dg_> Error: PageSize configuation is wrong: configured with 4096, but got 16384

18:34 dg_ has quit []

18:57 m1nion has joined #asahi-gpu

19:03 mini0n has quit [Ping timeout: 480 seconds]

19:28 tim has joined #asahi-gpu

19:29 tim is now known as Guest1427

19:32 zkrx has quit [Remote host closed the connection]

19:33 zkrx has joined #asahi-gpu

19:35 possiblemeatball has quit [Ping timeout: 480 seconds]

19:39 possiblemeatball has joined #asahi-gpu

20:12 bcrumb has joined #asahi-gpu

20:12 bcrumb has quit []

20:15 bcrumb has joined #asahi-gpu

20:22 bcrumb has quit [Quit: WeeChat 3.7.1]

20:27 bcrumb has joined #asahi-gpu

20:43 bcrumb has quit [Quit: WeeChat 3.7.1]

20:47 SSJ_GZ has quit []

20:50 bluetail has quit [Quit: The Lounge - https://thelounge.chat]

20:53 bluetail has joined #asahi-gpu

21:06 bluetail has quit [Quit: The Lounge - https://thelounge.chat]

21:09 bluetail has joined #asahi-gpu

21:16 balor has quit [Quit: balor]

21:16 balor has joined #asahi-gpu

21:20 krbtgt has joined #asahi-gpu

21:26 possiblemeatball has quit [Quit: Leaving]

21:31 Guest1427 has quit [Quit: Guest1427]

21:54 Dementor has quit [Remote host closed the connection]

21:54 possiblemeatball has joined #asahi-gpu

21:55 Dementor has joined #asahi-gpu

23:00 yrlf has quit [Quit: The Lounge - https://thelounge.chat]

23:20 irth has joined #asahi-gpu

23:44 m1nion has quit []

23:53 ju has quit [Remote host closed the connection]

23:54 possiblemeatball has quit [Ping timeout: 480 seconds]

23:56 possiblemeatball has joined #asahi-gpu