#asahi-gpu on 2021-07-10 — irc logs at oftc.irclog.whitequark.org

2021-06-22 12:29 ChanServ changed the topic of #asahi-gpu to: Asahi Linux: porting Linux to Apple Silicon macs | GPU / 3D graphics stack black-box RE and development (NO binary reversing) | Keep things on topic | GitHub: https://alx.sh/g | Wiki: https://alx.sh/w | Logs: https://alx.sh/l/asahi-gpu

00:08 quarkyalice has quit [Ping timeout: 480 seconds]

00:50 quarkyalice has joined #asahi-gpu

00:58 quarkyalice has quit [Ping timeout: 483 seconds]

01:20 Emantor has quit [Quit: ZNC - http://znc.in]

01:20 Emantor has joined #asahi-gpu

01:32 quarkyalice has joined #asahi-gpu

01:40 quarkyalice has quit [Ping timeout: 480 seconds]

01:54 yuyichao has joined #asahi-gpu

02:01 chadmed has joined #asahi-gpu

02:23 PhilippvK has joined #asahi-gpu

02:26 phiologe has quit [Ping timeout: 480 seconds]

02:45 chadmed has quit [Remote host closed the connection]

02:52 chadmed has joined #asahi-gpu

03:00 al3xtjames has quit [Quit: The Lounge - https://thelounge.chat]

03:03 al3xtjames has joined #asahi-gpu

03:51 chadmed has quit [Ping timeout: 480 seconds]

04:27 nafod has quit [Read error: Connection reset by peer]

04:27 nafod has joined #asahi-gpu

04:42 quarkyalice has joined #asahi-gpu

04:43 quarkyalice_ has quit [Quit: Leaving]

04:47 quarkyalice_ has joined #asahi-gpu

04:47 quarkyalice has quit []

04:59 al3xtjames has quit [Quit: The Lounge - https://thelounge.chat]

05:00 al3xtjames has joined #asahi-gpu

05:23 al3xtjames has quit [Quit: The Lounge - https://thelounge.chat]

05:24 al3xtjames has joined #asahi-gpu

05:28 al3xtjames has quit []

05:30 chadmed has joined #asahi-gpu

05:33 al3xtjames has joined #asahi-gpu

05:33 al3xtjames has quit []

05:36 al3xtjames has joined #asahi-gpu

05:51 chadmed has quit [Remote host closed the connection]

05:57 chadmed has joined #asahi-gpu

06:36 chadmed has quit [Ping timeout: 480 seconds]

06:59 chadmed has joined #asahi-gpu

07:39 aleasto has joined #asahi-gpu

11:14 choozy has joined #asahi-gpu

13:32 alyssa has joined #asahi-gpu

13:32 alyssa has left #asahi-gpu [#asahi-gpu]

13:37 bloom has joined #asahi-gpu

13:38 * bloom thinking of doing some housekeeping on the mesa driver

13:39 <bloom> may or may not stream, thinking about it though

13:50 <jix> would probably watch if you stream... might be too distracted to actually follow what's going on today though

13:53 <chadmed> now im conflicted as to whether to fall asleep to such a stream or stay up and learn something

13:56 choozy has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

14:10 <bloom> I don't want to be the reason someone's not sleeping!

14:11 <bloom> Alright. Sure. Let me get a bite to eat first. 20 minutes, then? (Starting at ~10:30am?)

14:18 <jix> works for me, it's afternoon here, so me not sleeping is probably a good thing ^^

14:29 <chadmed> bloom: hence my dilemma, its almost 1am so i should sleep, but i also want to watch

14:30 <bloom> :F

14:30 <bloom> Just getting setu now

14:34 <bloom> on air

14:35 <jix> can hear you :)

14:40 <jix> I've heard that even recent-ish discrete desktop GPUs use tiling (need to dig out the blog post which had some code to actually demonstrate that)

14:44 <jix> found it again, it was https://www.realworldtech.com/tile-based-rasterization-nvidia-gpus/

15:11 <jix> bloom: do you need to exclude already freed entries in track_free?

15:16 <jix> bloom: yeah your change of zeroing everything was what I was thinking of when I wrote that

15:51 <jix> bloom: I think the stream freezed

15:51 <jix> audio is still fine but it only shows the broken demo

15:51 <jix> yeah can hear you

15:52 <bloom> should be back now

15:52 <jix> yeah

15:58 <jix> i wonder, are the handles at the end of the header and at the end of each entry, or are they actually at the start of each entry? (might be obvious with more context, but isn't to me)

16:07 <jix> no still fine

16:10 <jix> maybe that code got culled at some point ;)

16:33 choozy has joined #asahi-gpu

16:35 phiologe has joined #asahi-gpu

16:39 PhilippvK has quit [Ping timeout: 480 seconds]

16:55 <jix> I'm a bit confused by having DIRTY_ANY defined to be a specific bit, or do you set it whenever you set another bit? (might have missed that)

16:56 <jix> at least that's what I had expected

16:56 <jix> wouldn't this also change the thing you were testing last?

16:57 <jix> (I might be totaly confused though)

16:57 <jix> ah yeah that was what I was missing, thanks :)

17:11 * bloom wondering how dirty tracking of user uniforms was ever supposed to work...

17:17 <xerpi[m]> Regarding this random hang/HW issue: are the CPU and GPU cache coherent? If so, only LLC or also L1/L2? Maybe you need to flush some shared stuff/descriptors up to LLC

17:17 <jix> is the windowserver log msg just forwarding errors from kernel, or is it having an additional error when this happens?

17:22 <jix> (I was asking because I was wondering whether the windowserver error has anything to do with how you get stuff on screen in the end (not knowing how you do that/how that works on osx), but just everything starting to fault makes sense, especially after seeing OBS hang before)

17:27 <xerpi[m]> Let's hope this was not the issue and that the GPU also snoops the L1/L2, otherwise it will get hairy, you would also need to invalidate BOs when reading data the GPU has written..

17:29 <xerpi[m]> Would be nice if the GPU reported some kind of error code. Maybe macOS kernel doesn't expose it to userspace, but would mmio tracing help?

17:31 <xerpi[m]> Maybe the debug/developer macOS kernel printks a nice error message

17:32 <jix> is there any apple code talking to the GPU that would need to be synchronized with what your mesa driver is doing? if it's not some cache issue, it seems like a race condition to me, just from the random timing and how it is affected by debug printing etc..

17:34 quarkyalice has joined #asahi-gpu

17:37 <jix> thanks for streaming :)

17:37 <bloom> thanks for watching!

17:38 <xerpi[m]> :D

17:38 <xerpi[m]> Btw this is interesting: https://developer.apple.com/documentation/metal/mtlstoragemode/managed

17:38 <xerpi[m]> Probably one of those magic numbers is related to this

17:39 <bloom> nod

17:40 <bloom> managed mode doesnt make sense for unified memory

17:40 <bloom> but maybe they play cache games

17:40 <jix> "In iOS and tvOS, the managed storage mode is not available.

17:41 <xerpi[m]> Hmm but maybe you don't need to have CPU and GPU caches coherent for some buffers, so this "managed" mode can help reduce traffic

17:41 <jix> sounds to me like they might have forgotten to update that this to say that it also includes macos on apple silicon

17:42 <sven> fwiw, so far all dma transactions i've seen between the cpu and any peripheral have been cache coherent

17:43 <bloom> sven: to be fair, gpu is "special" as far as peripherals go..

17:43 <sven> fair enough :-)

17:56 <bloom> sven: but this is definitely one of those "impossible" issues :|

17:56 <bloom> it does seem to be worse at higher frame rates and with higher geometry load

18:04 <jix> does it get worse when other processes do more GPU stuff at the same time?

18:04 quarkyalice__ has joined #asahi-gpu

18:05 quarkyalice has quit [Remote host closed the connection]

18:17 choozy has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

19:23 quarkyalice__ has quit [Ping timeout: 480 seconds]

19:43 <bloom> jix: yes, absolutely

19:43 <bloom> (which made me think the problem is preemption, but I can't prove that)

20:04 <jix> I wonder if dtrace would help in figuring out when this happens

20:10 <bloom> maybe? I' not familiar with dtrace

20:12 <jix> me neither, but I think it should allow you to do selective system wide tracing of stuff... e.g. the IOKit calls and scheduling/preemption of threads/processes

20:14 <jix> at least that's what I remember from when it was introduced and I was still using mac os, never did more than trying a few example dtrace scripts though... and that was ~14 years ago, so I have no idea how well it is supported today

20:15 <bloom> nod

20:22 <jix> there's also the instruments GUI app that's based on dtrace, and looking at https://web.archive.org/web/20200620075030/https://developer.apple.com/documentation/metal/using_metal_system_trace_in_instruments_to_profile_your_app that seems useful, if you can add rules to also trigger on the calls the mesa driver makes

20:24 <jix> (or without web.archive.org ... it's still online, that was just the link from wikipedia)

21:19 quarkyalice_ has quit [Quit: Leaving]

21:20 quarkyalice has joined #asahi-gpu

22:50 aleasto has quit [Quit: Konversation terminated!]