#asahi-gpu on 2023-03-07 — irc logs at oftc.irclog.whitequark.org

2022-12-21 00:46 ChanServ changed the topic of #asahi-gpu to: Asahi Linux GPU development (no user support, NO binary reversing) | Keep things on topic | GitHub: https://alx.sh/g | Wiki: https://alx.sh/w | Logs: https://alx.sh/l/asahi-gpu

00:50 aratuk has quit [Remote host closed the connection]

01:37 aratuk has joined #asahi-gpu

01:47 alyssa has joined #asahi-gpu

01:47 <alyssa> -next is looking quite a bit less scary

01:50 nafod8 has joined #asahi-gpu

01:50 nafod has quit [Read error: Connection reset by peer]

01:50 nafod8 is now known as nafod

01:52 aratuk has quit []

02:10 <alyssa> still big, but

02:10 <alyssa> 40 commits now

02:11 akspecs_ has quit [Remote host closed the connection]

02:11 akspecs has joined #asahi-gpu

03:02 <i509vcb> Not sure if it is implemented right now, but I believe the hardware supports enough to implement timeline support for DRM sync objects?

03:03 <i509vcb> /s/believe/does

03:03 <alyssa> i think yes

03:20 kesslerd has quit [Remote host closed the connection]

03:36 <i509vcb> I'm wondering is the next release going to support GLES 3.0? I have seen a lot of the commit history showing compute related stuff.

03:37 <i509vcb> (Although I'd like to try my hand at some vulkan stuff for agx but don't want to cause 2 efforts for it)

03:39 <alyssa> we don't really have a release schedule

03:39 <alyssa> but yes, gles 3.0 comes after gles 2.0

03:39 <alyssa> and gles 3.1 after gles 3.0

03:39 <alyssa> and vulkan 1.0 after that

04:23 <lina> i509vcb: Timeline sync is implemented, though completely untested ^^

04:24 <lina> (I just copied what Xe did)

04:24 <i509vcb> Hmm neat, so assuming no bugs it should be effectively automatic to have VK_KHR_timeline_semaphore

04:25 <lina> Probably! (I don't know much about Vulkan yet, but the UAPI is designed based on modern Vulkan requirements)

04:26 <i509vcb> I personally don't want to bug ella right now until some of the buffer and image copy stuff is pushed in (probably some local changes). But I will stick around and try to help with Vulkan if possible

04:27 <lina> alyssa: Did you get a chance to test the new kernel branch? I want to know if it fixes all the brokenness you were running into ^^;;

04:27 <i509vcb> I get that it is not your priority right now, but nothing wrong with trying to make it work in parallel

04:28 <lina> Definitely, and I think both alyssa and I will be moving towards Vulkan as the GL stuff settles down too ^^

04:28 <i509vcb> (I partially want to see Vulkan so I can yeet the EGL code in my compositor)

04:28 <lina> I'm actually going to write a doc on how the queue synchronization works in our driver, since I need that to make the GL frontend understandable too, but it should be useful for Vulkan

04:28 wintp has quit [Quit: Connection closed for inactivity]

04:32 user982492 has joined #asahi-gpu

04:35 <alyssa> lina: what's the new kernel branch

04:35 <alyssa> I've been running b7b005953e96d9e443ea2edd5e3befd63e911138

04:35 <alyssa> works like a charm

04:36 <alyssa> (with next)

04:36 <alyssa> (except for all the shit piglit provokes but, i'm just playing with registers ;~P)

04:39 <lina> alyssa: That's before the fixes, gpu/explicit-sync should fix the piglit explosions and that splat that you saw

04:39 <lina> It's also a rebase (and might need a new m1n1 for you) but it's what will be released so I would appreciate it if you can test it ^^

04:43 * alyssa eyes clock

04:43 <alyssa> sure ok

04:47 <lina> alyssa: Is agx/release still the right branch for release (other than that dual source blending thing we need to drop)?

04:51 <alyssa> yes please

04:59 <alyssa> lina: what happened to my DCP?

05:00 <alyssa> reserved-memory node 'framebuffer' not found

05:00 <alyssa> adev bind failed: -19

05:00 c10l has quit [Quit: Bye o/]

05:00 <alyssa> I updated my mini

05:01 c10l has joined #asahi-gpu

05:01 <lina> Are you sure you updated m1n1?

05:01 <alyssa> yes

05:02 <lina> To upstream main?

05:02 <alyssa> to lina/gpu-wip

05:03 <lina> Oh sorry no, that's just my experiments and I hadn't pushed yet ^^;;;

05:03 <lina> Pushed now (or just use main)

05:03 <alyssa> switched to main

05:03 <alyssa> girl you gotta tell me these things :p

05:03 <lina> There are no m1n1 changes for agx, it's just DCP that changed

05:04 <alyssa> ok

05:04 <alyssa> what happened to my AGX?!

05:04 <lina> Whaa...

05:05 <alyssa> DRM_ASAHI=n and I don't see the option

05:05 <alyssa> Guessing Rust is broken again

05:05 <lina> "make rustavailable"

05:05 <alyssa> rust is available

05:05 <lina> Did I mess this up??

05:06 <lina> Aaaaa hold on

05:06 <lina> Sorry this was a thing I added last minute yesterday and kconfig confuses me

05:07 <alyssa> ;/

05:08 <lina> alyssa: Pushed, should be fixed, sorry T_T

05:12 <alyssa> GPU crashed immediately, try again

05:12 <lina> Umm...

05:12 <lina> Whaaa...

05:12 <lina> How??

05:13 <alyssa> I'm going to guess whatever code I just pulled is broken

05:13 <alyssa> Or extremely bad luck

05:13 <alyssa> But probably the first one

05:13 <alyssa> https://rosenzweig.io/a

05:13 <alyssa> can I downgrade now?

05:14 <lina> But it works for me...

05:14 <lina> Do you have more dmesg backlog?

05:14 <alyssa> no

05:14 <lina> The first line there is already broken...

05:15 <alyssa> that's just from running kmscube

05:16 <lina> Are you on 12.3?

05:16 <alyssa> yes

05:17 <lina> I don't get it... I really need more dmesg backlog to have a guess at what went wrong there, it's like something is horribly wrong with the fwctl ring...

05:17 <lina> And I dind't touch that code in forever...

05:18 <alyssa> there's no other dmesg backlog

05:18 <lina> OH

05:18 <lina> OMG I forgot repr(C) on that and your Rust compiler must've decided to use a different struct layout aaaa

05:18 <alyssa> santa are you ok

05:18 <lina> Please try again T_T

05:19 <lina> This has always been broken, you just got unlucky hitting it now T_T

05:19 <alyssa> Woo

05:19 <alyssa> luv the commit msg

05:20 <alyssa> and glad i could be a serviceable rubber duck

05:21 <lina> ;;

05:23 <alyssa> ok yes i have spinny cube again

05:29 <alyssa> lina: new kernel survived a piglit run. thanks for the fixes :)

05:29 <lina> ^^

05:30 <alyssa> and actually the piglit fail/crash number isn't all that different from the older release

05:30 <lina> Glad to hear that!

05:30 <alyssa> so unless I see something scary in the fails.csv list (haven't looked yet) I'd say agx/release is good to go

05:32 <lina> Yay~!

05:34 <alyssa> `yeah, nothing here scares me

05:34 <alyssa> good to go

05:34 <lina> Thank you for testing, and sorry for keeping you up late ^^;;

05:35 <alyssa> nw

05:35 <alyssa> ok but why is my system hanging trying to open gnome

05:35 <alyssa> ;;;;

05:35 <lina> Ummmmmmm

05:35 <lina> ;;;;;;

05:35 <lina> Works here...

05:36 <lina> I don't want to keep you up too late but if you can get me some backtraces of relevant stuff and dmesg if anything that might help ^^

05:36 <alyssa> sway is going ok

05:37 <alyssa> could it be because gnome is a piece of-

05:38 <alyssa> ooh, oof, this is your bug after all

05:38 <lina> ;;;;;;;;;;;

05:38 <lina> kernel or mesa?

05:38 <alyssa> kernel

05:38 <lina> Any messages?

05:39 <alyssa> https://rosenzweig.io/splat

05:39 <alyssa> more missing repr(C)?

05:40 <lina> Is there anything earlier? That just looks like the firmware died...

05:40 <alyssa> no

05:40 <lina> Can you upload your kernel binary somewhere and go to sleep? I can probably track it down running it in the hypervisor.

05:41 <alyssa> https://rosenzweig.io/Image.gz

05:41 <lina> Could be a repr(C) thing and your Rust compiler really wants to shuffle structs around for some reason

05:42 <alyssa> IDK why it's only happening with gnome

05:42 <alyssa> sway + stk is fine

05:45 <lina> Added more reprs though it was kinda trivial stuff, I don't know if that's it...

05:45 <alyssa> im kinda surprised rust doesn't catch this sort of bug

05:46 <lina> It's an FFI thing, you need to set the struct layout... Once I add the Castable thing it should catch some (but not all) of these issues

05:46 <lina> I think it's something that should eventually be a core feature

05:47 <alyssa> what I mean is, how is it ever sound to get the raw bytes of a repr(Rust) struct

05:47 <alyssa> or is this happening in an unsafe{} block with a botched SAFETY comment?

05:47 <lina> It isn't, it's all unsafe, yeah

05:47 <alyssa> still feels like this is a footgun API

05:47 <lina> Hence the Castable thing (that's supposed to mean "safe to get the raw bytes" but this isn't a core language feature yet)

05:47 <alyssa> because even in an unsafe{} block it is literally never legal to get raw bytes of repr(Rust)

05:48 possiblemeatball has quit [Quit: Quit]

05:48 <lina> It is, if you know there is no padding (which can be known)

05:48 <alyssa> is that portable?

05:48 <lina> You can't assume anything about the layout, but you could restore it into an identical struct

05:48 <lina> So that's not unsafe

05:49 <lina> Can you check if the last patch fixed it? It's all I've got...

05:49 <alyssa> (OK, I hit the issue with firefox in sway on the older kernel. good to know it's not just gnome)

05:54 <alyssa> lina: Nope, still no good

05:54 <alyssa> https://rosenzweig.io/I is with your patch

05:54 <lina> OK, go to sleep sis ^^

05:54 <lina> I'll test with your kernel

05:54 <alyssa> fiiiiine

05:54 <alyssa> nini

06:08 <i509vcb> bytemuck does have some safety notes that might be worth considering even though you seem to be rolling your own type -> bytes conversion: https://docs.rs/bytemuck/1.13.1/bytemuck/trait.Pod.html

06:16 user982492 has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

06:25 alyssa has left #asahi-gpu [#asahi-gpu]

07:21 c10l has quit [Read error: Connection reset by peer]

07:21 c10l has joined #asahi-gpu

07:35 cr1901_ has joined #asahi-gpu

07:42 cr1901 has quit [Ping timeout: 480 seconds]

08:06 MajorBiscuit has joined #asahi-gpu

08:15 Major_Biscuit has joined #asahi-gpu

08:16 MajorBiscuit has quit [Ping timeout: 480 seconds]

08:20 bisko has joined #asahi-gpu

09:05 nyilas has joined #asahi-gpu

09:07 nyilas has quit [Remote host closed the connection]

09:12 <lina> OK, alyssa was using the debug allocator and I think we're hitting a firmware bug/race, but I don't think we use that codepath at all in the regular mode so I'm just going to punt on it...

09:14 nyilas has joined #asahi-gpu

09:26 DarkShadow44 has quit [Quit: ZNC - https://znc.in]

09:26 DarkShadow44 has joined #asahi-gpu

10:17 DarkShadow44 has quit [Quit: ZNC - https://znc.in]

10:19 DarkShadow44 has joined #asahi-gpu

10:31 commandoline has joined #asahi-gpu

10:34 commandoline has quit []

10:35 commandoline has joined #asahi-gpu

11:12 <lina> Wrote this to attach to the DRM submission ^^ (cc ella-0 & alyssa): https://github.com/AsahiLinux/docs/wiki/SW:AGX-driver-notes

11:16 as400 has quit [Remote host closed the connection]

11:19 as400 has joined #asahi-gpu

11:41 nyilas has quit [Remote host closed the connection]

11:46 karolherbst_ is now known as karolherbst

11:59 hightower2 has joined #asahi-gpu

12:24 alyssa has joined #asahi-gpu

12:28 <alyssa> lina: nit, "the VDM level (in the GPU command stream)"

12:29 <alyssa> *GX actually has a "control stream" in powervr lingo, not a command stream

12:29 <alyssa> the distinction being, it isn't executed / interpeted in any meaningful way, it's not structured as commands or bytecode or anything

12:30 <alyssa> it's just a bunch of control packets one after the other, plus a link command to jump between control streams

12:30 <alyssa> this is very different than GPUs with real command streams,

12:31 <alyssa> e.g. Mali-G610 exposes an entire Turing-complete-by-design ISA where the state vector is passed in registers that the driver sets with MOV_IMM32 instructions

12:31 <alyssa> but that has general purpose "load/store registers from/to memory" instructions, conditional branching based on the values of registers, a simple integer ALU, etc

12:33 <alyssa> generally this means that AGX is easier to program -- and the AGX control stream is a lot more compact than the Mali command stream -- but there's a pretty low ceiling for what you can do with AGX in terms of indirection

12:33 <alyssa> there's hardware support for a few important types of indirects, where there's a special control word to do an indirect draw or whatever

12:33 <alyssa> but if you have an API requirement that the hw doesn't accommodate, you're kinda screwed

12:34 <alyssa> notably, multiDrawIndirect, which is like draw indirect but now draws an arbitrary number of draws indirectly with the draw count loaded from memory

12:34 <alyssa> on Mali-G610, the driver implements this with a loop in the command stream

12:35 <alyssa> on AGX, we have no way to express a loop, it's not turing complete by itself

12:35 <alyssa> so either we don't do multiDrawIndirect, or we implement it with a compute kernel that generates a control stream with all the draws at runtime and then stream_links to it

12:37 <alyssa> (which is kinda horrible, because you need to plumb the draw ID into the shaders as a special uniform, but changing the uniform state means re-emitting a whole pile of shader state. it can be done with a "clone and patch" technique, or with a sufficient horrible compute kernel. but it's not nice.)

12:37 <alyssa> (but Metal's "GPU indirect buffers" are exactly this, and if MoltenVK supports multiDrawIndirect - dont remember if they do - it's with this)

12:37 <alyssa> another notable one, conditional rendering, which lets you predicate draws on an occlusion query passing

12:38 <alyssa> on Mali, this is really easy!

12:38 <alyssa> LOAD the result of the query and branch over the draws based on the result

12:38 <alyssa> (*Mali-G610, older Malis this all suuuucks far worse tha AGX)

12:38 <alyssa> on AGX, this is a lot more tricky

12:39 <alyssa> unless there's a sufficiently general conditional stream link, we again need to dispatch a compute kernel for this

12:39 <alyssa> (feel free to replace "Compute kernel" with "internal vertex shader" if you want this to all happen on the VDM, btw)

12:39 <alyssa> this compute kernel though isn't too horrible, thankfully

12:40 <alyssa> in the control stream after emitting the compute, always emit a stream link and put all the draws to predicate in a new control stream

12:40 <alyssa> and then the compute kernel will just patch the link target based on the query result

12:42 <alyssa> none of this is *challenging*, mind you

12:42 <alyssa> and perf will be ok unless the app goes crazy with this stuff

12:42 <alyssa> but it's still a really steep cliff you fall off as soon as you leave the comfy confines of what the hw designers intended

12:42 <alyssa> (otoh, as far as I can tell, Mali-G610 was uniquely designed around supporting multiDrawIndirect and that's it ;P)

12:57 <alyssa> lina: "Blit command support will be added in a future driver"

12:57 <alyssa> ....uh

12:58 <alyssa> 'since it is not safe to do so'

12:58 <alyssa> and not stable

12:58 <alyssa> 'Upstreaming blockers (affect current draft UAPI):'

12:58 <alyssa> also, scratch memory

12:59 <alyssa> I don't want to block upstreaming on me figuring out a spiller so I'll probably just wire up load_scratch/store_scratch as a proof-of-concept

12:59 <alyssa> but this is absolutely required for a production driver and I don't want a UAPI stabilized without it

12:59 <alyssa> probably super easy in the kernel side, though

12:59 <alyssa> i hope

13:16 alyssa has quit [Quit: leaving]

13:17 cy8aer has quit [Remote host closed the connection]

13:17 cy8aer has joined #asahi-gpu

13:32 bisko has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

14:01 bisko has joined #asahi-gpu

14:10 c10l has quit [Quit: Bye o/]

14:24 balrog has quit [Remote host closed the connection]

14:51 possiblemeatball has joined #asahi-gpu

14:56 kesslerd has joined #asahi-gpu

15:03 kesslerd has quit [Quit: Konversation terminated!]

15:16 kesslerd has joined #asahi-gpu

15:21 karolherbst_ has joined #asahi-gpu

15:23 iaguis has joined #asahi-gpu

15:25 karolherbst has quit [Ping timeout: 480 seconds]

15:26 balrog has joined #asahi-gpu

15:43 alyssa has joined #asahi-gpu

15:53 bisko has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

16:04 <lina> alyssa: It's a wiki, you can just edit it~

16:10 <alyssa> woahh

16:15 <alyssa> lina: Hmm for glGenerateMipmap() i guess it shouldn't be too hard to do a single drm_asahi_submit with all the different batches that make it up

16:16 <alyssa> maybe less heavyhanded than hooking up the blit queue

16:16 <alyssa> and still get most of the benefit

16:17 <alyssa> I guess the "right" way to do it is with a background program and no draw calls

16:18 <alyssa> and if you do that, you don't need the VDM for anything which is where the blit queue comes in

16:18 <alyssa> yeah, ok, sure whatever

16:24 <alyssa> glGenerateMipmap is silly but a bunch of benchmarks have it in a hot path so v_v

16:24 <alyssa> and faster game load screens is good maybe

16:34 <mort_> Hey, I'm trying to build the kernel according to this pkgbuild: https://github.com/AsahiLinux/PKGBUILDs/blob/d66008ea33ce6547a806fd197688a9344c02f6ef/linux-asahi/PKGBUILD -- and it works, but the edge kernel it builds doesn't seem to have an asahi.ko.zst.. I don't understand why, but it seems like the config file for the edge build somehow gets

16:34 <mort_> overwritten?

16:35 <mort_> the src/linux-asahi-blah/build/edge/.config.old file ends up with the stuff from config.edge (including CONFIG_DRM_ASAHI=m), but src/linux-asahi-blah/build/edge/.config ends up without it

16:38 <jannau> mort_: no user support here, see /topic

16:38 <mort_> ah sorry, I should've read that before asking.

17:02 Bey0ndB1nary has joined #asahi-gpu

17:07 chipxxx has joined #asahi-gpu

17:07 chipxxx has quit [Remote host closed the connection]

17:08 chipxxx has joined #asahi-gpu

17:17 bluetail9 has quit [Ping timeout: 480 seconds]

17:23 maria has quit [Ping timeout: 480 seconds]

17:40 Bey0ndB1nary has quit []

18:00 Major_Biscuit has quit [Ping timeout: 480 seconds]

18:09 bluetail9 has joined #asahi-gpu

18:10 maria has joined #asahi-gpu

18:11 <alyssa> marcan: so when are you release this thing

18:18 * jannau looks on the clock and the magic eight ball answers tomorrow

18:19 <alyssa> jannau: no but i just asked so you need to double it

18:19 <alyssa> so two days from now, got it

18:21 zshrc has joined #asahi-gpu

18:35 karolherbst_ is now known as karolherbst

18:42 zshrc has quit [Ping timeout: 480 seconds]

18:46 zshrc has joined #asahi-gpu

18:52 zshrc is now known as Guest6969

18:52 zshrc has joined #asahi-gpu

18:58 Guest6969 has quit [Ping timeout: 480 seconds]

19:04 zshrc has quit [Quit: leaving]

19:31 <jannau> short test of agx/release shows no problems on a m1 ultra

19:36 bluetail94 has joined #asahi-gpu

19:43 <alyssa> Woop

19:43 bluetail9 has quit [Ping timeout: 480 seconds]

19:50 i509vcb has quit [Quit: Connection closed for inactivity]

20:00 i509vcb has joined #asahi-gpu

20:06 <jannau> pink/violet boxes/triangles during plasma/wayland login on m2, very briefly

20:13 <alyssa> Noooo

20:16 <jannau> reproducible, boxes, maybe 16x8, 1 or 2 frames

20:29 cr1901_ is now known as cr1901

20:30 <jannau> https://www.jannau.net/asahi/2023-03-07_gpu-explicit-sync-agx-release_m2.mov

20:33 <jannau> reproduces with ~50% in the first 3 login attempts after boot and seems to be very unlikely to reproduce after that (3 repeats)

20:41 <jannau> no, still reproducible later. so maybe reproducible in 10 to 15%

20:45 iaguis has quit [Quit: leaving]

20:51 <jannau> not reproducible on the m1 ultra in over 30 tries with the exact same m1n1/kernel/mesa

20:52 possiblemeatball has quit [Quit: Quit]

21:02 D-Spirits has joined #asahi-gpu

21:03 possiblemeatball has joined #asahi-gpu

22:40 bluetail94 has quit []

23:08 D-Spirits has quit [Quit: D-Spirits]

23:28 c10l has joined #asahi-gpu

23:30 amada95 has joined #asahi-gpu

23:52 kesslerd_ has joined #asahi-gpu