#asahi-dev on 2022-06-04 — irc logs at oftc.irclog.whitequark.org

2022-03-22 11:58 ChanServ changed the topic of #asahi-dev to: Asahi Linux: porting Linux to Apple Silicon macs | General development | GitHub: https://alx.sh/g | Wiki: https://alx.sh/w | Logs: https://alx.sh/l/asahi-dev

00:38 nuh^ has joined #asahi-dev

01:03 nuh^ has quit [Ping timeout: 480 seconds]

01:07 NathanielTan[m] has joined #asahi-dev

01:09 bps has joined #asahi-dev

01:20 Emantor has quit [Quit: ZNC - http://znc.in]

01:20 Emantor has joined #asahi-dev

01:49 al3xtjames has quit [Read error: Connection reset by peer]

01:49 al3xtjames has joined #asahi-dev

01:51 bps has quit [Read error: Connection reset by peer]

02:34 the_lanetly_052__ has joined #asahi-dev

02:41 the_lanetly_052___ has quit [Ping timeout: 480 seconds]

02:48 PhilippvK has joined #asahi-dev

02:51 phiologe has quit [Ping timeout: 480 seconds]

03:33 pyropeter1 has joined #asahi-dev

03:35 PyroPeter_ has quit [Ping timeout: 480 seconds]

03:55 kov has quit [Quit: Coyote finally caught me]

05:08 nicolas17 has quit [Ping timeout: 480 seconds]

05:10 jluthra has quit [Remote host closed the connection]

05:10 jluthra has joined #asahi-dev

05:10 tomf_ has joined #asahi-dev

05:21 bisko has joined #asahi-dev

05:37 bisko has quit [Read error: Connection reset by peer]

06:12 chadmed has quit [Read error: No route to host]

06:13 chadmed has joined #asahi-dev

06:46 roxfan has quit [Ping timeout: 480 seconds]

06:59 AoV has quit [Quit: The Lounge - https://thelounge.github.io]

07:13 <chadmed> is there something deeper going on in m1n1 that causes that little bench: piece of assembly to hang the system? i cant see why it would

07:13 <chadmed> but when i try to run too many iterations of it (trying to make a power virus) the system irrecoverably hangs and i have to hard reboot it

07:37 ramitgoolry[m] has joined #asahi-dev

08:07 WindowPain_ has joined #asahi-dev

08:08 <jannau> chadmed: proxy timeout? how long is the estimated run time?

08:10 <chadmed> its not actually the run time in this case, i have it set to few enough loops that it comes back successfully after running

08:10 <jannau> you could run it on a secondary core with smp_call()

08:11 <chadmed> im doing that, im following along with what was done to measure the pstate latencies

08:11 WindowPain has quit [Read error: Connection reset by peer]

08:11 <chadmed> what ive done differently is put the bench routine under a while True, which is what causes the uart timeout

08:12 <chadmed> idk why it would do this since smp_call should always return in time and just run again

08:12 <jannau> not sure what smp_call does if the previous call is still running on the same core

08:14 <jannau> should just clobber the previous return value

08:16 <chadmed> thats what i expected but the shell just respectfully waits for it to finish doing whatever its doing, which seems to be never because the routine has hung somewhere

08:16 <jannau> but the second smp_call can cause a proxy timeout if the first is still running (on the same core)

08:18 <jannau> smp_call waits just before the the secondary core enters the called function

08:39 roxfan has joined #asahi-dev

09:21 bisko has joined #asahi-dev

09:26 bisko has quit []

09:37 bisko has joined #asahi-dev

09:39 bisko has quit []

09:44 bisko has joined #asahi-dev

09:46 bisko has quit []

09:49 bisko has joined #asahi-dev

10:01 bisko has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

10:25 fetsorn has joined #asahi-dev

10:32 <chadmed> seemed to have been a race between smp_call and the interpreter iterating over the loop. luckily i seem to be able to get reasonably sane data by just beating the race

10:32 <chadmed> amazing what stepping away for a couple of hours and coming back well fed can do :P

10:34 fetsorn has quit [Remote host closed the connection]

10:45 <jannau> do you have a smp_wait or any other kind of synchronization with the code you're running

10:45 <jannau> if not why wouldn't it racy

11:05 <chadmed> i did try using the smp_call_sync() instead but it gave me the same results *shrug*

11:05 <chadmed> i might even just have a dodgy usb cable who knows

11:10 <jannau> smp_call_sync waits for the result and will timeout if the bench code takes more than 3 seconds

11:12 <jannau> the same might happen for the second smp_call on the same core. it will wait until the first call has finished

11:18 norb has joined #asahi-dev

11:18 <chadmed> it was the thunderbolt port on my dock, i started getting uart checksum errors too :/

11:19 <chadmed> plugged into a thunderbolt port on the actual host machine and all is well now

11:20 <norb> Hi devs, first of all a BIG THANKS. That was an incredible smooth installation my M1Pro. I would like to know if I can help with something. I am a ~20 year Debian Developer now running Arch as main desktop (M1pro and desktop), normally build my own kernels and I'm not scared with playing around with sysinternals and testing wild stuff. Any pointer is welcome!

11:22 <marcan> https://github.com/AsahiLinux/m1n1/pull/194 \o/

11:22 <marcan> hypervisor now has a gdbstub :)

11:22 <marcan> that should be fun for Linux kernel debugging

11:24 <marcan> chadmed: smp_call without waiting for completion is not supposed to work, no

11:24 <marcan> you need to use smp_wait or smp_call_sync

11:24 <marcan> but yeah, sounds like your underlying issue was something else too

11:25 <chadmed> yeah i figured, as i said though turned out the weird errors were due to a dodgy port on the host end, all cleared up now

11:26 <chadmed> new fun challenge: a single core uses such little power in most of the pstates that its basically just noise to the SMC

11:26 <marcan> norb: if you're still a debian project member, I'd love to see official work on that end for integrating with these machines :) (even if just discussion at this stage)

11:27 <marcan> we have some folks doing unofficial debian images but nothing official that I'm aware

11:27 <marcan> chadmed: unsurprising tbh :)

11:27 <norb> I'm not doing Debian stuff anymore, have moved completely to arch. I can help with anything Debian related, but don't use it myself anymore

11:27 <marcan> also reminds me someone needs to write hwmon, it's on my endless pile but that's the kind of thing anyone can pick up given the smc scaffolding already in place

11:28 <marcan> maybe there should be an official-ish list of "things for folks to pick up"

11:28 <chadmed> marcan: i cant say im shocked either, just trying to think of ways to get values for opp-microwatt without performing invasive surgery on a motherboard

11:29 <marcan> AIUI the SoC has internal power meters, which are probably readable through PMP if not directly

11:29 <marcan> those should give you better data than SMC

11:30 <marcan> and measure in accumulated joules I believe?

11:30 <marcan> actually hold on, CLPC keeps complaining and I don't *think* that hits PMP

11:30 <marcan> so maybe those are raw

11:30 <marcan> chadmed: want me to take a quick look at that see if I find it quickly?

11:32 <marcan> clpc node is under pmgr so yeah, must be raw

11:32 <chadmed> if you want or have time, if its going to be an effort though dont worry too much about it

11:32 <marcan> let me give it 30 mins or something

11:32 <chadmed> ack

12:17 al3xtjames3 has joined #asahi-dev

12:21 al3xtjames has quit [Ping timeout: 480 seconds]

12:21 al3xtjames3 is now known as al3xtjames

12:43 chadmed has quit [Ping timeout: 480 seconds]

12:52 chadmed has joined #asahi-dev

13:01 <marcan> chadmed: https://mrcn.st/p/vIb0vP7u

13:02 <marcan> not sure how much of that init is required, but some of it seems to be to get good numbers for the energy counter, I think?

13:02 <marcan> this is for t8103/mac mini

13:02 <marcan> not sure what the units are either

13:02 <marcan> also the pcore/ecore units seem to be different for some reason

13:06 <marcan> interestingly the energy counters seem to be the same for all the boost states of the pcores, in fact it peaks a bit higher at 2988 MHz

13:06 <marcan> which makes sense, since those boost states all use the same voltage AIUI

13:07 <marcan> so that means the dynamic power per clock is the same, and due to static power, higher clocks actually save (a tiny bit of) energy

13:07 <marcan> not sure if all those magic constants are the same for everyone or calibration stuff calculated off of the values read or what

13:11 <marcan> also some counters aren't working yet

13:13 <chadmed> the one thing that strikes me as odd is the seemingly different units between the pcores and ecores

13:17 <marcan> yeah

13:17 <marcan> I might have the init wrong

13:18 <marcan> let me look closer

13:20 <chadmed> arent the clusters at 0x210e00000/0x211e00000?

13:22 <marcan> cluster globals yes

13:23 ___nick___ has joined #asahi-dev

13:24 <chadmed> oh yeah didnt notice the change in set_pstate()

13:24 ___nick___ has quit []

13:25 ___nick___ has joined #asahi-dev

13:28 <marcan> chadmed: some of the init constants do seem different, also those regs are arrays of pstates (8 bits each) and they are preceded in the full HV log (which I did not paste verbatim) by reads of 8 regs that seem to contain pstate info for 8 pstates

13:28 <marcan> so they're reading those and computing transposed arrays of that info

13:28 <marcan> and there are significant differences between pcores and ecores so it seems plausible that the units differ

13:28 <marcan> will look in more detail later

13:35 <chadmed> cool, ill spend some time on it tomorrow too. i was just playing around with some ideas out while i had some free time before exams so dont feel obligated to go too deep, im sure there are bigger fish to fry :)

14:11 <jannau> was there a change in the hypervisor that could make linux boot or the vuart very slow?

14:12 <jannau> almost slow enough to see single character updates in the earlycon up to the linux mem init at least

14:21 the_lanetly_052___ has joined #asahi-dev

14:27 the_lanetly_052__ has quit [Ping timeout: 480 seconds]

14:34 roxfan has quit [Ping timeout: 480 seconds]

14:45 dbancroft[m] has joined #asahi-dev

15:49 povik has quit [Ping timeout: 480 seconds]

15:56 <marcan> jannau: if you have updated recently, the gdbstub was merged, which touched a bunch of stuff. there was also the spinlock thing and a fix for that.

15:56 <marcan> (prior)

15:56 <marcan> and related changes

15:56 <marcan> nothing specific to the vuart but those things touched the general exception entry stuff

15:59 <jannau> I think it was before I rebased onto the gdbserver merge

16:03 <jannau> I was previously on "display: Report time spent modesetting". I'll check if it's a "regression"

16:03 povik has joined #asahi-dev

16:26 roxfan has joined #asahi-dev

18:09 <jannau> any idea how the macOS selected timing mode could persist in dcp over reboot and poweroff (macbook pro 14")?

18:12 <sven> nvram setting + DCP firmware patching from iboot maybe

18:13 <jannau> I don't think patching is necessary, iboot probably just configures it and it persists after that

18:14 <jannau> the display is initialized

18:15 * jannau is just surprised that the timing mode is saved in nvram

18:15 <sven> oh, true

18:16 <jannau> and a little annoyed since modesets are crash dcp macbook but not on other devices

20:07 ___nick___ has quit [Ping timeout: 480 seconds]

20:50 jeffmiw has joined #asahi-dev

20:54 fetsorn has joined #asahi-dev

20:55 <jeffmiw> 11:28 <marcan> maybe there should be an official-ish list of "things for folks to pick up" <= I very much like this idea, I think it can really help the project

20:57 fetsorn has quit [Remote host closed the connection]

20:58 kloenk has quit [Remote host closed the connection]

20:59 <jeffmiw> marcan: I will be happy to look at hwmon and get someting going. I need to catch-up on SMC. I suppose the best is to state from bits/110-smc ? any suggestion/direction is more than welcome :)

21:00 <j`ey> jeffmiw: just put commits on top of the asahi branch

21:00 <j`ey> jeffmiw: also look at drivers/power/supply/macsmc_power.c, you can see how to write a smc driver from that

21:12 <jannau> sigh, something is broken. I had now really glacial startup from disabling the MMU in m1n1. I recorded 2 min and 14 seconds

21:12 <jeffmiw> thanks j`ey !

21:12 kloenk has joined #asahi-dev

21:12 <jannau> and I only grabbed the phone and started recording after I saw it was slow

21:13 <j`ey> is this on ultra?

21:13 <povik> could you bisect it to some m1n1 change?

21:13 <jannau> it was not that slow earlier. no on the macbook pro 14"

21:13 <povik> ah

21:13 <marcan> jeffmiw: I have a big TXT of SMC key stuff, let me throw it somewhere

21:14 <jeffmiw> marcan: cool

21:14 <jeffmiw> how did you extracted it ? from tracing macos under hv I suppose ?

21:14 <marcan> https://hub.marcan.st/t/smc-key-info.txt just scattered notes

21:15 <marcan> guess work, apple schematic references, experimentation

21:15 <jannau> on the bright side: modesets (just changing the refresh rate) seem to work now on the macbook pro

21:15 <marcan> very little is from macos/fieldtest, they have few actual references in the software

21:16 <marcan> you can look at the old applesmc driver for stuff, fan/etc things should be the same I think?

21:17 <jannau> no, :( - it failed just earlier so simpledrm kept working

21:18 <jeffmiw> marcan: ok will do. Thanks.

21:21 <sven> if someone makes that list: “irq support for the i2c driver” is a nice and small task as well

21:21 <povik> ha was thinking about that too when the list was mentioned :)

21:22 <sven> :-)

21:22 <sven> I’m happy to help whoever wants to pick that up. it’s probably als a good task for anyone who wants to get started with kernel development

21:23 fetsorn has joined #asahi-dev

21:26 fetsorn has quit []

21:36 <jeffmiw> just created this https://github.com/AsahiLinux/docs/wiki/List-of-%22things-for-folks-to-pick-up%22, then I realized that may be git issues could be more appropriate. Any opinions ?

21:39 <jeffmiw> marcan, sven: I added contact names(yours(and mine) in this case respectively for hwmon & i2c irq) so people know who to reach in case, feel free to remove if you don't like it

21:40 <sven> works for me

21:40 nicolas17 has joined #asahi-dev

21:41 <jeffmiw> if and once everyone is ok with this proposal, I'll add it to the developer section of the table of content

21:43 <povik> i think you should just go ahead, in worst case it will get corrected later

21:50 * povik expanded on the i2c task

21:53 <jeffmiw> added to the ToC/SideBar then

21:57 fetsorn has joined #asahi-dev

21:59 fetsorn has quit [Remote host closed the connection]

22:18 psykose has quit [Remote host closed the connection]

22:19 psykose has joined #asahi-dev

23:06 nicolas17 has quit [Quit: Konversation terminated!]

23:18 <amarioguy> sven: i can probably pick up the i2c stuff

23:19 <amarioguy> unless someone else is already working on it

23:25 yuyichao has joined #asahi-dev

23:31 ModR\M has joined #asahi-dev