#asahi-dev on 2022-02-15 — irc logs at oftc.irclog.whitequark.org

2021-07-26 22:57 ChanServ changed the topic of #asahi-dev to: Asahi Linux: porting Linux to Apple Silicon macs | General development | GitHub: https://alx.sh/g | Wiki: https://alx.sh/w | Logs: https://alx.sh/l/asahi-dev

00:41 user982492 has joined #asahi-dev

01:33 phire_ has joined #asahi-dev

01:33 phire is now known as Guest407

01:33 phire_ is now known as phire

01:36 Guest407 has quit [Ping timeout: 480 seconds]

02:55 yuyichao has quit [Ping timeout: 480 seconds]

03:00 skipwich has quit [Quit: DISCONNECT]

03:01 skipwich has joined #asahi-dev

03:31 PhilippvK has joined #asahi-dev

03:34 phiologe has quit [Ping timeout: 480 seconds]

03:43 kov has quit [Quit: Coyote finally caught me]

03:48 kov has joined #asahi-dev

05:00 c10l3 has quit []

05:00 c10l3 has joined #asahi-dev

05:49 the_lanetly_052___ has joined #asahi-dev

06:42 Bey0ndB1nary has joined #asahi-dev

07:45 Bey0ndB1nary has quit []

08:00 MajorBiscuit has joined #asahi-dev

08:18 Bey0ndB1nary has joined #asahi-dev

08:18 Bey0ndB1nary has quit []

08:47 user982492 has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

08:55 gladiac has quit [Quit: k thx bye]

08:56 gladiac has joined #asahi-dev

09:24 Major_Biscuit has joined #asahi-dev

09:26 MajorBiscuit has quit [Ping timeout: 480 seconds]

09:32 user982492 has joined #asahi-dev

09:32 user982492 has quit []

10:27 n1c has joined #asahi-dev

10:34 n1c has quit [Quit: ZNC 1.8.2+deb1+focal2 - https://znc.in]

10:43 n1c has joined #asahi-dev

11:34 the_lanetly_052___ has quit [Remote host closed the connection]

11:34 the_lanetly_052___ has joined #asahi-dev

11:48 the_lanetly_052___ has quit [Ping timeout: 480 seconds]

12:20 yuyichao has joined #asahi-dev

13:40 Major_Biscuit has quit [Ping timeout: 480 seconds]

13:41 Major_Biscuit has joined #asahi-dev

13:49 refi64 has quit [Remote host closed the connection]

13:50 alyssa has joined #asahi-dev

13:50 refi64 has joined #asahi-dev

14:11 riker77 has quit [Quit: Quitting IRC - gone for good...]

14:29 riker77 has joined #asahi-dev

14:34 riker77 has quit [Quit: Quitting IRC - gone for good...]

14:35 MajorBiscuit has joined #asahi-dev

14:37 yuyichao has quit [Ping timeout: 480 seconds]

14:37 Major_Biscuit has quit [Ping timeout: 480 seconds]

15:01 yuyichao has joined #asahi-dev

15:23 riker77 has joined #asahi-dev

15:38 <marcan> ok, too tired to do more work today, I'm going to sleep early for once. pushed things as is to spmi/work.

15:39 <marcan> t6000 should work; t8103 needs devicetree stuff (the nvmem/spmi bits)

15:39 <j`ey> marcan: yes get some rest!

15:39 <marcan> thanks :)

15:40 <j`ey> marcan: i guess the t8103 stuff is pretty simple, if copied mostly from the previous attempt..

15:41 <marcan> yeah, mostly

15:41 <marcan> just needs the nvmem offsets, which I think I pasted here previously

15:42 <marcan> also the RTC stuff is, of course, >12.x only

15:42 <j`ey> oh right.. that might be the push I need to get off 11.x :P

16:14 axboe has joined #asahi-dev

16:14 <axboe> marcan: tested the new branch, and fwiw I see the same issue as the old one

16:15 <axboe> it works for the E cores, and first cluster of the P cores

16:15 <axboe> second cluster is stuck at whatever the boot frequency is

16:15 <axboe> second cluster also has cpu_capacity == 0 rather than 1024

16:15 <axboe> I feel like this is something silly, but it eluded me yesterday

16:18 <j`ey> does it try to change it? ie does it even get to apple_soc_cpufreq_set_target for the 2nd cluster

16:21 <axboe> I poked a bit yesterday, and seems like it was using the wrong clk base for pcluster2, it was the pcluster1 address

16:21 <axboe> which again made little sense, I saw all three get registered

16:22 <axboe> and the fact that capacity isn't set either just made me assume that something is wrong with the setup of that second pcluster

16:23 <axboe> new one does seem to set the right clock if I just force performance as the governor

16:23 <axboe> so might be as simple as schedutil being borken with capacity == 0

16:23 <axboe> given that the other clusters have it set right

16:27 <jannau> the readq_poll_timeout in apple_soc_cpufreq_set_target needs to be atomic

16:29 <axboe> in the devicetree in sysfs, capacity looks correct for all CPUs

16:29 <jannau> otherwise it appears to work expected on a m1 max with all cores

16:32 <axboe> jannau: curious on the m1 max, do you get the cpu capacity set for both p clusters?

16:34 <jannau> axboe: yes, 456 for the two e-cores and 1024 for all 8 p-cores

16:34 <axboe> jannau: hmm

16:35 <axboe> 456 here for E cluster, 1024 for first P cluster, 0 for second P cluster

16:38 <jannau> I suspect we need to remove the disabled core also in cpu-map in the devicetree

16:38 <axboe> did try that yesterday but didn't make a difference

16:39 <axboe> let me try on the new base, just in case I messed it up

16:41 <jannau> reproduced the missing capacity by skipping cpu 5 and 9

16:42 <axboe> booting now with those 2 removed

16:42 <axboe> I think m1n1 needs a fixup for that

16:43 <axboe> now it finds last cpu in p cluster 1 "not alive", and then hits a MPIDR mismatch on first cpu in cluster 2

16:46 <jannau> yes, I'm working on it

17:11 <jannau> axboe: https://github.com/jannau/m1n1/commit/55b48fd6f266815ed0d31c82cd1bce3cf14dd433 works for me with a patch to skip cores

17:12 <axboe> jannau: I can give it a whirl

17:12 <axboe> jannau: with or without editing the dtsi to remove the non-existent cores?

17:14 <jannau> axboe: should work with an unmodified dtsi

17:14 <axboe> jannau: ok, will try

17:16 <kettenis> to what extent does m1n1 trim the device tree?

17:18 <j`ey> marcan: spmi/work Kconfig has MFD_APPLE_SPMI_PMU, which seems to be leftover

17:20 <axboe> jannau: gives me a MPIDR mismatch

17:20 <axboe> DT CPU 1 MPIDR mishatc; 0x1 ~= 0x10100

17:21 <jannau> kettenis: quite bit if nodes are missing from the ADT, required by the hypervisor

17:22 <jannau> I'm only aware of the missing cpu cores for direct boot

17:23 <kettenis> I mean the FDT not the ADT

17:25 <jannau> it removes nodes from the FDT based on the ADT

17:25 <kettenis> right, and it should do so for the cpu nodes as well

17:26 user982492 has joined #asahi-dev

17:29 <jannau> it does, it doesn't however remove disabled cores from the cpu-map added for cpufreq driver

17:32 <axboe> if I just hack smp.c to keep a separate cpu index rather than use i it works for me, on top of plain m1n1

17:32 <sven> jannau: it should’ve been clairvoyant and done that already ;)

17:33 <axboe> fixes capacity, and cpufreq then works on 2nd pcluster too

17:37 <jannau> axboe: which "cpu-id"s do you have in the apple device tree? 0-7 or 0-9 with gaps

17:37 <jannau> m1n1 prints "Starting CPU x ..."

17:38 <axboe> jannau: this is with the dtsi trimmed, didn't try without for that hack

17:38 <axboe> jannau: so e(0 1) p(0 1 2) p(0 1 2)

17:39 <axboe> jannau: I tried your patch with stock dtsi

17:44 MajorBiscuit has quit [Quit: WeeChat 3.4]

17:48 <jannau> axboe: we have to solve this in m1n1 as we haf to use a single dtb for M1 Pro machines with 8 and 10 cpu cores

17:49 <axboe> jannau: yep

17:50 <axboe> https://pastebin.com/cT08t1PT

17:50 <axboe> for reference, this is the m1n1 hack

17:50 <axboe> haven't tried it with un-edited dtsi

17:51 <axboe> the sparseness ends up being an issue for the spin_table

17:51 <axboe> which the hack works around

17:51 <axboe> so might work with stock dtb

17:56 <jannau> I thought you tried with the unmodified dtb. I use https://paste.debian.net/1231019/ to simulate the sparseness and that works with my patch and the stock dtb

17:59 <axboe> jannau: for your patch I used unmodified

17:59 <axboe> jannau: like you asked :)

18:00 <axboe> jannau: see the debug output I badly typed

18:01 <jannau> ok. I saw that but I can't explain how the change could cause that

18:04 user982492 has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

18:04 Guest119 has left #asahi-dev [WeeChat 3.3]

18:07 <sven> axboe: do you want to submit that tipd interrupt mask fix or should I do it?

18:15 <axboe> sven: either way is fine with me, just let me know what you prefer

18:16 <sven> if you already have a patch that’s working it’s probably the easiest to just submit it

18:17 <axboe> ok, will do so right now

18:18 <axboe> sven: adding an actual changelog :)

18:19 <axboe> sven: do you want a suggested-by in there?

18:19 <sven> :D

18:19 <sven> uh, sure

18:19 <axboe> credit where credit is due

18:20 <axboe> Sven Peter <sven@svenpeter.dev>

18:20 <axboe> that your preferred email?

18:20 <sven> yup

18:22 <axboe> shipped

18:22 <sven> nice :)

18:28 user982492 has joined #asahi-dev

18:29 user982492 has quit []

18:33 <axboe> I'm all for flushing out patches, hate carrying them

18:33 <alyssa> here here

18:33 <j`ey> so with the tps issue, it was a DEFER_PROBE thing or?

18:33 <alyssa> oh that tps

18:33 <axboe> https://twitter.com/axboe/status/1493646629953589248

18:33 <axboe> I know you guys all know this, but I'm still blown away by the perf here. game changing imho

18:35 <j`ey> axboe: we're a bit late on the carrying patches part :P

18:37 <sven> yeah… we’re carrying way too many patches right now.

18:39 <axboe> j`ey: I know! but can always get done ;)

18:40 <j`ey> sven: 157 patches now

18:40 <sven> :(

18:40 <sven> At least I’m only responsible for maybe 10

18:40 <j`ey> sven: Im hopeful!

18:45 <axboe> with a big backlog like that, useful to just try and flush out some of the independent ones. at least that makes you feel like you made progress ;)

18:47 <jannau> sven: t6000-dart could use a ping for robin

18:47 <sven> I already pinged him off-list last week or so

18:48 <sven> I guess I could also re-submit that ugly usb hack

18:49 <axboe> sven: oh, reminds me, didn't find any time to fiddle with shared tags yet

18:49 <axboe> sven: as a temporary improvement, adjusting AQ to 2 would make sense imho

18:50 <axboe> I've only once seen higher AQ depth be useful, and that was under provisioning and with discard sizes being smaller where then having a higher admin queue depth made a difference

18:51 <axboe> sven: speaking of patches that could be pushed out... ;)

18:51 <axboe> sven: upstream would look kindly at the apple-nvme driver, as it can be combined with a "remove various weird apple workarounds or quirks" in the generic one

18:52 <sven> oh, I agree. But nvme requires this rtkit protocol that’s also required for smc. I’m just waiting for marcan to finish that part

18:52 <axboe> ah ok, dependencies

18:52 <sven> apple-nvme right now only works for platform devices fwiw

18:52 <axboe> ok, so we can't drop the generic quirks

18:53 <axboe> (not a huge issue imho, it's not very invasive)

18:54 <axboe> it's more of an hch'ism

18:54 <sven> :-)

18:54 <axboe> as long as they stay out of the fast path, then I don't really care

18:55 <sven> I considered adding a pcie backend to apple nvme but that driver is imho already big enough for the initial submission

18:55 <j`ey> .. did I just realise that the first 'h' stands for Herr?

18:56 <axboe> sven: I think the platform one is a good start

18:58 <axboe> j`ey: if we ever end up at the same conference, I'll tell you the story of the time hch took my kids to watch the lego movie

18:59 <j`ey> haha ok :D

19:09 <sven> yeah, it’s also been tested by a few people for at least weeks (and sometimes even months) now without any major issues

19:10 <jannau> marcan: spmi t8103 dts and a small fix for for the cpufreq driver https://github.com/jannau/linux/commits/spmi/work

19:10 <sven> just waiting for marcan and smc

19:11 <j`ey> and smc is mostly about the bindings

19:11 <jannau> working poweroff is nice

19:12 <sven> we’ll have to figure out how to deal with the shared rtkit dependency but I guess in the worst case we just create an immutable branch somewhere so that both trees can merge it after review

19:12 <j`ey> jannau: for the atomic thing did you actually get the sleep-during-atomic BUG, or just from manual inspection?

19:13 <jannau> I hit a sleep during atomic BUG

19:16 <kettenis> I hope the bindings we came up with will be fairly uncontroversial since they largely just use generic stuff

19:29 <axboe> sven: two subsequent boots I've run into aq timeouts

19:29 <axboe> haven't seen that before

19:29 <sven> huh, don’t think I ever saw that one either

19:29 <axboe> oh sorry, io queue

19:30 <axboe> hmm weird

19:30 <jannau> the same I saw with the old cpufreq patches?

19:31 <axboe> times out tag 10, then 2, then 3 and resets controller

19:31 <axboe> 30 seconds between each

19:31 <axboe> boots fine, log into X and start a bunch of things

19:32 <sven> guess I jinxed it by claiming no one had any major issues ;)

19:33 <sven> so it doesn’t even completion poll the commands but they actually time out?

19:34 <axboe> it does

19:34 <axboe> hang on, took a pic

19:34 <axboe> completioned polled for all 3, on last one it resets the controller, reset fails, IO errors ensue

19:34 <axboe> let me know if you want to see the pic

19:36 <sven> sure

19:36 <sven> it should only reset when completion polling didn’t find that command

19:37 <axboe> https://kernel.dk/nvme.jpg

19:38 <axboe> loading up the device with just reads doesn't lock it up

19:38 <sven> looks like the reset even fails

19:39 <axboe> didn't reproduce now on starting tbird + ff

19:39 <axboe> ok it did

19:39 <sven> -62 is timeout… so I guess the controller firmware doesn’t come up anymore? Weird

19:39 <axboe> same thing again

19:40 <axboe> ok so tag 2 this time, it reset the controller

19:40 <axboe> and it did come up fine

19:40 <axboe> (after reset)

19:42 <axboe> does look like it's correlated with some IO load and CPU usage

19:42 <axboe> eg loading firefox with bunch of tabs

19:46 <sven> looks like I can’t reproduce it on the M1. Let me try with the cpufreq driver

19:55 <axboe> sven: never saw it before adding that

19:55 <axboe> sven: question, what serializes writes to q->sq_db?

19:57 <sven> I don’t think I need to serialize writes there. it’s not really a queue pointer but just “please start command with tag x”

19:57 <axboe> sven: yeah I guess as long as the writel is atomic it should be fine

19:59 <sven> yeah. nvmmu_inval might need a lock though to serialize the write/read sequence

20:00 <jannau> still reproducible with setting the cpufreq governor to performance

20:01 <jannau> seeing following messages after nvme reset https://paste.debian.net/1231040/

20:02 <axboe> grabbing anv->lock around the issue does seem to fix it for me

20:03 <axboe> https://pastebin.com/PYTqnDrG

20:04 <sven> weird

20:04 <axboe> could just be timing and I'll still hit it, but seemed to be trivial to hit before and seems solid now

20:17 <sven> strange, I can reproduce it with the cpufreq driver as well

20:17 <sven> and adding that lock seems to fix it too

20:17 <sven> i just don’t understand why

20:18 timokrgr has quit [Quit: User left the chat]

20:19 <axboe> sven: trying to come up with a theory :)

20:19 <axboe> but having unserialized issue seems like asking for trouble

20:20 <axboe> though not quite sure why it needs to be serialized against irq

20:20 <axboe> how relaxed is the arm memory model?

20:20 <kettenis> very relaxed

20:20 <sven> very. that writel has a dma barrier though

20:20 <axboe> ok

20:22 <axboe> we _might_ just need an appropriate barrier so that it's visible on a different CPU

20:22 <axboe> but not sure how that's even easily doable without serialization anyway

20:22 <axboe> then again, not an ordering expert, I can always convince myself either way...

20:22 <axboe> I'd sugggest just going with the lock + irq disable around the issue for now

20:22 <sven> yeah, same here.. time to watch will deacon’s io ordering talk again ;)

20:23 timokrgr has joined #asahi-dev

20:23 <axboe> sven: cuddle up with a cup of your favorite beverage and open Documentation/memory-barrier.txt ;)

20:24 <sven> I assumed that writel to MMIO should be enough. It’s dma_wmb which should order it against the previous writes to normal memory and then just a single STR

20:24 <sven> :D

20:25 <axboe> sven: my worry was between CPUs

20:25 <axboe> issue on N, irq right after on N+1 - seems far fetched though

20:27 <sven> hmmm… I wonder if there’s anything in the irq handler that could mess something up in that case

20:28 <axboe> the spin unlock is a barrier, and as irq grabs the lock too first, I think it'd be safe

20:28 <axboe> iirc you grab the lock around the whole irq handling

20:31 <sven> yeah, I’m pretty sure most of the irq handling is inside the lock

20:31 <axboe> another thing I just noticed, there seems to be excessive tcb clearing in there

20:32 <axboe> since we memset at alloc, do we really need to clear in both inval and submit?

20:32 <sven> ah, no

20:32 <sven> once should be enough

20:32 <axboe> seems we can kill both of those, and just set flags to zero

20:32 <sven> true, that’s even better

20:32 <axboe> I've got a patch for that too :)

20:33 <axboe> and now booted

20:33 <sven> i still wonder what exactly messes up the submission if it’s not inside a lock though

20:33 <axboe> sven: one idea

20:34 <axboe> you have two issues going on, one is half way through memcpy of command, other happens faster and submits the tag

20:34 <sven> the memcpys should be to two different tags though

20:34 <axboe> not sure how the nvme hw side works if first tag isn't written yet

20:34 <axboe> yep

20:35 <axboe> but only explanation I can find is that we trigger an interrupt for completion and don't find it, then it gets polled later

20:36 <sven> the hw first looks into the tcb at the specified tag, then reads the actual index into the “queue” (which is really more like an array here) and then reads the queue at that index

20:36 <sven> I just set index = tag so that I don’t have to deal with a tcb index and a queue index

21:01 <axboe> sven: https://pastebin.com/djdMXek9

21:01 <axboe> cleanup of irq handling

21:01 <axboe> unrelated

21:02 <axboe> no new ideas on the other front, yet

21:05 <axboe> sven: assuming you're keeping the ->enabled state for adding suspend, because I don't think it's needed right now

21:13 <sven> i think the enabled originally comes from NVMEQ_ENABLED in pci.c. i convinced myself at some point that i didn't need it but then went back a few days later because i convinced myself that i do need it and then decided to keep it for now just-in-case IIRC

21:14 <sven> and that irq handling indeed looks better :)

21:29 <axboe> yeah I don't think you need it right now, but for suspend you probably will. so I left it alone

21:29 <axboe> I'll toss you that tcb patch too in a bit

21:32 <axboe> you can just fold in or however you prefer, I don't need attribution since it's not in tree yet

21:36 <axboe> sven: https://git.kernel.dk/cgit/linux-block/log/?h=m1-test

21:36 <axboe> top 3 commits there

21:37 <axboe> (and what a mess of patches...)

21:44 <j`ey> axboe: 'force' in apple_nvme_handle_cq is unused now

21:44 <axboe> j`ey: heh, it's funny I even thought of that while doing it

21:44 <axboe> then forgot

21:45 <axboe> will fix

21:46 <j`ey> the code after /* last chance to complete any requests before nvme_cancel_request */ wont be useful if its removed

21:47 c10l39 has joined #asahi-dev

21:48 <axboe> yeah I know, plan was to refactor around it but then I forgot the last bit

21:54 c10l3 has quit [Ping timeout: 480 seconds]

21:57 jeffmiw has quit [Ping timeout: 480 seconds]

21:58 jeffmiw has joined #asahi-dev

21:59 axboe has quit [Ping timeout: 480 seconds]

23:00 <marcan> jannau, axboe: I expected the cpu-map issue; also that fix is probably incorrect

23:00 <marcan> we actually need to renumber the nodes in cpu-map, since the code expects them to be consecutive

23:01 <marcan> the fix will only work if only core #3 is missing from each cluster