#asahi on 2021-11-18 — irc logs at oftc.irclog.whitequark.org

2021-08-23 06:15 marcan changed the topic of #asahi to: Asahi Linux: porting Linux to Apple Silicon macs | General project discussion | GitHub: https://alx.sh/g | Wiki: https://alx.sh/w | Topics: #asahi-dev #asahi-re #asahi-gpu #asahi-stream #asahi-offtopic | Keep things on topic | Logs: https://alx.sh/l/asahi

00:21 aidenfoxivey has joined #asahi

00:21 aidenfoxivey has left #asahi [#asahi]

00:22 <marcan> mps: did you follow the instructions to run step2.sh?

00:23 <marcan> radnic: it does not remove m1n1 if you boot into macos

00:23 <marcan> assuming you're using two partitions

00:23 <radnic> Right, i thought it does because I did not see it anymore in the location I curled it to originally.

00:23 <marcan> the location does not matter

00:24 <marcan> kmutil will copy m1n1 to the Preboot volume and format it a certain way

00:24 <marcan> the original .macho file is only used when you run kmutil

00:24 <radnic> hmm, well, the original .macho is not there after a reboot

00:24 <marcan> where was it originally?

00:25 <radnic> Uhhhh.. home folder of the console?

00:25 <marcan> in 1TR? that's a ramdisk

00:25 <radnic> Ah, that explains it.

00:25 <radnic> So, I am poking around, out of curiousity, as to why I get that hpm0 issue, and the funny part is that it seems intial communication is OK with the chip, it only fails after it sends the SSPS command.

00:26 <marcan> hm, that still kind of sounds like the old i2c problem :/

00:26 <radnic> I barbarically added a bunch of extra printfs.

00:26 <marcan> I mean that's how I debug this too :p

00:27 <radnic> Hah, I was spoiled and used with pretty debuggers.

00:27 <marcan> well, we have the hypervisor for that

00:27 <marcan> though the debug features are relatively bare-bones, it's more of a tracer

00:28 <marcan> you don't have the problem with hpm1, right?

00:28 <radnic> No, it's hpm0

00:28 <radnic> But they seem to be the same chip.

00:29 <marcan> and you're connected to USB port 0? does it matter if you switch to 1?

00:29 <marcan> they are separate chips

00:29 <radnic> Right, smae chip model, but different chips physically.

00:29 <marcan> yes

00:29 <radnic> It doesn't seem to matter.

00:29 <marcan> interesting

00:31 <marcan> maybe see if you can narrow down exactly where it fails, and also try adding some delays (delays aren't the solution but they might help reveal the nature of the problem)

00:32 <marcan> I assume the error still happens if you chainload.py m1n1 again?

00:32 <marcan> < radnic> Is there a simple way to update the m1n1 without thw whone tiersome boot cycle/curl/update? <- on disk no, for testing live, chainload.py

00:33 <radnic> It goes: ps6598x_command(dev, "SSPS", &data, 1, NULL, 0); and inside there it goes to read back the command and that is where it fails, it seems.

00:34 <radnic> Why would delays fix it?

00:34 <radnic> I mean... we have 1 task.

00:34 <marcan> last time we had this problem it was an issue with not waiting for the read transaction to finish properly, someone added some delays and that "fixed" it (then I found the real cause)

00:34 <marcan> if delays fix it it means we aren't waiting for something we need to wait for

00:34 <radnic> Never chainloaded, currently I can't get it to start the USB connection.

00:34 <marcan> but hpm1 works?

00:35 <marcan> so port 1 should work?

00:35 <radnic> Yes.

00:35 <radnic> Hmmm.. maybe? I understand only the Port 0 is the onew I can use for chainloading?

00:35 <marcan> no, you can use any port

00:35 <marcan> m1n1 doesn't care

00:36 <marcan> port 0 only matters if you want serial (from another mac or a custom interface) or to DFU

00:36 <radnic> Ah, stupid me, yes, that works. :)) Did not try it so far.

00:37 <radnic> Where did you get the register names/locations for I2C? Reversed?

00:37 <marcan> I'll be back later, but hopefully this helps your debugging :)

00:37 <marcan> I2C is pasemi

00:38 <marcan> same thing as in pasemi PowerPC chips

00:38 <marcan> I actually have the hardware spec for it but it's not publicly available and I can't share it, unfortunately

00:38 <marcan> but it's the same thing in i2c-pasemi in linux (which we also use)

00:38 <marcan> apple did make some changes though, I found a few new registers by poking around

00:39 <radnic> Cool, thanks! Will look into it.

00:53 <radnic> Hah, after chainload it looks like it's OK!

00:57 <radnic> It's interesting also that the I2C seems to answer with unexpected length. that is funny.

00:57 <radnic> It says wnat to read 4 bytes but the device wants to send 31

01:04 Dcow has quit [Quit: My Mac Mini has gone to sleep. ZZZzzz…]

01:10 yuyichao has quit [Ping timeout: 480 seconds]

01:59 c10l has joined #asahi

02:03 c10l has quit []

02:04 c10l has joined #asahi

02:49 yuyichao has joined #asahi

03:14 bgb has joined #asahi

03:19 bgb_ has joined #asahi

03:25 bgb has quit [Ping timeout: 480 seconds]

03:34 bgb_ has quit [Quit: WeeChat 3.3]

03:43 phiologe has joined #asahi

03:47 PhilippvK has quit [Ping timeout: 480 seconds]

04:48 marvin24 has joined #asahi

04:51 marvin24_ has quit [Ping timeout: 480 seconds]

05:09 phiologe has quit [Ping timeout: 480 seconds]

06:11 sailorek1234 has joined #asahi

06:40 chamomile has quit [Ping timeout: 480 seconds]

07:54 chamomile has joined #asahi

07:57 <sven> for which command?

07:57 <sven> (or rather for which register)

07:58 <sven> if it's for that DATA register that's expected

07:58 <sven> but i don't think we actually read from that one

08:08 chamomile has quit [Ping timeout: 480 seconds]

08:13 <mps> marcan: no, I didn't run step2.sh script. didn't know it exists and need to be run. I found it after boot to macos when looked at Linux partition and thought it was run automatically

08:14 <mps> I will try again when I finish my $day_job

08:16 easontek has joined #asahi

08:18 easontek has quit []

09:39 bps has joined #asahi

09:40 c10l has quit [Remote host closed the connection]

09:47 <marcan> mps: when the installer tells you to choose the boot volume in the picker, after you do so it tells you about step2 and the steps needed to complete the installation

09:47 <marcan> did that not show up somehow? or did the machine just reboot or shut down?

09:50 <mps> marcan: I'm not sure that I saw this, maybe I didn't read all text carefully. at the end script told that everything is finished and I rebooted it

09:50 Glanzmann has joined #asahi

09:50 <mps> in 1TR I see asahi logo with disk to select to boot from

09:50 <Glanzmann> marcan: I had the very same problem. I did only saw the notice for step2.sh after it rebooted, because the boot picker covered it up and rebooted afterwards.

09:50 <radnic> @sven. I think it was the TPS_REG_CMD1 but it was like... just the once. most of the times, it does not say that. It seems to be inconsistent. This is why I suspect the device state is not always the same when we enter m1n1.

09:51 <mps> Glanzmann: ah, this also happened to me

09:52 <sven> radnic: huh, TPS_REG_CMD1 should always only be 4 bytes

09:52 <FireFox317> radnic, i had the same problem yesterday, and i'm trying to reproduce the issue but it is indeed really inconsistent. The m1n1 installed on my macbook is kinda old, so that could be a problem too.

09:52 <marcan> mps: so when you clicked "reboot" the machine actually rebooted?

09:52 <mps> marcan: yes

09:53 <marcan> it's not supposed to do that, we do an (ugly) race to kill the boot picker before it reboots, assuming you didn't kill the installer

09:53 <sven> firefox317: marcan did fix some bug that resulted in the same issue a while ago so an old m1n1 would explain that

09:53 <marcan> radnic: where exatly does it fail?

09:54 <marcan> like with what error code/message?

09:54 <mps> marcan: I didn't 'killed' it, just selected asahi and clicked reboot

09:54 <marcan> the installer was still running in the terminal at that point?

09:55 <mps> hmm, I think so, but not 100% sure now

09:55 <marcan> if you closed it then of course it won't work

09:55 <marcan> it's possible that sometimes it loses the race, but I've never seen it myself

09:55 <marcan> it would be nice to find a way to abort the reboot more reliably...

09:56 <mps> marcan: ok, I will try again later today when I finish with my $day_job and look more carefully at the process

09:57 <marcan> sven: I wonder if the two I2C ports are not properly interlocked

09:57 <marcan> hypothesis: right now we do start-write-stop start-read-stop

09:57 <sven> ugh. that would be annoying

09:58 <FireFox317> okay, i managed to get the error message again. however its super inconsistent

09:58 <marcan> normally read txns would be more properly pipelined as start-write-repeatstart-read-stop

09:58 <sven> at least in theory the second port is supposed to use CMD2 instead of CMD2

09:58 <sven> *CMD2 instead of CMD1

09:58 <mps> (my free time is 'eaten' by preparing some fixes and upgrades for next alpine linux release which will happen in a few days)

09:58 <marcan> yes but if there is a shared register pointer, SMC could sneak in between our two transactions and change the register pointer

09:58 <radnic> I can't reproduce it at the moment to confirm it 100% but basically, it fails when it reads back the TPS_REG_CMD1. I added extra prints, so the messages are not what you would expect.

09:58 <sven> hrm, true

09:58 <FireFox317> its also only ever hpm0 that fails

09:59 <marcan> I can try changing that to do a repeatstart and see if that fixes it

09:59 <radnic> Someone else reported hpm1 failing as well.

10:00 <radnic> Not sure if insisting on this issue is clever on my part though. I mean, most people don't have the issue and I can still connect on the other port, and maybe I'm stealing useful resources from somewhere else.

10:00 <radnic> I jut think it's a fun puzzle.

10:00 <FireFox317> radnic, it fails consistently for you?

10:00 <radnic> Yes.

10:00 <radnic> Sometimes it's ok.

10:01 <radnic> chainloading leads to it being OK consistently.

10:01 <marcan> radnic: try this in i2c_smbus_read:

10:01 <marcan> - if (i2c_xfer_write(dev, addr, 1, 1, &reg, 1))

10:01 <marcan> + if (i2c_xfer_write(dev, addr, 1, 0, &reg, 1))

10:04 <marcan> (I guess you'll have to rebuild and install it...)

10:05 c10l has joined #asahi

10:19 <FireFox317> sven, with an updated m1n1 is failing much more consistently. it looks like the same issue as radnic, i'm trying marcan s fix now

10:20 palmer_ has joined #asahi

10:22 palmer_ has quit [Remote host closed the connection]

10:23 <FireFox317> marcan, with your change both hpm0 and hpm1 fail to init

10:26 <FireFox317> oh interesting, the connection with my linux box works fine, also when hpm0/1 init fails

10:37 kov has joined #asahi

10:48 Glanzmann has quit [Quit: EOF]

10:51 Dcow has joined #asahi

10:51 Dcow has quit []

11:04 palmer_ has joined #asahi

11:05 palmer_ has quit [Remote host closed the connection]

11:10 bpalmer4[m] has joined #asahi

11:11 palmer_ has joined #asahi

11:11 palmer_ has quit [Remote host closed the connection]

11:11 palmer_ has joined #asahi

11:12 palmer_ has quit [Remote host closed the connection]

11:14 <FireFox317> kode54, you can install m1n1 with u-boot or linux as a payload, and in that case m1n1 just acts as a bootloader and immediately loads the next stage.

11:18 <radnic> so I just tried it and, it seems that the error is not happening anyymore? Interesting.

11:18 <radnic> I did 3x tries.

11:18 <radnic> let me give it a couple more spins.

11:19 <radnic> Uhh, is there a way I can persuade it to enumareate consistenly as a ACM device with the same index?

11:19 <radnic> Updating the variable is annoying.

11:19 <FireFox317> radnic, latest m1n1 with the patch that marcan send? and are you on a macbook air m1 (j313)?

11:19 <radnic> Updating the variable is annoying.

11:19 <radnic> yes

11:20 <radnic> J313 is a what now?\

11:20 <FireFox317> some codename that apples uses, see https://github.com/AsahiLinux/docs/wiki/Devices

11:21 <radnic> Ah, no, I saw it again.

11:21 <radnic> 5th try, there it was.

11:22 <radnic> It's an Air3. Can I make it show anywhere the codename?

11:23 <kov> radnic, if it's an Air M1 it's j313

11:23 <FireFox317> m1n1 also prints it iirc

11:24 <radnic> Yes it does, saw it in the console. I see it as a j313 so, yep, there it is.

11:28 <radnic> Ok, so... totally stupid question, but chainloading does NOT change the default .macho -

11:28 <radnic> Scratch my previous claim, I need to update it manually again.

11:28 <FireFox317> radnic, correct.

11:30 Dcow has joined #asahi

11:37 <radnic> Sooo, interestin. now I updated the m1n1 fo' real with the fix from marcan... and... I don't see it anymore.

11:37 <radnic> about 15 tried.

11:38 <radnic> tries.

11:38 <radnic> Maybe I got really lucky.

11:38 <radnic> yesterday it was much easyer to reproduce it TBH, than now. Even without the change.

11:38 <radnic> But, well, did not see it.

11:45 Dcow has quit [Quit: My Mac Mini has gone to sleep. ZZZzzz…]

11:46 bgb has joined #asahi

11:50 <radnic> But, I am confused _why_ I checked the datasheet for the chip and it says that it kind of expects the stop byte. The one that was removed. So... how come it works then?

11:50 <sven> hm?

11:51 <radnic> I mean the change from marcan.

11:51 <radnic> <marcan> - if (i2c_xfer_write(dev, addr, 1, 1, &reg, 1))

11:51 <radnic> <marcan> + if (i2c_xfer_write(dev, addr, 1, 0, &reg, 1))

11:51 Dcow has joined #asahi

11:51 <sven> oh

11:51 <marcan> radnic: it does not.

11:51 <radnic> That removes the stop bit sending when telling it where you want to read data off the I2C device.

11:52 <radnic> No

11:52 <marcan> TPS65983B

11:52 <radnic> ?

11:52 <marcan> page 79

11:52 <marcan> Sr -> repeated start

11:52 <marcan> S Unique Address Wr A Register Number A Sr Unique Address Rd A ...

11:52 <marcan> no stop bit

11:52 <radnic> Ah, I was looking at TPS65982

11:52 <marcan> it's the same

11:52 <marcan> page 79

11:53 <radnic> Hmmm, let me find that.

11:54 <marcan> repeat start is the more common way of doing it

11:55 <radnic> Ah, you are correct. I read the documentation backward.

11:55 <radnic> Ok, then... if that is the case, why did it _ever_ work?

11:56 <kettenis_> chip caches the address from the last transaction

11:57 <radnic> But yeah, I understood the datasheet wrong, sorry marcan.

11:58 <marcan> the way this works is the chip has a pointer register

11:58 <marcan> the write sets the pointer, the read reads from the pointer

11:58 <marcan> it doesn't care if you issue a stop or not

11:59 <marcan> my hypothesis was that the chip handles i2c transactions on both i2c ports with the *same* pointer register (and of course they are interlocked, so it will clock stretch transactions on one port while another one is busy)

11:59 <marcan> a repeat start will not release the lock, but a stop will

11:59 <marcan> allowing the SMC interface to change the read pointer

11:59 <marcan> what tipped me off was that you ended up reading more bytes than expected

11:59 <marcan> which suggests you read the wrong register

11:59 <marcan> I'm about to test that hypothesis now :p

12:01 <kettenis_> even if that isn't what's happening, the fix is right ;)

12:03 kettenis_ is now known as kettenis

12:04 <FireFox317> i can just go into 1tr and use kmutil configure-boot to update m1n1 right?

12:04 <kettenis> yes

12:16 <FireFox317> With the change from marcan applied I get the following output on a Macbook Air (j313): https://paste.debian.net/1219923

12:16 <FireFox317> With only one usb c-a cable connected to my linux box

12:16 <radnic> Huh, that's _with_ the change? Straaaange.

12:18 <radnic> Even stranger you get it after chainloading.

12:18 <radnic> Never saw it after

12:18 <radnic> Consistenly or ocassionally?

12:18 <marcan> firefox317: are you sure you made the right change?

12:18 <FireFox317> marcan, i2c.c:131 should be 'if (i2c_xfer_write(dev, addr, 1, 0, &reg, 1))' right?

12:19 <marcan> correct

12:19 <marcan> well, that's weird because it definitely doesn't break for me on chainload...

12:19 <radnic> Yep, that is what I have.

12:19 <radnic> It seems it broke _both_ devices now.

12:20 <FireFox317> marcan, did you install the latest m1n1? or are you just chainloading?

12:20 <FireFox317> because it seems to matter

12:20 <marcan> chainloading

12:20 <marcan> ok

12:20 <radnic> I can confirm that.

12:21 <marcan> ok, I think I see what you mean

12:21 <radnic> I ahve extra cnahges as in... extra prints.

12:21 <radnic> *have

12:21 <radnic> But that's it.

12:23 <FireFox317> radnic, extra prints in the i2c code might add some delays and change the behavior maybe?

12:23 <radnic> Possible.

12:27 <radnic> Ok, got in the same state as you.

12:27 <radnic> I had an extra delay put in the system;. Removing that, I get the same result as you do.

12:28 <radnic> After i2c_smbus_write(dev->i2c, dev->addr, TPS_REG_CMD1, (const u8 *)cmd, 4) I had a

12:28 <radnic> mdelay(100);

12:28 <radnic> removing that, I get the same as you reported.

12:28 <radnic> leaving that, I never got it in 20x boots.

12:29 <FireFox317> with 20x boots do you mean chainloading 20 times, or rebooting manually?

12:29 <radnic> manually rebooting.

12:29 <radnic> Not chainloading.

12:29 <radnic> you can try it if it's not too big of a pain.

12:31 <marcan> I just added some debug prints and that makes it fail again

12:31 <marcan> yaaay timing issues

12:31 <marcan> er sorry, work again

12:32 <FireFox317> yeah timing issues are super annoying, and this is definitely a timing issue :(

12:33 <FireFox317> marcan, fortunately you can repro the issue now right?

12:34 <bgb> seems mini rarely fails

12:35 <marcan> ehhh I bet I know what the problem is

12:35 <marcan> while (val & PASEMI_RX_FLAG_EMPTY && --timeout)

12:35 <marcan> val = read32(dev->base + PASEMI_FIFO_RX);

12:35 <marcan> this is completely broken

12:35 <marcan> 10000 iteration timeout with no delay

12:35 <marcan> guess what, it doesn't take the CPU long to poll that register 10000 times...

12:35 <marcan> it's just timing out early

12:38 <marcan> well, not just that, there's more than one problem here :p

12:41 <daniels> if only there was some convenient kernel helper you could use to poll a value with a timeout!

12:42 <marcan> daniels: this isn't the kernel, it's m1n1

12:43 <FireFox317> that helper is even in m1n1 :p

12:43 <marcan> it isn't

12:43 <marcan> it's different

12:43 <marcan> the kernel helper is a macro, the m1n1 helper is a function that does *not* return the register value

12:43 <marcan> that's a bit of a problem when you're reading from a FIFO

12:43 <FireFox317> ahh i see, my bad

12:44 <daniels> marcan: ah, I take it back

12:46 X-Scale` has joined #asahi

12:46 <marcan> radnic: okay, that delay makes it work and I don't know why. time to stick a scope on the i2c pins

12:46 <marcan> (thankfully there is a magic USB-PD command to put those on the type C pins...)

12:51 X-Scale has quit [Ping timeout: 480 seconds]

12:52 <FireFox317> yeah the mdelay(100) also works on my machine

13:01 <radnic> Hah, accidental discoveries, got to love them.

13:13 <radnic> (thankfully there is a magic USB-PD command to put those on the type C pins...) that is soo cool, so you can see the i2c off a split cable? :))

13:14 <marcan> oh god I think I figured it out

13:14 <marcan> we're DoSing the damn chip

13:14 <marcan> the whole thing is firmware, right?

13:14 <marcan> we poll for command completion so fast it never actually completes

13:14 <marcan> it needs a delay in the command poll loop

13:15 <radnic> Is there firmware in the I2C peripherial?? That sounds... wierd.

13:16 <marcan> of course there is

13:16 <marcan> it's a USB-PD controller

13:16 <marcan> it has a huge blob of firmware

13:16 <marcan> both a ROM and a pile of patches/app code loaded from SPI

13:16 <marcan> the datasheet tells you all about that too

13:16 <radnic> No, wait, I thouthg we're DoS'ing it hereval & PASEMI_RX_FLAG_EMPTY && --timeout

13:16 <marcan> no, I mean the TPS

13:17 <marcan> the timeout thing is also broken but unrelated

13:17 <radnic> Ah, that do while.

13:18 <radnic> Ok, funnily enough, I saw this happen in another life on another chip. Makes sense.

13:18 <radnic> That's why the delay there let it pass. Managed to finish its stuff.

13:19 <marcan> yup

13:19 <marcan> just pushed a bunch of changes

13:19 <marcan> let me know if that finally fixes things :)

13:19 slicey has joined #asahi

13:20 <radnic> Isn't 10ms too much? Eh, never mind, probably doesn'

13:20 <radnic> doesn't matter*

13:21 <marcan> depends on whether firmware is involved in that path, which it probably is (at least time to first byte)

13:23 <sven> oh, ouch on that DoS

13:23 <marcan> I think the repeat-start thing made it happen more often (after I patched it to always send the command, which was my way of reproing with chainload) because that gets rid of the wait for the stop condition, so it makes the timing of the polling even tighter

13:24 <marcan> this thing has the curse of USB

13:24 <marcan> there is no way a stupid I2C device should've caused us so much grief

13:24 <marcan> but since it's USB-related, well, it makes sense

13:24 <marcan> the curse spreads

13:24 <sven> well, it's related

13:24 <sven> yup

13:25 <sven> hrm, looks like the linux driver also does the tight command completion polling loop even though there is an interrupt it could use

13:25 <marcan> heh, yeah

13:25 <marcan> so I think what actually happens is that when we DoS it, it never executes the commandc

13:25 <marcan> and then after ~1 second or so, a watchdog fires

13:25 <marcan> and the chip resets

13:26 <radnic> Uhhh.. I still get the problem. :))

13:26 <marcan> and *that* breaks the I2C bus/read/whatever

13:26 <marcan> radnic: dammit

13:26 <sven> :D

13:26 <sven> and the USB curse strikes again!

13:26 <radnic> both 0 and 1 case

13:26 <marcan> what machine is this?

13:26 <radnic> Let me make sure before I lead you down a chase

13:26 <radnic> air 13

13:26 <FireFox317> yep for me it also breaks

13:27 <FireFox317> I just reinstalled the new m1n1 on my m1 air (j313)

13:27 <marcan> wait hold on

13:27 <marcan> argh yeah it's still broken

13:27 <marcan> but maybe for a different reason?

13:28 <sven> lol

13:28 <radnic> well, yeah, because you put the delay at the end.

13:28 <radnic> It will fail after the first poll.

13:28 <radnic> And exit.

13:28 <radnic> Never retrying again?

13:29 <FireFox317> yeah it seems to timeout really fast

13:29 <marcan> okay yeah I'm an idiot

13:29 <marcan> logic's backwards

13:30 <marcan> fixed (force pushed :p)

13:30 <marcan> I made that rework last and I think I didn't test it lol

13:34 <FireFox317> It works :D

13:39 <marcan> radnic: please tell me it works :-)

13:40 <radnic> Yes, works :)

13:40 <marcan> \o/

13:40 <radnic> one question though, in here:

13:40 <radnic> do {

13:40 <radnic> return -1;

13:40 <radnic> if (cmd_status == TPS_CMD_INVALID)

13:40 <radnic> if (i2c_smbus_read32(dev->i2c, dev->addr, TPS_REG_CMD1, &cmd_status))

13:40 <radnic> return -1;

13:40 <radnic> udelay(100);

13:40 <radnic> } while (cmd_status != 0);

13:40 <radnic> if the commands fails to run ,you won't reitereate, you'll just exit.

13:40 <marcan> that's correct

13:41 <radnic> WHat is the point of the do while.

13:41 <marcan> the valid values of that read are 0 and the command code

13:41 <marcan> TPS_CMD_INVALID means the command was rejected

13:41 <marcan> so loop while it's the command code (which means it's neither zero nor INVALID)

13:42 <marcan> arguably that could be rewritten as while cmd_status == cmd or whatever instead, and then check for 0/INVALID/unknown at the end

13:45 <radnic> But I see no case when you woudl reaiterate. Either you try once and it's OK or you exit,

13:45 <radnic> *reiterate

13:46 <marcan> cmd_status will be neither INVALID nor 0 while the command is in progress

13:46 <marcan> therefore it will loop

13:46 <radnic> Ah... missed that.

13:46 dottedmag has left #asahi [#asahi]

13:47 <radnic> Makes sense. Silly me.

13:59 aleasto has joined #asahi

14:02 Dcow has quit [Quit: My Mac Mini has gone to sleep. ZZZzzz…]

14:13 sailorek1234 has quit []

14:24 Dcow has joined #asahi

14:53 slicey has quit [Quit: cya]

14:57 <FireFox317> do i have to enable a specific kernel option such that it recognizes the boot args when running under the hypervisor i.e. with run_guest.py?

14:57 <FireFox317> it doesnt seem to recocnize the boot args that i specifiy

15:09 <FireFox317> i guess i can just put them in the dts for now

15:46 phiologe has joined #asahi

15:47 sailorek1234 has joined #asahi

15:49 yuyichao has quit [Ping timeout: 480 seconds]

15:51 gladiac is now known as Guest6204

15:51 gladiac has joined #asahi

15:55 Dcow has quit [Quit: My Mac Mini has gone to sleep. ZZZzzz…]

15:56 Guest6204 has quit [Ping timeout: 480 seconds]

16:01 <j_ey> firefox317: are you using the new boot args payload?

16:03 sailorek1234 has quit []

16:03 <FireFox317> j_ey, with run_guest.py you can specify bootargs after the payload. I tried to use that. However I also looked at the m1n1 code and it looks like it only changes the bootargs in the adt, not in the fdt.

16:05 Dcow has joined #asahi

16:07 yuyichao has joined #asahi

16:11 <Dcow> how is the linux running on the m1max comparing to the m1pro?

16:12 <Dcow> is it same state except the second ANE or something else is there?

16:13 <mort_> it's literally the same CPU except for the memory bandwidth (which is already more than high enough for CPU-only stuff on the Pro)

16:13 <mort_> afaik you can't really use the GPU yet in linux

16:13 <mort_> so the main difference would be that you have 16 extra GPU cores sitting around unused

16:14 <_jannau_> the second ane doesn't exists. I doubt anyone has drivers or uses a M1 pro/max in way that the differences would be noticeable

16:15 <mini> Is the ANE the same between M1, M1 Pro and M1 Max?

16:15 <mini> it looks like it is

16:16 <mini> (I've been using an app that uses it on macOS, and there's no performance difference at all between a M1 mac mini and a M1 Max 14" MBP)

16:21 <marcan> firefox317: those are the xnu bootargs, not the linux bootargs. they are unrelated.

16:22 <marcan> _jannau_: the second ane definitely exists, and is definitely disabled for some reason

16:22 <FireFox317> marcan, yeah thanks for the clarification, i figured that out xd

16:23 Dcow has quit [Quit: My Mac Mini has gone to sleep. ZZZzzz…]

16:24 Dcow has joined #asahi

16:27 <_jannau_> that's what I ment. it is currently not useable under linux even ignoring the non-existing driver

16:37 neobrain has quit [Remote host closed the connection]

16:37 neobrain has joined #asahi

16:39 cptcobalt has quit [Read error: Connection reset by peer]

16:39 nkaretnikov has quit [Read error: Connection reset by peer]

16:39 brinly has quit [Read error: Connection reset by peer]

16:39 tom-w has quit [Remote host closed the connection]

16:39 rann has quit [Read error: Connection reset by peer]

16:39 jkkm has quit [Remote host closed the connection]

16:39 Chainsaw has quit [Remote host closed the connection]

16:39 sjg1 has quit [Read error: Connection reset by peer]

16:39 Vaughn has quit [Remote host closed the connection]

16:39 arnd_ has quit [Remote host closed the connection]

16:39 robher has quit [Remote host closed the connection]

16:39 ovf has quit [Read error: Connection reset by peer]

16:39 stblassitude has quit [Read error: Connection reset by peer]

16:39 eichin has quit [Read error: Connection reset by peer]

16:39 austriancoder_ has quit [Read error: Connection reset by peer]

16:39 steev has quit [Read error: Connection reset by peer]

16:43 Vaughn has joined #asahi

16:48 eichin has joined #asahi

16:48 sjg1 has joined #asahi

16:49 ovf has joined #asahi

16:50 stblassitude has joined #asahi

16:50 tom-w has joined #asahi

16:51 robher has joined #asahi

16:54 leah has quit [Quit: WeeChat 3.3]

16:57 yuyichao_ has joined #asahi

16:58 steev has joined #asahi

16:58 arnd_ has joined #asahi

16:58 cptcobalt has joined #asahi

16:58 austriancoder_ has joined #asahi

17:00 jkkm has joined #asahi

17:00 Chainsaw has joined #asahi

17:00 yuyichao has quit [Ping timeout: 480 seconds]

17:00 rann has joined #asahi

17:04 brinly has joined #asahi

17:07 <boardwalk> Tried to do an online resize2fs on my root partition on nvme (and on t6001) and it threw some errors: https://ersatsz.com/~boardwalk/resize2fs.jpeg

17:07 yuyichao has joined #asahi

17:07 <boardwalk> Partition was unwritable until reboot, but seems to be fine & was resized.

17:08 nkaretnikov has joined #asahi

17:10 <boardwalk> Also haven’t been able to get pcie to not report “link didn’t come up”. Increased the timeout from 1/10s to 1s from where that log is generated, and now USB comes up (wasn’t before), but I still get “link didn’t come up”. And lspci only shows the pci bridges.

17:11 yuyichao_ has quit [Ping timeout: 480 seconds]

17:24 <marcan> boardwalk: WiFi/SD require an SMC command to turn on the PCIe devices

17:28 <boardwalk> So no devices showing up is expected, gotcha.

17:30 Dcow has quit [Quit: My Mac Mini has gone to sleep. ZZZzzz…]

17:31 Dcow has joined #asahi

17:40 Dcow has quit [Quit: My Mac Mini has gone to sleep. ZZZzzz…]

17:42 Dcow has joined #asahi

18:08 <sven> Hmmm… those nvme errors look odd. I know that the normal error handling doesn’t work too well with ANS. We’re there any errors before that or is that timeout the first one?

18:09 <sven> The unwritable until reboot is because I need to fix the error recovery paths

18:10 <sven> AFAIK ANS just doesn’t support the nvme abort command but that would be opcode 8 and not 0

18:12 <sven> Wait.. that is a 8 in the “invalid opCode” message

18:12 <sven> so that makes sense.

18:12 <boardwalk> Yeah, sorry for the fuzzy shot, heh.

18:13 <boardwalk> I didn’t notice any other errors before doing the resize2fs. I’ll double check.

18:14 <sven> Makes more sense now. Some command timed out and then it dies because the normal error recovery just doesn’t work

18:18 <sven> Which tree is that? t6000-bringup?

18:19 PhilippvK has joined #asahi

18:21 <sven> hrm… it also looks like it tried to send the abort command to the io queue?!

18:23 <boardwalk> t6000-bringup-work. And I don’t see any other errors in the logs (I have everything up to ‘resizing…’ in the journal, which is in that pic).

18:23 phiologe has quit [Ping timeout: 480 seconds]

18:24 <sven> Ah, yes, so maybe ans does support the normal abort command after all. I just accidentally send them to the IO instead of the admin queue. Whoops.

18:25 <sven> Or so I… ugh… it’s been too long since I looked at that code

18:25 <sven> *or do I

18:28 <sven> Ah, nope, i was just confused.

18:29 <sven> so I think this fails so badly because some command times out and then the error recovery path just doesn’t work correctly for ANS

18:47 bgb_ has joined #asahi

18:53 bgb has quit [Ping timeout: 480 seconds]

19:07 X-Scale has joined #asahi

19:10 [X-Scale] has joined #asahi

19:12 X-Scale` has quit [Ping timeout: 480 seconds]

19:16 X-Scale has quit [Ping timeout: 480 seconds]

19:18 chamomile has joined #asahi

19:42 Dcow has quit [Quit: My Mac Mini has gone to sleep. ZZZzzz…]

20:11 ___nick___ has joined #asahi

20:13 ___nick___ has quit []

20:14 ___nick___ has joined #asahi

20:27 aleasto has quit [Remote host closed the connection]

20:50 helltraum has joined #asahi

20:51 helltraum has quit [Remote host closed the connection]

20:54 helltraum has joined #asahi

20:55 helltraum has quit [Remote host closed the connection]

20:58 helltraum has joined #asahi

20:59 helltraum has quit [Remote host closed the connection]

21:03 <boardwalk> sven: I can try and track down what times out (not sure how much of an exception circumstance it is, if you have any debugging tips would be helpful). Would implementing the recovery path difficult? (i.e. Would it be reasonable for a kernel newb to try or is it not done for a good reason?)

21:03 <sven> it's not done because i've been too lazy :D

21:04 <sven> that nvme code needs quite some cleanup anyway

21:05 <sven> so the task is more "clean up nvme and while doing that do error recovery correctly"

21:08 ___nick___ has quit [Ping timeout: 480 seconds]

21:11 PhilippvK has quit [Quit: No Ping reply in 180 seconds.]

21:12 phiologe has joined #asahi

21:41 torstenvl has joined #asahi

22:07 palmer_ has joined #asahi

22:07 palmer_ has quit [Remote host closed the connection]

22:12 torstenv_ has joined #asahi

22:17 torstenvl has quit [Read error: No route to host]

22:24 torstenv_ has quit [Ping timeout: 480 seconds]

22:49 radnic is now known as RealityVoid

23:40 <RealityVoid> So, I'm tryin gto build the kernel and when I try to run the build... config restarts for some reason.

23:40 <RealityVoid> Is that normal?

23:41 <RealityVoid> Doing something wrong?

23:44 <j_ey> did you run the config and build lines with the same parameters?

23:44 <j_ey> didnt accidentally leave off 'ARCH' for example

23:45 <RealityVoid> I just ran this:

23:45 <RealityVoid> make ARCH=arm64 CROSS_COMPILE=aarch64-linux-gnu- oldconfig

23:45 <RealityVoid> and the .config is the one from thewiki page, I placed it in the root of the linux folder

23:46 <j_ey> but it builds afterwards right?

23:47 <RealityVoid> Well, did not let it do that, because it asked me a bunch of configuration questions. Did not seem right.

23:47 <j_ey> should be fine

23:47 <RealityVoid> Just enter enter enter?

23:48 <RealityVoid> Naah, not building, besides, at the end it says that .config was overwritten.

23:49 <j_ey> are you running make ARCH=arm64 CROSS_COMPILE=aarch64-linux-gnu- Image.gz?

23:50 <j_ey> or just make ARCH=arm64 CROSS_COMPILE=aarch64-linux-gnu-, or only the one with 'oldconfig'?

23:50 <RealityVoid> I tried both of these:make ARCH=arm64 CROSS_COMPILE=aarch64-linux-gnu- oldconfig

23:50 <RealityVoid> make ARCH=arm64 CROSS_COMPILE=aarch64-linux-gnu- -j8 Image dtbs

23:50 <RealityVoid> but bpth overwrite the config.

23:50 <RealityVoid> *both

23:53 <RealityVoid> I get: https://gist.github.com/RealityVoid/1fa73486496d392a6eb55ad9745b0a47

23:55 <RealityVoid> When building.

23:56 <steev> fatal error: openssl/bio.h

23:56 <steev> you're missing ssl headers

23:57 <steev> if you're on a debian system, that's libssl-dev

23:57 <RealityVoid> Found it. Thanks.

23:58 <RealityVoid> Yes, stupid of me, sorry.

23:58 <steev> it's all good, not everyone is used to cross compiling :)

23:59 <steev> or kernel compiling in general