#aarch64-laptops on 2023-02-28 — irc logs at oftc.irclog.whitequark.org

2022-12-21 00:44 ChanServ changed the topic of #aarch64-laptops to: Linux support for AArch64 Laptops (Asus NovaGo TP370QL - HP Envy x2 - Lenovo Mixx 630 - Lenovo Yoga C630)

00:18 <steev> maybe the picture is bad and my eyes just see what they want :P

00:26 echanude has joined #aarch64-laptops

01:06 <HdkR> bamse: Tricky, I'll give that steam guard path a retry and see if something broke

01:09 <HdkR> steev: Do you know how the grub from the boot partition understands to pull the grub.cfg from the next partition without configuration?

01:09 <steev> nope :)

01:09 <steev> probably some code somewhere

01:09 <HdkR> The magic bewilders and confuses me

01:16 <robclark> "probably some code somewhere" is an answer to a great many questions ;-)

01:26 <qzed> IIRC you can build a config into grub. That config can then just be something that tells grub to search for the drive containing the path /boot/grub/grub.cfg or something like that, which it then sources

01:29 <HdkR> Does it actually scan partitions to find it?

01:35 <qzed> found it: https://www.gnu.org/software/grub/manual/grub/html_node/Embedded-configuration.html

01:38 <HdkR> Ah yes, magic scanning

02:42 <steev> robclark: but re: the vsync.. why would it only affect Xorg?

02:43 <robclark> because xorg is using legacy cursor ioctls which we have to emulated by setting a timer to fire shortly before start of vblank period to flush changes to hw

02:44 <robclark> so totally makes sense that it only effects xorg

02:44 <steev> ahh

02:44 <steev> i guess that's also what made the cursor occasionally jump instead of lag

02:45 <robclark> yeah, it's very sensitive to calculating the time of start of vblank period

02:47 <robclark> annoyingly CrOS compositor also uses legacy cursor api.. but things with that patch seemed to be working properly there.. I'll look again tomorrow (didn't have time today) but it is important for other reasons that the vblank helper can correctly calc the start of the vbl deadline

03:38 <steev> if i knew what to even remotely look for, for the reason why mutter crashes when we try to use external, it would be a moot point

03:51 <steev> https://gitlab.gnome.org/GNOME/mutter/-/issues/2398 oh, looks like lumag looked into a bit :)

04:10 <robclark> I thought there was a mutter/g-s thing that you already fixed

04:13 <robclark> oh.. I guess it is another case of msm being less crippled than intel surfacing bad userspace assumptions again ;-)

04:30 <steev> nope... i fixed other things...

04:32 hexdump01 has joined #aarch64-laptops

04:34 hexdump0815 has quit [Ping timeout: 480 seconds]

04:35 <robclark> if you are interested in downstream kernel hacks to work around broken userspace I can probably find you something that we have since reverted in CrOS.. since we encountered these same broken assumptions

04:44 <steev> i mean... i'm not *personally* against it... i'm down with any hack that makes a ssytem usable

04:45 <steev> https://www.aliexpress.us/item/3256804714924643.html

04:45 <steev> oh dang it, that's a 2230

04:48 <robclark> not in front of CrOS git trees atm but tomorrow (it doesn't hurt to ping me to remind me) I'll point you at downstream kernel hacks we used to work around similar issues

04:50 <HdkR> steev: 2230->2242 adapters exist which /might/ work

04:50 <steev> i'm not in any rush :) that bug has been open 6 months...

04:53 <steev> and lumag brings up a good point, asking why mutter cares about primary plane... it would be nice if someone from upstream would respond

05:33 iivanov has joined #aarch64-laptops

05:41 iivanov has quit [Ping timeout: 480 seconds]

06:04 iivanov has joined #aarch64-laptops

06:44 iivanov_ has joined #aarch64-laptops

06:52 iivanov has quit [Ping timeout: 480 seconds]

08:12 iivanov_ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

08:16 iivanov has joined #aarch64-laptops

10:07 <abelvesa> jhovold: with your wip/sc8280xp-v6.2: https://pastebin.com/raw/GRQmR3yx

10:08 <jhovold> abelvesa: thanks, i'm just debugging that. Was able to trigger it once and it is likely the issue ndec is hitting

10:08 <jhovold> the annoying part is that I reported this to bamse in october, but he couldn't seem to find the cause:

10:08 <jhovold> https://lore.kernel.org/all/Y1efJh11B5UQZ0Tz@hovoldconsulting.com/

10:09 <jhovold> the race is definetely still there in 6.2, but may have been fixed in 6.3

10:12 <jhovold> or rather, he did find a workaround, but then convinced himself it wasn't needed, possibly because bames tends to run linux-next where the hotplug detect code has been reworked:

10:12 <jhovold> abelvesa: if you got a decent reproducer, can you try moving the initialising like bamse suggested here:

10:12 <jhovold> https://lore.kernel.org/lkml/Y86vaTQR7INWezyj@hovoldconsulting.com/

10:13 <abelvesa> jhovold: thing is I only managed to repro it once on latest boot

10:14 <abelvesa> jhovold: I'll reboot a few more times to see if repros again

10:15 <jhovold> yeah, it seems to depend on when you userspace starts the pd-mapper. Was able to trigger it four times in a row with one build and got some logs, but now it's gone again

10:17 <jhovold> so the drm driver is definitely buggy as in 6.2 it enabled hotplug notifications before everything has been set up (hence the NULL deref)

10:17 <jhovold> and it seems that bamse's pmic-glink-altmode driver is buggy as it is sending hotplug notifications also when not having any external display connected

10:18 <jhovold> and/or it's the fw that buggy

10:21 <abelvesa> yep, reproduces 4 times out of 4

10:23 <jhovold> i can reproduce it on the crd where I got a serial console, which helps to say the least

10:24 <abelvesa> funny thing is, when I let the grub timeout to be reached, it reproduces every time

10:25 <abelvesa> when I select an entry before timeout it reached, it never reproduces

10:25 <jhovold> heh, fscking heisenbug

10:25 <mothenjoyer69> <jenneron[m]> "by problem i mean embedded..." <- do you happen to have an ACPI dump? or know of anyone who may be able to provide one?

10:26 <abelvesa> jhovold: I'll start digging into it then

10:27 <mothenjoyer69> or if any Galaxy Book Go 5G owners (8cx variant) could get an ACPI dump for me, that would be helpful. i have a strong suspicion that it is going to have a lot in common with the galaxy book s but i can't seem to find anything online

10:28 <jhovold> abelvesa: that's ok, I'm on it. But if you can try bamse's workaround from above and have a decent repro then see if that alone is enough to address this

10:31 <abelvesa> jhovold: ok then, will try bamse's workaround

10:33 <jhovold> abelvesa: are you still able to ssh into machine after you hit the NULL deref?

10:34 <jhovold> my USB ethernet never probes, and the serial getty is not started either when I've hit it

10:34 <jhovold> and the screen remains black as ndec reported...

10:36 <jhovold> abelvesa: and are you using an external display or not? I don't and still hit it as I mentioned above.

10:42 <jhovold> hmm. ok, so the pmic-glink altmode driver always sends notifications, in my case a disconnected event

10:42 <abelvesa> jhovold: nope, can't ssh into it

10:43 <jhovold> guess that was the last piece of the puzzle

10:43 <jhovold> how did you get the log out?

10:43 <abelvesa> journalctl

10:44 <abelvesa> I can't ssh into it because the eth is over typec

10:44 <jhovold> heh, guess ndec never checked that (as lumag suggested)

10:44 <jhovold> same here

10:45 <jhovold> abelvesa: I'll prepare a fix, and then we can see if we need to carry it out-of-tree or if can get it into 6.3 and backported even with the reworked hotplug code in there

10:46 <abelvesa> jhovold: thanks, please cc me on it as well, I can test it

10:46 <jhovold> i will, thanks

11:00 falk689 has quit [Remote host closed the connection]

11:02 falk689 has joined #aarch64-laptops

11:13 <abelvesa> jhovold: BTW, with bamse's workaround, I can't repro it anymore

11:13 <jhovold> abelvesa: thanks for confirming

11:19 <jhovold> i have a reliable reproducer now too

12:07 <HdkR> oop, I noticed external display isn't working anymore on X13s. Regression or do I just need to reboot the device?

12:08 <danielt> Which kernel version? There was a display regression recently but it has been fixed.

12:08 <HdkR> Rocking steev's 6.2 branch

12:09 <HdkR> I believe the rc branch was the one that had it working but I guess something changed between those time periods

12:13 <danielt> I see... I using a rebased version of https://github.com/jhovold/linux/tree/wip/sc8280xp-v6.2 (where external display works).

13:19 <HdkR> hm

13:46 kettenis_ has quit [Ping timeout: 480 seconds]

14:52 hightower2 has joined #aarch64-laptops

16:00 falk689 has quit [Remote host closed the connection]

16:00 falk689 has joined #aarch64-laptops

16:56 hightower2 has quit [Ping timeout: 480 seconds]

16:56 <steev> sometimes i have to unplug and then plug the device back in HdkR

16:57 falk689 has quit [Remote host closed the connection]

17:01 <HdkR> steev: I did that even. Maybe it does just need a restart

17:03 <danielt> steev: Very good point... I typically have to hotplug twice to get the second display to fire up.

17:18 falk689 has joined #aarch64-laptops

17:41 hightower2 has joined #aarch64-laptops

18:06 <clover[m]> External display working here (hotplug) with steev 6.2

18:06 <steev> i typically have to unplug my nvme ssd adapter, and then unplug and plug the usb-c back in IF it doesn't crash...

18:07 <steev> debating leaving the mutter crash in so i know that the external display works ;)

18:07 <HdkR> When I get some time I'll restart and retest. Could be a quirk with longer uptime, and/or a mixture of sleep

18:09 <steev> HdkR: hm, don't forget to flip the usb-c as well, just to test that

18:10 <HdkR> That was the second thing I tried after replugging the cable :D

18:10 <steev> i only get external display with the pinebook usb-c hub when its flipped so that the ports are facing the user

18:10 <steev> https://pine64.com/product/pinebook-pro-usb-c-docking-deck/ but hey, it was only 50 bucks so *shrug*

18:11 <steev> oh wait no, that's the new one

18:11 <steev> they don't seem to sell the version i got anymore

18:12 <steev> oh right - i got the phone edition - https://pine64.com/product/pinephone-pro-usb-c-docking-bar/ - because it was only 27 bucks :D

18:13 <amstan> that's typical behavior for something not hooked up right with the lanes

18:14 <amstan> the AP needs to be told which orientation stuff is from whomever talks PD

18:15 <steev> not to disparage pine64, because i love their stuff, but lets not pretend their hardware is wired up correctly either... could be something on their end

18:17 <amstan> no, i wouldn't blame the adaptor, probably your laptop/drivers

18:17 <amstan> i mean.... it's possible it's the adaptor, but given we're all in this channel for a reason ;)

18:18 <steev> lol

18:19 <steev> it's definitely the driver, i just dunno/don't have the skill to fix it; bamse said something about one of his other trees probably has the fix, i just haven't gone through the backlog to see which one it was to compare

18:21 <steev> like, sm8150 or something

18:22 <amstan> this is how gru does it: https://github.com/torvalds/linux/blob/master/arch/arm64/boot/dts/rockchip/rk3399-gru-chromebook.dtsi

18:22 <amstan> grep for usbc_extcon1

18:22 <amstan> extcon is the old way of doing it, there might be some typec apis now instead

18:24 <amstan> on your device the other end (non-dp ip block) is probably not an EC but something else that also speaks PD, maybe the tcpc directly, or knowing qcom maybe the pmic

18:34 kettenis has joined #aarch64-laptops

19:21 <steev> i couldn't say :(

20:06 hexdump01 has quit [Quit: WeeChat 1.9.1]

20:06 hexdump0815 has joined #aarch64-laptops

21:08 hexdump01 has joined #aarch64-laptops

21:09 hexdump0815 has quit [Ping timeout: 480 seconds]

21:26 <robclark> steev: one char typo in the "cleaned up" version of my patch was responsible for your cursor badness:

21:26 <robclark> https://www.irccloud.com/pastebin/XxwkmNa5/

22:14 <steev> ohh

22:16 <steev> will give that a spin!

22:18 <ajhalaney[m]> pesky exclamation points :P

22:33 iivanov has quit [Ping timeout: 480 seconds]