#aarch64-laptops on 2023-02-19 — irc logs at oftc.irclog.whitequark.org

2022-12-21 00:44 ChanServ changed the topic of #aarch64-laptops to: Linux support for AArch64 Laptops (Asus NovaGo TP370QL - HP Envy x2 - Lenovo Mixx 630 - Lenovo Yoga C630)

00:21 laine has joined #aarch64-laptops

00:24 laine has quit [Remote host closed the connection]

00:24 laine has joined #aarch64-laptops

00:29 laine has quit [Remote host closed the connection]

00:29 laine has joined #aarch64-laptops

00:31 <HdkR> https://browser.geekbench.com/v5/cpu/compare/20676539?baseline=20676625 Hmmmm, that single threaded result on the Lenovo implies longer term tests are working with appropriate perf

00:33 <HdkR> But that might also imply that that massive 8MB L3 is working while the 6MB SLC isn't :P

02:04 laine has quit [Ping timeout: 480 seconds]

02:11 <HdkR> steev: Do you have any kernel debugging flags enabled in your config? I'm seeing high kernel load under perf in some instances

02:19 laine has joined #aarch64-laptops

02:22 <steev> i do believe there are, yeah

02:23 <steev> i was trying to get uh... what was it called

02:25 <steev> https://ingraind.org/ this working

02:27 <steev> and it required some kernel config options... but i never did get it

02:28 <HdkR> Toggled a bunch of random settings, hope it resolves this performance delta with short lived applications

02:29 <steev> let me know which ones, and i can look into disabling them too

02:29 <steev> actually... i could compare to mani's defconfig

02:31 <steev> https://paste.debian.net/1271294/

02:32 <HdkR> Does seem like a ton of differences

02:33 <steev> nah, the only real one i can think of is the fault injection

02:33 <steev> function*

02:33 <steev> the rest are mostly just modules becoming built-ins

02:33 laine has quit [Ping timeout: 480 seconds]

02:39 <HdkR> Actually, I think this has to be cache related. Profiling only code that does linear writes to memory, Lenovo takes 52% more time

02:40 <HdkR> 51ms -> 78ms in absolute time

02:40 <steev> https://paste.debian.net/hidden/033f1ef6/ actually, that's the difference between what i run and mani's defconfig

02:42 <HdkR> Quite small

02:43 <steev> indeed, most of the changes are just the new options

02:45 <HdkR> Is there any way to determine if L3 and SLC is running at max clocks?

02:46 <HdkR> Also no idea how to even see SLC in the dts

02:46 <HdkR> llcc_bwmon_opp_table?

02:48 <steev> maybe (and check the dtsi)

02:49 <HdkR> The opp tables seem weird. 1.5GB/s on the top end?

02:49 <steev> i blame bamse

02:50 <HdkR> oop, that's in kilobytes, not kilobits. 15.2GB/s is still significantly lower than what sc8280xp offers

02:51 <HdkR> Unless this is operating modes that once you hit that threshold it hits whatever opp-6 level is

02:52 <HdkR> If I delete all but the largest, would I get the highest config?

02:54 <steev> maybe?

02:58 <HdkR> It's very tricky when I can't tell what frequency each of these levels of cache are running at. As far as I'm aware there is no debugfs available for it

02:58 <steev> maybe someone in -msm knows?

03:15 <HdkR> Deleting all the opp values except the top end one seemingly gave a few percentage in geekbench but in measurable absolute times it didn't seem to improve hmmm

03:24 <HdkR> I am seeing that epss_l3 is falling down the generic compatible path but sc7180/sc7280/sc8180x has a specific compatible

03:25 <bamse> HdkR: are you saying that i should turn those numbers to 11?

03:25 <HdkR> Not really, it doesn't solve my issue with short-term program execution and I'm just trying random ideas since I'm not a kernel dev

03:25 <bamse> HdkR: iirc i dumped the frequency table from epss, and then spread the values across the cpu frequencies haphazardly

03:26 <HdkR> Considering I have no idea what that means, sounds reasonable to me :P

03:27 <steev> i don't even know how to do things like dump the freqency tables

03:27 <HdkR> Same

03:28 <steev> still trying to figure out how to actually capture all of the bluetooth traffic so i can figure out the frame reassembly crap

03:28 <bamse> steev: for l3, you go into the osm-l3.c and you find a dev_dbg()...make that print and you'll get the table

03:28 <steev> oh

03:29 <HdkR> bamse: Is there any way to determine if SLC is running maxed out?

03:31 <bamse> HdkR: might be possible to measure the clock rates

03:31 <HdkR> Since moving from 51GB/s of sm8380 to 68GB/s of sc8280xp and faster CPU cores should be a clear win rather than a loss

03:31 <HdkR> So I can only assume cache

03:31 <bamse> HdkR: what are you measuring there?

03:31 <HdkR> Absolute time to write code in to a linear buffer

03:32 <bamse> sized for hitting each cache?

03:32 <HdkR> tasksetting to the X1 of the sm8350 and the X1C of the sc8280xp

03:33 <HdkR> not sized since this is just measuring total time while running an application

03:33 <HdkR> It's probably only a few megabytes

03:33 <bamse> i've been running https://github.com/andersson/mybw/blob/main/mybw.c

03:34 <HdkR> So that's testing linear reads I see

03:35 <bamse> yes, because when doing write i noticed in some cases the cache performance was limited by ddr performance

03:35 <HdkR> That would be a good test as well in my case, since L3 cache size has double between the two platforms and DDR perf has increased

03:35 <HdkR> has doubled*

03:36 <HdkR> And basically testing what I'm measuring right now

03:36 <bamse> when using writes, i saw something like l3 performance being limited by ddr performance

03:39 <HdkR> Oh, L3 and SLC size doubled going from sm8350 to sc8280xp

03:39 <HdkR> 4MB+3MB -> 8MB+6MB

03:39 <bamse> hmm, did i merge that slc-fix for 6.3?...

03:39 <HdkR> Everything just goes bigger. I can't see why perf would go down :(

03:42 <steev> which slc fix?

03:43 <bamse> the incorrect slice ids

03:44 <steev> i don't recall seeing anything about that... but seems like something johan would have added if it were in

03:44 <steev> oh

03:44 <steev> from abel?

03:44 <bamse> yes

03:45 <steev> i don't see it in actual -next yet

03:46 <steev> but it *should* be in what hdkr is using

03:46 <bamse> then i should pick that up for 6.3-rc

03:47 <bamse> doesn't seem to be able to boot my sm8350 device today...so i guess we have some fixes needed for that as well

03:50 <steev> it boots the x13s just fine

03:50 <steev> i've rebooted a bunch while playing around with bluetooth today :)

03:50 <steev> and tim's v2 is arguably worse than v1 - at least in terms of figuring out which patch to use...

03:51 <bamse> well... HdkR didn't test with upstream sm8350 at least...

03:51 <bamse> it definitely needs some tlc

03:52 <HdkR> I'm still on steev/lenovo-x13s-next-20230210

03:53 <steev> you should switch to the 6.2 stuff (6.2-rc8)

03:53 <HdkR> Would this bring in the mentioned slc slice id fix?

03:53 <HdkR> oh no, I see a commit about it

03:54 <steev> no, it's already in that one too

03:54 <bamse> applied it and re-ran mybw...there it didn't make a difference at least

03:54 <steev> but tbh... next is kinda.... iffy tbh

03:55 <bamse> steev: hoping we can get 6.3-rc1 up and running, and polish that iffyness out of it...

03:56 <HdkR> So if SLC got a fix, maybe L3 needs a fix as well? :)

03:57 <bamse> no, not that problem

03:59 <bamse> HdkR: but if you run "mybw" on your x13s, what do you get?

03:59 <HdkR> Let's see

04:01 <HdkR> Looks like it is peaking out at around 24GB/s atm

04:02 <bamse> do you have a functional sm8350 that you can run it on as well?

04:02 <HdkR> aye

04:03 <HdkR> https://gist.github.com/Sonicadvance1/b7e6d5e3e150917d32a4f3c2657cda75 There's some bad stuff in there

04:04 <bamse> ohh

04:04 <HdkR> tasksets just to ensure it stayed on X1/X1C cores

04:05 <bamse> https://hastebin.com/share/elagikeyux.bash

04:05 <bamse> that's what i get...

04:05 <HdkR> Well there's quite a difference there

04:05 <bamse> your number matches pretty much what i had before i fixed the l3 scaling

04:05 <bamse> s/fixed/enabled...

04:06 <HdkR> So L3 needs a fix you say? :P

04:06 <bamse> definitely!

04:06 <HdkR> Where do I get this fix?

04:06 <bamse> 9235253ec73d ("interconnect: qcom: osm-l3: Add per-core EPSS L3 support") and a few others...went into v6.2-rc1

04:07 <bamse> https://lore.kernel.org/all/20221111032515.3460-1-quic_bjorande@quicinc.com/

04:08 <bamse> before that we got ~15% lower geekbench score in linux compared to windows...

04:08 <bamse> with that we're ~5% faster

04:09 <HdkR> Which is more in line with what I'd expect from just the clock speed boost even

04:11 <HdkR> Grabbing steev's latest branch which has that commit

04:12 <bamse> steev: ahh, i didn't pick the llcc bug fix, because johan's ask is reasonable

04:34 hexdump01 has joined #aarch64-laptops

04:36 hexdump0815 has quit [Ping timeout: 480 seconds]

04:39 <steev> Ah, that makes sense

04:39 <steev> HdkR: yours should have it too

04:39 <steev> Well, the next should

04:40 <steev> Either way

04:40 <HdkR> Building now at least

04:40 <steev> I just prefer what’s in 6.2 because the 6.3 iffyness

04:43 <HdkR> I'm on the 6.2.0-rc8 branch now at least

04:44 <HdkR> Which seemingly fixed the mybw program's results at least

04:44 <steev> interesting

04:45 <HdkR> Didn't solve the memory writing issue though sadly

04:49 <steev> i'm not sure why he dropped the GPU and other one as well

04:50 <steev> * dropped the LLCC_GPU and LLCC_WRCACHE max_cap changes

04:50 <steev> i'm assuming those numbers just aren't in the docs?

04:56 <bamse> HdkR: do you have a smilarly simple test that manifest the problem you're seeing?

04:57 <bamse> HdkR: i don't have any more cards up my sleeve...but i'd be happy to take a look (and/or spread the word...)

05:03 <HdkR> bamse: Sadly nothing simple. I'm grabbing some xray stats of FEX running `/usr/bin/true`

05:04 <HdkR> I think this 6.2.0-rc8 branch solved the issue I was having with NFS hardlocking though

05:16 <HdkR> Two full builds without hang, so that's nice

05:50 alfredo has joined #aarch64-laptops

06:08 <clover[m]> steev: im on 6.2.0-rc8-0-x13s+

06:08 <clover[m]> seems like back to dummy output for audio

06:09 <clover[m]> gpu still works

06:10 <clover[m]> bt works

06:24 alfredo has quit [Ping timeout: 480 seconds]

06:25 <steev> dmesg output?

06:26 <steev> there are still issues here and there, and we're waiting on another patchset from srini

06:26 <steev> it might just be a reboot is needed if you got something like an underflow

06:26 <steev> i wish there was a way to unload the audio modules, but it seems like they rely on something being built in, and it can't be removed

06:34 alfredo has joined #aarch64-laptops

06:44 <HdkR> https://gist.github.com/Sonicadvance1/8a4017195f38de9762198950fedbc640 Scheduler seems a bit spicy but this game might just be abusive

07:05 <steev> maybe you should just get thunks working

07:05 <steev> disclaimer: i have no idea what thunks are

07:15 <HdkR> lol

07:16 <HdkR> We're working on it but 32-bit thunking is complex and upstream doesn't want our patches to make it easier

07:18 <steev> i hate when upstream doesn't want things to make it easier

07:18 <steev> not even sarcasm

07:21 <HdkR> It's indeed a pain

07:42 <amstan> do the thunks!

07:42 <amstan> (i also don't know what those are, but the word sounds fun)

07:48 <HdkR> hah, it's in progress!

07:49 <HdkR> I /think/ we have a way to fix the 32-bit allocator problem but it's going to take some effort

08:21 <steev> i should also look into fex... i wanna play darkest dungeon

08:25 <HdkR> It's marked as working on my game list. Don't think I ever perf tested it, probably fine

08:27 <steev> if that game had perf issues... that would be very sad. it's not exactly an intense game

08:27 <HdkR> That's what I was thinking but you never know. Sometimes games do terrible things

08:28 <steev> heh

08:28 <HdkR> Some game's audio loops right now can get really intense due to sleeping for too long, then trying to quickly do work

08:28 alfredo has quit [Quit: alfredo]

08:48 <HdkR> I think this is mostly from the virtual cycle counter not being fast enough. Really looking forward to those ARMv9.1 CPUs enforcing 1Ghz

09:01 <clover[m]> steev: https://pastebin.com/VermQs3Z

09:02 <steev> [ 24.290021] platform sound: deferred probe pending

09:02 <steev> OH

09:03 <steev> you need

09:03 <steev> https://git.linaro.org/people/srinivas.kandagatla/linux.git/commit/?h=wip/sc8280xp-v6.2-rc6&id=fb129ee792f3c7d16eaba4e21f869bbdd7c5053e make sure those options are set in your config

09:03 <steev> you're probably missing +CONFIG_SC_LPASSCSR_8280XP=y

09:06 <steev> hm, i thought i pulled in mani's patch that deals with the timeouts too, shouldn't be that many, but maybe that's just what we get now, unsuspend shows them moreso than at boot

09:06 <steev> either way, it's 3am here, and i should head to bed since the neighbor's kid isn't screaming anymore

09:40 Guest5298 has joined #aarch64-laptops

09:41 <Guest5298> any news about X13s firmware update to expose EL2?

09:45 <HdkR> That would be pretty sick

09:46 <HdkR> But is it even likely to occur>?

09:51 Guest5298 is now known as Caterpillar2

09:51 krzk has joined #aarch64-laptops

09:52 <Caterpillar2> HdkR: perhaps javierm broonie know if it will be released or not. Everybody here is under f** NDA

09:52 <Caterpillar2> buying X13s has been a huge mistake

09:53 <Caterpillar2> maybe one day I will destroy it and put the video on youtube

10:00 <HdkR> Ah. I see.

10:19 alfredo has joined #aarch64-laptops

10:47 alfredo has quit [Quit: alfredo]

10:53 jhovold has joined #aarch64-laptops

12:02 <qzed> Keep in mind I "know" this stuff mostly from random internet posts/comments and from things I've heard:

12:02 <qzed> Qualcomm runs some firmware stuff in EL2

12:03 <qzed> or that's what people say at least

12:03 <qzed> so my guess is no one and not even an OEM if they're asking nicely is going to get access to it anytime soon

12:04 <qzed> at least on their WoA stack, the android stack seems to be different

12:04 <qzed> and don't ask me why they can't do it the android way...

12:05 <qzed> so our best bet for KVM is to somehow make use of the bypervisor thing they have set up there

12:05 <qzed> *hypervisor

12:10 <qzed> also: some android devices suffer from the same problem: https://github.com/msm8916-mainline/qhypstub

13:59 <ardb> qzed: iirc the chromesos arm64 laptops boot at el2

14:00 <qzed> ah right, was mixing up android with chromeos I think

16:04 Caterpillar2 has quit [Quit: Konversation terminated!]

16:21 krzk has quit [Quit: Lost terminal]

16:46 <javierm> ardb: correct, the chromebooks support KVM

17:20 falk689 has quit [Quit: No Ping reply in 180 seconds.]

17:22 falk689 has joined #aarch64-laptops

18:25 <steev> yeah but chromebooks aren't your bog standards oem... google can say "make this happen"... we can't

18:27 <clover[m]> steev: yep, that config value was my missing puzzle piece

18:27 <steev> clover[m]: oops, sorry

18:27 <clover[m]> rc8 working fine now

18:28 <steev> good to hear :)

18:32 <steev> there is one issue i occasionally see here, wifi drops and starts spamming [52013.164196] ath11k_pci 0006:01:00.0: HTC Rx: insufficient length, got 1480, expected 1488 (the got/expected change); simply `sudo modprobe -r ath11k_pci; sudo modprobe ath11k_pci` works

18:34 <HdkR> That's suspiciously close to MTU. Wonder what happens if you reduce

18:37 <steev> i was just gonna mention it to mani and then forget about it til next time i hit it :P

18:37 <steev> i suppose i could try that too

18:45 <HdkR> Drop to 1400 and hope it goes away? :P

18:50 <HdkR> Does the kernel expose a way to query the CPU cores cache sizes these days? I need to double check the MSR list

18:50 <HdkR> `L2 L#0 (0KB) + L1d L#0 (0KB) + L1i L#0 (0KB) + Core L#0 + PU L#0 (P#0)` lstopo doesn't understand them at least :D

18:53 spikerguy has joined #aarch64-laptops

19:04 <clover[m]> Welcome spikerguy:

19:25 Caterpillar2 has joined #aarch64-laptops

19:32 <qzed> anyone here with decent PCI subsystem knowledge (especially pm / power-off related)? or alternatively does someone know dark magic and has a lamb to sacrifice? or can point me to someone who knows?

19:33 <qzed> turns out ResetSystem isn't just broken on some ARM/Qualcomm devices but also on the x86 Surface Pro 9 and Surface Laptop 5

19:33 <qzed> and it seems PCI shutdown stuff is to blame somehow

19:34 <qzed> which is a bit out of my depth... so I was wondering whether anyone here has some pointers

19:34 <qzed> (IRCs, people to ask... anything really)

19:50 <steev> manni might know, he's been touching the arm/qualcomm stuff

19:50 <steev> mani_s when he's on irc

19:54 <qzed> the quick summary of the problem is: EFI's ResetSystem returns (doesn't seem to crash / page-fault as far as I can tell) under certain conditions

19:54 <qzed> in particular, it returns after some PCI shutdown functions ran

19:55 <qzed> so it looks like firmware is expecting some devices to be left on I guess

19:56 <qzed> which means if we run those PCI shutdown callbacks for some devices, the system essentially stays on and goes to a busy halt loop

19:56 <qzed> now I can write a PCI quirk for that... but I'm not sure if that's the best approach, feels very janky

20:09 laine has joined #aarch64-laptops

20:23 alfredo has joined #aarch64-laptops

20:31 alfredo has quit [Ping timeout: 480 seconds]

20:36 <HdkR> oop, looks like the new kernel didn't solve the hanging problem after all. Just significantly less likely to occur

20:42 spikerguy has quit [Quit: Page closed]

21:27 <steev> that... sucks... but is also good to know?

22:28 <HdkR> Good to know so I can warn people yes

22:47 <steev> just buy them a 2TB ssd to put in them ;)

22:47 <steev> although looks like lenovo only even has 512GB on their site available :(

22:48 <steev> i was looking to get a 1T

22:48 <HdkR> I've actually got a 2TB on the way for replacing the 512GB one

22:49 <steev> https://www.amazon.com/Sabrent-DRAM-Less-Internal-Performance-SB-1342-2TB/dp/B07XVRTG1H i've been eyeing this one

22:49 <HdkR> That's exactly the one I bought

22:49 <HdkR> There aren't many 2230 or 2242 drives available. Shame it is DRAM-less but eh

22:50 <steev> yeah, i noticed that too

22:50 <HdkR> WD's drives in the smaller form factor have the same problem

22:51 <HdkR> I just think it physically doesn't have the space and they don't use PoP

22:51 <steev> i don't know enough about storage technologies :(

22:52 <steev> but once you get it... i definitely wanna know what you think of it

22:53 <HdkR> It will /probably/ be fine

22:53 <steev> i wanna know speed wise, because i do a lot of building as well

22:53 <steev> i build kali packages on it

22:53 <steev> in fact... right now i'm building rust 1.65... which required building 1.64 first *sigh*

22:54 <steev> because neither 1.64 nor 1.65 will be pushed to sid/testing before bookworm releases

22:57 <HdkR> I could probably run some fio benches before and after

23:00 falk689 has quit [Remote host closed the connection]

23:00 falk689 has joined #aarch64-laptops

23:20 Caterpillar2 has quit [Quit: Konversation terminated!]