#linux-msm on 2021-07-23 — irc logs at oftc.irclog.whitequark.org

2021-06-22 12:30 ChanServ changed the topic of #linux-msm to:

01:27 <steev> bamse: should i send another mail to the list - ran into this just now - https://bpa.st/GXIQ

01:28 <steev> and it won't wake from suspend

03:06 <bamse> steev: isn't that the same issue?

03:07 <bamse> qcom_lmh_dcvs_handle_irq() ends upp calling mutex_lock()...

03:28 <steev> i thought so, but i wasn't sure since this time it seemed to take out the ufs

03:42 <bamse> ufs is really bad at telling you when it's actually ufs that's out :/

03:42 <bamse> typically any mishap that causes ufs operations to time out is considered a ufs fault and you get that huge blargh

05:57 cmeerw has joined #linux-msm

06:48 sumits has quit [Remote host closed the connection]

06:48 bhsharma has quit [Remote host closed the connection]

06:48 bryanodonoghue has quit [Remote host closed the connection]

06:48 lumag has quit [Read error: Connection reset by peer]

06:48 alimon has quit [Read error: Connection reset by peer]

06:48 mani_s has quit [Write error: connection closed]

06:51 alimon has joined #linux-msm

06:53 lumag has joined #linux-msm

06:54 mani_s has joined #linux-msm

06:54 sumits has joined #linux-msm

06:55 bryanodonoghue has joined #linux-msm

06:56 bhsharma has joined #linux-msm

07:26 pevik__ has quit [Remote host closed the connection]

12:14 pg12 has joined #linux-msm

12:21 pg12_ has quit [Ping timeout: 480 seconds]

12:31 lounge-user has joined #linux-msm

15:07 flto has quit [Read error: No route to host]

15:07 flto has joined #linux-msm

15:32 <steev> do you happen to have any kind of guidance?

15:32 <steev> sboyd: would you mind taking a look at https://paste.ubuntu.com/p/bTq3dTMTDd/ ? running into an issue of disp_cc_mdss_pclk0_clk_src: rcg didn't update its configuration - booted with drm.debug=0x3, and i don't really know where to go from here, but the paste is trimmed down to just the area where the issue occurs. abhinav said that https://github.com/steev/linux/commit/170b763597d3a0a79f135e4d83a38462c3964fdf was done for a similar issue...

15:36 <bamse> steev: is that with or without {clk,pd}_ignore_unused?

15:36 <steev> with both of those enabled

15:38 <steev> kernel command line on that boot would have been "BOOTIMAGE=/boot/vmlinuz-5.13.4 root=UUID=<uuid> ro pd_ignore_unused cclk_ignore_unused verbose drm.debug=0x3"

15:39 <steev> clk was spelled correctly though

15:39 <bamse> it's the thought that counts ;)

15:41 <steev> fwiw

15:41 <steev> removing them

15:41 <steev> [ 4.510637] dpu_mdss_enable+0x30/0x130 [msm]->msm_dss_enable_clk: core en fail. rc=-16

15:41 <steev> [ 4.512910] disp_cc_mdss_ahb_clk status stuck at 'off'

15:41 <steev> [ 4.511148] [dpu error]clock enable failed, ret:-16

15:41 <steev> [ 4.512421] ------------[ cut here ]------------

15:42 <steev> https://paste.ubuntu.com/p/qfNCnrJvRH/

15:47 <bamse> hmm okay, that's different from what we get on the mtp...need to take a look at that as well then

15:48 <bamse> i posted a patch/rfc recently to fix another problem where mdp_src_clk get stuck because we're turning off the parent and then we try to reparent it without turning on the old parent again

15:49 <steev> hm, lemme looksie

15:49 <bamse> but that's a different clock...

16:05 Marijn[m] has joined #linux-msm

16:36 <steev> bamse: the intf_config ?

16:51 cmeerw has quit [Ping timeout: 480 seconds]

18:08 cmeerw has joined #linux-msm

18:29 <bamse> steev: no, https://lore.kernel.org/linux-arm-msm/20210707043859.195870-1-bjorn.andersson@linaro.org/

18:31 <steev> ohh, i wasn't seeing it in patchwork for some reason

18:36 <bamse> steev: but, that problem only relates to clk_ignore_unused...

18:37 <bamse> steev: the typical thing with the rcgs is that when you change the mux both the old and the new parent must be ticking...so i wonder what them parents are doing in your case

18:38 <steev> not sure - but fwiw, it shows up on both c630, so it's not specific to that one, at least :)

18:58 <steev> bamse: so stupid/silly question... if i went through there and found the one that is giving me problems and made the same change, should i give that a test or nah?

19:03 <bamse> steev: i don't think that's useful in itself, but figuring out what the old and new parent is

19:04 <steev> that is a bit outside my paygrade

19:05 <steev> bamse: oh... one other thing/question - any way we could get the ipa to retry to find the firmware? i've noticed if the module is in the initrd, and the firmware isn't, you have to modprobe -r ipa && modprobe ipa after boot to get it to work

19:07 <bamse> steev: that would be very useful...not sure how to deal with it though

19:07 <bamse> steev: last time i "discussed" this with torvalds i ended up on the front page of theregister.com

19:07 <steev> oh, rip

19:08 <steev> i was gonna say maybe the way uh, adreno does since it retries to find the firmware

19:08 <bamse> steev: that was before his break though, maybe he's up for a new round :)

19:08 <bamse> steev: hmm, does it? need to look at what robclark came up with there then

19:09 <bamse> steev: but, do you have IPA as =y or =m?

19:09 <steev> i have as M

19:09 <bamse> and you have qcom_ipa.ko in the ramdisk?

19:09 <steev> https://github.com/steev/linux/commit/5125b628632223c0c3eb6dc1d7426c7bc2fa59fd#diff-725dee1b5969e869d347d824338588038e684a18c72a5e7afd1fe76c296618ec

19:10 <bamse> because if you have the .ko on the disk the firmware should be on disk as well when the module is loaded

19:10 <steev> it's just ipa.ko

19:10 <steev> in my initrd

19:10 <steev> (i also put the firmware in after figuring that out with an initramfs-tools hook)

19:11 <steev> although... other modules seem to include their firmware in the initrd, is there a reason why that doesn't?

19:11 <robclark> bamse: drm/msm defers loading the gpu and retries on each open of device file (basically so we can get display up without needing gpu fw in initrd).. I'm not really sure I'd recommend it, it adds complexity..

19:12 <bamse> robclark: ahh right

19:12 <bamse> robclark: yeah and i don't think we have such an anchor point in the ipa driver

19:12 <steev> well there's no device file until the firmware is loaded there

19:12 <steev> i don't think there is one even after the firmware is loaded?

19:13 <robclark> bamse: maybe we need a missing-fw equiv of -EPROBE_DEFER? Ie. "try this driver again once new fs is mounted"?

19:13 <bamse> robclark: that was pretty much were our discussion was heading when torvalds objected

19:14 <bamse> robclark: but i think we need to revive that discussion anyway

19:14 <steev> hm

19:14 <robclark> actually, otoh, couldn't udev do something like that?

19:14 <steev> something i thought of - one thing i notice about this 5.13+ kernels... - before the display comes up "properly" - it's like there is something going on

19:14 <bamse> robclark: because i noticed that arm64 is the one platform with CONFIG_FW_LOADER_USER_HELPER_FALLBACK=y

19:15 <steev> rob, if you remember when i had that doubled up display - it's kinda like that

19:15 <robclark> something pitch related messed up in efifb?

19:15 <steev> ill take a vid

19:16 <robclark> well, I mean, if it is before drm driver probes, it is efifb

19:16 <bamse> robclark: problem is that "the fs" isn't well defined...

19:16 <steev> https://usercontent.irccloud-cdn.com/file/FkP6SSAe/64876059532__C3A7FD69-D96C-4CE0-9753-9FFC0EEE30ED.mp4

19:17 <steev> Blergh. Of course I get a blue screen this boot

19:19 <steev> where the blue screen happens though, is where it normally boots up showing the kali dragon plymouth screen

19:19 <steev> fine in grub -> breaks -> unbreaks

19:22 <robclark> I guess maybe check if there had been some recent efifb patches.. I was under the impression that it changes rarely.. but if it is before the display driver probes, assuming the fw didn't change recently, it must be an efifb issue

19:24 <steev> https://patchwork.freedesktop.org/patch/428413/ is the last patch that touches efifb here

21:35 cmeerw has quit [Ping timeout: 480 seconds]