#etnaviv on 2020-08-26 — irc logs at oftc.irclog.whitequark.org

2020-05-12 17:40 austriancoder changed the topic of #etnaviv to: #etnaviv - the home of the reverse-engineered Vivante GPU driver - Logs https://freenode.irclog.whitequark.org/etnaviv

01:52 pcercuei has quit [Quit: dodo]

05:14 _daniel_ has joined #etnaviv

06:35 _daniel_ has quit [Quit: Leaving.]

07:47 JohnnyonFlame has quit [Ping timeout: 246 seconds]

07:50 JohnnyonFlame has joined #etnaviv

08:27 lynxeye has joined #etnaviv

08:41 <marex> the MX8MM GPCv2 can fail because if the gpu2d exits imx_gpc_pu_pgc_sw_pxx_req() in one thread and is just before pm_runtime_put() , and gpu3d enters imx_gpc_pu_pgc_sw_pxx_req() and is just before pm_runtime_get_sync(), then depending on the order, the gpumix might just be enabled and disabled right away, followed by gpu3d PU enabling, which obviously fails

08:41 <marex> so PD nesting might need some locking work

08:41 <marex> lynxeye: ^

08:55 <lynxeye> marex: I don't quite understand the failure case. If one GPU domain puts the mix domain before the other has reached get_sync() the mix domain will be powered down, but then get powered up again when the thread reaches the get_sync()

08:55 <lynxeye> Is this a problem?

08:56 <lynxeye> If one thread already passed the get_sync() the mix domain should not be powered down, even when the other GPU domain executes the put()

08:56 <marex> lynxeye: the problem is if one GPU powers down gpu2d and the other powers up gpu3d ; the first thread can power down the mix domain right after it was powered up by gpu3d thread

08:56 <marex> unless there is proper synchronization

08:57 <lynxeye> Huh? How do you get the mix domain to power down after one GPU executed the get_sync()? The mix domain power state is reference counted through the genpd code, no?

08:58 <marex> lynxeye: so there should be two references at that point ?

08:59 <marex> lynxeye: are you sure that works if the parent device of all the gpcv2 PDs is the same parent device ?

08:59 <lynxeye> Yes, there should be two references. But it's worth validating this assumption ;)

09:00 <marex> lynxeye: btw did you ever use the SMC-based power managed to start the GPU, did that work ?

09:01 <marex> iirc it was flaky for me too

09:01 <lynxeye> I haven't used that at all.

09:03 <marex> lynxeye: ha, ok, well, I'll keep digging

09:29 <marex> lynxeye: well, duh ... imx_gpc_pu_pgc_sw_pxx_req() calls pm_runtime_get_sync(domain->dev); and pm_runtime_put(domain->dev);

09:29 <marex> lynxeye: so if gpu3d thread calls the former, gets rescheduled ... then gpu2d thread calls the later, gets rescheduled ... then gpu3d tries to enable GPU3D, it must fail

09:29 <marex> yes ?

09:29 <marex> notice the domain->dev

09:30 <marex> that's the same device for all of the PDs

10:00 <marex> so nope, it's all bolted down with mutexes now, and it still fails

11:03 pcercuei has joined #etnaviv

11:31 berton has joined #etnaviv

13:36 T_UNIX has joined #etnaviv

15:47 lynxeye has quit [Quit: Leaving.]

16:03 hanzelpeter has joined #etnaviv

16:42 <marex> so ok, the weirdness is some gpumix+gpu2d interaction

17:59 <marex> yep, seems that SRC reset of the GPU is needed and fixes the gpu2d misbehavior

17:59 <marex> that means the three gpu PDs on MX8MM are basically unusable because the GPUs share one reset

17:59 <marex> sigh

18:07 T_UNIX has quit [Quit: Connection closed for inactivity]

18:28 <mntmn> marex: do they need to be reset all the time or what?

18:46 <marex> mntmn: dunno

18:46 <marex> mntmn: TFA does it every time the PD is brought up

18:46 <marex> I wonder whether if I bring the PD up, reset the GPUs, and then suspend it again, whether that would be reliable

18:47 <marex> but obv they should've connected reset to each GPU ... sigh

19:17 _daniel_ has joined #etnaviv

19:23 <marex> so nope, toggling the GPU reset in TFA doesn't help

19:23 <marex> that basically means that to bring up either GPU, you either bring up the whole cluster or nothing, sigh

19:24 <marex> so three power domains are useless I guess

19:30 hanzelpeter has quit [Quit: leaving]

21:03 _daniel_ has quit [Quit: Leaving.]

21:16 berton has quit [Remote host closed the connection]

22:22 karolherbst has quit [Ping timeout: 272 seconds]

23:50 karolherbst has joined #etnaviv