ChanServ changed the topic of #asahi-gpu to: Asahi Linux GPU development (no user support, NO binary reversing) | Keep things on topic | GitHub: https://alx.sh/g | Wiki: https://alx.sh/w | Logs: https://alx.sh/l/asahi-gpu
aratuk has quit []
jhan has quit [Ping timeout: 480 seconds]
jhan has joined #asahi-gpu
jhan has quit [Ping timeout: 480 seconds]
jhan has joined #asahi-gpu
jhan has quit [Ping timeout: 480 seconds]
jhan has joined #asahi-gpu
user982492 has joined #asahi-gpu
jhan has quit [Remote host closed the connection]
Hibyehello has quit [Ping timeout: 480 seconds]
jhan has joined #asahi-gpu
Hibyehello has joined #asahi-gpu
Hibyehello_ has joined #asahi-gpu
Hibyehello has quit [Ping timeout: 480 seconds]
kesslerd has quit [Remote host closed the connection]
Hibyehello_ has quit [Ping timeout: 480 seconds]
Hibyehello has joined #asahi-gpu
Hibyehello has quit [Ping timeout: 480 seconds]
jhan has quit [Remote host closed the connection]
jhan_ has joined #asahi-gpu
jhan has joined #asahi-gpu
jhan_ has quit [Read error: Connection reset by peer]
jhan has quit [Read error: Connection reset by peer]
Hibyehello has joined #asahi-gpu
DarkShadow44 has quit [Quit: ZNC - https://znc.in]
DarkShadow44 has joined #asahi-gpu
Hibyehello has quit [Ping timeout: 480 seconds]
possiblemeatball has quit [Quit: Quit]
jhan has joined #asahi-gpu
Hibyehello has joined #asahi-gpu
user982492 has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]
Hibyehello has quit [Ping timeout: 480 seconds]
Hibyehello has joined #asahi-gpu
Hibyehello has quit [Ping timeout: 480 seconds]
JTL has quit [Remote host closed the connection]
JTL has joined #asahi-gpu
JTL has quit [Remote host closed the connection]
JTL has joined #asahi-gpu
jhan has quit [Remote host closed the connection]
jhan has joined #asahi-gpu
jhan has quit [Ping timeout: 480 seconds]
bisko has joined #asahi-gpu
Hibyehello has joined #asahi-gpu
<lina> alyssa: I can't reproduce your splat but I think I know what happened. I thought there would be something in drm_sched to wait for in-progress jobs when a scheduler/entity gets destroyed, but it doesn't look like it... so you probably had a GPU job submitted, then the userspace process aborted, then the kernel freed all the scheduler stuff and when the GPU job completed it crashed because the scheduler
<lina> was gone.
<lina> To make things even more confusing, the job completion isn't via a job reference, it's via a fence...
<lina> I've added a reference from the job to the scheduler and I think that will fix it... since the job should never get destroyed until the scheduler cleans it up from its main loop, which can only happen when the fence gets signaled or fails, so that should mean the scheduler always outlives the job outlives the fence...
<lina> This ownership/lifetime stuff is so subtle and completely undocumented in C APIs, it's such a mess T_T
<lina> I'll look at the piglit stuff next though I'm less worried about that since I think we've always had corner case GPU crash bugs (as much as I've tried to eliminate them...)
<lina> And GPU crashes are better behaved now, at least it doesn't just hang your system
pjakobsson has joined #asahi-gpu
<lina> Okay, reproduced the sched thing with something deliberate (8Kx8K glmarks getting killed in a loop) ^^
<lina> Let's see if my fix worked...
<lina> Okay, I fixed the splat but I have another issue... I'm leaking slots somewhere with this workload, it runs out
<lina> ... and I can't reproduce it now? ;;
<lina> Why do I get the feeling this is drm_sched again... something like killing the entity stops jobs from being run, but doesn't cancel/free pending jobs...
<lina> Ohhh wait, I think I'm never calling the entity cleanup function. Okay, that one's on me then...
<lina> Wait no I do
jhan has joined #asahi-gpu
<lina> Ah, this could be a bad firmware interaction... I do know I invalidate context before waiting for jobs to complete, which is probably a bad idea. Maybe that just kills things and leaves the jobs dangling, never to complete.
jhan has quit [Ping timeout: 480 seconds]
DarkShadow44 has quit [Quit: ZNC - https://znc.in]
<lina> Nope, this is getting signaled... so why is the scheduler not cleaning this up?
DarkShadow44 has joined #asahi-gpu
stickytoffee has quit [Quit: brb]
DarkShadow44 has joined #asahi-gpu
<lina> And this time I crashed RTKit ^^;;
<lina> That could just be the context issue I mentioned though... but first I want to find out why I'm leaking jobs...
stickytoffee has joined #asahi-gpu
<lina> This is weeeird... the job cleanup callback gets called but the job doesn't get dropped sometimes?
nyilas has joined #asahi-gpu
<lina> Ohh... am I deadlocking by any chance?
<lina> Yeeeah...
<lina> Okay, I can't put a reference to the scheduler in the job, because if it is the last reference dropping the scheduler from the job cleanup callback deadlocks ;;
<lina> Maybe drm_sched_stop() before killing the scheduler will do what I want...?
<lina> No, but that only cleans up completed jobs and detaches the callbacks, it doesn't actually free pending jobs because it assumes you want to restart the queue later...
<lina> I think I need to modify the C side for this, this is just broken, I have no idea how to safely wrap this API without duplicating job tracking...
jhan has joined #asahi-gpu
<lina> OK, I think I finally fixed this one ^^
<lina> Now the piglig thing...
jhan has quit [Ping timeout: 480 seconds]
DarkShadow44 has quit [Quit: ZNC - https://znc.in]
DarkShadow44 has joined #asahi-gpu
jhan has joined #asahi-gpu
jhan has quit [Ping timeout: 480 seconds]
jhan has joined #asahi-gpu
<lina> Reproduced the RTKit crash... now did my GpuContext thing fix it?
<lina> [ 83.025087] asahi 206400000.gpu: Allocator: Corruption after object of type asahi::fw::fragment::RunFragmentG13V12_3 at 0xffffffa00009be00:0x928 + 0x0..0x5
<lina> Ooooo that's a new one
MajorBiscuit has joined #asahi-gpu
jhan has quit [Ping timeout: 480 seconds]
kode54 has quit [Quit: Ping timeout (120 seconds)]
kode54 has joined #asahi-gpu
hightower2 has joined #asahi-gpu
<lina> streaming-texture (or something) is OOMing for me...
<lina> Excluding that though, I got through a piglit run ^^
<lina> Trying again with higher GL...
<lina> Still works ^^
<lina> Let me run with a fix for that corruption warning and see how that goes...
<lina> *Wild guess* those fields might have something to do with preemption, that sounds like the kind of thing piglit would end up triggering...
<lina> I think it's fixed! ^^
<lina> alyssa: Please uprev your kernel, I think I fixed both issues ^^
jhan has joined #asahi-gpu
possiblemeatball has joined #asahi-gpu
<lina> alyssa: Also this is now rebased on 6.2 with DCP changes, so you might need a m1n1 update too (I did)
jhan has quit [Ping timeout: 480 seconds]
jhan has joined #asahi-gpu
kujeger has quit [Quit: ZNC 1.8.2 - https://znc.in]
kujeger has joined #asahi-gpu
kujeger has quit []
kujeger has joined #asahi-gpu
benoyelq has joined #asahi-gpu
jhan has quit [Ping timeout: 480 seconds]
benoyelq has quit []
jhan has joined #asahi-gpu
hightower2 has quit [Remote host closed the connection]
le0n has quit [Ping timeout: 480 seconds]
bluetail9 has quit [Ping timeout: 480 seconds]
le0n has joined #asahi-gpu
ChaosPrincess has quit [Quit: WeeChat 3.8]
ChaosPrincess has joined #asahi-gpu
bluetail9 has joined #asahi-gpu
possiblemeatball has quit [Remote host closed the connection]
jhan has quit [Remote host closed the connection]
jhan has joined #asahi-gpu
jhan has quit [Remote host closed the connection]
hightower2 has joined #asahi-gpu
jhan has joined #asahi-gpu
jhan has quit [Ping timeout: 480 seconds]
jhan has joined #asahi-gpu
jhan has quit []
kesslerd has joined #asahi-gpu
bluetail9 has quit [Ping timeout: 480 seconds]
MajorBiscuit has quit [Ping timeout: 480 seconds]
bluetail9 has joined #asahi-gpu
TheLink7 has joined #asahi-gpu
VinDuv_ has joined #asahi-gpu
thevar1able_ has joined #asahi-gpu
m42uko_ has joined #asahi-gpu
snuck has joined #asahi-gpu
m5zs7k_ has joined #asahi-gpu
Misthios1 has joined #asahi-gpu
merry_ has joined #asahi-gpu
TellowKrinkle_ has joined #asahi-gpu
grange_c68 has joined #asahi-gpu
karolherbst_ has joined #asahi-gpu
Mary6 has joined #asahi-gpu
jnn has joined #asahi-gpu
pbsds3 has joined #asahi-gpu
antonio__ has joined #asahi-gpu
V_ has joined #asahi-gpu
lawrence6 has joined #asahi-gpu
minecrell4 has joined #asahi-gpu
akemin-dayo has joined #asahi-gpu
vup2 has joined #asahi-gpu
Yamakaja has joined #asahi-gpu
merry has quit [Ping timeout: 480 seconds]
TellowKrinkle has quit [Ping timeout: 480 seconds]
ah- has quit [Remote host closed the connection]
karolherbst has quit [Read error: Connection reset by peer]
ah-_ has joined #asahi-gpu
JoshuaAs- has joined #asahi-gpu
stipa_ has joined #asahi-gpu
kit_ty_kate2 has joined #asahi-gpu
ChaosPrincess has quit [Remote host closed the connection]
sneak has quit [Write error: connection closed]
ligma_toad has quit [Read error: Connection reset by peer]
ChaosPrincess has joined #asahi-gpu
mairacanal0 has joined #asahi-gpu
JoshuaAshton has quit [Read error: Connection reset by peer]
grange_c6 has quit [Read error: Connection reset by peer]
m42uko has quit [Write error: connection closed]
grange_c68 is now known as grange_c6
ligma_toad has joined #asahi-gpu
thevar1able has quit [Ping timeout: 480 seconds]
kit_ty_kate1 has quit [Read error: Connection reset by peer]
kit_ty_kate2 has quit [reticulum.oftc.net helix.oftc.net]
nyilas has quit [reticulum.oftc.net helix.oftc.net]
bisko has quit [reticulum.oftc.net helix.oftc.net]
stipa_ has quit [reticulum.oftc.net helix.oftc.net]
Misthios has quit [reticulum.oftc.net helix.oftc.net]
Mary has quit [reticulum.oftc.net helix.oftc.net]
Yamakaja_ has quit [reticulum.oftc.net helix.oftc.net]
TheLink has quit [reticulum.oftc.net helix.oftc.net]
mairacanal has quit [reticulum.oftc.net helix.oftc.net]
V has quit [reticulum.oftc.net helix.oftc.net]
m5zs7k has quit [reticulum.oftc.net helix.oftc.net]
pbsds has quit [reticulum.oftc.net helix.oftc.net]
stipa has quit [reticulum.oftc.net helix.oftc.net]
akemin_dayo has quit [reticulum.oftc.net helix.oftc.net]
minecrell has quit [reticulum.oftc.net helix.oftc.net]
VinDuv has quit [reticulum.oftc.net helix.oftc.net]
vup has quit [reticulum.oftc.net helix.oftc.net]
jn has quit [reticulum.oftc.net helix.oftc.net]
lawrence has quit [reticulum.oftc.net helix.oftc.net]
manawyrm has quit [reticulum.oftc.net helix.oftc.net]
TheLink7 is now known as TheLink
VinDuv_ is now known as VinDuv
Misthios1 is now known as Misthios
lawrence6 is now known as lawrence
akemin_dayo has joined #asahi-gpu
mairacanal has joined #asahi-gpu
pbsds has joined #asahi-gpu
V has joined #asahi-gpu
vup has joined #asahi-gpu
Yamakaja_ has joined #asahi-gpu
minecrell has joined #asahi-gpu
jn has joined #asahi-gpu
manawyrm has joined #asahi-gpu
Mary has joined #asahi-gpu
nyilas has joined #asahi-gpu
kit_ty_kate2 has joined #asahi-gpu
stipa_ has joined #asahi-gpu
minecrell has quit [Max SendQ exceeded]
V has quit [Max SendQ exceeded]
nyilas has quit [Ping timeout: 484 seconds]
Mary has quit [Ping timeout: 484 seconds]
Yamakaja_ has quit [Ping timeout: 484 seconds]
akemin_dayo has quit [Ping timeout: 484 seconds]
pbsds has quit [Ping timeout: 484 seconds]
jn has quit [Ping timeout: 484 seconds]
mairacanal has quit [Ping timeout: 484 seconds]
vup has quit [Ping timeout: 484 seconds]
TellowKrinkle_ is now known as TellowKrinkle
possiblemeatball has joined #asahi-gpu
mairacanal0 is now known as mairacanal
stipa has joined #asahi-gpu
stipa_ has quit [Read error: Connection reset by peer]
kit_ty_kate3 has joined #asahi-gpu
kit_ty_kate2 has quit [Read error: Connection reset by peer]
manawyrm has quit [reticulum.oftc.net helix.oftc.net]
manawyrm has joined #asahi-gpu
minecrell4 has quit []
minecrell has joined #asahi-gpu
aratuk has joined #asahi-gpu
aratuk has quit []
antonio__ has quit [Remote host closed the connection]
aratuk has joined #asahi-gpu
hightower2 has quit [Remote host closed the connection]
minecrell9 has joined #asahi-gpu
minecrell has quit [reticulum.oftc.net helix.oftc.net]
manawyrm has quit [reticulum.oftc.net helix.oftc.net]
manawyrm has joined #asahi-gpu
aratuk has quit [Remote host closed the connection]
aratuk has joined #asahi-gpu
aratuk has quit [Remote host closed the connection]
aratuk has joined #asahi-gpu
minecrell9 has quit []
minecrell9 has joined #asahi-gpu
aratuk has quit [Remote host closed the connection]
Mary6 has quit []
Mary6 has joined #asahi-gpu
aratuk has joined #asahi-gpu
aratuk has quit [Ping timeout: 480 seconds]
jole_ has quit [Remote host closed the connection]
jole has joined #asahi-gpu
aratuk has joined #asahi-gpu