#dri-devel on 2022-05-11 — irc logs at oftc.irclog.whitequark.org

2022-03-22 11:57 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:01 stuart has quit []

00:05 <zmike> dcbaker: I'm waiting on https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16252 to merge into main and then I'll backport this along with the rest

00:10 mnadrian has joined #dri-devel

00:11 oneforall2 has quit [Remote host closed the connection]

00:13 oneforall2 has joined #dri-devel

00:17 nadrian has quit [Ping timeout: 480 seconds]

00:28 mbrost has quit [Ping timeout: 480 seconds]

00:30 icecream95 has quit [Ping timeout: 480 seconds]

00:39 alanc has quit [Remote host closed the connection]

00:56 co1umbarius has joined #dri-devel

00:57 columbarius has quit [Ping timeout: 480 seconds]

00:59 ManMower has quit [Read error: Connection reset by peer]

00:59 ManMower has joined #dri-devel

01:02 sdutt has quit [Ping timeout: 480 seconds]

01:23 <zmike> dcbaker: okay, ci isn't cooperating so I just pushed now so you can get on with things

01:24 <zmike> I think

01:24 <zmike> lemme double check that I got everything...

01:25 <zmike> okay now now I've pushed everything

01:25 nchery has quit [Ping timeout: 480 seconds]

01:27 <mareko> zmike: no further commnets on 15504

01:27 <zmike> mareko: can I consider that an ab for the gallium commits?

01:27 <mareko> yes

01:27 <zmike> cool thx

01:29 oneforall2 has quit [Remote host closed the connection]

01:32 oneforall2 has joined #dri-devel

01:34 RSpliet has quit [Ping timeout: 480 seconds]

01:38 Company has quit [Quit: Leaving]

01:39 icecream95 has joined #dri-devel

01:48 nchery has joined #dri-devel

01:58 nchery has quit [Ping timeout: 480 seconds]

02:08 lemonzest has quit [Quit: WeeChat 3.4]

02:14 kurufu has quit [Remote host closed the connection]

02:14 aravind has joined #dri-devel

02:14 kurufu has joined #dri-devel

02:21 frankbinns1 has joined #dri-devel

02:24 heat_ has joined #dri-devel

02:24 heat has quit [Read error: Connection reset by peer]

02:27 frankbinns has quit [Ping timeout: 480 seconds]

02:28 kurufu has quit [Remote host closed the connection]

02:29 kurufu has joined #dri-devel

02:30 heat_ has quit [Remote host closed the connection]

02:31 heat has joined #dri-devel

02:38 mbrost has joined #dri-devel

02:49 heat has quit [Remote host closed the connection]

02:50 kurufu has quit [Remote host closed the connection]

02:50 kurufu has joined #dri-devel

02:50 heat has joined #dri-devel

03:00 haasn has quit [Quit: ZNC 1.7.5+deb4 - https://znc.in]

03:01 haasn has joined #dri-devel

03:02 icecream95 has quit [Ping timeout: 480 seconds]

03:06 <dcbaker> zmike: All good for staging/22.1 then?

03:07 <zmike> dcbaker: should be!

03:07 <zmike> had to eat some ci regressions, but I don't think they're going to show up anywhere else

03:12 <dcbaker> zmike: most of those are also queued for 22.0, should I just denominate them and move on?

03:13 <zmike> dcbaker: yeah I'd say don't worry about it

03:13 <zmike> mostly focusing on 22.1 at this point

03:14 <dcbaker> awesome, thanks

03:18 heat has quit [Remote host closed the connection]

03:18 heat has joined #dri-devel

03:26 <dcbaker> @zmike, only 92 patches since rc5, do you think you could come up with 8 more :D

03:27 heat has quit [Remote host closed the connection]

03:27 heat has joined #dri-devel

03:29 <zmike> dcbaker: hm

03:29 <zmike> you could probably pick some of the llvmpipe ones if I missed those

03:30 <zmike> also I hacked in some changes to one patch randomly that I could split out into another patch if you're really set on rounding off to an even 100 :P

03:31 sdutt has joined #dri-devel

03:31 <dcbaker> I picked a couple, and I even tried harder by adding a couple of patches from jekstrand and dj-death

03:31 <zmike> hmm

03:31 <zmike> you could grab ajax's dri cleanup series

03:32 <zmike> that's a solid no-op 6

03:32 <dcbaker> lol

03:33 <zmike> oh is my blitter patch in?

03:33 <zmike> I forgot about that one

03:33 <zmike> 38ab178c4ad

03:35 <dcbaker> auto nominated

03:36 <zmike> alright, lemme actually look at everything you pulled in

03:37 <zmike> yea grab that dri series I think

03:37 <zmike> should be Very Safe

03:37 <zmike> that gets you to 98

03:37 <zmike> the version bump is 99

03:42 <dcbaker> well first it looks like I've got a regression, so I might get to remove or pull a few things in

03:46 heat has quit [Remote host closed the connection]

03:47 heat has joined #dri-devel

03:52 LexSfX has quit []

03:57 frankbinns2 has joined #dri-devel

03:58 frankbinns1 has quit [Read error: Connection reset by peer]

03:59 LexSfX has joined #dri-devel

04:05 stuart has joined #dri-devel

04:12 <dcbaker> zmike: I've got a patch of yours that fails to build on the 22.0 branch, "f1d1371e51 gallivm/draw: fix oob ubo reads"

04:12 <dcbaker> it applies fine, but there's some missing groundwork I didn't find on a quick check

04:13 <dcbaker> not sure what you want to do about it

04:14 <zmike> uhhhhh

04:14 <zmike> I think that one is on top of another patch that's only in 22.1?

04:14 <zmike> so maybe just drop that

04:14 <dcbaker> btw, if you only want to send patches for 22.1 you can do `cc: 22.1 mesa-stable`

04:14 <dcbaker> and the script will ignore them on the 22.0 branch

04:15 <zmike> oh is that syntax live now?

04:15 <zmike> 🤔

04:15 <dcbaker> or, maybe it's `cc: "22.1" mesa-stable`

04:15 <dcbaker> nope, no quotes needed

04:15 <zmike> I thought I did fixes tags on those ones

04:16 <dcbaker> that's always worked with the CC code. I've been meaning to do a `backport: 22.1` and do away with the `cc: mesa-stable` stuff but it's never happened

04:16 <zmike> ah

04:18 flto has quit [Remote host closed the connection]

04:19 flto has joined #dri-devel

04:26 Duke`` has joined #dri-devel

04:33 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

04:33 TMM has joined #dri-devel

04:43 mbrost has quit []

04:49 dviola has quit [Quit: WeeChat 3.5]

04:59 dviola has joined #dri-devel

05:09 heat has quit [Ping timeout: 480 seconds]

05:23 itoral has joined #dri-devel

05:38 Duke`` has quit [Ping timeout: 480 seconds]

06:10 mszyprow has joined #dri-devel

06:10 itoral has quit [Remote host closed the connection]

06:11 itoral has joined #dri-devel

06:12 itoral has quit [Remote host closed the connection]

06:13 itoral has joined #dri-devel

06:15 tzimmermann has joined #dri-devel

06:20 ahajda has joined #dri-devel

06:22 itoral has quit [Remote host closed the connection]

06:22 itoral has joined #dri-devel

06:26 JohnnyonF has joined #dri-devel

06:28 JohnnyonFlame has quit [Ping timeout: 480 seconds]

06:29 itoral has quit [Remote host closed the connection]

06:30 itoral has joined #dri-devel

06:36 kurufu has quit [Remote host closed the connection]

06:36 kurufu has joined #dri-devel

06:40 itoral has quit [Remote host closed the connection]

06:41 itoral has joined #dri-devel

06:41 sdutt_ has joined #dri-devel

06:43 frieder has joined #dri-devel

06:43 JohnnyonF has quit [Read error: Connection reset by peer]

06:47 JohnnyonFlame has joined #dri-devel

06:47 sdutt has quit [Ping timeout: 480 seconds]

06:56 Daanct12 has joined #dri-devel

07:02 stuart has quit []

07:05 lemonzest has joined #dri-devel

07:06 tursulin has joined #dri-devel

07:10 icecream95 has joined #dri-devel

07:11 JohnnyonF has joined #dri-devel

07:15 Johnny has joined #dri-devel

07:17 danvet has joined #dri-devel

07:17 JohnnyonFlame has quit [Ping timeout: 480 seconds]

07:19 JohnnyonF has quit [Ping timeout: 480 seconds]

07:22 Johnny has quit [Read error: Connection reset by peer]

07:24 gouchi has joined #dri-devel

07:24 gouchi has quit [Remote host closed the connection]

07:26 JohnnyonFlame has joined #dri-devel

07:26 thellstrom has joined #dri-devel

07:52 anujp has quit [Ping timeout: 480 seconds]

07:56 apinheiro has joined #dri-devel

08:00 thellstrom has quit [Ping timeout: 480 seconds]

08:01 lynxeye has joined #dri-devel

08:05 eukara has quit [Ping timeout: 480 seconds]

08:05 anujp has joined #dri-devel

08:06 cheako has quit [Quit: Connection closed for inactivity]

08:10 hansg has joined #dri-devel

08:11 JohnnyonFlame has quit [Read error: No route to host]

08:11 <hansg> mlankhorst, I just tried your latest patch for the BYT rendering issue caused by "Remove short-term pins from execbuf, v6.". Unfortunately it does not work.

08:12 <hansg> Unlike the last 2 days I'm available the entire day (CET) today to test things, so I thought I would jump on IRC and see if we can speedup the debug cycle a bit, by me being available for testing in realtime

08:26 jkrzyszt has joined #dri-devel

08:34 rpigott has quit [Remote host closed the connection]

08:34 rpigott has joined #dri-devel

08:34 RSpliet has joined #dri-devel

08:37 rpigott has quit [Remote host closed the connection]

08:37 rpigott has joined #dri-devel

08:41 itoral has quit [Remote host closed the connection]

08:42 itoral has joined #dri-devel

08:49 frankbinns2 has quit []

08:50 frankbinns has joined #dri-devel

08:53 RSpliet has quit [Quit: Bye bye man, bye bye]

08:54 RSpliet has joined #dri-devel

08:55 ella-0 has joined #dri-devel

08:55 pcercuei has joined #dri-devel

08:57 ella-0_ has quit [Read error: Connection reset by peer]

09:23 rasterman has joined #dri-devel

09:46 rkanwal has joined #dri-devel

10:07 eukara has joined #dri-devel

10:12 Daanct12 has quit [Quit: Leaving]

10:13 Daanct12 has joined #dri-devel

10:19 mvlad has joined #dri-devel

10:21 devilhorns has joined #dri-devel

10:28 icecream95 has quit [Ping timeout: 480 seconds]

10:28 <mlankhorst> hansg: if still there, new version. :)

10:40 Daanct12 has quit [Read error: Connection reset by peer]

10:48 <danvet> hwentlan____, agd5f_ for the psr-su damage tracking comments, do you want me to reply on amdgfx or ok with just the irc thoughts I dropped a few days ago?

10:49 sdutt_ has quit [Ping timeout: 480 seconds]

10:51 karolherbst has quit [Read error: Connection reset by peer]

10:52 karolherbst has joined #dri-devel

10:53 <zzag> emersion: while the compositor holds client buffer's dmabuf file descriptors, can the underlying buffer data be actually destroyed? I've noticed that chromium with native wayland support (chromium --enable-features=UseOzonePlatform --ozone-platform=wayland) is not animated as expected when it's closed, the animation abruptly stops...

10:53 <zzag> Same in weston, chromium doesn't fade out but immediately disappears.

10:53 <emersion> nope, it can't

10:54 <emersion> that's a weston bug

10:54 <emersion> iirc

10:54 <emersion> a dmabuf is ref'counted, and a FD opened in the compositor is a ref

10:55 <emersion> so is an EGLImage

10:55 <daniels> yeah and we do animate those out if you configure it to

10:55 <zzag> Huh, other closed windows that provide dmabuf buffers are animated as expected in weston

10:55 <daniels> you're not using wl_shm rather than dmabuf, are you?

10:55 <emersion> wasn't there an issue about weston_buffer being destroyed when the wl_buffer is?>

10:55 <zzag> No, chromium provides dmabuf client buffers

10:55 <daniels> we don't animate out wl_shm because wl_shm is hugely irritating, but that is fixable now

10:56 <daniels> emersion: was, not anymore

10:56 <emersion> ack

10:56 <zzag> FTR, I do not work on chromium just trying to debug why it doesn't work as expected in kwin

10:56 <daniels> (even then dmabuf/EGLImages got kept via elaborate behind-the-scenes hax)

10:57 <zzag> daniels: emersion: chromium uses drmPrimeHandleToFD() though. I've tried to port relevant code to gbm_bo_get_fd_for_plane(), it didn't help..

11:00 <daniels> zzag: that shouldn't be making any difference though - as emersion says, the compositor has a totally separate ref to the underlying storage

11:00 <emersion> bad drmPrimeHandleToFD usage would just result in corrupted GEM handles in the chromium process

11:01 <daniels> I'd probably just walk back to proto - is it doing something fun with subsurfaces such that no-one ever sees a single toplevel destruction to fade out? is it attaching a NULL buffer to the surface before destroying? etc

11:03 <zzag> daniels: hmm, right, didn't pay attention to subsurfaces

11:03 <zzag> it indeed has a subsurface

11:03 <zzag> and it's destroyed before the main surface

11:05 flacks has quit [Quit: Quitter]

11:05 <tzimmermann> daniels: hi! is anything blocking https://gitlab.freedesktop.org/freedesktop/freedesktop/-/issues/433 ?

11:07 heat has joined #dri-devel

11:07 flacks has joined #dri-devel

11:08 <daniels> tzimmermann: the linear and constrained nature of time

11:08 <daniels> (done it now)

11:08 <tzimmermann> daniels, thanks a lot

11:10 <daniels> np

11:28 Company has joined #dri-devel

11:52 jkrzyszt has quit [Remote host closed the connection]

11:59 <jfalempe> tzimmermann, daniels, thanks a lot ;)

12:00 <jfalempe> now, I need to learn the dim workflow.

12:01 mclasen has joined #dri-devel

12:01 heat_ has joined #dri-devel

12:02 <javierm> jfalempe: you can mostly follow https://drm.pages.freedesktop.org/maintainer-tools/getting-started.html verbatim

12:02 <javierm> jfalempe: feel free to ping me if you need any help with the workflow

12:02 heat has quit [Remote host closed the connection]

12:04 <jfalempe> yes, I was already on this page ;)

12:05 <javierm> jfalempe: great :)

12:10 ivyl has quit [Quit: end of flowers]

12:13 <tzimmermann> jfalempe, don't worry. if you really accidentaly commit garbage (that's hard with dim) danvet can roll-back drm-misc trees for you.

12:14 <graphitemaster> Shot in the dark, is anyone aware of glClearNamedBufferSubData crashes on AMD with a clear size of 1 (ubyte). I'm hitting a null-pointer dereference in the driver on that call with clear sizes of 1 only. Not actually mesa though, this is the Windows AMD drivers

12:14 <danvet> we've had to do that once thus far in years of drm-misc

12:14 <danvet> and I think once more in drm-intel <- jani or do I misremember?

12:15 <HdkR> graphitemaster: Sounds like you hit a driver bug. Good job

12:16 <graphitemaster> :(

12:23 FireBurn has joined #dri-devel

12:27 itoral has quit [Remote host closed the connection]

12:29 calebccff has quit [Remote host closed the connection]

12:29 calebccff_ has joined #dri-devel

12:30 <karolherbst> airlied: rusticl on crocus, how much work would you expect? And what would be something I need to fix within rusticl?

12:32 whald has quit [Remote host closed the connection]

12:39 lumag_ has joined #dri-devel

12:41 flto has quit [Quit: Leaving]

12:43 <tzimmermann> javierm, thanks for providing me with concrete examples.

12:44 <javierm> tzimmermann: you are welcome. Probably I should add something like that in the commit message

12:45 <javierm> tzimmermann: basically what we want to prevent is modprobe vc4 && modprobe simpledrm to create two /dev/dri/card{0,1}

12:46 <javierm> the latter whould be a no-op instead

12:48 <javierm> *should

12:49 <tzimmermann> javierm, i'll go through it

12:49 <javierm> tzimmermann: thanks a lot for looking at the patches

12:49 <javierm> tzimmermann: don't you prefer for me to re-send with the different split that you asked ?

12:51 ivyl has joined #dri-devel

12:55 <tzimmermann> javierm, maybe for the next iteration. with your explanation, i'll manage for now

12:58 <javierm> tzimmermann: Ok, perfect

13:16 rkanwal has quit [Ping timeout: 480 seconds]

13:19 <danvet> tzimmermann, javierm once more trying to catch up on mails, I've dropped some comments on the huge fbdev hotunplug discussion

13:19 <javierm> danvet: thanks. And sorry that this thread deralied that much

13:20 <danvet> oh from skimming at least I think it covers a lot of really tricky questions

13:21 <danvet> and it's probably good if we keep them in mind and make sure we have a rough agreement on what the code ideally should look like

13:24 <javierm> danvet: right. The tl;dr I think is that the issues are not easy to fix and would require a lot of work but probably makes sense to distill them and at least have them in Documentation/gpu/todo.rst

13:25 <jfalempe> tzimmermann, my v2 of gamma for mgag200 is almost ready.

13:25 <jfalempe> But if I remove the call to drm_mode_crtc_set_gamma_size(), then it doesn't work anymore.

13:27 sdutt has joined #dri-devel

13:29 <jfalempe> I didn't find why, if it's in the kernel, or mutter looking for legacy interface.

13:29 <javierm> danvet: I'll try to do that btw to make sure that are tracked. But are too big issues for me to tackle :)

13:30 <danvet> javierm, yeah sounds like a great idea, thanks a lot

13:30 <danvet> (documenting in todo.rst I mean)

13:42 heat has joined #dri-devel

13:42 heat_ has quit [Read error: Connection reset by peer]

13:45 ella-0 has quit [Remote host closed the connection]

13:47 <tzimmermann> jfalempe, if I may make an educated guess about gamma_size: i'd say that userspace reads it via ioctl from here: https://elixir.bootlin.com/linux/v5.17.6/source/drivers/gpu/drm/drm_crtc.c#L483

13:47 rkanwal has joined #dri-devel

13:49 <emersion> tzimmermann: that's legacy gamma

13:49 <emersion> atomic gamma goes through the GAMMA_LUT_SIZE prop

13:49 <emersion> which can be different

13:49 <ajax> tzimmermann: hey, you pinged me the other day about some horrid g200se patch. i think i do remember the context for that patch, what was your question about it?

13:50 <tzimmermann> emersion, indeed. but what do i know what gnome does ;)

13:50 Guest90 has quit []

13:51 <tzimmermann> jfalempe, emersion. and gnome presumably also sets gamma via legacy ioctl. which in turn actually updates the atomic property with the call here: https://elixir.bootlin.com/linux/v5.17.6/source/drivers/gpu/drm/drm_color_mgmt.c#L400

13:52 <emersion> eh, fun

13:53 <tzimmermann> jfalempe, emersion: i don't have the big picture here, but we might be able to fix this easily by updating gamma_size as part of enabling the gamma properties in https://elixir.bootlin.com/linux/v5.17.6/source/drivers/gpu/drm/drm_color_mgmt.c#L160

13:54 <jfalempe> tzimmermann, yes, but maybe it's also good to be backward compatible with the "legacy" gamma api ?

13:55 aravind has quit [Ping timeout: 480 seconds]

13:55 <ajax> Updating 742a8732095..ac0a61e17b8

13:55 <ajax> Fast-forward .gitlab-ci/valve/b2c.yml.jinja2.jinja2 | 2 +-

13:55 * ajax squints

13:55 <tzimmermann> jfalempe, it would finally make this compatible

13:56 <tzimmermann> jfalempe, you've discovered a bug :)

13:56 <tzimmermann> jfalempe, that's maybe worth a separate patchset to fix all drivers. for now, you're welcome to leave the drm_mode_crtc_set_gamma_size() in mgag200

13:56 <tzimmermann> maybe with an updated comment

13:56 <jfalempe> ok, I will do that.

13:57 <tzimmermann> ajax, my question is why this is for g200se only

13:57 <tzimmermann> ?

13:57 <tzimmermann> what about all the other devices with 2 MiB ?

13:58 <tzimmermann> and the test covers devices with 'less than 2MiB' . how little memory does the g200se actually have. even the old matrox cards from the mid-90s had 2 MiB at least

13:59 <ajax> there's not a "the" g200se, sadly

13:59 <ajax> there's a few models and the very very early ones had something like 1.75M of vram

14:00 <ajax> but then jumped to 8M or more later on

14:00 <karolherbst> jekstrand: what kind of intel hw do you have available?

14:00 <ajax> so that patch only addresses g200se because that's the only extant thing i cared about, because nobody has a 2M actual G200

14:01 <jekstrand> karolherbst: Tigerlake and Haswell

14:01 <ajax> if you wanted to generalize it to all low-memory devices, go for it, but probably that's better done inside xserver?

14:01 <jekstrand> And I suppose Skylake if I can get ksim to work. :joy:

14:01 hansg has quit [Quit: Leaving]

14:01 * jekstrand waits for krh to show up suddenly.

14:01 <karolherbst> jekstrand: mhhhh.. haswell is crocus, isn't it?

14:01 <jekstrand> karolherbst: yes

14:02 <karolherbst> do I have to ask or do you already know what I'd ask next? :P

14:02 <tzimmermann> 1.75 MiB. that's terrible

14:02 <karolherbst> jokes aside, I think iris should be good to go across all gens as long as we keep fp64 disabled

14:02 <ajax> (actually now i remember why i didn't do that as an xserver heuristic, i don't think the drivers tell us available vram early/consistently enough to let us make that kind of decision)

14:03 <karolherbst> should try it out here on CMT-H

14:03 <ajax> tzimmermann: rhel5 still had barely enough 8bpp support that it was "plenty", in their minds

14:03 <tzimmermann> ajax, that code has been ported into the kernel meanwhile. i was working on mgag200 and found that change, which i tracked back into the x11 mga driver.

14:03 <ajax> but then rhel6 dropped pseudocolor

14:03 <ajax> and the IHV still wanted the same machine to be certified with rhel6

14:03 <ajax> so

14:04 <tzimmermann> i see

14:04 rexbcchen_ has joined #dri-devel

14:04 <tzimmermann> thanks for giving me some context

14:04 <ajax> but yeah, 1.7M made me spit my coffee out

14:04 <karolherbst> jekstrand: but how do we file for conformance from a practical pov, everybody runs it on their machine with the hardware, or should one have all the hardware and test some gens? Not sure what we should be end up doing

14:04 <ajax> they had to go out of their way to be that dumb

14:04 <ajax> even then, you couldn't _source_ ram that small

14:05 <tzimmermann> :)

14:05 <tzimmermann> i'll keep all this in mind when working on the code

14:05 ella-0 has joined #dri-devel

14:09 <tzimmermann> 1.7 mib. i'm getting ptsd from that number. my first computer's graphics card was a trident 9000 with 512 kib of vram. it was total garbage, even back then

14:10 rxbcchen has quit [Ping timeout: 480 seconds]

14:13 <alyssa> karolherbst: running the full CTS (with the actual cts-runner) is the tricky part

14:13 <karolherbst> nah, that's fine

14:14 <alyssa> once you have a passing run with the current CTS version (I screwed that up), actually doing the submission is trivial

14:14 <alyssa> at least for GLES, I assume CL is similar

14:14 <karolherbst> sure, but I mean how do you run it against multiple harwdare?

14:14 <alyssa> so multiple people doing independent submissions for respective hardware would be fine

14:14 <karolherbst> you just collect the outputs and submit them in one go?

14:14 <alyssa> I mean

14:15 <alyssa> iris and llvmpipe are going to be separate submissions, first of all

14:15 <karolherbst> ahh

14:16 <alyssa> llvmpipe would be just one submission

14:16 <alyssa> iris... I guess you have some flexibility there

14:16 <alyssa> it looks like a single submission for hardware in the same family is acceptable

14:17 <alyssa> the realistic answer is that conformance submission is boring and you probably don't care to submit for every random Intel platform ever :-p

14:18 <karolherbst> yeah...

14:21 ella-0_ has joined #dri-devel

14:23 <krh> jekstrand: you rang?

14:24 ella-0 has quit [Read error: Connection reset by peer]

14:26 ella-0 has joined #dri-devel

14:28 <orbea> anyone mind reviewing this trivial one liner build fix? https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16429?commit_id=b5985b59162764cd0a60fa4b4b64a34539655358

14:29 <orbea> err, better link https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16429

14:29 ella-0_ has quit [Read error: Connection reset by peer]

14:36 <jekstrand> krh: You came in a whole page down from my ksim mention. I'm dissapointed. :P

14:36 <jekstrand> karolherbst: Ideally, one person has a giant pile of hardware and does all the runs.

14:38 <jekstrand> karolherbst: I could have done it but someone made me send all my hardware back.

14:38 * jekstrand glares at Ryback_ :P

14:44 <MrCooper> tzimmermann: my first computers had 64 kB RAM total :) then the Amiga 500 had 512 kB total stock

14:46 ppascher has quit [Ping timeout: 480 seconds]

14:48 rexbcchen has joined #dri-devel

14:48 ella-0_ has joined #dri-devel

14:49 <jani> danvet: yeah, someone accidentally pushed drm-tip to drm-intel-next I think (or drm-intel-next-queued back then)

14:49 <jani> danvet: I've added the "are you sure" question on pushing > 10 patches or any merges since then

14:51 rexbcchen_ has quit [Ping timeout: 480 seconds]

14:51 ella-0 has quit [Read error: Connection reset by peer]

14:53 <karolherbst> jekstrand: yeah... I have a bunch of laptops here now, but I suspect they are all form the last 2-3 gens :D

14:53 <karolherbst> anyway.. CMT seems to have some issues

14:54 <danvet> jani, yeah that one plus the "yes this is the dim script not raw git" check server side

14:54 <danvet> since then no accidents anymore iirc

14:55 <jekstrand> karolherbst: There are only really 4 gens you care about: BDW (maybe), Gen9 (SKL, KBL, CFL, etc.), ICL, and TGL.

14:55 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

14:55 TMM has joined #dri-devel

14:59 ppascher has joined #dri-devel

14:59 <karolherbst> ahh, okay

15:03 <danvet> jekstrand, small gen9 aka apl?

15:03 <danvet> or you just don't bother with that one

15:07 <jekstrand> Sure, maybe

15:14 flto has joined #dri-devel

15:15 <karolherbst> jekstrand: api min_max_read_image_args fails on gen9

15:15 <karolherbst> not sure if I messed up rebasing or not: test_api: ../src/gallium/drivers/iris/iris_program.c:946: iris_setup_binding_table: Assertion `bt->sizes[i] <= SURFACE_GROUP_MAX_ELEMENTS' failed.

15:17 <karolherbst> something is up with int64 lowering as well

15:18 <karolherbst> oh well..

15:18 <karolherbst> let's see when I'll have time for that 🙃

15:19 stuart has joined #dri-devel

15:24 <jekstrand> karolherbst: Let me see if I can find a SKL NUC cheap on EBay

15:24 neonking has quit [Remote host closed the connection]

15:26 <alyssa> jekstrand: I hit the retry button on https://gitlab.freedesktop.org/mesa/mesa/-/jobs/22530859 for you

15:26 <alyssa> looks like gl_simpleaaclip_aaclip flaked?

15:26 neonking has joined #dri-devel

15:29 ella-0_ has quit [Remote host closed the connection]

15:30 mszyprow has quit [Ping timeout: 480 seconds]

15:35 <jekstrand> alyssa: :(

15:40 sdutt has quit [Ping timeout: 480 seconds]

15:40 mbrost has joined #dri-devel

15:41 <MrCooper> "llvmpile", now there's a Freudian slip

15:42 <alyssa> blink

15:42 * alyssa scalarizes her IR

15:43 <karolherbst> alyssa: you are still using vectors?

15:43 * karolherbst has a bad plan on how to scalarize nir

15:44 <alyssa> Scalarize NIR how

15:44 <alyssa> I mean in the sense of ACO

15:44 <karolherbst> vectors are just big types, no?

15:44 <alyssa> Yeah... in effect, forbidding vec4 but allowing 128-bit scalars

15:44 <alyssa> with SPLIT and COLLECT pseudoinstructions

15:44 <karolherbst> or well.. 1024 for CL

15:44 <alyssa> Woof

15:45 <karolherbst> that would even help us, because our tex instructions just take multiple 128 bit sources :P

15:47 <karolherbst> although I slowly come to the conclusion that on nv hw, we literally only have 128 bit registers

15:47 <alyssa> heh, that's the case on Midgard

15:47 <alyssa> AGX only has 16 bit registers

15:47 <karolherbst> we still index them in a scalar way though

15:47 <karolherbst> but you can only allocate multiple of 4 regs (which are 32 bit)

15:48 <alyssa> I'm undecided about Bifrost. I think all 32-bit but I guess there's some debate.

15:48 <tzimmermann> MrCooper: c64?

15:48 <karolherbst> alyssa: my thinking is more like.. if we just go for "128 bit all the way" you wouldn't need split/collect

15:49 <karolherbst> mhhh

15:49 <karolherbst> I should stop thinking about terrible ideas

15:49 <MrCooper> tzimmermann: that was numbers 2 & 3, the first one was a Canon MSX

15:49 <alyssa> That's what we do for Midgard, it works ok because the machine is natively vec4 with a 128-bit data path

15:49 Duke`` has joined #dri-devel

15:49 <karolherbst> alyssa: ahh

15:49 <alyssa> It's a terrible idea for a scalar ISA though, since the top 96-bits are almost always wasted and your memory footprint bloats up

15:50 <karolherbst> we don't have 96 bit ops

15:50 <alyssa> (Depending on impl details)

15:50 <karolherbst> ehh wait

15:50 <karolherbst> that's not what I meant

15:50 eukara has quit [Remote host closed the connection]

15:50 <tzimmermann> i've not even heard the msx :)

15:50 <alyssa> what data structures would you use for liveness analysis? etc

15:50 <karolherbst> you still assign to "vector components"

15:50 <karolherbst> and use them as sources

15:50 eukara has joined #dri-devel

15:50 <karolherbst> and RA fills up vecs in a packed way

15:51 <karolherbst> alyssa: the bad thing is, at least for nvidia hardware the RA needs to be vector aware

15:51 <karolherbst> even if it's a scalar ISA by nature

15:51 <karolherbst> it is really annoying

15:52 <karolherbst> it's also just one register file, so 32/64/128 bit sources share the same index

15:56 <MrCooper> MrCooper: MSX was a home computer standard, many of the big Japanese manufacturers made machines conforming to it; MS stands for Microsoft BTW (the BIOS and BASIC interpreter in ROM was from Microsoft)

15:56 <MrCooper> err tzimmermann ^

15:57 tango_ has quit [Ping timeout: 480 seconds]

15:58 tango_ has joined #dri-devel

15:59 sdutt has joined #dri-devel

16:03 tango_ is now known as Guest397

16:03 tango_ has joined #dri-devel

16:04 eukara has quit [Remote host closed the connection]

16:04 gouchi has joined #dri-devel

16:04 gouchi has quit [Remote host closed the connection]

16:04 eukara has joined #dri-devel

16:06 <karolherbst> mhhh

16:06 <karolherbst> float rtn/rtp/rtz conversions to (u)long are failing

16:06 <karolherbst> the other way around

16:08 eukara has quit []

16:08 <karolherbst> mhh, same for clz and popcount

16:08 <karolherbst> I bet something is wrong with the int64 lowering :)

16:08 <karolherbst> " Failure for popcount( (cl_long) 0x8c7f0aac ) = *33 vs 16" mhh

16:09 Guest397 has quit [Ping timeout: 480 seconds]

16:13 maxzor has joined #dri-devel

16:19 <jekstrand> karolherbst: Other way around. SKL has "real" int64. :)

16:19 <karolherbst> ahhh

16:19 <karolherbst> well

16:19 <karolherbst> looks broken :P

16:20 <karolherbst> I bet the code assumes 32 bit for some opcodes though

16:20 <jekstrand> Sounds believable

16:29 frieder has quit [Remote host closed the connection]

16:30 lynxeye has quit [Quit: Leaving.]

16:38 ambasta[m] has joined #dri-devel

16:44 <jekstrand> tarceri: I'll try to get back to the loop unrolling MR today if I can but I didn't sleep well last night so my brain isn't running at 100% and that's definitely an at least 90% brain MR. :)

16:44 i-garrison has quit [Remote host closed the connection]

16:44 <danvet> robclark, airlied I still think at least the gitlab yaml should be put quite a bit higher under drivers/gpu/ci or drivers/gpu/drm/ci

16:44 i-garrison has joined #dri-devel

16:44 <danvet> and then filters to arm the hw tests per driver or something like that

16:44 <danvet> if every driver tree rolls their own for this we have an endless mess

16:45 <danvet> but that's maybe a bikeshed for us to figure out, not drag greg&linus in with

16:45 <danvet> it might also help with making a better case if it's sw/build/kunit testing for everyone + a set of drivers with hw ci integration

16:46 <danvet> maybe drivers/gpu/drm/ci so it's still drowned in the driver updates in the diffstat :-)

16:46 apinheiro has quit [Ping timeout: 480 seconds]

16:52 AndrewR has quit [Ping timeout: 480 seconds]

16:55 AndrewR has joined #dri-devel

17:03 gio has quit []

17:08 gio has joined #dri-devel

17:12 <danvet> tomeu, ^^ too I guess

17:34 FireBurn has quit [Ping timeout: 480 seconds]

17:37 jeeeun84 has quit []

17:37 mbrost has quit [Read error: Connection reset by peer]

17:37 maxzor has quit [Ping timeout: 480 seconds]

17:38 jeeeun84 has joined #dri-devel

17:45 nchery has joined #dri-devel

17:48 <robclark> danvet: perhaps.. although the current thing still allows and even encourages re-use.. the toplevel yml file is just a tiny shim

17:48 <robclark> (it encourages in the sense of if you re-use it you don't have to write piles of yml yourself.. which seems like a pretty big incentive)

17:48 <zackr> tzimmermann: is it ok if i cherry-pick those three vmwgfx patches to drm-misc-next-fixes from drm-misc-next or do you want to merge drm-misc-fixes into drm-misc-next-fixes first?

17:52 rkanwal has quit [Ping timeout: 480 seconds]

17:57 <tzimmermann> zackr, it's not my turn. either mripard or mlankhorst is on duty

17:57 <danvet> robclark, yeah but I kinda thing most of that should be pulled into the kernel anyway

17:57 <danvet> robclark, also I'm mildly worried that if we start littering the entire project with ci/ directories there's going to be a kneejerk reaction

17:58 <danvet> so one ci/ directory for everything that's hosted on fd.o sounds like a better approach

17:58 <tzimmermann> zackr, in principle -misc-fixes should be forwarded to -rc6 state and you could put your fixes into it

18:01 devilhorns has quit []

18:01 <robclark> danvet: but I assume you still end up w/ a drm/$driver/ci directory for expectation files.. possibly the .rst doc should move to something a bit less driver specific (but there isn't really too much driver specific in it already)

18:04 <danvet> robclark, I guess we could stuff them all into one

18:04 <danvet> or just into the driver dir directly if it's the only thing really

18:05 * airlied would like to keep ci files with the drivers if possible, though how to we deal with uprevs of the igt in ci? mega commits in next/fixes?

18:06 <robclark> hmm, I suppose I kinda defaulted to "put those in driver specific location" since that is how we do it for mesa and it works well there.. (the separate locations I think also helps with the rules about what CI jobs to run given which files changed)

18:07 <robclark> airlied: the toplevel yml file should be (and will be in next rev of RFC) pointing at an explicit revision of drm-ci .. which in turn points at explicit revision of igt

18:07 <robclark> I guess if we have same igt version for all drivers, then it would be a bit of a mega-commit

18:07 <robclark> if we keep igt version per-driver, that seems a bit more manageable..

18:09 <airlied> robclark: so when it pokes at the CI system, it will run the specific revision of the drm-ci per driver?

18:09 <airlied> like I push drm-next and it will build, but then run separate revisions on each hw platform?

18:09 <airlied> tomeu: ^?

18:11 <robclark> I *think* so.. or hmm, I guess it depends on which gitlab tree is running the CI.. because that is where you configure where to pull the toplevel gitlab-ci.yml

18:12 <robclark> we will anyways need separate builds for x86 vs arm.. it seems like separate builds per driver would make merging things thru per-driver next trees easier.. ie. I don't think we should try and use a single common version until all of drm-next is gitlab MR's in a single tree :-P

18:13 <zackr> tzimmermann: got ya. thank you

18:13 <airlied> robclark: ideally I'd like to be able to push drm-next without MRs and validate the in-tree CI results pass

18:14 <zackr> mripard, mlankhorst: are you ok with me fast-forwarding drm-misc-next-fixes to rc6 and adding three small vmwgfx fixes or do you want to fast-forward yourself?

18:14 <danvet> airlied, I'm also thinking of stuff like selftests and build tests

18:14 <airlied> with a single toplevel gitlab-ci.yml

18:14 <danvet> which would be really good to standardize across every driver

18:14 <danvet> hw tests per-driver is the way to go ofc

18:14 <danvet> but even there maybe we get to a point where we can run hw tests for core drm changes too

18:14 * danvet can dream

18:15 <danvet> airlied, yeah essentially top level gitlab-ci.yml for drm

18:15 <danvet> that's the key piece, the other bits we can bikeshed around

18:16 <robclark> airlied, tomeu: yeah maybe running the same thing in drm-next as in msm-next is a good argument for having drm/ci/gitlab-ci.yml instead.. but I think it is not going to be uncommon to need a scheme where we have a branch with a patch or two outside of drm which is required (since -next is usually branched off an early -rc)

18:16 * airlied just feels it would be a conflict nightmare in-tree

18:17 <danvet> meaning, as long as there's not too much copypaste between drivers doing the same stuff, I don't really care strongly one way or another

18:17 <danvet> robclark, yeah but -next should then also keep the same frozen igt sha1 and test list and all that?

18:17 <danvet> and if you then send in an msm update, you'd update the msm expectations

18:17 <robclark> hmm

18:18 <danvet> and it /should/ all keep fitting

18:18 <danvet> and the top-level ci has the usual filters to only run msm hw tests when you guys think it's relevant

18:18 <danvet> and since that hopefully matches the diffstat of the merge commit, it should all fire when the msm pull is integrated into drm-next

18:18 <robclark> airlied: yeah, maybe the thing to do is, if we are going to do an igt uprev, do it immediately after drm-next branches, and before other drivers branch off of drm-next.. and then keep it stable over the release cycle

18:18 <danvet> so if there's breakage in helpers/core it should show up

18:19 <danvet> robclark, drm-misc and drm-intel don't base upon drm-next, they just keep floating

18:19 <danvet> I think amd is similar-ish (at least the internal tree)

18:19 <robclark> backmerge?

18:19 <danvet> yeah, pretty much backmerges only

18:20 <danvet> but upreving igt to latest every time we open drm-next sounds like a good idea

18:20 <robclark> yeah, I think if we have single igt version for all drivers, we pretty much have to do it that way

18:20 <danvet> flip side of this is also that igt should probably test against the latest kernel release to make sure it keeps working well enough

18:20 <danvet> robclark, oh I was still thinking per-driver overwrite for at least the hw tests

18:21 <robclark> fwiw, tomeu does have some patches to add gitlab ci job for igt itself ;-)

18:21 <danvet> but stuff like vkms or anything else that's pure sw would upgrade in lockstep

18:21 <danvet> anything else = maybe llvmpipe + virgl or so in a pure vm setup

18:21 <danvet> as long as you only run kms tests that should be ok-ish

18:21 <danvet> or very basic igt

18:22 <robclark> hmm, I think vm or hw is kinda orthogonal to the question of per-driver igt version or not..

18:23 <danvet> robclark, yeah maybe external dependencies matter more

18:23 <danvet> but my reasoning here is that pure virtual you should be able to debug, so we could try to make sure it really always works

18:24 <danvet> vs hw testing can go boom in funny ways, so if you want to uprev igt it might need some hw access or similar

18:24 <robclark> oh, sure.. but I guess no one has ever debugged $other_hw via gitlab ci in mesa ;-)

18:24 <danvet> also maybe igt eventually becomes solid enough so that this is much less a problem

18:24 <danvet> robclark, sure you can do, I just mean it's getting more annoying

18:25 <danvet> plenty of people use intel-gfx-ci the same way to debug stuff on hw they don't have

18:25 * karolherbst wants to abuse intel-gfx-ci to run the CL CTS

18:25 <danvet> but eventually you should bite the bullet and stop wasting a few weeks of machine time every time you move your printk around :-)

18:25 <karolherbst> :P

18:25 <danvet> karolherbst, we run piglit

18:25 <karolherbst> meh

18:25 <danvet> it just almost never breaks, so the results aren't even listed if you don't go digging real hard

18:26 <karolherbst> I should probably fix the broken CL tests in piglit

18:26 <danvet> I think we also at least planned to run crucible

18:26 <robclark> danvet: yeah, after a couple tries you ping someone who works on the driver to beg for help ;-)

18:26 <danvet> the thing is, imo running cts for userspace for kernel ci is massively wasted machine time if you don't bother to at least focus it somewhat on hw testing

18:26 <danvet> like blowing through intel-gfx-ci time to run compiler unit tests is a bit silly

18:27 <danvet> robclark, :-)

18:27 <robclark> (I do want to run deqp.. but heavily cut down, not a full run)

18:28 <danvet> robclark, yeah there's definitely some value in that to make sure render engine state doesn't get thrashed or some w/a or clock gating bit fell off and now some oddball sampler mode is busted

18:29 <danvet> hence why we run piglit and like to add a few more of these

18:30 <robclark> yeah, it could be piglit instead of deqp, although I think deqp generally does better result validation.. I don't have a strong preference, just want some sanity testing

18:30 <robclark> and it is a bit much to construct cmdstream to get 3d pipe and shader core going in igt

18:31 LexSfX has quit []

18:31 tzimmermann has quit [Quit: Leaving]

18:31 <danvet> robclark, yeah we only have a very basic one to do copy operations on the render engine

18:32 <danvet> helps with some kernel validation, but very much _not_ mean to validate the render engine works correctly

18:32 <danvet> and yeah if deqp has a nice test list for hw testing maybe we could switch to that in intel-gfx-ci too

18:32 <danvet> or both

18:32 * danvet dunno

18:34 LexSfX has joined #dri-devel

18:34 <robclark> was kinda just thinking a fractional run (like 1 out of 10) of dEQP-GLES3 or something along those lines

18:39 eukara has joined #dri-devel

18:39 jhli has quit [Remote host closed the connection]

18:39 jhli has joined #dri-devel

18:44 <danvet> robclark, hm yeah maybe, as long as your fraction is stable

18:44 <danvet> I guess hash the testname

18:44 AndrewR has quit [Ping timeout: 480 seconds]

18:46 mbrost has joined #dri-devel

18:47 <alyssa> danvet: I don't have a horse in the race, but my kneejerk preference is for dEQP over Piglit, given our experiences with the stability of each in Mesa CI

18:52 <robclark> danvet: we could also manually generate a fractional test list.. ofc there is the same issue as with igt that the test names *could* change across deqp/piglit/whatever uprev.. but that could be handled the same way (ie. kernel commit that updates commit sha of drm-ci tree in sync w/ updating test list and expectations)

18:53 <alyssa> robclark: I also think the subset depends strongly on the hardware

18:53 <alyssa> for (current and old) Mali, the kernel interface is extremely thin. just pass a pointer from userspace and go.

18:54 <alyssa> There's really not much for the kernel to screw up. If kmscube works and a deqp test doesn't, 99.9% chance it's a mesa bug, not a kernel one

18:54 AndrewR has joined #dri-devel

18:54 <alyssa> compare with PowerVR (and Apple by extension), where the kernel knows all sorts of gross details about depth/stencil attachments, occlusion queries, etc

18:55 <alyssa> suddenly getting specific coverage for those areas is critical

18:55 <alyssa> [I would like to reaffirm my belief that having the kernel know about graphics state is batshit.]

18:56 <danvet> +1

18:56 <danvet> like wtf who designed that

18:56 <robclark> I could see what (and if) other drivers want to run deqp as being driver specific.. for msm we do have some igt tests that exercise the CP but that doesn't mean that (for ex) we actually managed to power up other parts of the gpu

18:56 <robclark> and doing enough cmdstream building in igt to validate that means *tons* of gen specific code

19:01 rkanwal has joined #dri-devel

19:01 * Lyude is somewhat thankful it's really easy to do cmdstreams for nouveau from igt

19:02 <Lyude> (we have to do this for clearing memory (which will likely be handled by the kernel eventually), and blitting between different image formats

19:23 rexbcchen has quit [Read error: Connection reset by peer]

19:24 rexbcchen has joined #dri-devel

19:25 <karolherbst> nice

19:25 <karolherbst> we have a fix for the half types in llvm :)

19:25 <karolherbst> so we really only need to figure out what to do with the defined extensions

19:32 <karolherbst> jenatali: there are plans to drop support for those opencl headers, and I think we should try to move over with llvm-15 (I have patches), but it would be good if you could verify that things will work for you as well. I'll submit an MR with all relevant clc changes for that soonish

19:32 <karolherbst> but I'll wait until the patch made it into the tree

19:32 <karolherbst> ehh.. the patch fixing vload/vstore for halfs

19:32 <jenatali> Sounds good

19:33 <karolherbst> tldr: llvm-17 will drop opencl-c.h (RFC: https://discourse.llvm.org/t/rfc-deprecation-timeline-for-opencl-c-h/61585)

19:33 <karolherbst> or maybe later, we'll see :)

19:33 <karolherbst> but the speed up in compilation time is massive

19:34 <karolherbst> it literally shortens testing time on hw hhere by over 50% :)

19:42 * emersion refreshes https://github.com/NVIDIA/open-gpu-kernel-modules

19:44 <ajax> alyssa, danvet: sgi

19:44 <karolherbst> emersion: yeah......

19:44 <alyssa> ajax: emersion: ?

19:44 <emersion> https://www.nvidia.com/download/driverResults.aspx/187834/en

19:44 <ajax> irix kernel was very much The Graphics Server

19:44 * emersion hopes this is what he thinks this is

19:45 <heat> emersion, WHAT

19:45 <karolherbst> emersion: 🙃🙃

19:45 <ajax> most of the weird decisions about glx (that egl inherited) make almost sense if you look at them through the Xsgi lens

19:45 <emersion> heat: that is the correct reaction indeed

19:45 <alyssa> emersion: nice, publish reclocking firmware next? :p

19:45 <ajax> like why the kernel would know about framebuffer attachments

19:45 <heat> is this legit? Am I dreaming?

19:45 * karolherbst stops working for the day and deal with discussion on IRC

19:45 <emersion> alyssa: one step at a time

19:45 <alyssa> ajax: hey AGX wants that too! :p

19:45 <alyssa> or at least macOS does, not sure

19:46 <karolherbst> heat: yep, it is

19:48 <karolherbst> (or at least I think so)

19:48 <airlied> emersion: where did you get that url? seems to 404 here

19:48 <emersion> airlied: in the nvidia.com link below

19:48 <emersion> yeah it 404s right now

19:48 <karolherbst> oh no :(

19:49 <emersion> maybe they haven't pushed the "make public" button just yet

19:49 <karolherbst> not here

19:49 <karolherbst> well

19:49 <karolherbst> it still works for me

19:49 <emersion> "still"?

19:49 <karolherbst> ohh you mean the github?

19:49 <emersion> yes

19:49 <karolherbst> I thought you meant the download

19:49 <emersion> oh i don't really care about binaries

19:49 <karolherbst> airlied: the URL is on the download page

19:50 <karolherbst> "Published the source code to a variant of the NVIDIA Linux kernel modules dual-licensed as MIT/GPLv2. The source is available here: https://github.com/NVIDIA/open-gpu-kernel-modules"

19:50 <heat> is it going to be useful?

19:50 <heat> I've noticed they're not publishing the userspace part

19:51 <karolherbst> yeah... not sure yet

19:51 <karolherbst> I downloaded the archive to check if there is anything useful in it

19:51 <emersion> if it has KMS in it, it'd be pretty cool

19:52 <danvet> hm phoronix doesn't have an article ready

19:52 <karolherbst> oh no

19:52 <karolherbst> ahh

19:52 <karolherbst> fyi: https://developer.nvidia.com/blog/nvidia-releases-open-source-gpu-kernel-modules/

19:52 <emersion> danvet: disappointment ensues

19:53 <karolherbst> github also works

19:53 <emersion> now live!

19:53 <danvet> emersion, well what did you expect

19:53 <ajax> aw hell yeah

19:53 <danvet> I mean they reinvent work queues, like any good vendor module

19:54 <karolherbst> danvet: they invent a lot more than just that :P

19:54 <ajax> oh, so you mean we get to delete code

19:54 <ajax> excellent

19:54 <airlied> no you dont

19:54 <danvet> karolherbst, well it's the first thing I looked at

19:54 <karolherbst> you'll hate the code

19:54 <airlied> because its not useful code

19:55 <ajax> that makes it easier to delete i'd think

19:55 <karolherbst> yeah... doesn't seem to contain any arch speciifc code :(

19:55 <ajax> (i am not being entirely serious)

19:55 <danvet> emersion, there's another modeset driver in there, maybe there is something

19:55 <airlied> you get to be inspired.to write new code

19:55 <emersion> danvet: disappointment about phoronix

19:55 <karolherbst> how slow

19:55 <danvet> emersion, ah ok

19:55 <danvet> yeah

19:55 <Mis012[m]> did anyone sane ask for their ugly code? just gib some docs

19:55 <emersion> i'm just now looking at the nvidia-modeset dir

19:55 <danvet> can't even hear the furious typing

19:56 <karolherbst> danvet: you live real close, don't you?

19:56 <karolherbst> relatively

19:56 <airlied> danvet: not sure you will find anything it doesnt reinvent :-)

19:57 <karolherbst> I am actually disappointed now

19:57 <karolherbst> airlied: did you find anything arch speciifc?

19:57 <karolherbst> looks like some glue code

19:57 <karolherbst> well.. there is generci stuff, but

19:57 <karolherbst> ohh

19:57 <karolherbst> src/ duh

19:58 <karolherbst> :3 https://github.com/NVIDIA/open-gpu-kernel-modules/tree/main/src/nvidia/src/kernel/gpu

19:58 mbrost_ has joined #dri-devel

19:58 <karolherbst> there it is

19:58 <emersion> hmmm

19:59 <danvet> karolherbst, readme even explains that

20:00 <karolherbst> as if I read readmes

20:00 <emersion> oh so the interesting stuff is in kernel-open/nvidia-drm it seems

20:01 <airlied> that is old stuff

20:01 <airlied> been open for ages

20:01 <airlied> interesting stuff is in src

20:01 mbrost has quit [Read error: Connection reset by peer]

20:01 <karolherbst> yeah

20:02 <karolherbst> ignore everything besides src/

20:02 <emersion> was it

20:02 <emersion> ah

20:02 <karolherbst> yeah

20:02 <karolherbst> they needed some glue code to compile :P

20:02 <emersion> damn

20:02 <heat> emersion, dkms already builds that if you're using it

20:03 <emersion> sorry, i haven't messed around with the proprietary driver too much

20:04 lemonzest has quit [Quit: WeeChat 3.4]

20:08 <danvet> I'm just realizing how much amd's DAL was actually written for linux, because I'm pretty much completely lost on this thing :-)

20:08 <karolherbst> :)

20:08 <karolherbst> I love how they read sysfs files to get around GPL restirctions

20:09 <danvet> orly?

20:09 <karolherbst> sure

20:09 <airlied> danvet: so they have nvidia-modeset which is their modesetting interface to their X driver

20:09 <karolherbst> danvet: for runtime pm and stuff

20:09 <danvet> I mean no longer needed, but hilarious if that stuff is still in there

20:09 <airlied> then they have the drm driver which in theory does drm modesetting

20:09 <airlied> but I think on top of the same core

20:09 <danvet> airlied, yeah I guess that

20:09 <danvet> but since its a 100% hal I'm lost at connecting with the backend code in src/

20:09 <danvet> I guess when you're familiar with nv concepts it's a lot easier to find stuff

20:10 <airlied> no it really isn't

20:10 <karolherbst> the code isn't great :)

20:10 <danvet> well it's still wading through vendor lasagna most likely

20:10 <danvet> but at least you know a bit what stuff is about

20:10 <danvet> I can't even tell what stuff is

20:11 <alyssa> src/ feels like a windows driver

20:11 <airlied> alyssa: ding ding :-P

20:11 <karolherbst> alyssa: I'll ask a question and you answer it yourself: do you think any of this code is _new_?

20:12 <karolherbst> :P

20:12 <alyssa> karolherbst: :D

20:12 <emersion> they say it in their README

20:13 <emersion> "Though the kernel modules in the two flavors are different, they are based on the same underlying source code"

20:13 <graphitemaster> Does anyone know of a *clever* way one can determine if a GPU is an iGPU or dGPU from just regular GL alone, like maybe some weird mapped buffer trick?

20:13 <graphitemaster> Like some .. side channel observable behavior

20:13 <alyssa> karolherbst: "NvU32"

20:15 <ajax> graphitemaster: probably there's a way to coerce MapBuffer to give you a pointer into definitely vram not host ram? and you could time how fast readback is from that and guess whether it's uncached or other side of pcie from that

20:15 <airlied> https://blogs.gnome.org/uraeus/2022/05/11/why-is-the-open-source-driver-release-from-nvidia-so-important-for-linux/ some more info

20:16 <alyssa> "Nouveau can leverage the same firmware used by the NVIDIA driver, exposing many GPU functionalities, such as clock management and thermal management, bringing new features to the in-tree Nouveau driver"

20:16 <ajax> however: by the time you're doing that i feel like it's easier to just build a pattern match for GL_RENDERER and/or whichever egl/glx extension

20:16 <alyssa> so.. reclocking *is* getting solved then?

20:16 <airlied> alyssa: turing+ reclocking should be solvable

20:16 <airlied> leaves a bit of a gap in the support matrix

20:17 <alyssa> airlied: ...Wait no i have 2 other drivers to finish first ;p

20:17 <ajax> alyssa: my understanding is they've wanted to move reclocking control to the device side anyway

20:17 <karolherbst> alyssa: yep

20:19 <danvet> ajax, except igpu also can give you wc

20:19 <danvet> like most actually do I think

20:19 <ajax> danvet: sure, it's more does this latency feel like a pcie bus or not

20:19 <ajax> if it's too fast then no

20:19 <danvet> ah yes, that might work

20:20 <danvet> either that or gen2 intel igpu :-P

20:20 <karolherbst> alyssa: well, the firmware loading part is taken care of already anyway

20:20 <graphitemaster> ajax, Anything more robust than that, I really only need this behavior for AMD cards. Maybe there's some inherent arch choices on mobile vs desktop that are observable as a side channel.

20:21 <danvet> graphitemaster, on intel igpu is on a special pci bus:dev.fn

20:21 alyssa has left #dri-devel [#dri-devel]

20:21 <danvet> and I think you can clean that from the mesa extension?

20:21 <danvet> not sure it's the same on amd

20:21 <ajax> graphitemaster: are you allowed to assume GLX_MESA_query_renderer because its "unified memory" bit seems like what you're after

20:22 <ajax> or do people do fglrx on igp still

20:22 agd5f_ has quit []

20:22 agd5f has joined #dri-devel

20:22 <graphitemaster> Need something that works on Windows too

20:22 <agd5f> PCI ids?

20:23 <graphitemaster> Can you fetch those from GL

20:23 <graphitemaster> I didn't know if there was an extension or not

20:23 <ajax> probably not on windows

20:24 konstantin has joined #dri-devel

20:24 <ajax> what decision are you making based on this knowledge?

20:24 <graphitemaster> None, it's just for telemetry purposes.

20:25 <karolherbst> we missed the phoronix article from 20 minutes ago

20:25 anonymus1234 has joined #dri-devel

20:25 <danvet> geez phoronix was asleep, apparently some embargo and it didn't go out right when nvidia published

20:25 * karolherbst wonders if he should start trolling or not

20:25 * danvet severely disappointed

20:25 <karolherbst> danvet: I wouldn't be surprised if he didn't know

20:25 <danvet> karolherbst, do it

20:25 <danvet> karolherbst, says in the article the embargo lifted

20:25 <karolherbst> ahhh

20:26 <ajax> graphitemaster: i think collecting GL_RENDERER strings will get you that info as a side effect, product ids tend to be different for igp vs discrete

20:26 <danvet> and it's a bit much text for usual furious breaking news typing

20:26 heat has quit [Read error: Connection reset by peer]

20:26 <karolherbst> right...

20:26 heat has joined #dri-devel

20:28 <danvet> I guess now it's time to start the lwn timer

20:30 <graphitemaster> ajax, not on AMD >_>

20:30 gawin has joined #dri-devel

20:30 <graphitemaster> AMD in their infinite fucking wisdom decided to make laptop GPUs with the exact names as their discrete GPUs

20:30 <graphitemaster> NV did the same but you can determine the difference easily

20:31 <graphitemaster> Anyways sorry for asking, the NV news is more exciting

20:32 <graphitemaster> Whoo OPEN SOURCE KMD LETS GO </hype>

20:32 maxzor has joined #dri-devel

20:32 <karolherbst> 🎉🎉🎉

20:33 ella-0 has joined #dri-devel

20:33 <agd5f> graphitemaster, I don't think any of the APU marketing names overlap with dGPU marketing names.

20:33 <karolherbst> marketing names are all random, there is no sense behind it :P

20:35 <graphitemaster> The naming scheme is horrible for all vendors, everyone should be fired.

20:35 <bnieuwenhuizen> doesn't one of the EGL extensions tell you if it is an iGPU?

20:36 <Venemo> graphitemaster: are you trying to determine laptop vs desktop or iGPU vs dGPU?

20:36 <bnieuwenhuizen> err, GLX_MESA_query_renderer

20:36 <graphitemaster> iGPU vs dGPU, I can already determine laptop vs desktop based on the existence of a battery

20:36 <graphitemaster> I cannot tell if an AMD GPU is some APU thing or a dGPU though even based on the model name

20:37 <graphitemaster> Since they report similarly in their Windows driver as just "Radeon Graphics"

20:37 <graphitemaster> No model name in sight

20:38 <Venemo> Can't you query the size of dedicated VRAM?

20:39 <graphitemaster> It's kind of absurd, their windows driver just reports "AMD Radeon(TM) Graphics" for GL_RENDERER for like 8 different dGPUs and iGPUs

20:39 <graphitemaster> No way to distinguish anything, not even model.

20:39 <graphitemaster> So that's a separate problem now

20:39 <karolherbst> graphitemaster: best bet is to check for d3cold runpm support :P

20:40 <karolherbst> like if you got the firmware stuff wire up and all

20:40 <karolherbst> because desktop gPUs don't have this, and I hope all laptops do

20:41 <karolherbst> ohh wait.. it's iGPU vs dGPU

20:41 <bnieuwenhuizen> karolherbst: iGPUs don't

20:41 <karolherbst> ehh.. firmware buffer

20:41 <karolherbst> but yeah.. I think all vendors suck on that

20:42 <Mis012[m]> well, ideally you would check for the thing that makes you need to distinguish between iGPU and dGPU? :P

20:42 <Venemo> karolherbst: why'd NV publish their own out of tree driver when they could have contributed to nouveau instead?

20:42 <Mis012[m]> Venemo: so naive...

20:42 <Venemo> Just curious

20:43 Haaninjo has joined #dri-devel

20:44 <agd5f> Venemo, they have a working driver that works with CUDA today. making that happen with nouveau would take years?

20:45 <graphitemaster> No no, nouveau will have reclocking for newer NV GPUs now in the next hour right, because that's how all this works, it's so easy, just press a button /s

20:45 sadlerap has joined #dri-devel

20:45 <Mis012[m]> graphitemaster: actually, in a sane design, there wouldn't be any PCI(e) glue for the iGPU, but nevermind :P

20:45 <Venemo> Hm

20:45 mvlad has quit [Remote host closed the connection]

20:46 <Venemo> So it's easier to write this new driver from scratch than to add what was missing from nouveau?

20:46 <karolherbst> Venemo: .... wellll....

20:46 <graphitemaster> In a sane design OpenGL would have core features for querying something as basic as if a GPU is integrated or not, in addition to pcie vendor ids and what not.

20:46 <karolherbst> now we have source code?

20:46 <agd5f> Venemo, I think they just snapshot their existing driver for the most part

20:46 <karolherbst> Venemo: I think it was easier this way

20:46 <emersion> Venemo: it's not from scratch

20:46 <karolherbst> they can just dump their code into the public

20:46 ella-0[m] has joined #dri-devel

20:47 <karolherbst> and people can use it and fix nouveau

20:47 <agd5f> graphitemaster, why do you need to know in the first place? doesn't seem relevant for OpenGL

20:47 <Venemo> Huh, I thought they couldn't do that due to patents and stuff

20:47 <karolherbst> we do get occasional contributions from nvidia people though

20:47 <karolherbst> like today :P

20:47 <graphitemaster> most of the code is generated too, if you look here: https://github.com/NVIDIA/open-gpu-kernel-modules/tree/main/src/nvidia/generated

20:47 <karolherbst> Venemo: well that stuff lives inside the firmware

20:47 <graphitemaster> so it's not just some snapshot, it's also some other stuff they strip out or generate from description files they're not sharing, or some other programming language even

20:47 <Mis012[m]> graphitemaster: how dare you impose pci vendor ids when sane iGPUs are not on a glue PCI bus :P

20:48 Duke`` has quit [Ping timeout: 480 seconds]

20:48 <Venemo> Reading that blog post from the above link.

20:49 <graphitemaster> The generated code just looks like someone whoo really badly wanted virtual methods in C but C did not have them so they wrote a tool to generate it instead

20:49 <Venemo> Does RH really think they are "the only Linux vendor with the capacity to do so"?

20:49 <karolherbst> Venemo: do you see others doing it?

20:49 <karolherbst> well.. google I guess would be the only other one capable

20:49 <Venemo> karolherbst: Valve?

20:49 <karolherbst> not on this scale

20:50 <Sachiel> they also talk about compute

20:50 <karolherbst> yeah

20:50 <karolherbst> compute > graphics

20:50 <karolherbst> we'll get there eventually :D

20:50 <karolherbst> (I hope)

20:50 <karolherbst> ohh shit, do I have to say that rusticl has nothing to do with it, or won't people believe me anyway?

20:50 <bnieuwenhuizen> also sounds like the requirement for Turing was due to that having an embedded processor they could move all the secret stuff to

20:51 <karolherbst> bnieuwenhuizen: correct

20:51 <karolherbst> well

20:51 <karolherbst> the processors existed before already

20:51 <karolherbst> but with turing it can live on the one big one

20:51 <Venemo> The article specifically says this in context of creating a Vulkan driver

20:52 <bnieuwenhuizen> yes, so who is bootstrappinjg the Vulkan driver? :P

20:52 <karolherbst> we do :D

20:52 <Venemo> All due respect to RH but they are not the only people capable of developing a Vulkan driver

20:52 <karolherbst> Venemo: they are not, that's correct

20:52 <bnieuwenhuizen> they didn't say that?

20:53 <Venemo> It does say that

20:53 <karolherbst> well.. if somebody wants to start writing a vulkan driver, they can just do so :P

20:53 <Venemo> We already do

20:53 <karolherbst> but they could have done it before any of this regardless

20:54 <karolherbst> good luck with that then

20:54 <Venemo> Well, I'd call RADV a successful driver, at leaat

20:54 <karolherbst> yeah, and that was started by RH

20:54 konstantin has quit [Remote host closed the connection]

20:54 <karolherbst> or not?

20:54 <bnieuwenhuizen> it was, Dave and me

20:54 <karolherbst> :)

20:55 <bnieuwenhuizen> + a whole lot of stuff from Intel ;)

20:55 <karolherbst> right

20:55 <karolherbst> I'll assume there will be some vulkan driver sooner or later :D

20:56 <Venemo> I think it's pretty arrogant to say that RADV was made by RH

20:56 <karolherbst> I didn't say that, I'd say it got started by it

20:57 <airlied> RH has started the most vulkan drivers :-P

20:57 <airlied> actually Collabora might be even

20:57 <airlied> or has taken the lead, I should recount

20:57 <karolherbst> yeah.. I think collabora caught up

20:57 <bnieuwenhuizen> airlied: which ones did RH do? RADV + ?

20:57 <karolherbst> in the end we as the community made those drivers

20:57 <airlied> bnieuwenhuizen: lavapipe :-)

20:57 <karolherbst> but it's always about who started it, because if nobody does, others might just rely on vendor drivers

20:58 <karolherbst> maybe amdvlk would have beend the one everybody uses if airlied wouldn't have started radv with others, who knows

20:58 <Venemo> RADV has one contributor from RH, and several from Valve so I think it's unfair to talk about it without acknowledging Valve. That's all I'm saying

20:58 <karolherbst> (I don't think so, but...)

20:58 <bnieuwenhuizen> really happy though that nouveau isn't this dreadful place of "we can't do anything interesting anyway" anymore

20:58 <karolherbst> bnieuwenhuizen: yeah....

20:58 <karolherbst> that's like the big news here

20:58 <graphitemaster> agd5f, for telemetry purposes, want a way to collate and organize data from every user of the software for automated crash and performance reports - the more markers and filters for that the better, think like this: https://crash-stats.mozilla.org/topcrashers/?product=Firefox&version=102.0a1&process_type=gpu

20:58 <karolherbst> the GL driver is still quite crappy though :'(

20:59 <karolherbst> every time I touch it, I just feel like writing a new one

20:59 <bnieuwenhuizen> always felt like nouveau was just bleeding devs due to that lack of hope

20:59 <Venemo> karolherbst: will this stuff allow you to do proper reclocking of the cards now?

20:59 <karolherbst> Venemo: yeah

20:59 <airlied> of the turing+ cards

20:59 <Venemo> Yeah

20:59 <karolherbst> bnieuwenhuizen: yep, well.. I knew that this was going to happen sooner or later, but....

20:59 <graphitemaster> better example actually here

20:59 <graphitemaster> https://crash-stats.mozilla.org/signature/?product=Firefox&signature=OOM%20%7C%20large%20%7C%20mozalloc_abort%20%7C%20core%3A%3Aintrinsics%3A%3Aconst_eval_select%3CT%3E&version=102.0a1&date=%3C2022-05-11T20%3A58%3A52%2B00%3A00&date=%3E%3D2022-05-04T20%3A58%3A52%2B00%3A00

20:59 <karolherbst> doesn't help if you are the only one and there are like 100 things falling apart left and right :)

20:59 <bnieuwenhuizen> karolherbst: you're the one that stuck around anyway :P

20:59 <graphitemaster> lovely url

21:00 <karolherbst> bnieuwenhuizen: yeah..........

21:00 <karolherbst> didn't do a great job though

21:00 <bnieuwenhuizen> anyway, can I recommend a Vulkan driver + zink instead of yet another gallium driver?

21:00 <jekstrand> Yeah, the GL driver leaves something to be desired but if we can run the GPUs at full clocks, that's going to motivate more people to make it work.

21:00 * karolherbst should fix multithreading

21:00 * karolherbst is saying this for years

21:00 <Venemo> Exactly

21:00 <karolherbst> bnieuwenhuizen: yes, that's my ideal solution

21:00 <bnieuwenhuizen> might as well if we consider nouveau in need of a significant rewrite

21:00 <karolherbst> we could even do vulkan on nv50 if we really wanted to

21:01 <graphitemaster> karolherbst, honestly think it's better to focus on the vulkan driver and just let zink do GL :P

21:01 <bnieuwenhuizen> and with imagination in there is some pressure to make that actually work well

21:01 <karolherbst> graphitemaster: I just focus on CL :D

21:01 <jekstrand> Regardless of whether there's a nouveau rewrite on the table, we should build Vulkan first.

21:01 <karolherbst> yes

21:01 <karolherbst> and fixing multithreading is mostly done

21:01 <karolherbst> so that's fine

21:01 <Venemo> do they also have any info about their ISA and other stuff?

21:01 <karolherbst> the driver does work somewhat

21:01 <jekstrand> Once we brought up ANV, Kayden was able to write iris by himself re-using all the pieces we built for ANV.

21:01 <karolherbst> it's just not very stable

21:01 <graphitemaster> Vulkan is the easiest path to getting GL and D3D12 on Linux with nouveau + NV kmd

21:01 <jekstrand> Vulkan encourages you to do things in a better way if you actually care to do so.

21:01 <karolherbst> Venemo: well.. I do

21:01 <jekstrand> Then you re-use for GL.

21:02 <Venemo> karolherbst: a reverse engineered one or do you actually have an official one?

21:02 <karolherbst> jekstrand: yes, and I think if we keep this in mind, it should make it easy to write a new driver

21:02 <karolherbst> Venemo: I have one I didn't reverse engineered :D

21:02 <Venemo> I assume under NDA

21:02 <karolherbst> correct

21:02 <bnieuwenhuizen> up to where do the reverse engineered ones go?

21:02 <karolherbst> it doesn't contain encoding though

21:02 <karolherbst> but who cares about encoding

21:02 <bnieuwenhuizen> did that ever go beyond maxwell?

21:03 <karolherbst> bnieuwenhuizen: we do support ampere, more or less

21:03 <karolherbst> it just misses firmware

21:03 <karolherbst> ampere has the same ISA as volta+

21:03 <karolherbst> we still need to figure out encodings, but the doc explain the instructions, which is nice

21:03 <bnieuwenhuizen> so the ingredients are actually there if someone from the community wanted to start

21:03 <karolherbst> ohh totall

21:03 <karolherbst> but it was already there 3 years ago

21:04 <karolherbst> (we do have a vulkan driver, it's just not finished and not public) :D

21:04 <Venemo> karolherbst: so you're allowed to create open source software based on that doc, but you're not allowed to share the doc?

21:04 <karolherbst> Venemo: correct

21:04 <Venemo> I see

21:04 <karolherbst> the source code drop really only solves the performance part

21:04 <bnieuwenhuizen> karolherbst: but without hope to be useful it is pretty hard to start one

21:04 <karolherbst> nothing else

21:04 <jekstrand> karolherbst: If you figure out how to get my 2060 working (or tell me what other GPU to buy), I can help. :P

21:04 <karolherbst> sure, performance matters the most

21:04 <karolherbst> but....

21:04 <karolherbst> what I am saying, we knew this was coming

21:05 <karolherbst> if I wouldn't have known it, maybe I would have left, dunno

21:05 <karolherbst> jekstrand: welll... the code is out there :P

21:05 <karolherbst> :D

21:05 <karolherbst> jekstrand: anyway.... I'd wait a little longer

21:05 <Venemo> Turing is the 20xx series right?

21:06 <jekstrand> karolherbst: Vulkan driver code?

21:06 <karolherbst> there will be code making use of the new firmware, and there are some patches from Ben rewriting the host bits

21:06 <karolherbst> jekstrand: everything

21:06 <jekstrand> karolherbst: links?

21:06 <karolherbst> jekstrand: that's the annoying part, Ben doesn't like to share shit if Ben thinks it's not ready :(

21:06 <karolherbst> tease him :P

21:06 <karolherbst> I am just failing at this

21:07 <karolherbst> I already told him, that others will write a vulkan driver if he doesn't push :D

21:07 <graphitemaster> Ah yes the infamous open-but-not-actually Vulkan driver that exists-but-doesnt that I keep hearing about

21:07 <karolherbst> yeah...................

21:07 <bnieuwenhuizen> so do we create a MR with skeleton just to provoke someone into pushing?

21:07 <karolherbst> sounds like a plan

21:08 <karolherbst> I have something alreayd :D

21:08 <karolherbst> let me find it

21:08 anarsoul|2 has quit []

21:08 anarsoul has joined #dri-devel

21:08 <daniels> karolherbst: you just can't trust Australians

21:08 <anarsoul> Venemo: and 1650

21:09 <anarsoul> karolherbst: does it mean that nouveau will eventually get reclocking support?

21:09 <karolherbst> it's super and isn't doing anything, but: https://github.com/karolherbst/mesa/commits/nouveau_vulkan

21:09 <karolherbst> anarsoul: correct

21:09 <Venemo> anarsoul: good to know

21:09 <graphitemaster> This is all exciting news for graphics on Linux but I think I understand NV's strategy this time. In the past the concern was that they didn't want to give people their software. Now their new strategy is to give the software but make it impossible to get their hardware. It's quite brilliant. We can have an open sores NV platform but no one on earth can get their hands on a RTX GPU XD

21:10 iive has joined #dri-devel

21:10 <Venemo> graphitemaster: wow, you figured it out

21:10 <zf> it's going to be called nouveaulkan, right?

21:10 <karolherbst> zf: of course

21:10 <zf> :-)

21:10 <zf> that's the only thing I care about

21:11 <karolherbst> or mayube nouveaolcan?

21:11 <karolherbst> it's closer to french

21:11 <bnieuwenhuizen> novk

21:12 <zf> nouveaulcain?

21:12 <karolherbst> nok

21:12 <jekstrand> nvk

21:12 <jekstrand> 3 letters are better than 4

21:12 <Venemo> is it also time for an NCO?

21:13 <jekstrand> Venemo: Thinking of starting on it. :)

21:13 <karolherbst> let's take a new approach to writing a vulkan driver, everybody writes each N line and we put it alltogether and submit an MR

21:13 <bnieuwenhuizen> yeas, the extra letter in radv resulted in so much overhead :(

21:13 <karolherbst> Venemo: yeah.... a new compiler would be nice :D

21:13 <Venemo> :)

21:13 <karolherbst> so if somebody doesn't have anything better to do :D

21:13 <jekstrand> karolherbst: I even typed the skeleton for you: https://www.collabora.com/news-and-blog/blog/2022/03/23/how-to-write-vulkan-driver-in-2022/

21:13 <karolherbst> awesome

21:13 <jekstrand> karolherbst: Just copy+paste from the blog and it should all build. :D

21:14 <bnieuwenhuizen> y'all know that ACO was ACO because AC was already taken right?

21:14 <karolherbst> still busy with random CL stuff though :P

21:14 <karolherbst> jekstrand: sounds awesome

21:14 <jekstrand> Clearly, it needs to be called Nouveau IR. There's even an easy 3-letter acronym for that. Oh, wait....

21:14 <graphitemaster> karolherbst, I'll write the memory manager: addr_t gpu_malloc(size_t) { return rand_u64(); } // statistically safe (2^64) address space

21:14 <graphitemaster> Very unlikely anything will trample, you're welcome

21:15 <graphitemaster> More likely a bug exists elsewhere

21:15 <karolherbst> jekstrand: it's supposed to be a secret...

21:15 <karolherbst> graphitemaster: need to take care of alignment though

21:16 <graphitemaster> Does NV care about alignment?

21:16 dj-death has quit [Ping timeout: 480 seconds]

21:16 <karolherbst> graphitemaster: yes

21:16 <graphitemaster> I thought robust buffer access saves our bacon here XD

21:16 <zf> if it's french, it should be rin, for "représentation intermédiaire nouvelle"

21:16 <karolherbst> the point is, that a lot of commands only take aligned addresses, because the lower bits are missing :)

21:16 <bnieuwenhuizen> AFAIU nvidia has manual bounds checks for most stuff

21:16 <karolherbst> at least for some random stuff

21:17 <karolherbst> sure

21:17 <karolherbst> but load/stores need to be aligned and random other things

21:17 <bnieuwenhuizen> please fix the 256 byte alignment in your vulkan driver, would be beneficial for vkd3d-proton

21:17 <karolherbst> like fully aligned

21:17 <bnieuwenhuizen> for UBOs*

21:18 <karolherbst> I think it already is with nir, isn't it?

21:18 <graphitemaster> okay return (rand_u64() + (256 - 1)) & ~(256 - 1)

21:18 <jekstrand> Still, rand_u64() & 0xfffffffff000 ought to be pretty unlikely to collide.

21:18 <graphitemaster> happy

21:18 <jekstrand> (assuming a 48-bit address space)

21:18 <karolherbst> graphitemaster: GPU is only 48 bits :P

21:19 <graphitemaster> I still think statistically less likely to collide than any other random bug in the driver even for 48 bit address space

21:19 <karolherbst> mhhh maybe

21:19 <jekstrand> Or, you know, we can use src/util/vma_heap.h

21:19 <graphitemaster> I bet vma_heap is slower than xorshift128

21:19 <karolherbst> jekstrand: please do

21:20 sdutt has quit [Ping timeout: 480 seconds]

21:20 <karolherbst> I am tired of having "nouveau only libs" for stupid reasons

21:20 <jekstrand> I briefly tried to move the AMD drivers to util/vma_heap but theirs was pretty embedded into places.

21:21 <jekstrand> But anything built today should use it unless there's a good reason to do otherwise.

21:21 <bnieuwenhuizen> ?

21:21 <karolherbst> yeah....

21:21 <bnieuwenhuizen> it isn't that embedded into radv no?

21:21 <karolherbst> that's always the shitty thing about doing proper reworks on nouveau

21:21 <karolherbst> you start looking

21:21 <jekstrand> bnieuwenhuizen: It's embedded in the winsys stuff

21:21 <jekstrand> And I didn't feel like ripping it out.

21:21 <karolherbst> after 2 hours you give up, because writing from scratch takes less time

21:21 <bnieuwenhuizen> though I guess I have to kill libdrm_amdgpu due to the shader drm fd

21:21 <karolherbst> and solves the other 50 issues at the same time

21:22 <jekstrand> karolherbst: I based it on the AMD one, I just didn't want to take the time in a driver I don't know well to make AMD use the shared one.

21:22 <karolherbst> I am sure it's not better with nouveau

21:23 <karolherbst> anyway...

21:23 <karolherbst> we should have a proper vulkan driver :)

21:23 <jekstrand> Looks like it's now being used in itnaviv, virgil, iris, pvr, and ANV.

21:24 <jekstrand> karolherbst: Yes, we should. :)

21:24 <graphitemaster> Just leak the one-that-exists-but-doesnt-actually-yet open-but-not-open exists-but-does-not-exist one smh

21:24 <karolherbst> and I am sure I won't be useful on that besides compiler stuff, because people will move _fast_ :D but I also relied to much on others doing it anyway, so, that's fine :P

21:24 AndrewR has quit [Ping timeout: 480 seconds]

21:25 <bnieuwenhuizen> jekstrand: did you mean vma.h?

21:26 <jekstrand> bnieuwenhuizen: yes

21:27 <haagch> is the time right for vulkan-only + zink or will it be worth it to write an opengl driver / rewrite nouveau?

21:27 <Venemo> haagch: I think it's never been a better time for that

21:30 <karolherbst> haagch: I'd start with zink honestly

21:31 <karolherbst> the gl driver is in a really bad shape overall

21:31 <karolherbst> I think rewriting it is the only sane solution

21:31 <karolherbst> I'd still fix multithreading in a non breaking way, because that's so important to get fixed, but besides that?

21:31 <bnieuwenhuizen> so is your plan rusticl on top of zink?

21:31 <karolherbst> all I can think of "ehhh.. we want a new one anyway"

21:31 <Venemo> esp. considering that the HW vendor isn't contributing to it, it's better for the community to focus all effort on the VK driver

21:31 <karolherbst> bnieuwenhuizen: I suspect, or I'll change code around to target vulkan directly

21:32 <karolherbst> thing is... what would zink offer here besides doing some vulkan abstraction?

21:32 <karolherbst> we have to pass the original spir-v in

21:32 <bnieuwenhuizen> I hear anv can already ingest CL SPIRV internally

21:32 <karolherbst> otherwise it's a huge mess

21:32 <karolherbst> yeah..

21:32 <karolherbst> so we'd come up witha vulkan ext to just read CL spir-v

21:32 <karolherbst> and I think that's it?

21:33 <karolherbst> I've heard there is an ext for real buffers

21:33 <bnieuwenhuizen> also have to set the kernel args

21:33 <Sachiel> but you'd miss on the fun of having extra layers to debug

21:33 <karolherbst> bnieuwenhuizen: that's boring stuff

21:34 <karolherbst> it's literally an ubo

21:34 <bnieuwenhuizen> VK_KHR_buffer_device_address yeah, though it isn't quite as flexible as CL wrt random casts AFAIU

21:34 <karolherbst> yeah.. no idea how much we need at runtime

21:34 <karolherbst> CL has image2d from buffer support e.g.

21:34 <karolherbst> and this can be painful

21:34 <karolherbst> and other random stuff

21:34 <karolherbst> but...

21:35 <karolherbst> I know that I don't really need much on top of GL gallium

21:35 <bnieuwenhuizen> can you require some alignment on the image2d from buffer stuff?

21:35 <karolherbst> yes

21:35 <karolherbst> strides are explicit

21:35 <karolherbst> and the API reports min alignment of the stride _and_ base addr

21:35 <bnieuwenhuizen> then I think you can just do stuff with vulkanlinear images there

21:35 <karolherbst> maybe

21:35 <karolherbst> I don't know vulkan :)

21:35 <karolherbst> I just now that on top of GL gallium I need 1. set_global_binding and... that's it :)

21:36 <karolherbst> (well besides compiler stuff, but I lower a lot of things to make it work)

21:36 <karolherbst> I'd need to change gallium APIS for 2D from buffer though

21:36 <karolherbst> but besides that? nope, nothing

21:37 <ajax> karolherbst: get a vulkan driver working and then just use zink

21:38 <bnieuwenhuizen> you ... are not the first offering that advice :P

21:38 <Venemo> You don't need to know vulkan to develop a vulkan driver

21:38 <ajax> oh, not for gl, here

21:38 * ajax hushes

21:38 <karolherbst> Venemo: yeah... I also don't know much about GL :)

21:38 <Venemo> Hehe

21:38 <karolherbst> but I do know a lot about CL now :O

21:39 <karolherbst> but if you start implementing on the API side you kind of have to

21:39 <Venemo> 🤣

21:39 <Venemo> Yeah, to some extent it doesn't hurt to know it

21:40 <karolherbst> bnieuwenhuizen: but yeah.. maybe it would be easier to use zink for now, now that things actually pass the CTS

21:41 <daniels> zmike's claiming it's the reference GL driver now

21:41 <karolherbst> but I kind of don't like the converting nir to spirv part

21:41 <karolherbst> and would rather just pass the spir-v through

21:41 <bnieuwenhuizen> karolherbst: just consider it a gallium driver and don't look under the hood :P

21:42 <karolherbst> that's what I am already doing! :D

21:42 <karolherbst> but honestly, you'll hate the nir you'd get

21:42 <karolherbst> the bigger issue is just the complexity of the nirs

21:42 <karolherbst> so.. we want to do real function calling at some point

21:43 <karolherbst> and you'll end up with nirs having a lot of values

21:43 <karolherbst> and I think it's easier to just require vulkan drivers to consume spir-v directly :)

21:43 <heat> given that we have license-compatible nvidia drivers now, isn't it a good idea to ditch nouveau? at least the kernel side?

21:43 <bnieuwenhuizen> yeah, we'll need to improve the compilers in the vulkan drivers too

21:43 <karolherbst> heat: nope

21:44 <heat> why not?

21:44 <karolherbst> nvidias code can't go upstream for multiple reasons

21:44 <bnieuwenhuizen> heat: sounds like a safe bet that the nvidia drivers are not upstreamable in their current form at least

21:44 <heat> yes, but do you need it upstream?

21:44 <karolherbst> yes

21:44 <Sachiel> also support for older platforms

21:44 <heat> you could do it like IIRC dri was done back then

21:44 <karolherbst> please no

21:45 <anholt_> there's a reason we stopped doing it like dri was done back in the bad old days

21:45 <karolherbst> why are we even arguing if having a non upstream driver is okay? of course it has to be upstream :P

21:45 <karolherbst> I am already annoyed by canonical saying "ahh yeah, we just ship this out of tree stuff"

21:46 <karolherbst> but well.. it's cannonical :P

21:46 <jekstrand> canonical has been shipping the blob for years

21:46 <karolherbst> I didn't expect anything else

21:46 <karolherbst> I know

21:46 <karolherbst> but they didn't disable nouveau yet, which I expect them to do in the future

21:46 mbrost_ has quit [Ping timeout: 480 seconds]

21:47 <jekstrand> :-/

21:47 <karolherbst> well.. do you expect them to not do this once the out of tree driver covers all relevant devices?

21:47 <karolherbst> well maybe they would just patch out support for newer devs in nouveau

21:47 <karolherbst> who knows

21:47 <karolherbst> maybe they wouldn't

21:47 <karolherbst> point is, I don't trust them to not do it

21:47 <heat> how's mesa going to do things then? no nvidia support and just nouveau?

21:48 <karolherbst> heat: why would mesa want to target out of tree UAPIs? :P

21:48 <karolherbst> that's a hell of a nightmare to support

21:48 <karolherbst> the biggest problem is though.. nvidia needs UAPI for CUDA, but CUDA stays closed

21:48 <heat> ideally someone brings it upstream (that may take a while, of course)

21:48 <karolherbst> and we won't accept UAPIs without open userspace

21:49 <karolherbst> so how would that stuff gets upstreamed?

21:49 <Venemo> Okay, maybe a stupid question, but if they didn't write this from scratch but just opened their driver, why does this new one lack some features and HW support?

21:49 <karolherbst> because they disabled it

21:49 <heat> Venemo, what's it lacking?

21:49 <Venemo> Why was it okay to open the Turing support but not the others

21:49 <karolherbst> heat: it's mostly compute only at this point

21:49 <heat> oh really?

21:49 <karolherbst> Venemo: because turing has this big risc-v CPU which can load this huge firmware blob

21:50 <Venemo> so?

21:50 <Venemo> I don't see the connection

21:50 <karolherbst> they obviously didn't want to open stuff which is in firmware :P

21:50 <Venemo> The old GPUs also had something that loaded the FW, didn't they?

21:50 <karolherbst> yeah, but not like this

21:50 <karolherbst> on turing+ you can use this one risc-v one instead of all the others

21:51 danvet has quit [Ping timeout: 480 seconds]

21:51 <Venemo> Sure, they don't wanna open the FW, but nobody expects them to do that anyway

21:52 <dcbaker> I mean, it works be great if they did

21:52 <Venemo> What is wrong with the old way of loading the FW that warrants keeping those closed?

21:53 anonymus1234 has quit []

21:53 <Venemo> Also, why do they care? Those GPUs are obsolete by now anyway

21:53 <karolherbst> why would it even matter?

21:54 <karolherbst> they made a decision, and that's what we got

21:56 <Venemo> sorry, my mistake for trying to make sense of it

21:57 <gawin> some people are putting older gpus into PCs for compatility with 98/xp

21:57 Haaninjo has quit [Quit: Ex-Chat]

21:57 <karolherbst> gawin: I think those are people you don't have to care about

21:58 maxzor has quit [Ping timeout: 480 seconds]

21:59 alanc has joined #dri-devel

22:05 * airlied called my vulkan driver nouv :-P

22:05 <graphitemaster> no.uv

22:05 <karolherbst> airlied: it was your name and I think I used it then or soemthing

22:05 apinheiro has joined #dri-devel

22:08 ahajda has quit [Quit: Going offline, see ya! (www.adiirc.com)]

22:15 <javierm> Venemo: but maybe the old GPUs didn't have a big enough co-processor to push all their IP to the firmware ?

22:16 dolphin has quit [Remote host closed the connection]

22:16 dolphin has joined #dri-devel

22:16 mmind00 has quit [Quit: No Ping reply in 180 seconds.]

22:16 unerlige has quit [Remote host closed the connection]

22:16 rsripada has quit [Remote host closed the connection]

22:16 jhli has quit [Remote host closed the connection]

22:16 jhli has joined #dri-devel

22:16 Ryback_ has quit [Remote host closed the connection]

22:16 nchery has quit [Remote host closed the connection]

22:16 unerlige has joined #dri-devel

22:16 Ryback_ has joined #dri-devel

22:16 mattrope has quit [Remote host closed the connection]

22:16 ramaling has quit [Remote host closed the connection]

22:16 pzanoni has quit [Remote host closed the connection]

22:16 tursulin has quit [Remote host closed the connection]

22:16 mdnavare has quit [Remote host closed the connection]

22:16 ramaling has joined #dri-devel

22:16 pzanoni has joined #dri-devel

22:16 mdnavare has joined #dri-devel

22:17 nchery has joined #dri-devel

22:17 <javierm> in other words, maybe for turing they could push IP that was in the driver to the firmware but that maybe wasn't possible for older GPUs

22:17 mattrope has joined #dri-devel

22:17 aswar002 has quit [Remote host closed the connection]

22:17 rsripada has joined #dri-devel

22:17 mmind00 has joined #dri-devel

22:18 aswar002 has joined #dri-devel

22:21 CATS has quit [Ping timeout: 480 seconds]

22:21 CATS has joined #dri-devel

22:24 <FLHerne> karolherbst: Why does CUDA need unique uapi that isn't required for CL and similar

22:24 <karolherbst> because CUDA is huge

22:24 <FLHerne> (?)

22:24 <karolherbst> the uapi is out there, just look at it

22:24 <FLHerne> yeah, but I thought most of the hugeness would end up being userspace stuff

22:24 <karolherbst> it's mostly nvidia-uvm, but there are a looot of uapis

22:24 <FLHerne> hm, ok

22:25 <karolherbst> the problem is, are they willing to open up CL if we require a compiler or is that already a show stopper

22:25 <karolherbst> so that's the second issue

22:26 tursulin has joined #dri-devel

22:32 OftenTimeConsuming is now known as Guest429

22:32 OftenTimeConsuming has joined #dri-devel

22:32 Guest429 has quit [Remote host closed the connection]

22:34 <jekstrand> karolherbst: Have you seen https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16450 ?

22:34 <jekstrand> karolherbst: Truly aweful but also probably what we need to do thanks to CL barrier semantics. :-/

22:34 <karolherbst> I didn't

22:35 <Venemo> javierm: I didn't think about that, but maybe

22:38 <karolherbst> jekstrand: yeah.. we need that :)

22:38 tursulin has quit [Ping timeout: 480 seconds]

22:38 <karolherbst> but yeah.. it sounds horrible :(

22:38 <karolherbst> is there no instruction to converge threads?

22:41 <jekstrand> karolherbst: No. It's pretty horrible

22:41 <karolherbst> :(

22:41 <jekstrand> karolherbst: The OpenCL spec says:

22:41 <jekstrand> If the barrier is inside a conditional statement, then all work-items in the work-group must enter the conditional if any work-item in the work-group enters the conditional statement and executes the barrier.

22:41 <jekstrand>

22:42 <jekstrand> This is very different from GL where it's required to be in uniform control-flow.

22:42 <karolherbst> yeah...

22:42 <karolherbst> compute doesn't care about trivial things like "is this even feasible in hw?" :P

22:42 <jekstrand> With CL, you have to allow for some of a wave to hit the barrier and another part to hit it on the next time around.

22:42 <jekstrand> It may be that we can use a different kind of barrier on DG2 that works like CL wants it to.

22:42 <jekstrand> I seem to recall there's something that might work but I no longer have access to those docs.

22:42 <karolherbst> potentially

22:42 <jekstrand> Hrm...

22:43 <jekstrand> No, never mind. Not possible.

22:43 <karolherbst> for what is this needed here anyway?

22:43 OftenTimeConsuming has quit [Ping timeout: 480 seconds]

22:43 <jekstrand> What do you mean?

22:43 <karolherbst> like why does vulkan need this?

22:43 <karolherbst> or well.. I suspect it's for vulkan, but why is that needed?

22:43 <jekstrand> Vulkan doesn't. OpenCL does and Vulkan ray-tracing uses some OpenCL kerne.s

22:43 <jekstrand> *kernels

22:43 <karolherbst> ahh, right, of course

22:43 apinheiro has quit [Quit: Leaving]

22:44 <karolherbst> let me read something

22:44 <jekstrand> So one of the cases where you can hit this with CL is if you do "while (true) { ... if (non-uniform) { barrier(); /* stuff */ break; } }"

22:44 <karolherbst> soooo

22:45 <karolherbst> I think for ifs it has to be uniform

22:45 <karolherbst> for loops it doesn't

22:45 <jekstrand> Yes

22:45 <jekstrand> I think that's right

22:45 tursulin has joined #dri-devel

22:45 <jekstrand> If the barrier is inside a loop, then all work-items in the work-group must execute the barrier on each iteration of the loop if any work-item executes the barrier on that iteration.

22:45 <karolherbst> for loops we can force all threads to go through a barrier, might not be pretty but we could reorganize any loop in such a way

22:46 <jekstrand> That seems to imply actual uniform

22:46 <karolherbst> yeah.. soo there are two parts

22:46 <karolherbst> uniform control flow

22:46 <karolherbst> and converged/diverged threads

22:46 <karolherbst> they all might reach the barrier, but not at the same time

22:46 <jekstrand> It could be that we can just make the structurizer more barrier-aware.

22:46 <jekstrand> I don't know how to do that, though. :-/

22:47 <karolherbst> do all threads have to execute the barrier in lock-step on DG2?

22:47 <jekstrand> all invocations in a wave? Yes.

22:47 <jekstrand> All threads? That's kind of the point of barrier()

22:47 <karolherbst> mhh, right

22:47 icecream95 has joined #dri-devel

22:47 <jekstrand> Hrm...

22:47 <jekstrand> Yeah, we may be able to just chalk this up to structurizer fail, actually.

22:48 <karolherbst> hopefully

22:48 <jekstrand> Not sure how to fix it, though. :-/

22:48 <jekstrand> Reading Lionel's pass may be easier than trying to page in how the structurizer works. :-/

22:48 <karolherbst> :D

22:49 <karolherbst> I know that on nvidia we have to synchronize threads for stuff like this, which is annoying

22:49 nchery has quit [Ping timeout: 480 seconds]

22:49 <jekstrand> We may also be able to get away with just a loop break hoisting pass.

22:50 <jekstrand> Yeah, that's what the Intel barrier instruction does.

22:50 <jekstrand> It does some sort of semaphore thing to sync threads

22:50 <jekstrand> It's all in HW so I don't know the details.

22:50 <karolherbst> right

22:51 <karolherbst> we have a warpsync instruction with a constant thread mask

22:51 <karolherbst> well on newer gens

22:56 gawin has quit [Ping timeout: 480 seconds]

22:57 <dschuermann> I guess that could be done using ballot(non-uniform cond) != 0? modulo side-effects modulo messiness

23:01 nchery has joined #dri-devel

23:07 OftenTimeConsuming has joined #dri-devel

23:07 tursulin has quit [Read error: Connection reset by peer]

23:15 <jekstrand> airlied: Reminder: !16435 would like your eyes on the 4 llvmpipe patches.

23:15 <jekstrand> I don't think any of them should be controvertial

23:21 <airlied> will look once I run out of meetings to go to

23:21 <jekstrand> airlied: So, August, maybe?

23:21 <jekstrand> Of 2035

23:22 * jekstrand gets some of his best code review done during meetings.

23:24 morphis has quit [Ping timeout: 480 seconds]

23:25 morphis has joined #dri-devel

23:25 <tarceri> jekstrand: no problem. emma has given it a rb so up to you if you want to go over it as well :)

23:26 <jekstrand> tarceri: I was going to. My brain might be working well enough now. I'll give it a go. Otherwise, I'll look tomorrow.

23:40 icecream95 has quit [Ping timeout: 480 seconds]

23:50 <tarceri> :)

23:52 jernej has quit [Quit: Free ZNC ~ Powered by LunarBNC: https://LunarBNC.net]

23:53 jernej has joined #dri-devel

23:53 <jekstrand> tarceri: Why is there nothing in the then side besides the break? Is that part of it being a terminator?

23:53 pcercuei has quit [Quit: dodo]

23:55 <anholt_> how would I detect a vtn frexp or modf destination needing to be referenced with ->elems[i] instead of ->def? All the other cases seem to be fine with glsl_type_is_vector_or_scalar(value->type) to detect that the result is in ->def.

23:55 rkanwal has quit [Quit: rkanwal]

23:56 <tarceri> jekstrand: yeah the continue side generally has all instructions moved outside the if via opt_if

23:57 alistairp has joined #dri-devel

23:57 <anholt_> mediump looks like it's worth 5% on gfxbench vk-5-normal¸ putting us within 4% of freedreno.

23:57 <jekstrand> tarceri: That's not the side I'm worried about. :-) Left you a comment on the MR.

23:57 <anholt_> (I suspect once we sort out a bit of gmem allocation stuff, we'll get the rest and maybe pull ahead)