#dri-devel on 2022-04-05 — irc logs at oftc.irclog.whitequark.org

2022-03-22 11:57 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:01 nchery is now known as Guest1204

00:01 nchery has joined #dri-devel

00:08 Guest1204 has quit [Ping timeout: 480 seconds]

00:16 slattann has joined #dri-devel

00:18 mbrost_ has joined #dri-devel

00:20 mbrost has quit [Read error: Connection reset by peer]

00:27 apinheiro has quit [Ping timeout: 480 seconds]

00:31 eukara_ has joined #dri-devel

00:31 eukara has quit [Read error: Connection reset by peer]

00:38 slattann has quit []

00:38 heat has joined #dri-devel

01:03 co1umbarius has joined #dri-devel

01:04 ngcortes has quit [Remote host closed the connection]

01:04 columbarius has quit [Ping timeout: 480 seconds]

01:12 HankB_ has quit [Remote host closed the connection]

01:13 HankB_ has joined #dri-devel

01:15 LexSfX has quit [Ping timeout: 480 seconds]

01:15 LexSfX has joined #dri-devel

01:16 mhenning has quit [Quit: mhenning]

01:31 pribas has joined #dri-devel

01:31 pribas has quit []

02:10 icecream95 has quit [Quit: leaving]

02:15 tarceri has quit [Remote host closed the connection]

02:17 tarceri has joined #dri-devel

02:45 mbrost_ has quit [Ping timeout: 480 seconds]

03:03 JohnnyonFlame has joined #dri-devel

03:17 mbrost has joined #dri-devel

03:22 digetx is now known as Guest1220

03:22 Guest1220 has quit [Read error: Connection reset by peer]

03:22 digetx has joined #dri-devel

04:06 Duke`` has joined #dri-devel

04:10 JohnnyonF has joined #dri-devel

04:10 mclasen has quit [Ping timeout: 480 seconds]

04:13 mbrost has quit [Ping timeout: 480 seconds]

04:16 loki_val has joined #dri-devel

04:17 JohnnyonFlame has quit [Ping timeout: 480 seconds]

04:19 crabbedhaloablut has quit [Ping timeout: 480 seconds]

04:24 aravind has joined #dri-devel

04:32 slattann has joined #dri-devel

04:43 shankaru has joined #dri-devel

04:54 oakk has joined #dri-devel

04:54 aravind has quit [Read error: Connection reset by peer]

05:13 itoral has joined #dri-devel

05:14 jewins has quit [Ping timeout: 480 seconds]

05:17 heat has quit [Ping timeout: 480 seconds]

05:33 mbrost has joined #dri-devel

05:43 Duke`` has quit [Ping timeout: 480 seconds]

06:02 sdutt has quit [Read error: Connection reset by peer]

06:03 ahajda has joined #dri-devel

06:06 mszyprow has joined #dri-devel

06:14 danvet has joined #dri-devel

06:21 dviola has joined #dri-devel

06:25 soreau has quit [Read error: Connection reset by peer]

06:26 soreau has joined #dri-devel

06:30 macromorgan has quit [Read error: Connection reset by peer]

06:30 frieder has joined #dri-devel

06:42 itoral has quit [Remote host closed the connection]

06:42 itoral has joined #dri-devel

06:43 itoral has quit [Remote host closed the connection]

06:44 itoral has joined #dri-devel

06:46 yogesh_mohan has joined #dri-devel

06:47 mbrost has quit [Ping timeout: 480 seconds]

06:47 maxzor has joined #dri-devel

06:49 yogesh_m1 has quit [Ping timeout: 480 seconds]

06:50 itoral has quit [Remote host closed the connection]

06:50 itoral has joined #dri-devel

06:54 aravind has joined #dri-devel

06:59 jkrzyszt_ has joined #dri-devel

07:06 apinheiro has joined #dri-devel

07:16 tursulin has joined #dri-devel

07:16 naveenk2 has joined #dri-devel

07:28 MajorBiscuit has joined #dri-devel

07:33 icecream95 has joined #dri-devel

07:39 shashank_sharma has quit [Ping timeout: 480 seconds]

07:40 shashanks has joined #dri-devel

07:40 MajorBiscuit has quit [Ping timeout: 480 seconds]

07:44 shashank_sharma has joined #dri-devel

07:46 shashank_sharma has quit [Read error: Connection reset by peer]

07:47 shashank_sharma has joined #dri-devel

07:51 shashanks has quit [Ping timeout: 480 seconds]

07:51 MajorBiscuit has joined #dri-devel

07:55 lynxeye has joined #dri-devel

07:56 <marex> robertfoss: mripard[m]: do you think one of you can stamp this https://patchwork.freedesktop.org/patch/480504/ with your AB/RB , so I can go apply and grab my brown paper bag ?

08:03 <hakzsam> jekstrand: do you plan to update your vk_image RADV work or should I do?

08:06 ella-0 has joined #dri-devel

08:09 lemonzest has joined #dri-devel

08:10 ella-0_ has quit [Read error: Connection reset by peer]

08:11 OftenTimeConsuming has quit [Remote host closed the connection]

08:11 OftenTimeConsuming has joined #dri-devel

08:12 maxzor has quit [Remote host closed the connection]

08:13 maxzor has joined #dri-devel

08:18 elongbug has joined #dri-devel

08:22 Emantor has quit [Quit: ZNC - http://znc.in]

08:22 Emantor has joined #dri-devel

08:26 elongbug has quit [Ping timeout: 480 seconds]

08:27 lemonzest has quit [Quit: WeeChat 3.4]

08:44 <javierm> danvet: give me a few minutes to page all this in my head again. It been some time I wrote the email you answered and no longer remember the details :)

08:44 <javierm> *since I wrote

08:46 <danvet> yeah I'm doing the same right now

08:48 JohnnyonFlame has joined #dri-devel

08:50 pcercuei has joined #dri-devel

08:55 JohnnyonF has quit [Ping timeout: 480 seconds]

08:56 <javierm> danvet: Ok, I think that remembered the details. I'll answer in the list

09:02 <danvet> mripard[m], mlankhorst_ pls don't forget to roll branches forward so fixes don't get lost and drm-misc-next is in linux-next

09:08 Danct12 has quit [Remote host closed the connection]

09:10 guru_ has joined #dri-devel

09:11 rasterman has joined #dri-devel

09:13 oneforall2 has quit [Ping timeout: 480 seconds]

09:17 rasterman has quit [Quit: Gettin' stinky!]

09:19 <javierm> danvet: done, let me know what you think

09:23 rasterman has joined #dri-devel

09:24 <danvet> javierm, makes sense and I think a fix is fairly simple

09:24 <javierm> danvet: yup, I'll get another coffee and then try to write a patch

09:25 <danvet> javierm, ah if you want to get typing I'm fine with reviewing :-)

09:25 <javierm> danvet: great :)

09:25 <danvet> javierm, btw have you seen the patch from tzimmermann for offb?

09:25 <danvet> looks like that one is missing out on the sysfb fun

09:26 <javierm> danvet: yeah, had in my TODO but needed to refresh my memory in order to look at it. Now that I've all in cache I should do it

09:26 <danvet> javierm, oh for encapsulation probably needs a sysfb_try_unregister which takes the struct device

09:26 <danvet> and returns false if it's not the sysfb platform_dev

09:29 <javierm> danvet: but should be specific to sysfb ?

09:29 <javierm> danvet: for instance drivers/video/fbdev/vga16fb.c registers it's own pdev

09:30 <javierm> in it's module_init() handler

09:30 <javierm> but it's also info->flag |= FBINFO_MISC_FIRMWARE

09:31 <javierm> danvet: I tried to minimize that problem with 0499f419b76f ("video: vga16fb: Only probe for EGA and VGA 16 color graphic cards")

09:31 <javierm> but there may be other drivers that do the same

09:32 <javierm> that's why I think we need some global state in fbdev core that says "a DRM driver already probed, don't allow generic fbdev drivers to be registered anymore"

09:41 <javierm> danvet: actually, is more complicated than that... because you could for example probe a DRM driver for a small display but still want to allow FBINFO_MISC_FIRMWARE for your main display controller

09:43 <javierm> anyways, I'll answer in the list to avoid having this convo in two places

09:45 rasterman has quit [Read error: No route to host]

09:46 rasterman has joined #dri-devel

09:47 HankB_ has quit [Remote host closed the connection]

09:47 HankB_ has joined #dri-devel

09:53 HankB_ has quit [Remote host closed the connection]

09:53 oneforall2 has joined #dri-devel

09:54 HankB_ has joined #dri-devel

09:58 rkanwal has joined #dri-devel

09:58 guru_ has quit [Ping timeout: 480 seconds]

10:20 guru_ has joined #dri-devel

10:21 elongbug has joined #dri-devel

10:23 devilhorns has joined #dri-devel

10:24 oneforall2 has quit [Ping timeout: 480 seconds]

10:24 camus has quit [Ping timeout: 480 seconds]

10:27 <danvet> javierm, yeah we need the state to be in each platform sysfb

10:27 <danvet> so if vga16 has another one, then that also needs handling in there

10:33 maxzor has quit [Ping timeout: 480 seconds]

10:34 <danvet> javierm, anyway replied too, and added gregkh for more opinions

10:35 <danvet> javierm, I think I'll resend my series without those last two patches to get them unblocked

10:40 mclasen has joined #dri-devel

10:46 rkanwal has quit [Quit: rkanwal]

10:56 heat has joined #dri-devel

11:18 maxzor has joined #dri-devel

11:22 itoral has quit [Remote host closed the connection]

11:24 itoral has joined #dri-devel

11:26 itoral has quit [Remote host closed the connection]

11:27 itoral has joined #dri-devel

11:27 Thymo has quit [Quit: ZNC - http://znc.in]

11:30 itoral has quit [Remote host closed the connection]

11:30 itoral has joined #dri-devel

11:31 shashank_s has joined #dri-devel

11:38 shashank_sharma has quit [Ping timeout: 480 seconds]

11:48 slattann has quit []

11:54 adjtm has quit [Quit: Leaving]

11:58 thellstrom has joined #dri-devel

11:58 adjtm has joined #dri-devel

11:59 rkanwal has joined #dri-devel

12:05 maxzor has quit [Ping timeout: 480 seconds]

12:07 rkanwal has quit [Ping timeout: 480 seconds]

12:12 itoral has quit [Remote host closed the connection]

12:15 <javierm> danvet: sorry, got dragged into meetings. Yes, re-sending without those two makes sense to me and we can continue the discussion

12:18 Danct12 has joined #dri-devel

12:18 nchery has quit [Read error: Connection reset by peer]

12:23 shankaru has quit [Quit: Leaving.]

12:26 rkanwal has joined #dri-devel

12:30 rasterman- has joined #dri-devel

12:30 thellstrom has quit [Read error: Connection reset by peer]

12:32 thellstrom has joined #dri-devel

12:33 rasterman has quit [Ping timeout: 480 seconds]

12:41 crabbedhaloablut has joined #dri-devel

12:42 thellstrom has quit [Ping timeout: 480 seconds]

12:44 loki_val has quit [Ping timeout: 480 seconds]

12:46 icecream95 has quit [Ping timeout: 480 seconds]

12:51 maxzor has joined #dri-devel

12:53 loki_val has joined #dri-devel

12:54 crabbedhaloablut has quit [Ping timeout: 480 seconds]

12:54 lemonzest has joined #dri-devel

12:55 adjtm is now known as Guest1257

12:56 adjtm has joined #dri-devel

13:01 sdutt has joined #dri-devel

13:02 sdutt has quit []

13:02 sdutt has joined #dri-devel

13:03 Guest1257 has quit [Ping timeout: 480 seconds]

13:26 <javierm> danvet: added another option, this all feels as if the aperture and conflicting framebuffers removal is really a workaround and all this should be moved to the device model core

13:27 <danvet> javierm, yes

13:27 <javierm> that is expect that all DRM drivers need to call to request_mem_region() and have an option to force it and unregister any device that may already requested and overlapping aperture

13:27 <danvet> well request_mem_region is another thing

13:27 <danvet> in theory it should be used

13:28 <danvet> in practice the times of jumper controlled IO decoding is long past us, so there's no real incentive for drivers to use it

13:28 <javierm> danvet: yup, but if drivers used it could be a good indication of conflicting devices/drivers

13:28 <danvet> but yeah maybe we could integrate it a bit better with request_mem_region

13:29 <danvet> javierm, otherwise the removal flow is probably going to stay specific to graphics, due to the fw fb handover games

13:29 <danvet> there's some of that also going on for input devices, but that's all very platform specific

13:31 <javierm> danvet: yeah

13:31 alyssa has joined #dri-devel

13:32 <alyssa> building radeonsi for the first time

13:32 <alyssa> not a good feel

13:33 <javierm> danvet: I also had a question about offb, I don't think we can do it in sysfb since doesn't even use a pdev. It's not even a proper driver since doesn't use the device model

13:33 <javierm> danvet: I wonder if instead of Thomas' patch we should convert this to register a pdev driver, pdev and have a proper .probe

13:34 OftenTimeConsuming has quit [Remote host closed the connection]

13:34 <javierm> because his assumption that all fbdev are backed by a device sounds reasonable

13:34 OftenTimeConsuming has joined #dri-devel

13:36 <danvet> javierm, yeah longer-term, but as a quick regression fix the patch is more well contained

13:39 iive has joined #dri-devel

13:42 <javierm> danvet: yeah, agree

13:42 <javierm> danvet: every day I'm more happy that we are disabling all fbdev drivers since Fedora 36

13:45 <danvet> +1

13:48 shankaru has joined #dri-devel

13:58 bcheng has joined #dri-devel

13:58 jewins has joined #dri-devel

13:58 jernej has quit [Quit: Free ZNC ~ Powered by LunarBNC: https://LunarBNC.net]

13:59 jernej has joined #dri-devel

13:59 macromorgan has joined #dri-devel

14:10 <jekstrand> hakzsam: Oh, I can keep going. I din't have it fully passing yet. I didn't know if you thought I *should* keep going given that you seemed to already have it working.

14:11 <hakzsam> I still my old branch somewhere but rebasing will be annoying, I can do it if you want

14:12 rasterman- has quit []

14:12 rasterman has joined #dri-devel

14:13 kchibisov_ has quit []

14:14 kchibisov has joined #dri-devel

14:20 mbrost has joined #dri-devel

14:24 mlankhorst_ is now known as mlankhorst

14:24 <mlankhorst> danvet: can we find a new maintainer for drm-misc temporarily? I can do this cycle but expecting to be pretty busy next 6 months

14:24 bcheng has quit [Remote host closed the connection]

14:25 bcheng has joined #dri-devel

14:27 aravind has quit [Ping timeout: 480 seconds]

14:36 <Venemo> alyssa: I'd be interested in the "Explain why" part of your MR 15720

14:36 idr has joined #dri-devel

14:40 thellstrom has joined #dri-devel

14:43 <alyssa> Venemo: right, will write that now

14:43 <alyssa> It's not interesting to ACO fwiw

14:44 <Venemo> maybe so, but it still looks interesting

14:44 <alyssa> It doesn't work if you support geom/tess

14:44 <alyssa> which we don't

14:44 mszyprow has quit [Ping timeout: 480 seconds]

14:44 <Venemo> that's okay, it still seems like a neat idea

14:45 * alyssa types

14:45 <alyssa> TL;DR Mali doesn't have any hw support for transform feedback, I thought it did but it was totally wrong

14:46 <alyssa> Generically lowering to a compute shader works well enough and then the rest of the driver can forget about it

14:46 <alyssa> Possibly ditto for AGX? not sure what the geom/tess shader story is there

14:46 <Venemo> don't you still have to output stuff to the rasterizer too, though?

14:47 <alyssa> yes

14:47 <alyssa> so there's a perf cost, but.. sort of inevitable

14:47 <alyssa> well

14:47 <Venemo> so, you run both a CS and a VS?

14:47 <alyssa> under that MR, yes

14:47 <alyssa> I am not sure that's the right forward

14:47 <alyssa> but if we want a correct implementation for desktop GL, i don't think there's another choice

14:48 <Venemo> does the VS re-compute the output, or does it just read from the XFB buffer?

14:48 <alyssa> recompute on that MR, might be worth optimizing it

14:48 <Venemo> why not just keep the VS intact and store to the XFB buffer from the VS?

14:48 HankB_ has quit [Remote host closed the connection]

14:49 HankB_ has joined #dri-devel

14:52 shashanks has joined #dri-devel

14:54 bcheng has quit [Remote host closed the connection]

14:54 bcheng has joined #dri-devel

14:57 shashank_s has quit [Ping timeout: 480 seconds]

14:57 <idr> Venemo: If you don't have stream out hardware, getting things ordered correctly is probably somewhere between hard and impossible.

14:58 <idr> We thought about doing that on some older hardware that didn't have stream out, and we gave up on the idea pretty quickly.

14:58 <idr> That was years ago, and I don't remember all the details.

14:59 <Venemo> idr: well, don't you have to still solve the same problem for the CS, though?

14:59 <idr> Yes, but getting ordering between CS workers is much easier.

14:59 <Venemo> why?

15:00 <idr> In CS, each worker knows who it is. In VS, you don't know who you are.

15:00 <idr> At least, VS invocations couldn't know that when we were trying to do it.

15:00 shankaru has quit [Quit: Leaving.]

15:00 <Venemo> idr: I guess that depends on the HW capabilities

15:01 <idr> From a CS, I suspect it would also be easier to structure it so that each worker processes an entire primitive. I think that could make some things a little easier.

15:01 <idr> But I haven't looked at the MR.

15:01 <Venemo> for sure, API VS doesn't do this, but if the HW is capable, you can lower your XFB VS to use whatever the HW can do

15:02 Haaninjo has joined #dri-devel

15:06 <alyssa> idr: basically that

15:07 rkanwal has quit [Remote host closed the connection]

15:07 <alyssa> right now i'm just trying to replicate what panfrost had before

15:08 rkanwal has joined #dri-devel

15:09 <alyssa> oh ffs

15:13 pcercuei has quit [Quit: Lost terminal]

15:15 <alyssa> look I don't know what to tell you guys

15:15 <alyssa> I just want to land mali-g57 support

15:16 pcercuei has joined #dri-devel

15:16 <alyssa> panfrost's xfb implementation is totally broken, g57 won't be able to use it, might as well rewrite to be somewhat less broken

15:16 <alyssa> the CS approach is strictly less broken and opens the door to fixing harder

15:17 <alyssa> but it's still broken... indexed draws, strips/fans/quads/polygons, some details about bounds checking, this stuff still doesn't work

15:17 <alyssa> but at least with CS there's a way to make it work, with the nonsense "just alias the XFB buffer with an internal varying buffer" approach there's not

15:18 <alyssa> it won't work for GS/TS, guess what the hw don't support those

15:18 gawin has joined #dri-devel

15:18 <alyssa> mali is an es2 part at heart

15:19 jkrzyszt_ has quit [Ping timeout: 480 seconds]

15:21 <alyssa> idr: anything clever wrt primitive processing gets stomped over primitive restart

15:22 <alyssa> but sure throw util_draw_vbo_without_prim_restart in there as well

15:28 libv has joined #dri-devel

15:34 aravind has joined #dri-devel

15:34 bcheng has quit [Remote host closed the connection]

15:35 bcheng has joined #dri-devel

15:35 <alyssa> The MR provides an impl of xfb that's good enough to pass the GLES3.1 CTS.

15:35 <alyssa> Assuming ~nothing about the hardware other than compute support.

15:35 <alyssa> Playing nice with tilers that use bin shaders and have weird requirements around them

15:36 <alyssa> and is open for future extension to support full big GL xfb (per-primitive, etc)

15:38 <ajax> so i have this: https://paste.centos.org/view/raw/45d1382c

15:39 <ajax> and when i use zink and resize glxgears i get:

15:39 <ajax> glxgears: ../src/vulkan/runtime/vk_log.c:41: vk_object_to_device: Assertion `obj->device' failed.

15:40 <ajax> even though x11_swapchain has a vk_object_base as its first member, so, what am i missing?

15:41 <alyssa> ..and I need Midgard compiler changes. joy.

15:42 <alyssa> so much for no hw assumptions

15:45 <alyssa> wait maybe I don't

15:46 Duke`` has joined #dri-devel

15:47 gawin has quit [Ping timeout: 480 seconds]

15:55 maxzor has quit [Ping timeout: 480 seconds]

15:55 <Venemo> alyssa: thanks for your thoughts, I didn't mean to frustrate you, was merely curious

15:56 <Venemo> alyssa: if you can solve the ordering problem in CS but it's not solveable in VS, then for sure this approach does make sense for those drivers

16:00 nchery has joined #dri-devel

16:04 bcheng has quit [Remote host closed the connection]

16:06 naveenk2 has quit []

16:10 bcheng has joined #dri-devel

16:15 thellstrom1 has joined #dri-devel

16:15 thellstrom has quit [Read error: Connection reset by peer]

16:16 shankaru has joined #dri-devel

16:18 libv has quit [Remote host closed the connection]

16:24 mclasen has quit []

16:24 mclasen has joined #dri-devel

16:24 <jekstrand> Ugh... C's undefined behavior rules are making it nearly impossible to write ubsan-safe tests for timespec_add with overflow detection. :-(

16:25 <alyssa> :|

16:25 heat has quit [Remote host closed the connection]

16:26 heat has joined #dri-devel

16:35 <cheako> I'm testing flatpak today, does anyone know how to work with `org.freedesktop.Platform.GL.default`? Seems like an embeded copy of mesa, if true I'll need to know how to replace it.

16:36 aravind has quit [Ping timeout: 480 seconds]

16:38 jhli has quit [Quit: ZNC 1.8.2 - https://znc.in]

16:40 jhli has joined #dri-devel

16:41 camus has joined #dri-devel

16:47 lynxeye has quit [Ping timeout: 480 seconds]

16:49 MajorBiscuit has quit [Ping timeout: 480 seconds]

16:50 aravind has joined #dri-devel

16:58 aravind has quit [Ping timeout: 480 seconds]

16:59 garrison has quit [Remote host closed the connection]

17:00 <Venemo> cheako: Flatpak ships its own version of mesa. You should ask the details in their channel

17:00 i-garrison has joined #dri-devel

17:05 lynxeye has joined #dri-devel

17:05 jhli_ has joined #dri-devel

17:05 jhli_ has quit [Remote host closed the connection]

17:05 jhli_ has joined #dri-devel

17:07 <jekstrand> Of course I can't repro the lavapipe failures. Why would we want a SW rasterizer to be deterministic? *sigh*

17:08 jhli_ has quit []

17:09 <ajax> fast or deterministic, pick any one

17:11 shankaru has quit [Quit: Leaving.]

17:14 frieder has quit [Remote host closed the connection]

17:20 ybogdano has joined #dri-devel

17:23 thellstrom has joined #dri-devel

17:25 jhli_ has joined #dri-devel

17:25 jhli_ has quit []

17:28 thellstrom1 has quit [Read error: Connection reset by peer]

17:28 thellstrom has quit [Read error: Connection reset by peer]

17:29 dwlsalmeida has joined #dri-devel

17:29 thellstrom has joined #dri-devel

17:29 jhli has left #dri-devel [#dri-devel]

17:31 jhli has joined #dri-devel

17:34 camus has quit [Ping timeout: 480 seconds]

17:34 <alyssa> ajax: "or"

17:35 devilhorns has quit []

17:38 <ajax> i suppose a swrast whose only blend operation was | would be both fast and deterministic

17:40 LexSfX has quit [Ping timeout: 480 seconds]

17:40 LexSfX has joined #dri-devel

17:43 <danvet> javierm, just realized: to avoid regressions we only need to handle the platform_dev in sysfb.c

17:43 <danvet> and keep the platform_unregister for vga16fb (which has races, but meh)

17:43 <danvet> so not much needed to just unblock the revert

17:44 gouchi has joined #dri-devel

17:44 <jekstrand> zmike, airlied: Does lavapipe have any real concurrency between draw/compute jobs? Like can two small draws run in parallel?

17:45 <ajax> that was the "scene overlap" patch series from a bit ago iirc?

17:45 <zmike> I think that might have been reverted?

17:45 <zmike> I can't keep track of it

17:45 <jekstrand> Because we do nothing whatsoever for semaphore waits and I suspect that's the problem with this CI run.

17:46 <zmike> yeah that's probably not good

17:46 <ajax> well there's your answer, fishbulb

17:46 <zmike> I guess I need to reread common sync to see what entrypoint I missed there :/

17:49 <jekstrand> I found what old lavapipe was doing. It was calling wait_semaphores(). I missed that

17:49 <jekstrand> Just needed to ad a vk_sync_wait_many() call to the start of lvp_queue_submit().

17:57 anholt_ has joined #dri-devel

17:58 ngcortes has joined #dri-devel

18:00 <jekstrand> Ok, let's see if that fixes the test fails. Now I just have to figure out how to get my timespec_add overflow tests to pass ubsan. :-/

18:02 anholt has quit [Ping timeout: 480 seconds]

18:17 slattann has joined #dri-devel

18:18 thellstrom1 has joined #dri-devel

18:23 thellstrom has quit [Ping timeout: 480 seconds]

18:29 <javierm> danvet: yeah, not registering the platform dev in sysfb when a DRM driver already exists will cover most of the cases

18:35 mszyprow has joined #dri-devel

18:38 mszyprow has quit [Remote host closed the connection]

18:38 mszyprow has joined #dri-devel

18:47 mszyprow has quit [Ping timeout: 480 seconds]

18:52 <emersion> jekstrand: regular ping about That Kernel Patch for syncobj :)

18:54 <jekstrand> emersion: Yeah. It's blocking on a series from Christian König. I should go read those patches again.

19:00 slattann has quit []

19:08 pcercuei has quit [Quit: brb]

19:09 pcercuei has joined #dri-devel

19:16 rkanwal has quit [Quit: rkanwal]

19:25 mdnavare has quit []

19:26 mdnavare has joined #dri-devel

19:36 illwieckz has quit [Remote host closed the connection]

19:37 ybogdano has quit [Remote host closed the connection]

19:39 ybogdano has joined #dri-devel

19:58 shankaru has joined #dri-devel

20:03 heat_ has joined #dri-devel

20:03 heat has quit [Read error: Connection reset by peer]

20:07 i-garrison has quit [Remote host closed the connection]

20:08 i-garrison has joined #dri-devel

20:08 lynxeye has quit [Quit: Leaving.]

20:09 MajorBiscuit has joined #dri-devel

20:11 <danvet> emersion, syncobj? not dma-buf fence import/export?

20:11 <emersion> err yeah

20:11 <danvet> or is this about the vk syncobj stuff

20:11 <emersion> that one

20:11 <jekstrand> Well, once you have a sync_file, you can stuff it in a syncobj if you want.

20:11 <emersion> no syncobj, just good old sync_fence

20:11 <emersion> yup

20:11 <emersion> sync_file*

20:12 <jekstrand> Ooh, we should probably add timeline support to the sync_file import/export API...

20:12 <jekstrand> Export will fail if the point hasn't materialized, of course.

20:12 <emersion> i workaround this by converting to binary syncobj

20:12 <emersion> then inserting that binary syncobj into the timeline

20:12 <jekstrand> emersion: I'll try to pick that all back up once the big dma_resv rework lands.

20:12 <emersion> cool!

20:13 <jekstrand> Then maybe we can make the world of synchronization suck a bit less. :)

20:13 ybogdano has quit [Ping timeout: 480 seconds]

20:14 <danvet> emersion, how do you plan to use it? together with a vk renderer?

20:14 <emersion> danvet: i need it for https://gitlab.freedesktop.org/wlroots/wlroots/-/merge_requests/3282

20:15 <emersion> i've only implemented the GL side of things, but Vk should be pretty much "implement the missing hooks and you're good to go"

20:15 alyssa has left #dri-devel [#dri-devel]

20:16 <danvet> well is there an api in gl where you can import a dma-buf and it wont automatically do some funny version of implicit sync?

20:16 <danvet> but I guess this is for proto support, so you don't really care about oversync in that case I guess

20:17 <emersion> daniels might know better what exactly happens under the hood in the GL world

20:17 <danvet> implicit sync on everything the driver sends to the gpu

20:17 <danvet> flush out with glflush

20:17 <danvet> afaiui

20:18 <emersion> well that's the story when there's no wait on a fence

20:18 <emersion> i wonder if the driver skips implicit sync when there's a wait

20:18 <danvet> nah, those are on top

20:18 <emersion> eh

20:18 <emersion> at least we have the KMS side of things, where i think setting the FENCE_FD will make the driver ignore the implicit syncfile?

20:19 <danvet> at least with the drivers I know of

20:19 <jekstrand> danvet: Yeah, there's an explicit sync GL extension

20:19 <danvet> yeah, setting the fd should ignore implicit sync for kms

20:19 <jekstrand> Or EGL or something

20:19 <danvet> jekstrand, oh so you can tell it to not ever do implicit sync?

20:19 <jekstrand> danvet: Yeah, there's a thing. I don't remember what it's called off-hand

20:19 <emersion> i haven't found a gloabl "no implicit sync plz" switch

20:19 <danvet> pretty sure most drivers suck at implementing it :-)

20:20 <danvet> like amdgpu just cant

20:20 <emersion> EGL_KHR_fence_sync allows importing a sync_file

20:20 <danvet> and last time I looked at iris it didn't look like it does either

20:20 <emersion> EGL_KHR_wait_sync allows to wait

20:20 <danvet> yeah import/export is all fine

20:21 <emersion> jekstrand: but i don't recall a setting to disable implicit sync

20:21 <danvet> but if you pull the fences out of an implicit sync'ed buffer and expect to not stall after those completed

20:21 <jekstrand> EGL_EXT_image_implicit_sync_control

20:21 <danvet> probably a disappointment

20:21 <daniels> https://www.khronos.org/registry/EGL/extensions/EXT/EGL_EXT_image_implicit_sync_control.txt does exist, but there wasn't much point to it

20:21 <emersion> oh

20:21 <emersion> what do you mean?

20:21 <daniels> oh, the extension is fine

20:21 <daniels> it's just at the time, no-one was really meaningfully implementing explicit sync on Linux

20:21 <emersion> is it supported by mesa?

20:21 <danvet> yeah without dma-buf export/import it's somewhat useless

20:21 <daniels> the support wasn't typed out through winsys etc

20:22 <daniels> Mesa doesn't support it no, because at the time I only had Intel hardware, and i915's explicit-sync bypass was per-BO rather than per-CS

20:22 <emersion> 😥

20:23 <danvet> emersion, I guess wire it up, create a reason for mesa to implement it

20:23 <jekstrand> daniels: What do you mean by per-CS?

20:23 <daniels> I'm all for doing explicit sync properly now that people are interested :)

20:23 <danvet> daniels, isn't the spec per dma-buf import?

20:23 <daniels> danvet: it's per-EGLImage, of which you can have multiple

20:23 <emersion> yeah, i guess it's pretty simple to use after all

20:23 <danvet> so add flag to the iris_bo, make sure you don't sync in each execbuf that uses it?

20:24 <jekstrand> I'm pretty sure someone wired it up for i965

20:24 <jekstrand> Not sure if we ever cared for iris

20:24 <daniels> yeah, there is a per-intel_bo flag on i965

20:24 <danvet> jekstrand, git grep says no

20:24 <danvet> or I'm grepping the wrong thing

20:25 <jekstrand> danvet: I'm not finding it either

20:25 <emersion> yeah, grepping mesa just reveals khronos headers and XML

20:25 <jekstrand> And my memory of this is pre-GitLab so no MRs....

20:25 <jekstrand> Oh, hey, look! Patches from LFRB from 2017

20:25 <danvet> daniels, not finding anything here?

20:26 <jekstrand> Also, https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4205

20:27 <jekstrand> Including a recorded IRC conversation about why the extension is bogus.

20:27 <jekstrand> Awesome

20:27 <jekstrand> emersion: So, if you want to tilt at this particular windmill, I think !4205 contains all the information you need to get started.

20:27 <daniels> oh nice, Eric rescued it and added iris

20:28 <emersion> hmm

20:28 <daniels> yeah, I'd forgotten about the EGLImage thing as well - making it a per-dmabuf rather than per-surface attr would make some sense I think

20:28 <emersion> though i don't think it matters for my wlroots patchset

20:29 <emersion> i _always_ get a sync_file, thanks to jekstrand's kernel patch

20:29 <emersion> so i can unconditionally always disable implicit sync

20:29 <emersion> (without the kernel patch, i'll just fallback to implicit sync)

20:29 <danvet> jekstrand, while we discuss this, I just realized there's some entertainment with dma_resv_usage flags and the i915 execbuf flags

20:30 <danvet> atm ASYNC means "ignore the write fence"

20:30 <danvet> but we still set read fences

20:30 <daniels> emersion: oh yeah, that's a really good point!

20:30 <danvet> should ASYNC also mean "don't set a USAGE_READ fence, only a USAGE_BOOKKEEPING"?

20:30 <daniels> emersion: yet another reason to want that uAPI :)

20:30 <danvet> or would that like break the world

20:30 <danvet> emersion, \o/

20:31 <jekstrand> danvet: Hrm...

20:31 <danvet> emersion, the caveat with that MR might be that Kayden added support for separate ctx iirc

20:31 <daniels> emersion: so are you just doing sync_file for now, or looking at syncobj again?

20:31 <jekstrand> Yeah, ideally, I think ASYNC should mean BOOKKEEPING and we should do WRITE on EXEC_OBJECT_WRITE and READ otherwise.

20:31 <danvet> so just blindly setting ASYNC might break iris now, but I thought explicit sync is rolled out

20:31 <danvet> just more to check

20:31 <emersion> daniels: hm no i'm doing syncobj timelines, but using sync_file to import to various APIs

20:32 <jekstrand> danvet: Whether or not that'll break the universe, I don't know. I don't think it will but it would be a bit backwards incompatible.

20:32 <danvet> jekstrand, I guess we could just yolo and see what happens

20:32 <emersion> KMS, EGL, etc

20:32 <daniels> emersion: awesome :)

20:32 <danvet> jekstrand, in drm it's only a regression when someone sees the tree fall

20:32 turol has quit [Ping timeout: 480 seconds]

20:32 <jekstrand> danvet: That sounds a bit too much like Vulkan for my comfort. :P

20:32 <danvet> we're all gpu driver people, same disease I guess :-P

20:33 <danvet> but yeah maybe better to add a ASYNC_FOR_REAL flag

20:34 <jekstrand> Yeah

20:34 turol has joined #dri-devel

20:34 <danvet> I'm still tempted to just whack out a bunch of patches and see what happens

20:34 <danvet> könig might have simply done it

20:34 <jekstrand> Yeah, we've only go 8 EXEC_OBJECT bits used, we can burn another.

20:34 <danvet> well not for i915 because that's a giantic mess

20:35 <danvet> emersion, ^^ so maybe a bit more typing for the most glorious of futures

20:35 <Kayden> revenge of the explicit sync?

20:36 <jekstrand> danvet: If you want to type some patches for doing timelines etc. with just dma-buf import/export, v3dv needs "real" timeline support. :)

20:37 gawin has joined #dri-devel

20:37 <daniels> Kayden: this time for sure

20:39 <danvet> jekstrand, so does panfrost.ko, but they both use my drm/sched stuff so really might just write one helper and done

20:40 <danvet> jekstrand, on the in-fence side all drivers with implicit control completely ignore all fences

20:40 <danvet> so I think we could/should dare this on the out-fence side too with USAGE_BOOKKEEPING

20:41 <danvet> and for added fun, across the board

20:41 <danvet> i915 is a mess, but etnaviv, msm, lima all use drm/sched

20:41 <danvet> panfrost&v3d lack implicit control

20:41 <bnieuwenhuizen> FWIW for amdgpu I'm also targetting BOOKKEEPING, once I get it to work at least :)

20:42 <danvet> amdgpu is max entertainment still

20:42 <danvet> bnieuwenhuizen, yeah that's how it's all should work

20:42 <danvet> bnieuwenhuizen, and I guess import/export ioctl from jekstrand to control USAGE_READ/WRITE with radv?

20:42 <bnieuwenhuizen> yeah

20:42 <danvet> awesome

20:42 <bnieuwenhuizen> just trying to get implicit sync disabled first though

20:42 <bnieuwenhuizen> have some patches that should do it, but no evidence of explicit sync yet ...

20:42 <danvet> amgpu might actually leapfrog everyone else here from most bonkers implicit sync to "copypaste this pls"

20:43 <bnieuwenhuizen> still some rough edges on the dma-resv series as well (e.g. does the dma-buf poll syscall ignore BOOKKEEPING fences?)

20:44 <danvet> they should

20:44 <danvet> poll is about implicit sync

20:44 <danvet> like all BOOKKEEPING every holds up is the release of backing storage

20:44 <bnieuwenhuizen> agree, just an example of random bits that weren't quite hooked up

20:44 <danvet> which should be purely a kernel problem

20:44 <danvet> oh you mean the current code doesn't ignore BOOKKEEPING?

20:44 <daniels> danvet: panfrost has new uapi on the way anyway

20:45 <danvet> daniels, does it get all this stuff right?

20:45 <bnieuwenhuizen> daniels: wasn't even properly converted to usage flags

20:45 <bnieuwenhuizen> er, danvet

20:45 <danvet> bnieuwenhuizen, hm yeah I guess there's going to be a lot of that all over until it's sorted out

20:46 <bnieuwenhuizen> that said, IMO best way to get stuff like this stamped out is by testing & playing with it

20:46 <danvet> probably a few follow up patch sets and full on tree audit

20:46 <danvet> bnieuwenhuizen, yeah

20:46 <daniels> danvet: yep!

20:46 * danvet adds more reasons why i915 vm_bind cannot suck

20:47 MajorBiscuit has quit [Ping timeout: 480 seconds]

20:49 <anholt_> anyone able to take a look at https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15332 ? even r-bs on components would be nice

20:53 heat has joined #dri-devel

20:53 heat_ has quit [Read error: Connection reset by peer]

20:54 MajorBiscuit has joined #dri-devel

20:55 <gawin> is there a chance for CI for older intel platforms? currently testing is a painful experience (each gen is "big gen")

20:58 <airlied> gawin: there is some optional systems for g41/haswell

20:58 <airlied> gawin: how old do you want?

20:58 <airlied> there is also the intel mesa CI system which has some older ones

21:00 illwieckz has joined #dri-devel

21:00 Duke`` has quit [Ping timeout: 480 seconds]

21:02 <gawin> oh, it must be relatively new (don't remember seeing this)

21:02 <airlied> gawin: though anholt is moving so not sure of thier status at the moment

21:03 <danvet> robclark, poke for that legacy cursor testing on msm

21:04 nchery has quit [Ping timeout: 480 seconds]

21:06 <robclark> danvet: maybe abhinav__ can? I'm pretty sure CrOS userspace still uses legacy cursor

21:07 <danvet> robclark, oh that's why we should test whether I didn't put a stall in there accidentally

21:07 <jekstrand> karolherbst: test_basic readimage is failing on iris. Is this known and unfixed? Or do I have an old branch?

21:07 <danvet> robclark, at least for msm

21:07 <karolherbst> jekstrand: I never ran the CTS on iris, so no idea what fails or not :)

21:08 <karolherbst> but let's see what breaks

21:08 <karolherbst> jekstrand: mhh, passes for me. What branch are you on?

21:08 <danvet> abhinav__, https://lore.kernel.org/dri-devel/20220331152021.2671937-1-daniel.vetter@ffwll.ch/ <- patch I'd need a cros msm tested-by on pls

21:08 <jekstrand> karolherbst: Your latest rusticl/wip branch

21:08 <jekstrand> karolherbst: It crashed but I fixed that

21:08 <karolherbst> ohh wait... I selected CPU

21:08 * jekstrand doesn't have llvmpipe built in this worktree

21:09 <karolherbst> ahh yeah, it's a crash inside the backend compiler, right, I think I saw that once

21:09 <jekstrand> karolherbst: Ok, I've got a fix for that

21:09 <karolherbst> cool

21:10 <jekstrand> karolherbst: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15758

21:10 <anholt_> airlied: as mentioned in #crocus: g41 is back, hsw is in a box somewhere and I haven't found it. also, i915g is back too

21:10 <jekstrand> karolherbst: One line. :)

21:10 <jekstrand> karolherbst: But now it fails. I'll debug that next

21:11 <anholt_> nouveau CI is waiting on geothermal workers to be out of the area, since it's a bit more work to set up. also, nouveau CI is still a disaster so it's not super useful.

21:11 <karolherbst> jekstrand: wouldn't be surprised if iris assumes texture == sampler

21:11 <jekstrand> karolherbst: It doesn't

21:11 <karolherbst> ahh, cool

21:12 <jekstrand> karolherbst: It is telling me it has no samplers and no binding tables. :-/

21:12 <karolherbst> yep

21:12 <karolherbst> it's txf alright

21:14 <karolherbst> ehh wait, it's txs... mhh

21:14 <jekstrand> Kayden: Is there some reason we're not setting BindingTableEntryCount in GPGPU_WALKER->INTERFACE_DESCRIPTOR_DATA?

21:15 <gawin> anholt_: do I need something to trigger a run? (I don't have haswell, so it's gonna be helpful)

21:15 <anholt_> you click the play button in the pipeline.

21:15 <karolherbst> jekstrand: mhh weird.. I am luxmark works, so not sure if that test is doing something edgy

21:15 <karolherbst> although that's just write images afaik

21:16 <karolherbst> if at all

21:16 nchery has joined #dri-devel

21:17 <karolherbst> jekstrand: btw, I am considering moving towards iris for regression testing

21:17 <karolherbst> weird..

21:17 <karolherbst> ./build/test_conformance/images/kernel_read_write/test_image_streams passes here on iris as well

21:17 <karolherbst> well

21:17 gouchi has quit [Remote host closed the connection]

21:17 <karolherbst> partly

21:17 <karolherbst> mirroring is broken as it is on llvmpipe

21:18 <karolherbst> or something else

21:18 <jekstrand> I'm sure it's something dumb

21:20 MajorBiscuit has quit [Ping timeout: 480 seconds]

21:22 <jekstrand> zmike: Ok, CI is finally happy with the lavapipe MR. Changes since you reviewed: 1) Fixed ubsan issues with the timespec_add_nsec tests. 2) vk_sync_wait_many() for wait semaphores in lvp_queue_submit. 3) Delete some fails for lavapipe-asan test run because they no longer fail.

21:22 <zmike> I like the sound of all these things

21:22 shankaru has quit []

21:23 <jekstrand> zmike: Want to look again or should we assign marge?

21:23 <jekstrand> We really should fix the asan issues one of these days. Most of them seem to just be memory leaks.

21:23 <zmike> jekstrand: did you do a full cts run?

21:23 <jekstrand> zmike: No, I can, though.

21:23 <zmike> please do since otherwise I'll be doing it here

21:23 <zmike> slower

21:25 <anholt_> reminder for helping with slow lvp cts runs: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14999

21:25 libv has joined #dri-devel

21:25 <gawin> stupid me, there's something like "Job dependencies"

21:26 <jekstrand> anholt_: dang!

21:27 <airlied> anholt_: will look at it now

21:27 <anholt_> thanks!

21:27 <jekstrand> zmike: Running. Time to go turn my thermostat down to counter-act the space heater on my desk.

21:27 <zmike> just accept the warmth into your soul

21:28 <jekstrand> zmike: My soul has to be kept below -10C at all times.

21:29 <zmike> your electric bill must be hefty

21:31 <jekstrand> With all these lavapipe runs? Yeah, probably. :P

21:32 * airlied tries to only do lvp runs when solar is going :-P

21:34 <jekstrand> karolherbst: Surfaces and bindings look right. More digging!

21:37 <karolherbst> jekstrand: there are quite some regression with iris, mhh, but I fear that some/most is due to USE_HOST_PTR :/

21:38 <jekstrand> karolherbst: USE_HOST_PTR?

21:38 <karolherbst> jekstrand: userptr

21:38 <karolherbst> essentially

21:38 <karolherbst> just crappy and stupid

21:38 <karolherbst> soo CL allows you to back up cl_mem with a given pointer, but there are literally no requierements the client has to make sure of

21:39 <karolherbst> so if the driver needs page aligned pointers? too bad

21:39 <karolherbst> the runtime has to work it out

21:39 <jekstrand> karolherbst: I'm not seeing iris_resource_from_user_memory being called

21:39 <karolherbst> yes, and it fails

21:39 <karolherbst> or should

21:39 <jekstrand> karolherbst: It's not being called in this test

21:39 <karolherbst> ahh, then it's something else there :)

21:40 <karolherbst> soo.. CTS run completes in 7 minutes with iris here, nice

21:40 <karolherbst> takes roughly 25 minutes with llvmpipe

21:40 <karolherbst> let me see where iris regresses

21:40 <Kayden> jekstrand: No reason. We should be setting that. anv does, iris forgot I guess

21:42 <jekstrand> Kayden: Ok, I'll send a patch then.

21:42 <karolherbst> jekstrand: ohh.. CL also allows userptr for images :) "rc/gallium/drivers/iris/iris_resource.c:1178: iris_resource_from_user_memory: Assertion `templ->target == PIPE_BUFFER' failed."

21:42 <jekstrand> karolherbst: !

21:42 <karolherbst> we should think about something to support this for drivers not allowing that

21:43 <Kayden> I don't see why iris couldn't support that

21:43 <karolherbst> Kayden: what about random pointers (no alignment)?

21:43 <karolherbst> although CL allows handing in an alignment for images I think

21:44 <Kayden> I think we'd need cacheline (64B) alignment

21:44 <Kayden> for linear, or tile-size for tiled

21:44 <Kayden> but presumably that's linear

21:44 <karolherbst> what about buffers? I know that i915 (kernel) rejects anything which isn't page aligned

21:47 <karolherbst> anyway... most regressions are indeed because of crappy USE_HOST_PTR support :)

21:48 <karolherbst> Kayden: do you have an idea how we can properly support host pointers by lifting the page alignment requierement or is that something we have to deal with?

21:48 <jekstrand> karolherbst, Kayden: Looks like iris depends on textures_used, a 32-bit bitfield. That's not going to fly for CL.

21:48 <karolherbst> jekstrand: so, what clover is doing to support it, that is tries with resource_from_user_memory and if that fails, it crates a shadow buffer on the GPU and syncs it at certain points

21:48 <karolherbst> jekstrand: airlied has patches for that

21:49 <karolherbst> _but_ I think you don't need as many textures in CL, just many samplers

21:49 MajorBiscuit has joined #dri-devel

21:50 <jekstrand> karolherbst: I'm still digging through how iris handles samplers. I don't know that it's applicable to this particular test but you may be right that it assumes they're linked.

21:50 <jekstrand> The compiler doesn't but iris might in its binding table setup somewhere.

21:50 <karolherbst> yeah.. wouldn't surprise me

21:50 <karolherbst> does gallium nine runs on iris?

21:50 <Kayden> yes

21:50 <karolherbst> mhh

21:51 <karolherbst> ehh max_read_image_args is indeed 128 :( *sigh*

21:51 <karolherbst> jekstrand: anyway, it works with llvmpipe, and I didn't bumb the numbers there

21:51 <Kayden> karolherbst: I think the page alignment thing is a i915 restriction

21:51 <karolherbst> it's still on my todo list, but not that important atm

21:52 <karolherbst> Kayden: okay, glad to hear

21:52 <Kayden> I don't think the HW/iris should mind it being 64B

21:52 <karolherbst> I _reall_ don't want to deal with shadow buffers if I don't have to

21:52 <karolherbst> *_really_

21:53 <jekstrand> Yeah, we can get around the restriction

21:53 <jekstrand> Just pad out to a page on either side.

21:53 <Kayden> I don't think iris requires textures and samplers to be linked. it shouldn't, anyway, but there may be some stupidity in need of purging

21:53 <karolherbst> jekstrand: how?

21:53 <jekstrand> We know that whole pages exist. You can't have a partial page.

21:53 <Kayden> not aware of it being an issue

21:53 <karolherbst> I mean.. you can offset internally, sure

21:53 <karolherbst> if that's what you mean?

21:53 <jekstrand> So we can userptr the set of pages containing the address range. Then offset as needed.

21:53 <jekstrand> It'd be super annoying but we could do it.

21:53 <karolherbst> okay

21:53 <Kayden> right

21:53 <karolherbst> better than shadow buffers

21:53 <Kayden> re textures_used...yeah that'd need to get fixed

21:55 <karolherbst> jekstrand: there are some patches here: https://gitlab.freedesktop.org/airlied/mesa/-/commits/llvmpipe-cl-img-wip-2

21:55 <karolherbst> shader_info: stuff

21:56 * airlied has gone back and forward on how much those patches are needed

21:56 <karolherbst> airlied: yeah.. but we need 128 read images :(

21:56 <airlied> yes we do, but I'm not sure those patches are required

21:56 <karolherbst> ahh

21:56 <airlied> the use of those bitfields is quite a mess

21:56 <airlied> in some cases they seem only to be useful for GLSL type things

21:56 <jekstrand> Yeah...

21:57 <karolherbst> I see

21:57 <airlied> I dig in a few times get lost and run away

21:57 <jekstrand> I generally don't like textures_used

21:57 <karolherbst> shouldn't be all textures inside a shader be used? :P

21:57 <jekstrand> It's not used in Vulkan at all

21:58 <airlied> yeah it's very inconsistent

21:58 <jekstrand> We do need some way to set up the binding table in iris.

21:58 <jekstrand> Which means we at least need to know how many images and textures there are.

21:58 <karolherbst> you still have the texture vars, no?

21:58 <karolherbst> ehh samplers in gl

21:59 <jekstrand> Maybe? They may be eliminated by then.

21:59 <karolherbst> mhh

21:59 <jekstrand> The way all this stuff works with gl/gallium is very different from VK

21:59 <jekstrand> And I'm not convinced CL fits with the gallium model well.

21:59 <karolherbst> indirects are also annoying :(

21:59 <jekstrand> In some ways, I guess it fits fine.

22:00 <jekstrand> karolherbst: Actually, indirects aren't hard if we do it right.

22:00 <karolherbst> gallium is a bit too gl centric here, yes

22:00 <karolherbst> but nine has the same problem, no?

22:00 <karolherbst> jekstrand: yeah, CL doens't have those anyway

22:00 <karolherbst> well I guess we could optimize to indirects if that means dropping some code, but I don't think there is a nice way of actually doing indirects

22:01 <karolherbst> I think CL apps have to create arrays and shit

22:01 <jekstrand> My image handling pass already does indirects

22:01 <karolherbst> cool

22:01 <jekstrand> Like it'll handle it fine if you store a bunch of image pointers in an array and indirect the array.

22:01 <jekstrand> Not sure if that's legal in CL but we'd handle it.

22:01 <jekstrand> Assuming you're using the thing I wrote for clover

22:02 <karolherbst> I am sure it's legal in CL and the only way apps can do that

22:02 <Kayden> The thing about textures_used, is that at least in GL, you can bind a whole bunch of images or samplers, and the current shader may not actually use all the bindings

22:02 <Kayden> so you set-intersect and then only care about things that are both bound and referred to

22:02 <karolherbst> Kayden: sure.. but why would you care about what's bound?

22:03 <karolherbst> or can you not bind a sampler/texture and GL has to make sure it's not crashing?

22:03 <jekstrand> For binding table setup, iris doesn't. It only uses that to figure out which things are referenced and compact the table down.

22:04 ahajda_ has joined #dri-devel

22:04 ahajda has quit [Read error: Connection reset by peer]

22:04 <karolherbst> mhhh

22:04 <Kayden> back in a while

22:04 <karolherbst> GL has a strict indexing, right? you can't just reorder, or can you?

22:05 <karolherbst> although...

22:05 <jekstrand> You can do whatever you want in the driver as long as the client's bindings show up

22:06 <karolherbst> okay.. so you essentially just need to know how many textures there are or does texture_used has any benefits on top?

22:07 <karolherbst> like could you just DCE some, reindex and just move on?

22:07 MajorBiscuit has quit [Quit: WeeChat 3.4]

22:07 <jekstrand> Yeah, stuff can get DCEd

22:08 <karolherbst> okay

22:08 <jekstrand> zmike: CTS run looks as good as any other I've done

22:09 <karolherbst> jekstrand: anyway.. did you get the test to pass?

22:09 <zmike> jekstrand: 👍

22:11 <jekstrand> karolherbst: Not yet.

22:11 <jekstrand> zmike: ship it?

22:11 <zmike> 🚢 🚢 🚢

22:13 <karolherbst> jekstrand: does something in the stack do something funny with sampler vars?

22:14 <jekstrand> karolherbst: Good question

22:14 <karolherbst> "decl_var uniform INTERP_MODE_NONE sampler @2 (2, 8, 0)" I see that and I could imagine that the driver_location can mess things up

22:14 <jekstrand> karolherbst: It's using txl, not txf so it does use a sampler

22:14 <karolherbst> or well.. the location

22:14 fxkamd has quit []

22:14 <karolherbst> I still bind the sampler at index 0 though

22:14 <karolherbst> so I might have to reindex the sampler vars before passing it into drivers

22:15 <jekstrand> It's using the sampler at index 0 but maybe it's not getting bound?

22:15 * jekstrand looks

22:15 * jekstrand feels like he's debugged this before

22:15 <karolherbst> anyway, that reminds me that I have to do something with the sampler vars because atm they just take away space

22:16 <karolherbst> so if it's broken for iris, I can just do this tomorrow then

22:16 <jekstrand> karolherbst: I also need to come up with a less terrible buffer_clear implementation

22:16 <karolherbst> :)

22:16 <jekstrand> Really, I kind-of want rusticl to fall back to kicking off a kernel if buffer_clear doesn't exist.

22:16 danvet has quit [Ping timeout: 480 seconds]

22:17 <karolherbst> yeah... I didn't focus on any kind of fallbacks atm

22:17 <jekstrand> That's ok

22:17 <karolherbst> but I think doing fallbacks in kernels is better than doing them in sw :)

22:17 <jekstrand> Very much

22:17 <karolherbst> also for all those copy ops

22:17 <jekstrand> And I could implement something sensible in BLORP but I don't think it'd be any better than what we can do in rusticl for everyone.

22:18 <jekstrand> Yup

22:18 <karolherbst> well at least GPU to GPU copies we can't do on the GPU

22:18 <karolherbst> okay

22:18 * karolherbst adds stuff to an imaginary todo list

22:19 <karolherbst> would be fun to supply our own kernels

22:19 <jekstrand> karolherbst: Yeah, so iris uses textures_used for samplers. :(

22:19 <karolherbst> I guess we would just use nir_builder?

22:19 <karolherbst> jekstrand: figurew

22:19 <karolherbst> *figures

22:19 <jekstrand> karolherbst: Or OpenCL C and compile it with clc

22:19 <karolherbst> I'd rather not have to compile to much CLC code at runtime though

22:20 <karolherbst> also.. it's only for copies

22:20 <karolherbst> that's trivial stuff

22:20 <karolherbst> one memcpy intrinsic a loop and some casts :D

22:23 <karolherbst> I am also think about improving what I bind at kernel launching time, so we don't have to rebind everything

22:23 <karolherbst> and support partial updates

22:25 pcercuei has quit [Quit: dodo]

22:26 <jekstrand> Yeah, it can be done with nir_builder too

22:26 <jekstrand> We can also run clc at compile time and embed the SPIR-V

22:26 <jekstrand> Lots of options

22:26 <karolherbst> yeah..

22:26 <karolherbst> I will play around with it

22:27 <karolherbst> buffer <-> image copies I'd like to support with something like that :)

22:28 <karolherbst> but at this point I might even have to write a state tracking mechanism for kernels... will be fun

22:28 <karolherbst> or well.. went to

22:28 <karolherbst> *want

22:28 jfalempe has quit [Remote host closed the connection]

22:28 jfalempe has joined #dri-devel

22:29 <jekstrand> Ok, looks like samplers are uploading properly

22:29 icecream95 has joined #dri-devel

22:32 <jekstrand> karolherbst: Are we still using grid->input?

22:32 <karolherbst> for the time being, yes

22:32 <karolherbst> why?

22:33 <jekstrand> Because I seem to have dropped iris support for it

22:33 <jekstrand> But then how is anything working?!?!?

22:33 <karolherbst> well...

22:33 <karolherbst> I have no idea?

22:33 <karolherbst> I support it's still there?

22:33 <jekstrand> Hrm... iris is doing it via sysvals

22:33 <jekstrand> so cbuf0

22:34 <karolherbst> yeah, which is fine :)

22:38 * jekstrand wishes he could debug on hardware with int64 support

22:40 <zmike> are there any generic nir passes which can do array compaction for cross-stage i/o?

22:40 <jekstrand> zmike: look at brw_nir_link_shaders

22:41 <zmike> like if I have `out foo float[32]; out bar float[32];` it'll combine them using location_frac

22:41 <zmike> ?

22:41 <jekstrand> Oh, that? I don't think so.

22:41 <zmike> gah

22:41 <jekstrand> There might be something in the linking helpers but I don't remember.

22:42 <zmike> I looked, but they're all for compacting the location

22:42 <zmike> or so it seemed to me

22:42 <zmike> a project for tomorrow I guess

22:42 <bnieuwenhuizen> IIRC tarceris linking pass can combine varyings tightly in components

22:42 <jekstrand> karolherbst: Well, I may have found the bug. Looks like load_input is getting turned into load_ubo with an offset of 4B for no reason.

22:42 <karolherbst> jekstrand: uhm...

22:43 <bnieuwenhuizen> but you'll need to check that the array is getting lowered to separate varyings first

22:43 <bnieuwenhuizen> (IIRC there was something if all indexing was constant based but ...)

22:43 <zmike> bnieuwenhuizen: hm I'll have to look closer at that tomorrow I guess

22:43 <karolherbst> jekstrand: how can I turn on shader debugging for iris again?

22:43 <jekstrand> karolherbst: INTEL_DEBUG=cs

22:43 <jekstrand> karolherbst: Also, MESA_SHADER_CACHE_DISABLE=1

22:44 <zmike> it seems like I should be able to do this trivially by analysis since I can just set location_frac on teh whole variable and then export it with location+component

22:44 <karolherbst> yeah, I have that one already though :)

22:44 <karolherbst> jekstrand: mhh.. where is the offset added?

22:44 OftenTimeConsuming has quit [Remote host closed the connection]

22:44 <jekstrand> karolherbst: Working on that

22:44 <karolherbst> I see 1, 0 and 1, 0x20 as args

22:44 <karolherbst> but not 1, 4

22:44 <bnieuwenhuizen> zmike: I think the question is "what does that even look like" if you keep the array as an array

22:44 OftenTimeConsuming has joined #dri-devel

22:45 <jekstrand> hrm... No. No offset. I was looking at the binding table index. :-/

22:45 <zmike> bnieuwenhuizen: that's the adventure!

22:45 <karolherbst> ohhh

22:45 <bnieuwenhuizen> like AFAIU what you'd want is compact the 32-entry array in e.g. 8 locations of 4 components each right?

22:45 anholt__ has joined #dri-devel

22:45 <karolherbst> jekstrand: so the offset is inside the binding table for the cb? ehh...

22:46 <zmike> not entirely sure what I want at this point? in theory a 32-entry array is based on the driver's io capabilities

22:46 anholt__ is now known as anholt

22:46 <bnieuwenhuizen> I think how some of this might be happening on radv is that we lower io to a separate tmp array + copies, which means that for final output & initial input all accesses are constant indexed, which helps lowering arrays

22:47 <zmike> so either flattening into 8x4 (x2) or merging with the other array like 32x2

22:47 anholt_ has quit [Ping timeout: 480 seconds]

22:47 <zmike> I'm not looking at changing any of the shader, just the variable decls (ideally)

22:48 <zmike> I only stubbed my toe on this a minute ago, haven't really figured out what I want to do about it

22:48 rasterman has quit [Quit: Gettin' stinky!]

22:48 * zmike grumbles about tessellation shaders

23:05 <marex> time to exercise drm-misc commit access and apply a bugfix, stress level -> 11

23:05 ahajda_ has quit []

23:16 <marex> whew ...

23:16 tursulin has quit [Read error: Connection reset by peer]

23:17 bcheng has quit [Remote host closed the connection]

23:23 bcheng has joined #dri-devel

23:26 heat has quit [Remote host closed the connection]

23:26 soreau has quit [Read error: Connection reset by peer]

23:26 soreau has joined #dri-devel

23:31 iive has quit []

23:34 Haaninjo has quit [Quit: Ex-Chat]

23:46 alanc has quit [Remote host closed the connection]

23:46 mbrost has quit [Ping timeout: 480 seconds]

23:46 alanc has joined #dri-devel

23:48 mbrost has joined #dri-devel

23:49 HankB_ has quit [Remote host closed the connection]

23:50 HankB_ has joined #dri-devel

23:52 gawin has quit [Ping timeout: 480 seconds]

23:52 camus has joined #dri-devel

23:53 krushia has joined #dri-devel

23:54 thellstrom1 has quit [Ping timeout: 480 seconds]