#dri-devel on 2022-05-25 — irc logs at oftc.irclog.whitequark.org

2022-03-22 11:57 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:00 co1umbarius has quit [Ping timeout: 480 seconds]

00:02 nchery is now known as Guest189

00:02 nchery has joined #dri-devel

00:07 co1umbarius has joined #dri-devel

00:08 Guest189 has quit [Ping timeout: 480 seconds]

00:12 heat has quit [Remote host closed the connection]

00:13 alyssa has left #dri-devel [#dri-devel]

00:15 aravind has joined #dri-devel

00:15 simon-perretta-img has quit [Ping timeout: 480 seconds]

00:25 hch12907 has joined #dri-devel

00:32 aravind has quit [Ping timeout: 480 seconds]

00:44 apinheiro has quit [Ping timeout: 480 seconds]

00:48 sdutt has quit [Ping timeout: 480 seconds]

01:06 sdutt has joined #dri-devel

01:06 aravind has joined #dri-devel

01:10 heat has joined #dri-devel

01:24 sdutt has quit []

01:25 sdutt has joined #dri-devel

01:25 <robclark> the problem with user signaled fences is not *just* memory management... userspace can also indefinitely block atomic commits from wq, and eventually (depending on kernel config) things will reboot due to hung task in kernel.. we occasionally have that problem in CrOS when there are compositor bugs (because sw_sync and how it is used to paper over android<->wayland<->compositor impedance mismatches (ie. you can end up w/ gpu

01:25 <robclark> waiting on fence that never signals, and atomic commit waiting on fence that gpu would have signaled if it wasn't stuck, etc.. it can result in fence dependency changes that are hard to understand and look like kernel bugs when in fact they are not)

01:27 <HdkR> Just convert surfaceflinger to Wayland. EZ PZ ;)

01:28 mwk has quit [Remote host closed the connection]

01:28 mwk has joined #dri-devel

01:28 <robclark> heh, if android were sane, things would be much easier

01:29 <HdkR> So true

01:29 <graphitemaster> airlied, Thanks for the link.

01:29 <graphitemaster> I wish more was known about how WDDM and the graphics stack works on Windows

01:30 <graphitemaster> It does feel like they nailed the design of that right because all issues pertaining to sync and preemption and hw-scheduling appear solved there.

01:30 <graphitemaster> They just have the usual WSI and UI issues of HDR and DPI that everyone else has.

01:31 <robclark> I think if we had two things, user signaled fences could *perhaps* be sane: (1) some sort of way to dump out chain of fence dependencies when something goes wrong, which means dma-fence needs to somehow know when signaling fence B depends on fence A being signaled, and (2) some sort of reasonably shorts (couple seconds at most) give-up timer in kernel that goes ahead and signals unsignaled user fences for userspace

01:32 <robclark> graphitemaster: preemption is an orthogonal issue.. it "just" needs a combination of driver and hw support, not really anything in terms of core framework

01:33 <airlied> graphitemaster: wddm rebooted their ecosystem, so they didn't have to deal with it

01:33 <airlied> it's a bit hard to that on Linux, land of compositor choice :-P

01:35 <airlied> robclark: I think as long as you have compositors checking fence status on the cpu before using submitted buffers things should work

01:35 <airlied> it's just a lot of compositors don't work like that

01:35 <graphitemaster> I was told here from others that preemption requires a replumb of the entire Linux graphics stack so I feel like it's only orthogonal in specifics but not in the scope and amount of work necessary to get it like is the case with explicit sync here, plus there's subtle areas of overlap I believe with it, like the notion of long-running compute (and draw dispatches) which do complete but shouldn't just be tore down by the kernel as

01:35 <graphitemaster> it signals unsignaled fences (in your (2) want there), some sort of queue-flag that says "don't signal unsignal fences to avoid dead locks, preempt instead"

01:36 <robclark> I mean, that isn't going to help if you have long running shaders (like compute things if the hw doesn't have an independent ring for them).. that actually needs a combination of driver and hw support to preempt a running draw/grid

01:36 <robclark> ignoring that problem, it is just a compositor issue, not a driver issue ;-)

01:39 mhenning has joined #dri-devel

01:40 <airlied> robclark: well you definitely want separate compute queues for long running tasks, otherwise it's nuts

01:41 <robclark> until you encounter a creative shadertoy, then even that doesn't help ;-)

01:41 <graphitemaster> Sure, there's more work involved. Ideally what my wants are (which are not strictly part of any API yet) is something in Vulkan that expooses the handling of dead locks to the user as a queue initialization flag. So you have the DONTCARE type which treats work on that queue how it currently is, or what ever the default should be, a TIMEOUT type which is sort of like TDR (but also for compute) so this would be the kernel signaling

01:41 <graphitemaster> unsignaled fences and tearing down / resetting the context if need be (with user defined timeout, but a min timeout policy in the kernel that you cannot go below), and then a third option type of PREEMPT which is basically the same semantics of TIMEOUT except it doesn't tear down the context, it just preempts, and these queue types/options would be exposed on a case by case basis by driver+hw support

01:42 <graphitemaster> So if devs want PREEMPT, they query support for it, then can initialize their queue that way and begin recording commands into it and get those semantics.

01:43 <graphitemaster> The spec would then put a min requirement on DONTCARE and TIMEOUT, with PREEMPT being something only supported by modern GPUs and drivers that put the effort in.

01:45 <jekstrand> graphitemaster: Oh, there are lots of problems they have. :)

01:46 <robclark> What happens when some rude process sets a very long timeout ;-)

01:46 <jekstrand> Don't get me wrong. WDDM2 is definitely a step forward and better than what we have on most of Linux today.

01:46 <jekstrand> But to say they solved all the problems is a bit mutch.

01:46 <jekstrand> *much

01:46 <karolherbst> robclark: don't let processes do it :P

01:46 <graphitemaster> robclark, min **and** max timeout policy in the kernel most likely :D

01:46 <graphitemaster> Set some reasonable defaults

01:47 <karolherbst> OpenCL kind of has this issue and the solution is: don't do it in the kernel

01:47 <karolherbst> or on hw even

01:47 <jekstrand> The solution to OpenCL's problems is mid-kernel preemption

01:47 <karolherbst> that's not what I meant

01:47 <karolherbst> I meant user signalled fences

01:47 <robclark> what is a reasonable default, though.. it kinda depends on use-case, which isn't a thing kernel knows

01:47 <jekstrand> Or userspace

01:47 <jekstrand> At least not in drivers

01:47 <graphitemaster> I mean TDR is like 5 seconds on Windows right?

01:48 <robclark> the only "true" soln is preemption.. and that can be "hard"

01:48 <karolherbst> so in CL the application can signal events/fences, good luck with coming up with anything reasonable here

01:48 <jekstrand> graphitemaster: I think that's the default but there's registry keys for it

01:48 <robclark> I mean 1/5 fps isn't great, right?

01:48 <graphitemaster> I mean you could degrade anything taking longer to use PREEMPT by default if the HW supports it

01:49 <graphitemaster> So if you hit TDR, switch queue to PREEMPT

01:49 <karolherbst> and if the hw doesn't?

01:49 <graphitemaster> Then do what is currently done, reset

01:49 <karolherbst> which isn't legal in the CL world :P

01:49 <karolherbst> all I try to say here is, there are use cases we can't solve this way

01:50 <karolherbst> and sometimes stuff can run for minutes on purpose

01:50 <graphitemaster> OpenCL code kernels can be massages to be made reentrant and preemptive at the actual runtime layer though.

01:50 <graphitemaster> *messaged

01:50 <karolherbst> that's kernel stuff

01:50 <karolherbst> but what about user fences?

01:50 <graphitemaster> Oh I dunno about user fences :|

01:51 <karolherbst> well, the application can control certain events :)

01:51 <graphitemaster> *massaged

01:51 <graphitemaster> I can't type

01:51 <karolherbst> I've talked with jekstrand about it and if all this fancy kernel stuff could be used for it. In the end the only reasonable answer is: no

01:51 <karolherbst> it won't work

01:52 <graphitemaster> I tmight just be time to put all this stuff in a sarcophagus and define new things that only apply to new APIs and extensions, screw back compat.

01:52 <graphitemaster> Sort of what Windows did with WDDM

01:52 <karolherbst> yeah

01:53 <graphitemaster> jekstrand, Well if it does have issues, they've done a remarkably good job hiding them from user-space and developers of applications.

01:54 <graphitemaster> While everything in the Linux graphic stack seems to constantly butt heads with developers writing applications.

01:54 <graphitemaster> Which is a terrible leaky abstraction I might add.

01:54 <karolherbst> anyway... for CL you can split up work in smaller pieces and just execute the same kernel multiple times, which is good enough (tm)

01:55 <jekstrand> graphitemaster: Don't get me wrong, WDDM2 is much better

01:55 <jekstrand> But it's not flawless. :)

01:55 <graphitemaster> Nothing is flawless.

01:55 <jekstrand> But, frankly, if we could get there on Linux, I wouldn't worry too much about also trying to fix the flaws. It'd still be way better than where we are.

01:58 <graphitemaster> karolherbst, This is what I had to do at work, with GLSL compute shaders, basically instead of running my whole sim frame I'm just measuring each dispatch, dispatch indirect, draw, and draw indirect and trying to keep the actual calls for a sim step below 10ms so I have 6ms for rendering to keep things running at 60fps and the desktop from locking up, it's so much additional complexity and work, but it does work, there's a whole

01:58 <graphitemaster> time prediction thing in there too since the queries take a couple frames to return their results.

01:58 <graphitemaster> It's totally unnecessary on Windows, but it does a better job than Window's preemption so I'm just doing it by default now.

01:58 <karolherbst> graphitemaster: well for compute we don't need to have such tight schedules. But often those kernels can also be super huge...

01:58 <karolherbst> or well.. not even huge, just stupid

01:59 <karolherbst> one benchmarks just copyes TB of memory within a loop :)

01:59 <karolherbst> *copies

01:59 <graphitemaster> We're a GL application, it's remarkable that at least Windows NV is able to run compute separate from draws ...

01:59 <graphitemaster> Relying on a lot of driver magic there.

01:59 <graphitemaster> AMD runs at 1/38th the speed

02:00 <graphitemaster> The preemption helps.

02:00 <karolherbst> yeah.. new hw being able to preempt is really nice, even though there are really strict rules to that

02:02 <graphitemaster> Anyways I do think if users could control scheduling of work queues based on their needs rather than the dumb thing we have now, at least in APIs like Vulkan where the norm is "yeah you can BSOD or black screen a running OS from user-space, we're not safe or secure" ... it feels at home and gives developers the control they actually want. It's already bad enough that the only control we have in Vulkan is separation of compute and

02:02 <graphitemaster> graphics commands to achieve some magical "async compute" which is not even guaranteed either, since there's a ton of drivers that only report graphics+compute queue as one (NV)

02:03 <karolherbst> yep

02:03 <karolherbst> it's not for free

02:03 <graphitemaster> There should be queue priorities, scheduling types (deadline, immediate, deferred, preempt, timeout), and control of those timeouts, etc.

02:03 <karolherbst> makes the hw more complex

02:04 <karolherbst> context switching is really expensive, and on nv hw it's an opt in feature

02:04 <jekstrand> scheduling isn't the biggest problem. Preemption and dealing with long-running jobs is.

02:04 <graphitemaster> I mean you only need one in the hw, instruction-level preemption, you can do all other forms of scheduling in software ontop.

02:04 <karolherbst> ehh mid kernel/shader preemption is opt in I mena

02:04 <karolherbst> *mean

02:04 <jekstrand> graphitemaster: Hah You say instruction-level like that's the easy one.

02:04 <karolherbst> graphitemaster: bye bye performance

02:05 <karolherbst> we should stop trying to do CPU like things on GPUs :P

02:06 <graphitemaster> The hardware can do a lot of the work of context management, like a full shadow copy of all state for a context is probably remarkably simple to do with modern GPU designs where there's a lot of video memory and now things like direct storage to SSDs, I would say to hell with the idea of preemption on GPUs with limited memory / shared memory and tilers. I'd keep preemption strictly a desktop class feature.

02:07 <graphitemaster> I only care about BFGPUs :P

02:07 <karolherbst> :D

02:07 <karolherbst> if it would only be that simple

02:07 <karolherbst> we got firmware doing just context switching

02:08 <karolherbst> if that's what you mean by "the hardware can" sure, if not, then well.. ;)

02:08 <graphitemaster> I think for draws it's probably quite expensive, there's a lot of state there.

02:09 <graphitemaster> But for compute work I feel like the amount of state needed is probably less than actual modern x86 CPUs today

02:09 <karolherbst> it's probably simplier for compute, yes

02:09 <karolherbst> I wouldn't say it's not much, but

02:09 <icecream95> On at least Mali GPUs you could probably preempt fragment jobs just by disabling anything that hasn't been rendered yet in the tile enable map

02:09 <karolherbst> but for compute we can do different things. Worst case we schedule one block at a time and check if we should switch over to graphics or something :P

02:11 LexSfX has quit []

02:11 sdutt has quit [Ping timeout: 480 seconds]

02:13 <robclark> icecream95: modern adreno has kinda two levels of preemption, the "small hammer" cooperative preemption in between tile passes, and the "big hammer" which is the fallback if you don't reach end of tile pass in time, which involves saving/restoring all the gpu state as well as gmem (tile buffer) save/restore.. although there is some not-insignificant driver work to support both modes. The latter is defn way more expensive than

02:13 <robclark> cpu task preemption ;-)

02:14 <robclark> there is quite a bit of sqe fw involved in the latter

02:15 <graphitemaster> there'

02:15 <graphitemaster> I keep hearing the existence of pre-emption in some hardware and drivers on Linux

02:15 <graphitemaster> Yet I've yet to find a desktop setup of Linux on a configuration that doesn't lock up the moment you run anything on it that eats a lot of GPU time.

02:16 <graphitemaster> So I'm still strongly in the camp of it doesn't exist because no one can prove to me it exists sort of like other written about things that don't have proof :P

02:16 <robclark> like I said, newer hw + fw can do it.. but there is a lot of driver work needed too.. that is the missing piece.. and absolutely nothing at drm framework level needed ;-)

02:18 <graphitemaster> By driver work, do you mean KMD for the GPU or mesa here?

02:18 <robclark> some desktop gpu's make it easier in the more limited case of compute vs 3d by having separate rings where 3d can preempt compute.. but that only solves the more limited problem of long running compute jobs.. (but tbf that is the easier problem to tackle since *way* less state to save/restore)

02:18 <graphitemaster> Or both

02:18 <robclark> both

02:18 LexSfX has joined #dri-devel

02:18 <graphitemaster> Right

02:19 <graphitemaster> How exactly does GPU virtualization work on Linux at all if you do not have pre-emption?

02:20 <airlied> SRIOV

02:20 <airlied> they just parcel out the hw units

02:20 <airlied> I'm surprised the nvidia driver doesn't get it right on linux

02:20 <graphitemaster> Would it be possible to solve pre-emption in a similar way, parcel out the hw units per queue :P

02:21 <graphitemaster> Run the whole OS and driver stack per application lol

02:21 <robclark> virtualization is also more or less orthogonal.. (but the answer to your question is more or less: either (a) api level virtualization which sucks for performance or (b) vendor specific soln... I've been spending more of my time on making virt work decently, otherwise I might have been spending time on preemption ;-))

02:21 <robclark> airlied: *if* the hw supports partitioning like that

02:22 <airlied> robclark: yeah and sriov hw is fairly limited to servers

02:23 <robclark> because CrOS is a big fan of VM's on things that are very much $$$ is no object servers, I've been having fun in that area ;-)

02:24 <robclark> *very much not...

02:24 sdutt has joined #dri-devel

02:24 aravind has quit [Ping timeout: 480 seconds]

02:24 <airlied> like for VMs just running apps on a desktop, fair resource sharing really isn't required

02:25 <airlied> an android app inside a VM can work the same as a CrOS native app or a linux app in another vm

02:25 <robclark> yeah, it is exact same problem if there is more than one GPU user, regardless if one is in VM or not

02:26 <robclark> (but VMs do make memory management and things like cpufreq and scheduling much more entertaining)

02:27 <graphitemaster> They have hot-swap GPUs, pre-emption has to be possible for that.

02:27 <graphitemaster> You can physically pre-empt one

02:27 <graphitemaster> With your hands, mechanically

02:27 <graphitemaster> Server people have it good

02:28 <robclark> I mean, *that* sort of preemption doesn't make a great user experience ;-)

02:28 <graphitemaster> Just unplug and plug the GPU every 16ms

02:29 <robclark> I mean, sure.. we can also just kill gpu jobs that take more than 16ms.. which is fine as long as you don't have any gpu jobs that take more than 16ms ;-)

02:29 cef is now known as Guest199

02:30 cef has joined #dri-devel

02:30 Guest199 has quit [Ping timeout: 480 seconds]

02:32 <graphitemaster> robclark, oh it doesn't resume when you physically remove and insert a gpu?

02:32 <graphitemaster> damn, that's not as cool as I thought then

02:32 <graphitemaster> Like even if the software handles context reset event correctly?

02:32 <robclark> not unless userspace handles it by starting again from scratch.. and then you are back at the same problem, only 16ms later ;-)

02:32 <graphitemaster> Right

02:33 <robclark> if you intend to make forward progress, it is not the approach I would recommend ;-)

02:34 <graphitemaster> It should be as seamless as plugging in headphones. I just resume the music from where it was, I don't have to listen to the song from the beginning again :D

02:34 <graphitemaster> Ironically I imagine stateful sound cards of the 80s and 90s had similar problems, ones with midi hardware support and what not.

02:34 <robclark> to make forward progress you need to be able to save current state in some way that it can be restored and resumed.. and GPUs have a *lot* of state

02:35 <robclark> (less so for compute.. compute is a simpler subset of the same basic problem)

02:35 <graphitemaster> We solved all of this by taking it out of hardware and doing it all in software.

02:35 mhenning has quit [Quit: mhenning]

02:35 <graphitemaster> I mean that's a possibility too, you could immediately switch to a software implementation when the hardware resets to make forward progress

02:35 <graphitemaster> Then reupload all state when it comes back online

02:36 <graphitemaster> Keep a shadow copy of all state on the CPU

02:36 <graphitemaster> Quite expensive but somewhat neat.

02:36 <graphitemaster> Windows sort of does that doesn't it

02:36 <graphitemaster> When you have no graphics drivers it reinitializes the whole graphics subsystem when the drivers are installed / up to date

02:36 <graphitemaster> In the mean time it runs purely in software

02:36 <jekstrand> Not at all

02:37 <jekstrand> If you TDR on Windows, you get a TDR. You're done.

02:37 <jekstrand> Apple's GL implementation carried around shadow copies so they could migrate apps between GPUs but that's the only one I know of that's actually ever done full shadowing.

02:37 <graphitemaster> Sure, with TDR. I'm talking about when you have a non-accelerated desktop and install graphics drivers or update them, it can re-init the stack

02:37 <jekstrand> If you update your drivers, I think all active apps get a TDR

02:38 <HdkR> Any X.Org sysadmins? Apparently https://www.x.org/wiki/Events/History/ The links here to XDC 2019 and XDC 2020 now fail and redirect to LPC.

02:38 <jekstrand> HdkR: Lots of people can edit the wiki

02:39 <HdkR> Bigger thing is likely that https://xdc2020.x.org/ and https://xdc2019.x.org/ are now gone

02:39 <jekstrand> Oh, yeah, that'll take an admin

02:39 <airlied> yeah apple's optimus stuff was pretty scary

02:39 <jekstrand> Scary and probably also part of why Apple's OpenGL was like 40% of metal perf

02:40 * airlied contemplated implementing that in mesa for a few days :-P

02:41 ppascher has joined #dri-devel

02:41 <jekstrand> And we're all very glad you didn't. :)

02:42 <airlied> I only wish I'd added device lost to x11

02:43 <zmike> sounds like we need x12

02:43 <graphitemaster> y11

02:44 <HdkR> y2k

02:44 <airlied> xwaylandland is x11 :-P

02:45 <icecream95> yax11: yet another x11

03:00 haasn has quit [Quit: ZNC 1.7.5+deb4 - https://znc.in]

03:01 haasn has joined #dri-devel

03:19 heat has quit [Remote host closed the connection]

03:19 heat has joined #dri-devel

03:26 <graphitemaster> I do find it funny that everyone is like "x is bad, lets replace it", then they replace it with something worse or something that still has to speak x

03:28 Company has quit [Quit: Leaving]

03:31 bmodem has joined #dri-devel

03:55 Daanct12 has joined #dri-devel

04:01 <jekstrand> IDK that Wayland is worse. It's different. It has a different set of problems.

04:01 <jekstrand> And, yeah, it has to speak X because backwards compatibility forever!

04:03 lromwoo^ has quit [Ping timeout: 480 seconds]

04:17 lromwoo^ has joined #dri-devel

04:22 aravind has joined #dri-devel

04:50 Duke`` has joined #dri-devel

04:50 Daanct12 has quit [Remote host closed the connection]

04:58 sdutt_ has joined #dri-devel

04:58 sdutt has quit [Read error: Connection reset by peer]

04:59 aravind has quit [Remote host closed the connection]

04:59 aravind has joined #dri-devel

05:01 Daanct12 has joined #dri-devel

05:15 heat has quit [Ping timeout: 480 seconds]

05:16 itoral has joined #dri-devel

05:32 aravind has quit [Ping timeout: 480 seconds]

05:38 itoral has quit [Remote host closed the connection]

05:39 itoral has joined #dri-devel

05:40 Daanct12 has quit [Quit: Leaving]

05:44 aravind has joined #dri-devel

05:47 Duke`` has quit [Ping timeout: 480 seconds]

05:51 mvlad has joined #dri-devel

05:53 ppascher has quit [Ping timeout: 480 seconds]

05:54 itoral has quit [Remote host closed the connection]

05:56 itoral has joined #dri-devel

06:21 ahajda__ has joined #dri-devel

06:25 lemonzest has joined #dri-devel

06:28 Surkow|laptop has quit [Ping timeout: 480 seconds]

06:35 bmodem has quit [Ping timeout: 480 seconds]

06:42 AndrewR has quit [Ping timeout: 480 seconds]

06:53 danvet has joined #dri-devel

07:01 <dj-death> any knows whether the fd passed to driver screen_create vfunc in gallium are owned by the driver?

07:01 <dj-death> s/any/anybody/

07:12 nchery has quit [Read error: Connection reset by peer]

07:17 sdutt_ has quit [Remote host closed the connection]

07:17 zackr has quit [Remote host closed the connection]

07:17 sdutt_ has joined #dri-devel

07:17 abhinav__ has quit [Quit: Ping timeout (120 seconds)]

07:17 abhinav__ has joined #dri-devel

07:17 zackr has joined #dri-devel

07:17 jessica_24 has quit [Quit: Ping timeout (120 seconds)]

07:17 kj has quit [Remote host closed the connection]

07:18 exit70 has quit [Quit: ZNC 1.8.2 - https://znc.in]

07:19 anarsoul|2 has joined #dri-devel

07:19 exit70 has joined #dri-devel

07:20 jessica_24 has joined #dri-devel

07:21 dri-logg1r has quit [Remote host closed the connection]

07:21 dri-logger has joined #dri-devel

07:21 mslusarz has quit [Remote host closed the connection]

07:21 mslusarz has joined #dri-devel

07:22 anarsoul has quit [Read error: Connection reset by peer]

07:25 mclasen has quit []

07:25 mclasen has joined #dri-devel

07:26 agd5f has quit [Remote host closed the connection]

07:26 tlwoerner has quit [Remote host closed the connection]

07:26 agd5f has joined #dri-devel

07:27 tlwoerner has joined #dri-devel

07:28 ceyusa has quit [Remote host closed the connection]

07:28 gpiccoli has quit [Quit: Bears...Beets...Battlestar Galactica]

07:28 samueldr_ has quit [Remote host closed the connection]

07:29 samueldr has joined #dri-devel

07:30 ceyusa has joined #dri-devel

07:31 gpiccoli has joined #dri-devel

07:31 cheako has quit [Quit: Connection closed for inactivity]

07:40 tursulin has joined #dri-devel

07:50 <MrCooper> graphitemaster: preemption is mostly a HW / driver problem, not really related to explicit vs implicit sync (preemption is working with the latter with some drivers)

07:55 apinheiro has joined #dri-devel

08:01 lynxeye has joined #dri-devel

08:02 jkrzyszt has joined #dri-devel

08:03 AndrewR has joined #dri-devel

08:17 <MrCooper> graphitemaster: also, note that explicit vs implicit sync isn't one global binary choice; implicit sync is perfectly adequate for some things, and can be emulated just for those things using explicit sync at lower levels

08:17 bmodem has joined #dri-devel

08:24 rasterman has joined #dri-devel

08:25 gouchi has joined #dri-devel

08:26 bmodem has quit [Remote host closed the connection]

08:30 gouchi has quit [Quit: Quitte]

08:38 pcercuei has joined #dri-devel

09:06 jhli has quit [Remote host closed the connection]

09:06 jhli has joined #dri-devel

09:07 mdnavare has quit [Remote host closed the connection]

09:09 mdnavare has joined #dri-devel

09:09 mwalle has quit [Quit: WeeChat 3.0]

09:09 mwalle has joined #dri-devel

09:15 LexSfX has quit [Remote host closed the connection]

09:16 LexSfX has joined #dri-devel

09:18 sdutt_ has quit [Ping timeout: 480 seconds]

09:31 mi6x3m has joined #dri-devel

09:32 <mi6x3m> hey, i need some info as to what is going on. I start something with MESA_LOADER_DRIVER_OVERRIDE=swrast then i get this output https://pastebin.com/mSMSQ9jG but LLVM is used normally

09:39 simon-perretta-img has joined #dri-devel

09:45 <pq> mi6x3m, maybe also tell what you actually want, and how you ended up with that variable?

09:45 <pq> I mean, is "swrast" even a driver name?

09:46 <mi6x3m> it is, I am trying to test out different games with hardware and software rendering

09:46 <pq> swrast is usually just a generic term referring to some software rasterizer, or maybe explicitly to the "classic swrast" which I don't think exists anymore.

09:46 <mi6x3m> so I override the crocus driver with swrast

09:47 <mi6x3m> well what is the name of the software rasterizer in driver terms?

09:47 <pq> I don't know

09:47 <pq> not in terms of that variable at least

09:47 <mi6x3m> is there another var to override the driver?

09:48 <pq> also llvmpipe *is* a swrast

09:48 <mi6x3m> yes I know but it's selected automatically after the system reports that swrast can't be loaded

09:48 <mi6x3m> so swrast is considered a driver but llvmpipe isn't, very weird

09:49 <mi6x3m> if I set the var to be =llvmpipe it says no driver with that name exists

09:49 <pq> aha

09:51 <pq> ok, I do see that swrast_dri.so exists at least

09:52 <pq> the way I've forced software renderer is LIBGL_ALWAYS_SOFTWARE=1 which gest me llvmpipe then.

09:52 <pq> or "true", says Mesa docs

09:53 <pq> and GALLIUM_DRIVER when you want softpipe

09:55 rkanwal has joined #dri-devel

09:59 yogesh_mohan has joined #dri-devel

10:00 vyivel has quit [Read error: No route to host]

10:00 vyivel has joined #dri-devel

10:00 <mi6x3m> ah, interesting, might be a remnant of the past then

10:01 <pq> it's a good question why it doesn't work

10:01 <pq> maybe swrast loading code is the remnant of a past and doesn't work through a more standard mechanism

10:02 hch12907 has quit [Ping timeout: 480 seconds]

10:04 hch12907 has joined #dri-devel

10:12 <mi6x3m> pq, seems to be the case indeed

10:22 icecream95 has quit [Ping timeout: 480 seconds]

10:26 <MrCooper> pq: FWIW, there are cases where LIBGL_ALWAYS_SOFTWARE=1 has never had an effect, but MESA_LOADER_DRIVER_OVERRIDE=swrast works

10:26 <MrCooper> mi6x3m: those error messages are a red herring AFAICT, it ends up falling back to swrast anyway :)

10:26 <pq> what cases would those be?

10:27 <pq> yeah, it fails loading swrast, so it falls back to swrast :-P

10:27 <mi6x3m> thanks MrCooper :)

10:27 <MrCooper> pq: not sure exactly, but jadahl / swick were hitting it for mutter testing

10:27 <mi6x3m> my use case is also rather extreme as will be unveiled shortly but I do wanna test all paths

10:27 <MrCooper> you know how to set up suspense

10:28 <jadahl> MrCooper: "never" - it has, at least long long ago

10:28 <MrCooper> jadahl: we talked about this before :) it never worked for those particular cases

10:28 <jadahl> but I have memories (at least from years ago) that it worked :P

10:28 <jadahl> you're telling me I'm crazy?

10:28 <MrCooper> unless I misinterpreted the Git history

10:28 <pq> but swrast is not just one driver, is it? There's more than llvmpipe, or have all the others been deleted by now?

10:29 <MrCooper> jadahl: I suspect you were hitting a different case back then

10:29 <jadahl> MrCooper: it's true it didn't when I thought it did (when adding that documentation)

10:29 <jadahl> perhaps

10:29 <mi6x3m> MrCooper, it's a project so absurd it'll be made illegal by sane world governments

10:29 <jadahl> but that env var should probably eithe be removed, or at least removed from the documentation, unless it's actually fixed

10:29 <MrCooper> pq: there was only ever one swrast_dri.so

10:30 <pq> MrCooper, containing multiple "drivers", right?

10:31 <MrCooper> yes, multiple Gallium drivers now (or the classic swrast driver before)

10:31 <mi6x3m> i think the driver is swrast and it has 2 options

10:31 <mi6x3m> so GALLIUM_DRIVER should be =swrast with LLVMPIPE_ENABLED=0/1

10:32 <MrCooper> jadahl: somebody who cares would just need to add the handling where it's missing

10:32 <pq> swr is gone now, right?

10:32 <MrCooper> pq: MESA_LOADER_DRIVER_OVERRIDE is for selecting the DRI driver, there's GALLIUM_DRIVER for selecting the Gallium driver

10:32 <pq> confusing

10:33 <MrCooper> e.g. MESA_LOADER_DRIVER_OVERRIDE=swrast GALLIUM_DRIVER=softpipe

10:33 <mi6x3m> quite confusing

10:33 <jadahl> perhaps reading LIBGL_ALWAYS_SOFTWARE at the same place as MESA_LOADER_DRIVER_OVERRIDE is read would be enough

10:34 <pq> so there are or have been at least 5 software rasterizers: classig swrast, softpipe, llvmpipe, swr, and zink+lavapipe. Did I miss any? :-)

10:34 <MrCooper> yeah, seems like it should, in all the same places

10:35 <mi6x3m> thanks friends, this gives me some overview!!!

10:35 <MrCooper> pq: I think that covers them

10:36 <airlied> mi6x3m: try kms_swrast maybe

10:36 <MrCooper> oh yeah, no libGL errors with that

10:37 <mi6x3m> airlied, this worked :)

10:37 <mi6x3m> what's the difference

10:37 <pq> but then the "kms" in it is a lie? and both still exist as .so files.

10:38 <MrCooper> I wonder if there's any point still in having them as separate names

10:38 <pq> they are the same file, yes, but both file names are installed in file system

10:39 <MrCooper> all *_dri.so are hardlinks to the same mega driver

10:40 <pq> Debian stable disagrees by having two different mega drivers, but I suppose that's just legacy.

10:41 <pq> hmm, classic drivers I guess

10:41 rsalvaterra_ has joined #dri-devel

10:41 rsalvaterra_ is now known as rsalvaterra

10:42 <MrCooper> interestingly, they expose different sets of GLX extensions

10:43 <MrCooper> only swrast_dri.so exposes GLX_EXT_buffer_age and sync/swap control extensions, only kms_swrast_dri.so exposes GLX_ARB_context_flush_control & GLX_ARB_create_context_robustness

10:44 <MrCooper> seems like there's a mess to be cleaned up here

10:46 <mi6x3m> i discovered it, i want a share of the loot

10:47 <MrCooper> mi6x3m: you get to file a GitLab issue :P

10:48 flacks has quit [Quit: Quitter]

10:50 flacks has joined #dri-devel

10:50 <mi6x3m> i'll be a president of issues then

10:52 lromwoo^ has quit [Ping timeout: 480 seconds]

10:56 slattann has joined #dri-devel

10:59 kisak has quit [Quit: leaving]

11:03 kisak has joined #dri-devel

11:20 alanc has quit [Remote host closed the connection]

11:20 mattst88 has quit [Read error: Connection reset by peer]

11:20 alanc has joined #dri-devel

11:21 mattst88 has joined #dri-devel

11:25 sagar_ has quit [Remote host closed the connection]

11:25 sagar_ has joined #dri-devel

11:34 lromwoo^ has joined #dri-devel

11:36 Company has joined #dri-devel

11:57 aravind has quit [Ping timeout: 480 seconds]

12:06 Danct12 has quit [Quit: Quitting]

12:20 `join_subline has quit [Remote host closed the connection]

12:20 siqueira has quit []

12:20 flacks has quit [Quit: Quitter]

12:21 flacks has joined #dri-devel

12:24 siqueira has joined #dri-devel

12:27 anarsoul|2 has quit [Ping timeout: 480 seconds]

12:27 anarsoul has joined #dri-devel

12:29 lromwoo^ has quit [Ping timeout: 480 seconds]

12:34 `join_subline has joined #dri-devel

12:35 rsalvaterra has quit [Ping timeout: 480 seconds]

12:36 itoral has quit [Remote host closed the connection]

12:37 rsalvaterra has joined #dri-devel

13:05 lemonzest has quit [Quit: WeeChat 3.5]

13:05 lromwoo^ has joined #dri-devel

13:29 RSpliet has quit [Quit: Bye bye man, bye bye]

13:29 sdutt has joined #dri-devel

13:30 ppascher has joined #dri-devel

13:30 RSpliet has joined #dri-devel

13:31 lemonzest has joined #dri-devel

13:33 bcheng has quit [Remote host closed the connection]

13:34 bcheng has joined #dri-devel

13:38 lromwoo^ has quit [Ping timeout: 480 seconds]

13:41 alyssa has joined #dri-devel

13:41 * alyssa wonders how to evaluate latency scheduling

13:41 <HdkR> microprofiling!

13:41 <HdkR> :)

13:42 <alyssa> HdkR: sounds interesting, details? :p

13:42 <alyssa> what specifically in the perf counters etc is interesting?

13:43 <HdkR> Probably shader execution cycles

13:45 lromwoo^ has joined #dri-devel

13:46 bertje__ has joined #dri-devel

13:52 lromwoo^ has quit [Remote host closed the connection]

13:53 <dolphin> alyssa: step A) hope there is a free running clock on the HW that you can access from the executing workload

13:53 <dolphin> and also from the CPU via some MMIO

13:55 <dolphin> see i-g-t tests/i915/gem_exec_latency

13:59 <alyssa> Yes, there's a shader clock although I never finished wiring it up because it needed kernel changes

13:59 <alyssa> OTOH, those changes are needed for vulkan too I think

14:00 Surkow|laptop has joined #dri-devel

14:00 <dolphin> by my experience, that's the best way to get some real wall clock numbers

14:00 <dolphin> we also have further micros to split that number into smaller items

14:00 <dolphin> but it's at least a reasonable way to check that all the other micros add up to the total latency

14:00 lynxeye has quit [Quit: Leaving.]

14:01 <alyssa> OK

14:01 <dolphin> and that's actually the latency folks care about, wall clock time from when you submit from userspace to when the workload starts on the GPU/whatever

14:01 <alyssa> to be clear I'm talking about latency within the shader

14:01 <dolphin> oh :)

14:01 <alyssa> i.e. scheduling texture instructions early enough in the shader that we don't stall accessing the results

14:02 <dolphin> I guess that answer will be much more hardware specific

14:02 bertje__ has quit [Ping timeout: 480 seconds]

14:03 bertje__ has joined #dri-devel

14:03 <alyssa> yeah, for sure

14:04 bertje__ has quit [Remote host closed the connection]

14:05 agx has joined #dri-devel

14:06 mi6x3m has quit [Quit: Leaving]

14:14 Haaninjo has joined #dri-devel

14:18 mriesch has quit [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]

14:18 ella-0 has joined #dri-devel

14:19 mriesch has joined #dri-devel

14:21 ella-0_ has quit [Read error: Connection reset by peer]

14:40 janesma has joined #dri-devel

14:45 heat has joined #dri-devel

14:58 JohnnyonFlame has joined #dri-devel

15:02 JohnnyonF has quit [Ping timeout: 480 seconds]

15:02 ahajda__ has quit [Read error: Connection reset by peer]

15:04 ahajda__ has joined #dri-devel

15:17 <Venemo> alyssa, jekstrand can you guys pls finish your review on this MR? https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13080

15:25 JohnnyonF has joined #dri-devel

15:28 <zmike> anholt: would still like to do something with https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13439

15:28 saurabhg has joined #dri-devel

15:31 ahajda_ has joined #dri-devel

15:31 JohnnyonFlame has quit [Ping timeout: 480 seconds]

15:38 ahajda__ has quit [Ping timeout: 480 seconds]

15:40 <alyssa> Venemo: what'd i do

15:40 <alyssa> i thought i did?

15:40 <alyssa> did something change?

15:42 <alyssa> r-b assuming that's what I read before

15:43 <alyssa> which it seems to be, bikeshed colours aside

15:43 <alyssa> so r-b i think

15:45 tzimmermann has joined #dri-devel

15:48 <Venemo> alyssa: can you comment that on the MR too, please?

15:48 <Venemo> alyssa: thank you

15:53 <danvet> jekstrand, I guess we should have an ack from bnieuwenhuizen and dj-death and then we're good to go?

15:54 <bnieuwenhuizen> ack on what?

15:54 <danvet> import/export dma-buf ioctl

15:54 <danvet> ack that it looks reasonable for radv

15:54 <jekstrand> danvet: Yeah, acks on the kernel series would be good

15:54 <jekstrand> They've RB'd the Mesa patches but an actual kernel ack would be good too.

15:55 <danvet> yup

15:55 <danvet> jekstrand, you can push the igt side and that's reviewed, right?

15:55 anarsoul|2 has joined #dri-devel

15:55 <jekstrand> danvet: Is it reviewed? I've not payed attention. :joy:

15:55 <danvet> jekstrand, can you?

15:55 <danvet> :-P

15:56 <danvet> iirc I did comment on some early versions ...

15:56 anarsoul has quit [Read error: Connection reset by peer]

15:56 JohnnyonF has quit [Read error: Connection reset by peer]

15:57 <jekstrand> danvet: Yeah, you commented on some early versions but not the latest which are significantly reworked.

15:57 <danvet> ...

15:57 <jekstrand> danvet: And I got comments from Kamil asking me to add igt_describe(). I'll do that now.

15:57 <danvet> I guess annoy me when you have that and I'll ack

16:01 nchery has joined #dri-devel

16:10 kts has joined #dri-devel

16:11 <jekstrand> danvet: Looks like I do have RB tags from you. Want to look again or just assume they're recent enough?

16:11 <jekstrand> I'll send-email anyway

16:21 Duke`` has joined #dri-devel

16:22 mbrost has joined #dri-devel

16:23 anarsoul has joined #dri-devel

16:25 anarsoul|2 has quit [Read error: Connection reset by peer]

16:42 cheako has joined #dri-devel

16:47 <danvet> jekstrand, I have faith :-)

16:47 bertje__ has joined #dri-devel

16:47 <danvet> jekstrand, test-with might be good on the kernel side to make sure the tests actually work everywhere ...

16:47 <danvet> so that intel ci can pick up both igt and kernel and run them together

16:47 <danvet> jekstrand, https://intel-gfx-ci.01.org/test-with.html

16:48 <jekstrand> danvet: Pretty sure I've already been doing that

16:48 <jekstrand> Kernel cover letters have had Test-with on them

16:48 bertje__ has quit [Remote host closed the connection]

16:55 jkrzyszt has quit [Ping timeout: 480 seconds]

17:10 saurabhg has quit [Ping timeout: 480 seconds]

17:19 hch12907 has quit [Ping timeout: 480 seconds]

17:19 <jekstrand> Ok, time to see how robust our RADV CI is. :D

17:20 <zmike> I'll give you one guess

17:22 <jekstrand> Not robust enough for my patches, probably. :D

17:25 <zmike> make sure you run the manual jobs using the override token

17:34 sdutt has quit []

17:34 sdutt has joined #dri-devel

17:36 <jekstrand> ?

17:44 <bnieuwenhuizen> zmike: I think the collabora radv runners run by default?

17:45 <bnieuwenhuizen> the Valve ones need some kind of manual triggers

17:47 janesma has quit [Remote host closed the connection]

17:47 janesma has joined #dri-devel

17:50 <daniels> yeah

18:12 alyssa has left #dri-devel [#dri-devel]

18:15 nchery has quit [Quit: Leaving]

18:26 nchery has joined #dri-devel

18:38 tlwoerner has quit [Quit: Leaving]

18:38 tlwoerner has joined #dri-devel

19:03 slattann has quit [Quit: Leaving.]

19:11 lumag_ has quit [Ping timeout: 480 seconds]

19:20 Akari has quit [Ping timeout: 480 seconds]

19:29 <jenatali> Oof, https://en.cppreference.com/w/cpp/utility/unreachable is breaking my Mesa build because I have it set using c++latest

19:30 <jekstrand> Ouch

19:31 <jenatali> I can work around it (c++20 or 17) for now but we'll have to resolve that at some point probably

19:36 `join_subline has quit [Remote host closed the connection]

19:36 mbrost has quit [Read error: Connection reset by peer]

19:36 kts_ has joined #dri-devel

19:40 kts has quit [Ping timeout: 480 seconds]

19:50 lemonzest has quit [Quit: WeeChat 3.5]

19:50 ahajda_ has quit [Read error: Connection reset by peer]

19:52 * jekstrand is really starting to hate the RADV meta code...

19:55 ahajda_ has joined #dri-devel

19:56 `join_subline has joined #dri-devel

19:56 <jekstrand> Not that it's necessarily bad code or anything. It's just that begin/end rendering are hopelessly intertwined.

20:00 <airlied> well if vulkan had just done no subpasses up front :-P

20:02 <HdkR> Just means we need to add vkBegin and vkEnd.

20:02 <jekstrand> airlied: It's not that. I've got that *mostly* detangled.

20:03 <jekstrand> With the renderpass code, I'm resetting everything when you save the state as a sanity measure. Turns out a lot of things expect to be able to rely on the old render pass or attachments right up until the last moment. :-/

20:03 lumag_ has joined #dri-devel

20:16 ahajda_ has quit [Remote host closed the connection]

20:17 icecream95 has joined #dri-devel

20:17 tzimmermann has quit [Quit: Leaving]

20:24 `join_subline has quit [Remote host closed the connection]

20:34 `join_subline has joined #dri-devel

20:35 camus has joined #dri-devel

20:37 Danct12 has joined #dri-devel

20:39 rkanwal has quit [Read error: No route to host]

20:39 rkanwal has joined #dri-devel

20:40 camus1 has quit [Ping timeout: 480 seconds]

20:43 `join_subline has quit [Remote host closed the connection]

20:45 Duke`` has quit [Ping timeout: 480 seconds]

21:06 <jekstrand> cwabbott: Ping https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16111

21:08 `join_subline has joined #dri-devel

21:08 rkanwal has quit [Read error: Connection reset by peer]

21:09 rkanwal has joined #dri-devel

21:10 <zmike> jekstrand: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16638

21:12 Haaninjo has quit [Quit: Ex-Chat]

21:12 heat has quit [Read error: Connection reset by peer]

21:12 heat has joined #dri-devel

21:33 <cwabbott> jekstrand: I'm gone for the next few weeks

21:34 <jekstrand> cwabbott: :-/

21:40 mclasen_ has joined #dri-devel

21:45 mclasen has quit [Ping timeout: 480 seconds]

22:04 oneforall2 has quit [Remote host closed the connection]

22:12 jfalempe has quit []

22:12 oneforall2 has joined #dri-devel

22:15 <jstultz> danvet: jekstrand: hey! I'm trying to get my head around some of the umf / drm_syncobj discussions.. curious if there was a sense how non drm drivers (cameras, decoders, etc) would interplay with the move the drm_syncobj for everything? At least w/ sync_files were not so subsystem specific.

22:20 <jekstrand> jstultz: I don't think we're yet set on drm_syncobj for everything

22:22 <jstultz> jekstrand: ah, sorry, i'm maybe over-emphasizing some of danvet's comments. But are folks thinking wider than the drm/ dir w/ umf?

22:23 <jekstrand> I don't think we have a clear path, TBH.

22:23 <airlied> drm_syncobj for anything that is in drm

22:23 <jekstrand> We need to keep DRM working with other components

22:23 <airlied> if it has to interact with other stuff outside the drm device then sync files is fine

22:23 <airlied> but drm interfaces should be restricted to syncobjs and sync_files should be explicit conversions

22:24 <jekstrand> And if we move to UMF, the dma_fence finite time guarantees get tricky but maybe, in a UMF world, with "modern" fencing, we can do that with a simple timeout.

22:24 <jekstrand> But I don't think there's any assumption that things like v4l will need to start interacting with UMF any time soon.

22:24 <jekstrand> Or that it's even tractable, honestly.

22:25 <jekstrand> The goal is to get DRM drivers running as fast as possible with all the fancy stuff for compute etc.

22:25 <jekstrand> Once you leave the DRM world, I think we're ok with converting to sync_file.

22:25 <jekstrand> I think

22:25 <jekstrand> That's a load bearing "I think"

22:25 <jstultz> heh

22:25 <jekstrand> There's a whole lot of design space to trim down still.

22:26 <jekstrand> But I don't see us being able to ever get away from having to interact with sync_file so we're going to have to figure out how to make that work somehow.

22:26 <jekstrand> If the best solution we can find sucks too much than maybe plumb UMF through to other things.

22:27 <jekstrand> Hopefully, we'll find something that doesn't suck too much.

22:27 <jstultz> jekstrand: so, the sync_file API seems pretty bounded. In a lot of the discussions issues w/ dma_fences are transposed onto sync_files, but couldn't sync_files be backed by something else?

22:28 <jstultz> jekstrand: or is there some aspect where existing userland understands it as a dma_fence and so behavior is fixed?

22:29 <jekstrand> jstultz: Possibly but the guarantees we provide to userspace are basically the same guarantees as we have for dma_fence.

22:30 <jekstrand> Which is to say that while we certainly could swap out the internals, I don't see it helping solve the fundamental problem.

22:30 <jstultz> jekstrand: ok, so behavioral guarantees leaked.

22:30 <jekstrand> Because if it works for sync_file, it works for dma_fence and so we may as well just wrap <SYNC_THING> in a dma_fence.

22:30 <jekstrand> jstultz: I don't know that it's so much a leak as just very sensible semantics that both have.

22:31 <jekstrand> The primary such semantic being the finite time guarantees.

22:33 <danvet> jekstrand, timeout alone can deadlock

22:33 <danvet> it's like "mutex_lock_timeout fixes my deadlock" approach to locking design

22:33 <jekstrand> danvet: Yes, I know timeout isn't enough.

22:33 <jekstrand> I'm waving my hands and hoping that future me is somehow smarter.

22:34 <danvet> jekstrand, yeah just wanted to make sure, because the siren calls of "this should be simpler, why isn't it" is so strong on this topic :-)

22:34 <danvet> jekstrand, same

22:34 <jekstrand> danvet: Oh, sure.

22:38 mvlad has quit [Remote host closed the connection]

22:55 bgs has quit [Read error: Connection reset by peer]

22:55 bgs has joined #dri-devel

23:05 <danvet> jstultz, I'm kinda assuming that socs and smaller systems will stay with the dma_fence/sync_file semantics for quite some more time

23:06 heat_ has joined #dri-devel

23:07 heat has quit [Read error: No route to host]

23:07 <airlied> let's just create drm2, definitely be simpler :-P

23:08 morphis has quit [Ping timeout: 480 seconds]

23:08 <daniels> jstultz: well for V4L2, the UMF story is exactly the same as the pre-UMF story - V4L2 doesn't do anything at all and it's your problem to sort out :P

23:09 <daniels> (unless something's changed since I last looked)

23:09 morphis has joined #dri-devel

23:10 pcercuei has quit [Quit: dodo]

23:11 <jstultz> daniels: are solutions wanted in that space?

23:12 <jenatali> Hm... is there still a need for libglapi.so now that all drivers are Gallium drivers?

23:15 <jstultz> daniels: maybe that sounded snarky - not my intent. trying to better understand the reason wider solution might not be useful between the subsystems, seems like buffers being filled by v4l2 devices heading to the gpu or display would want to provide similar signaling

23:17 <jstultz> daniels: if its just different subsystems focused on their squares of space, or if there are other politics involved

23:18 kts_ has quit []

23:19 <airlied> jenatali: I think it's used on the xserver side

23:19 <jenatali> airlied: Oh like used directly rather than as an implementation detail of GLX/EGL?

23:20 <airlied> I think it uses dri drivers by both paths and it needs to be shared

23:20 <airlied> so it's own loader and via glx/egl

23:20 <jenatali> I see

23:21 * airlied forgets why shared glapi exists though now

23:22 <jenatali> It made sense to me when I was thinking about GL+GLES dispatching to either a classic or gallium driver

23:22 <jenatali> Having it as a muxer essentially

23:22 <jenatali> But now you could just embed the shared glapi in the gallium megadriver and be done with it

23:23 <idr> Yeah... it was at least partially there to make sure that functions that existed in both GL and GLES had the same dispatch offset.

23:23 <jenatali> I'm debating doing that for Windows - now that I split the gallium megadriver out there too, there's no reason for libglapi.dll to exist I think

23:26 <danvet> jstultz, v4l2 had some patches to add sync_file support but they never landed

23:26 <danvet> so someone once cared enough to type some code, but never enough to merge it

23:27 <danvet> otoh the people using sync_file tend to not use v4l2 much, for whatever reasons

23:30 <airlied> I wonder if that patch is shipping in android kernels

23:33 <jstultz> danvet: and if that support was revived, you don't see it as problematic integrating w/ the umf later on? Some of the interesting bits from my understanding the timeline semaphore stuff is how you can set things up and repeat the pipeline over and over. v4l2 frames coming in seems similarly repetitive and it might be nice to be able to avoid the buys work of having to re-generate single-shot sync_files over and over.

23:34 <jstultz> s/buys/busy/

23:36 rasterman has quit [Quit: Gettin' stinky!]

23:36 <danvet> jstultz, so in an ideal world v4l would adopt drm_syncobj (we can rename it) and android would have mesa vk stacks reusing all the drm_syncobj infrastructure

23:36 <danvet> in reality I think android doesn't use v4l at all and adopts upstream gpu stuff at a snails pace at best, so none of this matters

23:38 <danvet> jstultz, for interop the really nasty part of dma_fence is the interaction with memory shrinkers

23:38 <danvet> but v4l pre-reserves all buffers, so doing umf interop once you have drm_syncobj on v4l side should be trivial

23:39 <danvet> any time you have a v4l driver that does dynamic memory management it probably should have been a drm driver instead :-)

23:39 <airlied> like I think the biggest drm_syncobj block is they are identified by handing off the drm file descriptor

23:39 <jstultz> danvet: is renaming drm_syncobj sufficient? the ioctls all take drm devices, no?

23:39 <danvet> anyway I should have started sleeping like 2-3 hours ago ...

23:39 <airlied> we'd have to introduce fd semantics somewhere

23:39 <danvet> jstultz, it's a stand-alone fd too

23:40 <danvet> like sync_file

23:40 tursulin has quit [Read error: Connection reset by peer]

23:40 <danvet> jstultz, but yeah need some decoupling, maybe even a syscall

23:41 <jstultz> danvet: ok

23:41 <airlied> oh yeah we do share them as fds as well, it's just to make sure they aren't sync_file by accident

23:41 <danvet> jstultz, kinda like the gem_bo -> dma_buf fd trick we played

23:45 <daniels> jstultz: no snark taken :) from my understanding V4L2 isn't optimistic when you dequeue output (no signal before completion), and so far they haven't seen much need to push fencing down into the queue path either, because their usecases are so straightline that you don't gain much from having done so

23:46 <daniels> jstultz: I have no meaningful opinion on whether this is good or bad myself

23:46 <jstultz> danvet: ok, sounds good. i really do appreciate the discussion (I know the nihilism is strong), and this helps me paint a more coherent picture.

23:46 <daniels> (I'm sure this is to some extent coloured by how difficult it is to introduce new V4L2 uAPI ...)

23:55 danvet has quit [Ping timeout: 480 seconds]

23:57 <jekstrand> Did I break multiview?