#dri-devel on 2022-05-04 — irc logs at oftc.irclog.whitequark.org

2022-03-22 11:57 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:02 nchery is now known as Guest3491

00:02 nchery has joined #dri-devel

00:02 <jekstrand> karolherbst: Sounds about right

00:02 <jekstrand> karolherbst: It's probably timing out

00:02 <karolherbst> ahh

00:03 <karolherbst> that fast though?

00:03 <karolherbst> although I think something is not right :D

00:03 <karolherbst> with llvmpipe I am getting "write: 512 GB in 368.0 ms: 1391.4 GB/s"

00:04 <karolherbst> what the heck is that thing doing

00:05 <karolherbst> jekstrand: anything I can do so it doesn't time out?

00:06 iive has quit []

00:06 co1umbarius has joined #dri-devel

00:08 Guest3491 has quit [Ping timeout: 480 seconds]

00:08 columbarius has quit [Ping timeout: 480 seconds]

00:09 <Ristovski> lol I love "Cool contexts are too cool to be banned! (Used for reset testing.)"

00:10 <karolherbst> wait..

00:10 <karolherbst> it simply creates a 1GB buffer and copies that all over again

00:11 <karolherbst> yeah okay.. I guess llvm could optimize that shit away

00:29 <karolherbst> dcbaker: do we support build profiles in meson for rust?

00:30 <karolherbst> I might want to start adding debug specific code, because some issues start to become impossbiel to debug, but I also don't want release builds to suffer from it

00:30 <karolherbst> and I need some workarounds where functions are static inline in release, but real function in debug

00:32 LexSfX has quit []

00:37 LexSfX has joined #dri-devel

00:46 <jekstrand> karolherbst: Yeah, there are limits. Letting compute contexts run forever is on the ToDo list for i915 but it's a long road.

00:46 <karolherbst> mhh, how is intels compute runtime handling it?

00:47 <jekstrand> Getting banned unless you're running on a juiced kernel or with cmdline parameters set

00:47 <karolherbst> mhhh

00:47 <jekstrand> There was theoretical support for long-running contexts in theory for a while but it got pulled because it didn't actually work properly.

00:47 <karolherbst> strange.. because I tihnk that test just works as is with their runtime

00:48 <karolherbst> "128 GB in 6230.1 ms (20.5 GB/s)"

00:48 <karolherbst> the bandwidth even seems reasonable

00:49 <karolherbst> but I suspect that we might be doing something incorrectly somewhere... I just don't know what yet

00:49 <karolherbst> the test is super trivial though

00:53 Company has quit [Quit: Leaving]

00:55 Kayden has quit [Quit: Leaving]

01:00 <karolherbst> noooo.. now I can't lock my screen.. fun

01:04 columbarius has joined #dri-devel

01:05 co1umbarius has quit [Ping timeout: 480 seconds]

01:23 Kayden has joined #dri-devel

01:48 stuart has quit []

02:14 jewins has quit [Ping timeout: 480 seconds]

02:19 jimjams has joined #dri-devel

02:28 nchery has quit [Ping timeout: 480 seconds]

03:11 JohnnyonFlame has quit [Read error: Connection reset by peer]

03:15 heat has quit [Ping timeout: 480 seconds]

04:15 lemonzest has joined #dri-devel

04:15 yoslin has quit [Quit: WeeChat 3.4.1]

04:19 yoslin has joined #dri-devel

04:43 consolers has joined #dri-devel

04:44 <consolers> no clues for me about the opencl segfaults since mesa 21?

04:46 <consolers> afaict all my mesa builds since i moved from 20.3 to 21.2 and since have the problem. if it is a problem with my setup, i need a clue about where to look

04:59 sdutt_ has joined #dri-devel

04:59 sdutt has quit [Remote host closed the connection]

05:08 Duke`` has joined #dri-devel

05:12 mdroper has quit [Read error: Connection reset by peer]

05:27 itoral has joined #dri-devel

05:29 danvet has joined #dri-devel

05:33 itoral_ has joined #dri-devel

05:39 <HdkR> consolers: I'd recommend building a debug version of mesa and getting a backtrace and creating an issue on it.

05:39 Kayden has quit [Read error: Connection reset by peer]

05:39 K`den has joined #dri-devel

05:39 itoral has quit [Ping timeout: 480 seconds]

05:39 K`den is now known as Kayden

05:40 <HdkR> Three days asking about a segfault and no answers. Needs more information and better visibility tracking :)

05:40 <consolers> ok i'll try that. should . with rc3 or relase

05:41 <consolers> its happening since i moved to mesa-21

05:41 <airlied> install debug symbols and get a backtrace in gdb

05:41 <consolers> i'm on gentoo, it'll be a huge build

05:43 <consolers> the question is should i rebuild my present media-libs/mesa-22.0.0 or get new sources

05:43 <airlied> rebuild the one that is crashing seems like a good idea

05:46 <consolers> the problem is i wont be able to test it without installing

05:46 <consolers> if i use the gentoo framework

05:46 <consolers> let me see...

05:46 frieder has joined #dri-devel

05:54 reductum has joined #dri-devel

05:56 <airlied> Ristovski: yeah rusticl/amd is a fair bit away no matter which path I take or leave other people to take :-P

06:00 libv_ is now known as libv

06:03 cheako has quit [Quit: Connection closed for inactivity]

06:03 lumag_ has joined #dri-devel

06:08 frieder_ has joined #dri-devel

06:08 frieder_ has quit [Remote host closed the connection]

06:08 frieder has quit [Quit: Leaving]

06:09 frieder has joined #dri-devel

06:10 mvlad has joined #dri-devel

06:17 Daanct12 has joined #dri-devel

06:27 mceier has quit [Quit: Reconnecting]

06:27 mceier has joined #dri-devel

06:33 mszyprow has joined #dri-devel

06:39 consolers has quit [Ping timeout: 480 seconds]

06:42 zzoon[m] is now known as zzoon_holidays_till_8th[m]

06:45 Daanct12 has quit [Quit: Leaving]

06:46 Daanct12 has joined #dri-devel

06:53 MajorBiscuit has joined #dri-devel

06:55 jkrzyszt has joined #dri-devel

06:59 tursulin has joined #dri-devel

07:07 tzimmermann has joined #dri-devel

07:13 i-garrison has quit [Ping timeout: 480 seconds]

07:14 <pq> HdkR, maybe you could replace "disable autodetect" with "force mode"? I have a feeling that might be more likely to exist in any compositor.

07:27 <HdkR> Apparently wlroots has some understanding of virtual output as well

07:30 i-garrison has joined #dri-devel

07:35 <pq> if it's virtual output you actually want, then sure

07:35 <pq> I assumed this was about some KVM switch that messes up your whatever when you switch :-)

07:36 <HdkR> yes, that's the exact problem case

07:38 mairacanal[m] has quit []

07:38 arisu has quit []

07:38 tintou has quit [Quit: Bridge terminating on SIGTERM]

07:38 Eighth_Doctor has quit []

07:38 go4godvin has quit [Quit: Bridge terminating on SIGTERM]

07:38 tomba has quit [Quit: Bridge terminating on SIGTERM]

07:38 Tooniis[m] has quit []

07:38 dcbaker has quit [Quit: Bridge terminating on SIGTERM]

07:38 MatrixTravelerbot[m] has quit []

07:38 kallisti5[m] has quit []

07:38 bylaws has quit [Quit: Bridge terminating on SIGTERM]

07:38 exit70[m] has quit []

07:38 unrelentingtech has quit []

07:38 Guest331 has quit []

07:38 gagallo7[m] has quit []

07:38 Anson[m] has quit []

07:38 onox[m] has quit []

07:38 gnustomp[m] has quit []

07:38 YaLTeR[m] has quit []

07:38 hasebastian[m] has quit []

07:38 cwfitzgerald[m] has quit []

07:38 yshui` has quit []

07:38 jenatali has quit [Quit: Bridge terminating on SIGTERM]

07:38 pushqrdx[m] has quit []

07:38 reactormonk[m] has quit []

07:38 nielsdg has quit []

07:38 masush5[m] has quit []

07:38 Dylanger has quit [Quit: Bridge terminating on SIGTERM]

07:38 heftig has quit [Quit: Bridge terminating on SIGTERM]

07:38 danylo has quit [Quit: Bridge terminating on SIGTERM]

07:38 neobrain[m] has quit []

07:38 Newbyte has quit [Quit: Bridge terminating on SIGTERM]

07:38 DrNick has quit []

07:38 MrR[m] has quit []

07:38 PiGLDN[m] has quit []

07:38 DavidHeidelberg[m] has quit []

07:38 ralf1307[theythem][m] has quit []

07:38 Sumera has quit [Quit: Bridge terminating on SIGTERM]

07:38 T_UNIX has quit []

07:38 doras has quit [Quit: Bridge terminating on SIGTERM]

07:38 robertmader[m] has quit []

07:38 kusma has quit [Quit: Bridge terminating on SIGTERM]

07:38 aura[m] has quit []

07:38 Andy[m] has quit []

07:38 Strit[m] has quit []

07:38 chema has quit [Quit: Bridge terminating on SIGTERM]

07:38 znullptr[m] has quit []

07:38 undvasistas[m] has quit []

07:38 jasuarez has quit [Quit: Bridge terminating on SIGTERM]

07:38 mripard has quit [Quit: Bridge terminating on SIGTERM]

07:38 cleverca22[m] has quit []

07:38 zamundaaa[m] has quit [Quit: Bridge terminating on SIGTERM]

07:38 tonyk has quit [Quit: Bridge terminating on SIGTERM]

07:38 LaughingMan[m] has quit []

07:38 shadeslayer has quit [Quit: Bridge terminating on SIGTERM]

07:38 x512[m] has quit []

07:38 Mis012[m] has quit [Quit: Bridge terminating on SIGTERM]

07:38 martijnbraam has quit [Quit: Bridge terminating on SIGTERM]

07:38 egalli has quit []

07:38 Vin[m] has quit []

07:38 RAOF has quit []

07:38 gdevi has quit []

07:38 chivay has quit []

07:38 halfline[m] has quit []

07:38 zzoon_holidays_till_8th[m] has quit []

07:38 JosExpsito[m] has quit []

07:38 jekstrand[m] has quit []

07:38 Mershl[m] has quit []

07:38 sigmoidfunc[m] has quit []

07:38 moben[m] has quit []

07:38 naheemsays[m] has quit []

07:38 bluepenquin has quit [Quit: Bridge terminating on SIGTERM]

07:38 dhanuka[m] has quit []

07:38 mighty17 has quit []

07:38 unevenrhombus[m] has quit []

07:38 ramacassis[m] has quit []

07:40 itoral_ has quit [Remote host closed the connection]

07:40 itoral_ has joined #dri-devel

07:44 arisu has joined #dri-devel

07:52 <tzimmermann> hi! i'm looking to trade reviews for https://lore.kernel.org/dri-devel/20220502142514.2174-1-tzimmermann@suse.de/

08:02 lynxeye has joined #dri-devel

08:02 <javierm> tzimmermann: I'll prepare some coffee and then take a look

08:02 <tzimmermann> javierm, do you want to have anything reviewed?

08:04 <javierm> tzimmermann: I don't have nothing to trade, but maybe picking your brain to discuss how to move https://lists.freedesktop.org/archives/dri-devel/2022-April/353442.html forward ?

08:07 <tzimmermann> javierm, again? ok, where are we with this problem?

08:07 <javierm> tzimmermann: the three first patches have been superseded by https://lists.freedesktop.org/archives/dri-devel/2022-May/353872.html, but the rest is still needed

08:10 <javierm> tzimmermann: so there are two things that are still missing 1) make sysfb handle the unregistration of the platform devices registered by it and 2) disable sysfb when a driver is probed

08:11 <javierm> (2) also should unregister the device if sysfb registered it, that's why we need (1)

08:12 consolers has joined #dri-devel

08:13 <consolers> ok i have a backtrace for the clinfo segfault: http://ix.io/3WVD

08:13 <jfalempe> tzimmermann, your patch may conflict with https://lists.freedesktop.org/archives/dri-devel/2022-April/352966.html

08:14 <javierm> jfalempe: those have been reviewed by Lyude already right? Maybe we could land that so tzimmermann can re-send on top ?

08:14 <javierm> actually, better if tzimmermann reviews those ones and then you can trade with him :)

08:15 <jfalempe> Yes, Lyude wanted tzimmermann to review as well ;)

08:15 <jfalempe> I'm preparing another patch for better gamma support for mgag200 too.

08:15 <javierm> jfalempe: you also will be a much better reviewer for his patches than me, since you are already familiar with the driver code

08:16 <tzimmermann> jfalempe, i didn't see your patch. i'll review soon

08:16 <javierm> tzimmermann: just ignore this problem is also an acceptable answer for me. It's just that danvet wanted to land the last two patches of that series, and can't be done until we fix the race

08:17 <jfalempe> tzimmermann, thanks, that's a good trade ;)

08:17 <tzimmermann> jfalempe, ah yep. the gamma-lut setup is horrible

08:17 <tzimmermann> it's still from the times when mgag200 was non-atomic

08:18 <jfalempe> yes, that don't work with gnome3 night-time setup.

08:19 <jfalempe> also there are some special case for 16bits. I'm not sure if there are really needed.

08:19 <tzimmermann> jfalempe, BTW i intent to replace mgag200's simple-kms with regular atomic helpers. and also rework the way differnt models are handled. but please don't wait for these changes

08:20 <tzimmermann> i'm looking forward to the gamma changes. i always wanted to fix that, but never had the time. thanks for working on this

08:21 <tzimmermann> javierm, we wanted to disable sysfb when we register the first native driver, right?

08:21 <javierm> tzimmermann: yes

08:21 apinheiro has joined #dri-devel

08:21 pcercuei has joined #dri-devel

08:21 <jfalempe> I've looked into this, because there is a bug in mutter, where it always try to set gamma, even if driver doesn't support it.

08:22 <javierm> ah, matrox cards are common in server hardware. I wondered why you folks had so much interest in this driver :)

08:23 <javierm> tzimmermann: and that's what I did in v4

08:23 <tzimmermann> it has a retro feeling to it :)

08:24 <javierm> haha

08:26 <tzimmermann> javierm, can we do without DRIVER_FIRMWARE?

08:27 itoral_ has quit [Remote host closed the connection]

08:27 <javierm> tzimmermann: not really, because otherwise the DRM core has no way to know that simledrm is the driver registered the DRM device and will attempt to remove its own platform device

08:27 itoral_ has joined #dri-devel

08:27 <javierm> tzimmermann: so is either add a new DRIVER_FIRMWARE capability or do it at remove_conflicting_framebuffers() time

08:28 <javierm> since simpledrm and other drivers using a firmware provided fb won't call that

08:28 <javierm> I'm leaning towards the latter

08:28 <danvet> lynxeye, if you want me to just apply your patch and do the additional fix as a follow up I guess just tell me

08:29 rasterman has joined #dri-devel

08:29 <tzimmermann> javierm, you mean the patch at https://patchwork.freedesktop.org/patch/484027/?series=103319&rev=1 ?

08:29 <javierm> tzimmermann: yes

08:30 <tzimmermann> javierm, i have a meeting now. i'll get back to you later today

08:30 <javierm> tzimmermann: doing it at register_framebuffer() (for fbdev) and drm_dev_register() (for DRM) is more correct, agree but a good compromise is doing it at remove_conflicting_framebuffers() to avoid a new cap

08:30 <javierm> tzimmermann: Ok, later!

08:39 itoral_ has quit [Remote host closed the connection]

08:39 itoral_ has joined #dri-devel

08:40 hch12907 has quit [Ping timeout: 480 seconds]

08:42 jimjams has quit [Quit: Connection closed for inactivity]

08:50 consolers has quit [Ping timeout: 480 seconds]

08:53 <danvet> javierm, trying again to catch up a bit, which patch set should I look at?

08:54 itoral_ has quit [Remote host closed the connection]

08:54 itoral_ has joined #dri-devel

08:56 itoral_ has quit [Remote host closed the connection]

08:57 hch12907 has joined #dri-devel

08:57 itoral_ has joined #dri-devel

08:58 itoral_ has quit [Remote host closed the connection]

08:58 itoral_ has joined #dri-devel

08:58 <javierm> danvet: latest is https://lists.freedesktop.org/archives/dri-devel/2022-April/353442.html

09:01 <javierm> danvet: and the question was whether we want to add a new DRM_FIRMWARE cap (like the patch-set does) or just do the sysfb disable and pdev removal at remove_conflicting_framebuffers(), as was done in v2

09:02 <javierm> I'm leaning towards the latter, but tzimmermann suggested the former so I wanted to get an agreegment with him about the preferred approach

09:03 <danvet> javierm, maybe I get it all wrong, but I thought we have to do the removal upfront at remove_conflicting_fb time

09:03 <danvet> by drm_dev_register time it's too late

09:03 <danvet> maybe something uber-clever like doing it at drm_dev_alloc time might work, but it seems very wonky to make an alloc function change stuff like that

09:03 <javierm> danvet: right. I should be more precise. We are doing the removal at remove_conflicting_fb time but the question is about the disable

09:04 <javierm> but the disable also implies a removal if that wasn't done before

09:04 <danvet> hm but if you disable later, wont there be a race?

09:04 <danvet> or I'm confused

09:08 <javierm> danvet: right. No, you are not confused... that's true

09:09 <javierm> between removing the conflicting framebuffers and registering the DRM device there's a critical section where sysfb could register a "simple-framebuffer" device and that match simpledrm driver

09:11 itoral_ has quit [Remote host closed the connection]

09:12 maxzor has joined #dri-devel

09:12 itoral_ has joined #dri-devel

09:12 <javierm> danvet: that's an easy answer then, we must do the disable at remove_conflicting_framebuffers() time, and since that's not called by drivers using a firmware-provided FB, there's no need for DRIVER_FW cap

09:13 itoral_ has quit [Remote host closed the connection]

09:13 itoral_ has joined #dri-devel

09:15 Daanct12 has quit [Quit: Leaving]

09:16 itoral_ has quit [Remote host closed the connection]

09:17 itoral_ has joined #dri-devel

09:19 itoral_ has quit [Remote host closed the connection]

09:20 itoral_ has joined #dri-devel

09:24 itoral_ has quit [Remote host closed the connection]

09:25 itoral has joined #dri-devel

09:25 <danvet> javierm, well that leaves FB_INFO_MISC_FIRMWARE, but I'm not sure that actually matters when we nuke simpledrm through the sysfb device

09:25 <danvet> since "is it the driver bound against the sysfb device" is a much more precise check

09:26 <danvet> javierm, or is there some other use for the drm fw driver cap flag?

09:26 <danvet> (that I'm missing I mean)

09:27 itoral has quit [Remote host closed the connection]

09:28 itoral has joined #dri-devel

09:29 <javierm> danvet: FB_INFO_MISC_FIRMWARE was actually handled with a different approach in https://lists.freedesktop.org/archives/dri-devel/2022-May/353872.html

09:29 <javierm> danvet: so if that lands, then there's no need anymore for the drm fw driver cap flag

09:30 <danvet> javierm, do we need to set that flag even?

09:32 <danvet> it seems to impact only two places: the fb removal (where we don't need it when we do it all through sysfb)

09:32 <danvet> and some really funky font freeing special case on the virtual console

09:32 <javierm> danvet: I believe we do, for the corner case where you have simpledrm but then a real fbdev driver is probed that would want to kick out simpledrm

09:32 <javierm> danvet: because the remove confliciting fb loop has:

09:32 <javierm> if (!(registered_fb[i]->flags & FBINFO_MISC_FIRMWARE))

09:32 <danvet> uh

09:33 <danvet> can we just not care about that case?

09:33 <danvet> or if we do, teach fbdev to also nuke sysfb as needed?

09:34 <javierm> danvet: fbdev will nuke sysfb if we set FBINFO_MISC_FIRMWARE for simpledrm

09:34 <javierm> or rather, will nuke the pdev associated with the fbdev registered by simpledrm

09:34 itoral has quit [Remote host closed the connection]

09:34 <javierm> danvet: but I don't think that could cause any harm to set FBINFO_MISC_FIRMWARE, that feels the correct thing to do

09:35 <javierm> danvet: and the other two patches in that series have merit on its own IMO

09:35 itoral has joined #dri-devel

09:36 <javierm> danvet: what I would like is to land https://lists.freedesktop.org/archives/dri-devel/2022-May/353872.html and then rebase https://lists.freedesktop.org/archives/dri-devel/2022-May/353872.html on top

09:36 preda has joined #dri-devel

09:36 <javierm> the first 3 patches from that series could be dropped and the DRIVER_FW cap not needed since we will disable sysfb at remove conflicting fb time

09:36 <danvet> javierm, twice the same link?

09:36 <javierm> gah

09:37 <danvet> javierm, 9a45ac2320d0a just stumbled over this

09:37 <javierm> danvet: rabase https://lists.freedesktop.org/archives/dri-devel/2022-April/353442.html on top

09:37 <danvet> agd5f, ^^ did we figure out more what's going on there with efifb?

09:37 itoral has quit [Remote host closed the connection]

09:37 <danvet> javierm, yeah I guess makes sense

09:38 <danvet> javierm, note that default bpp is a different can of worms, I kinda want to outright nuke that entire thing because it's so much wrong

09:38 <danvet> but never got anywhere

09:38 <danvet> we mix up bpp and depth

09:38 itoral has joined #dri-devel

09:38 <javierm> danvet: yes, I noticed the FIXME in drm_fbdev_generic_setup()

09:39 <javierm> danvet: but this is actually in preparation of nuking it. Since then drm_fbdev_generic_setup() won't have a bpp param anymore and can be removed from "options"

09:40 Andy[m] has joined #dri-devel

09:40 aura[m] has joined #dri-devel

09:40 bylaws has joined #dri-devel

09:40 Guest26 has joined #dri-devel

09:40 chema has joined #dri-devel

09:40 chivay has joined #dri-devel

09:40 RAOF has joined #dri-devel

09:40 Eighth_Doctor has joined #dri-devel

09:40 cleverca22[m] has joined #dri-devel

09:40 cwfitzgerald[m] has joined #dri-devel

09:40 dcbaker has joined #dri-devel

09:40 Anson[m] has joined #dri-devel

09:40 dhanuka[m] has joined #dri-devel

09:40 Guest21 has joined #dri-devel

09:40 doras has joined #dri-devel

09:40 danylo has joined #dri-devel

09:40 Dylanger has joined #dri-devel

09:40 itoral has quit [Remote host closed the connection]

09:40 egalli has joined #dri-devel

09:40 exit70[m] has joined #dri-devel

09:40 gagallo7[m] has joined #dri-devel

09:40 gdevi has joined #dri-devel

09:40 gnustomp[m] has joined #dri-devel

09:40 Guest29 has joined #dri-devel

09:40 halfline[m] has joined #dri-devel

09:40 hasebastian[m] has joined #dri-devel

09:40 heftig has joined #dri-devel

09:40 zzoon_holidays_till_8th[m] has joined #dri-devel

09:40 jasuarez has joined #dri-devel

09:40 jekstrand[m] has joined #dri-devel

09:40 jenatali has joined #dri-devel

09:40 JosExpsito[m] has joined #dri-devel

09:40 kallisti5[m] has joined #dri-devel

09:40 kusma has joined #dri-devel

09:40 LaughingMan[m] has joined #dri-devel

09:40 mairacanal[m] has joined #dri-devel

09:40 martijnbraam has joined #dri-devel

09:40 masush5[m] has joined #dri-devel

09:40 Mershl[m] has joined #dri-devel

09:40 mighty17 has joined #dri-devel

09:40 Mis012[m] has joined #dri-devel

09:40 moben[m] has joined #dri-devel

09:40 mripard has joined #dri-devel

09:40 Vin[m] has joined #dri-devel

09:41 naheemsays[m] has joined #dri-devel

09:41 neobrain[m] has joined #dri-devel

09:41 Newbyte has joined #dri-devel

09:41 nielsdg has joined #dri-devel

09:41 DavidHeidelberg[m] has joined #dri-devel

09:41 onox[m] has joined #dri-devel

09:41 PiGLDN[m] has joined #dri-devel

09:41 pmoreau has joined #dri-devel

09:41 pushqrdx[m] has joined #dri-devel

09:41 r[m] has joined #dri-devel

09:41 ralf1307[theythem][m] has joined #dri-devel

09:41 ramacassis[m] has joined #dri-devel

09:41 reactormonk[m] has joined #dri-devel

09:41 robertmader[m] has joined #dri-devel

09:41 shadeslayer[m] has joined #dri-devel

09:41 itoral has joined #dri-devel

09:41 sigmoidfunc[m] has joined #dri-devel

09:41 Strit[m] has joined #dri-devel

09:41 Sumera[m] has joined #dri-devel

09:41 T_UNIX has joined #dri-devel

09:41 tintou has joined #dri-devel

09:41 tomba has joined #dri-devel

09:41 tonyk has joined #dri-devel

09:41 Tooniis[m] has joined #dri-devel

09:41 undvasistas[m] has joined #dri-devel

09:41 unevenrhombus[m] has joined #dri-devel

09:41 unrelentingtech has joined #dri-devel

09:41 MatrixTravelerbot[m] has joined #dri-devel

09:41 x512[m] has joined #dri-devel

09:41 <javierm> danvet: notice that is_firmware_framebuffer() also checks for if (!(registered_fb[i]->flags & FBINFO_MISC_FIRMWARE))

09:41 YaLTeR[m] has joined #dri-devel

09:41 yshui` has joined #dri-devel

09:41 zamundaaa[m] has joined #dri-devel

09:41 znullptr[m] has joined #dri-devel

09:41 pmoreau is now known as Guest33

09:42 itoral has quit [Remote host closed the connection]

09:43 itoral has joined #dri-devel

09:43 <javierm> so setting that for the simpledrm emulated fbdev really feels idiomatic

09:43 rkanwal has joined #dri-devel

09:43 <tzimmermann> javierm, danvet, about the DRIVER_FIRMWARE flag: maybe let's rather use a dedicated dev_register function in simpledrm that does not disable sysfb

09:44 <tzimmermann> example code at https://paste.opensuse.org/18607394

09:44 <danvet> tzimmermann, dev_register should never disable sysfb for anyone is my take

09:44 <javierm> tzimmermann: see danvet's comment about being too late at that point

09:45 <javierm> tzimmermann: we really should do it at remove_conflicting_fb time

09:47 <tzimmermann> why would that be it too late?

09:48 <tzimmermann> javierm, saw your comment on that

09:49 <tzimmermann> so we have to disable sysfb first and then kick out the existing firmware fb's

09:51 <danvet> javierm, caught up on some other discussions and I'm not sure that fb_release fix is sound

09:51 <javierm> tzimmermann: yes, or at least in the same section holding the registration lock

09:52 <javierm> danvet: oh, really? already landed in -fixes :/

09:57 <danvet> javierm, yeah hence the ping here

09:58 <javierm> danvet: just saw your email, let me look at the code again

09:58 <javierm> danvet: btw, this is a bug reported by 3 different people so even when papering over the issue, it prevents a NULL pointer deref on fbdev close

10:00 Duke`` has quit [Ping timeout: 480 seconds]

10:08 sdutt_ has quit [Ping timeout: 480 seconds]

10:09 <danvet> javierm, yeah leaking helps to paper over null deref :-)

10:09 Duke`` has joined #dri-devel

10:09 <danvet> javierm, do you know on which exact pointer we're blowing up on?

10:10 <javierm> danvet: yes, struct fb_info * const info in fb_release()

10:10 <javierm> danvet: I stand that the fix is the best we can do given the current situation

10:13 <javierm> danvet: the whole "fb_info can change and then you need to check if file->private_data is still valid" is insane really, but that's how things are

10:15 <danvet> well fbdev is wonky at best

10:15 itoral has quit [Remote host closed the connection]

10:15 <danvet> javierm, can I convince you for a revert? I really don't think this is the right fix

10:16 itoral has joined #dri-devel

10:16 <javierm> danvet: sure, but can we first agree on the right fix? I just don't want to do a revert and then a revert revert if we find that's the proper workaround

10:18 <danvet> I haven't seen yet where exactly we blow up

10:18 sagar_ has joined #dri-devel

10:19 <javierm> danvet: AFAICT https://elixir.bootlin.com/linux/latest/source/drivers/video/fbdev/core/fbmem.c#L1443

10:19 <javierm> info is NULL at this point and &info->lock is the NULL deref

10:19 sagar__ has quit [Ping timeout: 480 seconds]

10:20 itoral has quit [Remote host closed the connection]

10:20 <danvet> that looks very fishy

10:20 <danvet> did we confirm this?

10:20 itoral has joined #dri-devel

10:20 <danvet> like I have no idea how you'd even manage to clear file->private_data

10:21 <danvet> some pointer within fb_info become NULL sounds plausible

10:22 <javierm> danvet: we (thomas and me) weren't able to reproduce it but the report is https://github.com/raspberrypi/linux/issues/5011

10:22 itoral has quit [Remote host closed the connection]

10:25 icecream95 has quit [Ping timeout: 480 seconds]

10:26 <danvet> javierm, I have no idea

10:26 <danvet> javierm, minimally this needs a giantic comment that fbdev is too screwed and it's easier to just leak when we race against removal

10:27 <danvet> it's definitely a very wrong fix, that's for sure

10:30 flacks has quit [Quit: Quitter]

10:30 <danvet> javierm, I think I have it

10:30 <danvet> most drivers are bs when their driver remove callback is called

10:30 <danvet> instead of proper refcounting, they just unconditionally nuke the underlying fb_info

10:31 <danvet> any driver which has framebuffer_release() called from their ->remove callback instead of ->fb_destroy callback is busted

10:32 <danvet> javierm, note that the new drm fbdev emulation built on top of drm_client is I think the only fbdev implementation which gets this right

10:33 <danvet> and the reason this is regressing due to 27599aacbaefcbf2af7b06b0029459bbf682000d is because that switched from unregister_framebuffer to removething the device

10:33 <danvet> the former simply leaks the entire driver crap, the latter calls into the driver's ->remove which then releases the fb_info, but way too early

10:33 <danvet> boom

10:33 <danvet> 90% confident this is t

10:33 <danvet> *it

10:33 * danvet off for lunch now

10:34 <javierm> danvet: interesting, that makes sense. I'll take a look

10:34 <javierm> danvet: enjoy!

10:35 flacks has joined #dri-devel

10:36 <danvet> javierm, I expect if you dig into the assembly of the splat it blows up in some debug pointer in struct mutex or so

10:37 <danvet> which has become garbage

10:37 <danvet> but I don't really do arm assembly :-)

10:38 <javierm> danvet: efifb has the same issue btw

10:38 <javierm> but in this case the bug happened with simplefb

10:39 <javierm> fbdev is really wicked

10:41 <danvet> yeah

10:41 <danvet> luckily we only have to fix the FBINFO_MISC_FIRMWARE drivers

10:42 <javierm> yeah

10:42 <javierm> danvet: and in the future only simpledrm

10:44 <javierm> danvet: it's somehow ironic that the reason why tzimmermann and me are fixing all this fbdev issues is because we want to get rid of it :)

10:53 MajorBiscuit has quit [Ping timeout: 480 seconds]

10:57 <karolherbst> jekstrand: deadlock on the bufmgr :(

10:58 MajorBiscuit has joined #dri-devel

10:59 <karolherbst> no clue on how that can happen though

11:00 <danvet> karolherbst, userspace bufmgr?

11:00 <karolherbst> yes

11:00 <danvet> impressive indeed :-)

11:00 <karolherbst> probably I am doing something stupid :)

11:00 <karolherbst> CL contrary to GL is actually heavily threaded

11:00 <karolherbst> so.. most APIs are just thread safe

11:01 <karolherbst> and here is the fun part: they are not dead lock safe

11:04 <danvet> javierm, well it's the same with Xorg

11:05 <danvet> the people who really know why it should be nuked are also the only ones qualified to fix any bugs in there

11:05 * dv_ is now reminded of the X.org presentation by daniels :)

11:05 <danvet> karolherbst, there's like cl callbacks which allow the driver to call into cl again

11:05 <dv_> err, wayland presentation

11:05 Company has joined #dri-devel

11:06 <karolherbst> danvet: yeah, I know :)

11:06 morphis has quit [Ping timeout: 480 seconds]

11:06 <danvet> karolherbst, oh I mean this was a question?

11:06 <danvet> if yes, that sounds a bit cursed :-)

11:06 <karolherbst> it is, the API spec even is saying so: you might dead lock, be careful

11:06 <karolherbst> :D

11:06 morphis has joined #dri-devel

11:08 mclasen has quit [Remote host closed the connection]

11:08 <emersion> daniels: re wl presentation protocol, how can we unblock the situation? would anyone from the weston side have time for this?>

11:09 consolers has joined #dri-devel

11:11 <javierm> danvet: this then? https://paste.centos.org/view/raw/252de441

11:11 <javierm> danvet: also posted the revert already

11:13 <danvet> javierm, yup

11:13 <danvet> see also my reply to your revert, I think we should put a check into framebuffer_release for safety and easier debugging

11:13 <tzimmermann> javierm, is that unplug bug fixable? you mentioned that drivers need an update. (i'm somewhat out of the loop)

11:14 <danvet> and if we detect an issue, leak instead of calling kfree

11:14 <javierm> tzimmermann: https://paste.centos.org/view/raw/252de441, I plan to do the same for efifb

11:14 <danvet> javierm, simplefb is kinda finny since it drops the iomap from fb_destroy, despite that hw stuff should be dropped from ->remove

11:15 <danvet> so it's kinda exactly the opposite of what it should be

11:15 <danvet> but also given that simplefb doesn't tear down the mmap it's meh anyway

11:15 <danvet> imo not worth fixing

11:15 <karolherbst> ahh.. memory corruption.. nice

11:16 <javierm> danvet: yeah... it seems that for every patch I posted, I end with couple of patches more needed

11:16 <danvet> but it's the "devm for hw, drmm for sw" topic all over again

11:16 <javierm> danvet: the branching factor in fbdev is a thing :)

11:16 <danvet> javierm, well we only need to fix the bugs we uncover and get regression reports for

11:16 <danvet> not all the others

11:16 <tzimmermann> javierm, danvet: that last put can be quite some time later?

11:16 <danvet> especially for hotplug lifetim lolz I think "use drm with fbdev emulation" is totally fine answer

11:16 <danvet> tzimmermann, yeah

11:17 <javierm> tzimmermann: yes, because it may be that you remove the driver but still user-space has a reference to fb_info

11:17 <danvet> whenever userspace closes the last /dev/fb/* file

11:17 <javierm> i.e: can close the fbdev fd much later

11:17 hch12907 has quit [Ping timeout: 480 seconds]

11:17 <javierm> danvet: that open, mmap, write, close uAPI is really terrible

11:17 <tzimmermann> does fbdev release the resources in time? because that's why we added hot-unplug in the first place

11:18 <tzimmermann> vmwgfx tried to acquire the framebuffer that was still reserved by simplefb; hence failed to do so

11:18 <javierm> tzimmermann: I don't think it does, that's why danvet suggested to make the mmap'ed writes to do a SIGBUS

11:18 sagar_ has quit [Remote host closed the connection]

11:18 Namarrgon has quit [Ping timeout: 480 seconds]

11:18 <danvet> tzimmermann, the unregister_framebuffer needs to happen synchronously

11:18 <javierm> tzimmermann: ah, you mean the I/O mem region. Yes, I believe it does in remove

11:18 sagar_ has joined #dri-devel

11:18 <danvet> it's the fb_info kfree which needs to be deleayed

11:19 <tzimmermann> ok

11:19 <danvet> javierm, yeah tbh I'm tempted for a FBINFO_MISC_NOT_SHIT flag which just blindly uses the fb_info from file->private_data

11:19 <tzimmermann> otherwise, we'd be back to the original problem

11:19 <danvet> which we set for drm_client fbdev emulation

11:19 <danvet> since that has a) proper lifetime and b) drm drivers should take care of hotunplug races with drm_dev_enter already

11:20 <javierm> danvet: btw, I just noticed today that fbdev emulation is implemented as a drm_client

11:20 <javierm> so cool, that blew my mind

11:20 <danvet> and leave the horror show uapi for "real" fbdev drivers

11:20 <danvet> javierm, not for all drivers

11:20 <tzimmermann> javierm, it's the only drm_client :)

11:20 <danvet> there's still a pile which hand-roll iirc, and those tend to have a bunch of issues all over

11:20 <danvet> tzimmermann, there were patches for a nice boot splash using drm_client

11:20 <tzimmermann> indeed

11:20 <danvet> and also I think some kgdb resurrection using that

11:20 <tzimmermann> one day....

11:20 <javierm> danvet, tzimmermann: yes, and also for a drmlog

11:21 <danvet> yeah one day :-)

11:21 * danvet also hopefully, a lot has moved already

11:21 <javierm> or drmcon, can't remember

11:21 <danvet> yeah one of them

11:22 <danvet> javierm, uh just realized, we have to move that iounmap to ->remove

11:22 <danvet> since currently it is actually done there due to the unconditional call to framebuffer_release

11:23 <danvet> javierm, so yeah actually everything in simplefb_destroy needs to be called from _remove

11:23 Duke`` has quit [Ping timeout: 480 seconds]

11:23 <danvet> I think so at least

11:23 <danvet> maybe I'm confusing myself again

11:23 <daniels> emersion: yep, I'm definitely willing to put time into it - I think the best next steps are to figure out a) exactly what we need to make Vulkan FIFO work without blocking, and b) find out from media people exactly what they want for their queueing and the semantics, and just do the most achievable thing

11:24 <daniels> I've been doing a little bit of groundwork on Weston to make it easier to experiment with

11:24 <danvet> ah no framebuffer_release does not call ->fb_destroy

11:24 <tzimmermann> jfalempe, do you understand mga_vga_calculate_mode_bandwidth() ? https://elixir.bootlin.com/linux/v5.17.5/source/drivers/gpu/drm/mgag200/mgag200_mode.c#L683

11:24 <danvet> javierm, your patch is fine

11:24 <javierm> danvet: I looked at the order in which the resources are acquired and ioremap_wc() happen after framebuffer_alloc()

11:24 <danvet> daniels, mbox/queue in atomic kms or what's the context?

11:24 <javierm> danvet: yeah, I believe so

11:25 <danvet> javierm, I mean it's horrible, but we knew that going in :-)

11:25 <javierm> danvet: haha yeah

11:25 <jfalempe> tzimmermann, I didn't look into this function yet ;)

11:25 <javierm> danvet: at least is less horrible that the workaround I pushed and reverted :)

11:25 consolers has quit [Ping timeout: 480 seconds]

11:25 <javierm> danvet: sorry for pushing that so eagerly, but we had several reports about the crash

11:26 <tzimmermann> jfalempe, it appears to be a dotclock computation, but i cannot make sense of all these constants

11:26 <tzimmermann> 1024? why?

11:26 neonking_ has joined #dri-devel

11:30 <emersion> daniels: cool!

11:30 <daniels> danvet: in Wayland protocol

11:32 <danvet> daniels, ah cool so you figure this out and then we just implement whatever comes out of that in kms?

11:32 <danvet> or is the idea to fully absorb this in the compositor?

11:32 <jfalempe> tzimmermann, yes it multiply by 1000 and by 100, and divide by 1024, not sure why

11:32 neonking__ has quit [Ping timeout: 480 seconds]

11:33 Duke`` has joined #dri-devel

11:34 <jfalempe> some maybe needed for rounding issue with integer

11:35 <jfalempe> it does (active_area*clock*1000) / total_area

11:35 <tzimmermann> jfalempe, the callers of this function compare the result with some constant that's multiplied by 1024

11:35 <tzimmermann> that's part of a dotclock computation

11:36 <jfalempe> maybe it returns a result in kB

11:36 <tzimmermann> yeah, i guess.

11:37 mclasen has joined #dri-devel

11:37 <tzimmermann> it appears to compute some sort of required memory bandwidth for the mode and the caller compares it to the hardware limit

11:38 <jfalempe> yes, but what is strange to me is to divide by the total_area ?

11:40 <jfalempe> I would say bandwith should be roughly pixel_area * bytes_per_pixels * frame_per_seconds

11:42 <tzimmermann> jfalempe, i thought that was explained in an old xfree86 howto, but i cannot find it any longer https://tldp.org/HOWTO/XFree86-Video-Timings-HOWTO/

11:44 <tzimmermann> it could be some soft limit, so that videomode isn't to close to the actual hardware limits (i.e., use only 80% of the available bandwidth)

11:44 <tzimmermann> but really, i don't understand what this function really does

11:45 <tzimmermann> and i couldn't find similar code in the old matroxfb or x11 drivers

11:45 <tzimmermann> and mgag200 appears to be the only driver tha does this test

11:47 pcercuei has quit [Quit: brb]

11:57 Duke`` has quit [Ping timeout: 480 seconds]

11:58 shadeslayer[m] has quit []

11:58 shadeslayer[m]1 has joined #dri-devel

12:00 shadeslayer[m]1 has quit []

12:00 shadeslayer[m]1 has joined #dri-devel

12:06 pcercuei has joined #dri-devel

12:09 Namarrgon has joined #dri-devel

12:10 Namarrgon has quit []

12:10 consolers has joined #dri-devel

12:11 Namarrgon has joined #dri-devel

12:12 ivyl has quit [Quit: end of flowers]

12:12 <consolers> so it looks like a gcc bug? i'm on gcc-11.2.0 - in gdb mesa-22.0.0/src/gallium/drivers/iris/iris_disk_cache.c:276, note = 0x0 and there is an assert (not && build_id_length(note) == 20) there which does not trigger

12:13 <consolers> oh no not again

12:13 <consolers> this was with -O2 -g

12:14 <consolers> but the crash is a few lines down

12:15 <consolers> isp is flakey again. i have a matrix acct on intel gfx but not here

12:16 <jfalempe> tzimmermann, sometime I just copy this function in a test program, and see what it gives with real-world value. It helps to decide on the brokenness of the code ;)

12:18 <consolers> looking at mesa-22.0.0/src/util/build_id.c:118 (build_id_find_nhdr_for_addr): it looks like dl_iterate_phdr(build_id_find_nhdr_callback, &data) succeeds but data.note which is returned is 0x0

12:18 <consolers> is there some env variable to disable shader cache?

12:22 <consolers> nothing relevant has changed in the mesa side between mesa-20.2.0 and mesa-21.2.1 which is where i first started encountering the crash

12:22 <consolers> maybe i had a gcc upgrade at that point?

12:25 ivyl has joined #dri-devel

12:26 <consolers> maybe i'll try rebuilding with -O0 later

12:34 <daniels> danvet: I think KMS semantics would fall out of the compositor, but it's different enough it wouldn't be a carbon copy

12:36 <danvet> daniels, yeah and I guess the only reason to add fifo to the kernel is to expose the hw fifos

12:36 <danvet> otherwise not much point really

12:37 <danvet> and I have no idea how to expose the hw fifo flip queues since the limitations are tricky

12:37 <danvet> so maybe kms needs a "queue this as fifo, but only if you can put it into the hw fifo queue completely, otherwise don't bother"

12:38 <zmike> dcbaker: pushed

12:38 <consolers> that might solve the halting problem

12:40 consolers has quit [Quit: /l]

12:49 rkanwal has quit [Quit: rkanwal]

12:51 sagar_ has quit [Remote host closed the connection]

12:51 sagar_ has joined #dri-devel

13:00 devilhorns has joined #dri-devel

13:01 <agd5f> danvet, I don't think it was anything wrong with efifb. seems to be related to runtime pm and amdgpu. At least the issue I was fixing with the fbdev patch a few kernels ago

13:02 <agd5f> feel free to drop that patch if you need to. We have a better fix in amdgpu now

13:02 apinheiro has quit [Ping timeout: 480 seconds]

13:05 MajorBiscuit has quit [Quit: WeeChat 3.4]

13:08 <danvet> agd5f, nah if it's already solved then that's all good, it's not getting in the way anywhere

13:08 <danvet> was just reviewing users of FBINFO_MISC_FIRMWARE

13:09 <danvet> so if the amdgpu caller of the is_firmware_fb helper is already out and that's all unexported again then perfect

13:09 <danvet> agd5f, the maybe issue is that simpledrm doesn't set this flag, so if that's loaded instead of efifb it might upset things a bit

13:09 <danvet> maybe, not sure really

13:09 <agd5f> it still calls it, but it's no longer necessary

13:10 <agd5f> I can drop it

13:10 <danvet> agd5f, hm if you can gc that code would be nice

13:10 <danvet> thx

13:10 <agd5f> np

13:12 mihai has joined #dri-devel

13:17 <MrCooper> agd5f: glad you guys found a better solution for that

13:18 preda has quit [Ping timeout: 480 seconds]

13:24 sdutt has joined #dri-devel

13:26 shadeslayer[m]1 has quit []

13:28 shadeslayer[m] has joined #dri-devel

13:30 shadeslayer[m] has quit []

13:30 shadeslayer[m] has joined #dri-devel

13:31 mihai has quit []

13:31 jewins has joined #dri-devel

13:39 moony has quit [Read error: Connection reset by peer]

13:39 moony has joined #dri-devel

13:41 sdutt has quit []

13:42 sdutt has joined #dri-devel

13:46 shadeslayer has joined #dri-devel

13:47 iive has joined #dri-devel

13:51 alyssa has joined #dri-devel

13:51 <alyssa> karolherbst: it works!

13:51 <karolherbst> alyssa: \o/

13:58 hch12907 has joined #dri-devel

14:05 <zmike> pepp: are you planning to submit a fix for that subgroup test?

14:14 kchibisov_ has joined #dri-devel

14:14 kchibisov has quit [Read error: Connection reset by peer]

14:18 Haaninjo has joined #dri-devel

14:22 <karolherbst> "128 GB in 362.1 ms (353.5 GB/s)" this is actually correct...

14:22 <karolherbst> the benchmark is just shitty

14:22 <karolherbst> (it writes 128 times into the same buffer with the same values)

14:23 <karolherbst> I don't think nir can look behind that, but llvm seems to be able to

14:31 <alyssa> what benchmarks *aren't* shitty

14:31 <alyssa> is gfxbench any good?

14:31 <karolherbst> yeah

14:31 <karolherbst> I think..

14:31 <karolherbst> gputest is also quite nice

14:32 <alyssa> I really need to get FEX set up so I can run things that aren't glmark2 and neverball

14:32 * karolherbst starts looking into why darktable renders garbage

14:32 <alyssa> HdkR: ^^ Is there a nice way to slot in my own mesa buidlds into FEX? (Keeping in mind I do trickery with LIBGL_DRIVERS_PATH etc for dev)

14:36 Duke`` has joined #dri-devel

14:49 consolers has joined #dri-devel

14:49 <consolers> its probably not a gcc bug. apparently the build-id is not being found

14:50 shadeslayer is now known as Guest90

14:50 shadeslayer[m] is now known as shadeslayer

14:51 <consolers> meson prints out: Compiler for C supports link arguments -Wl,--build-id=sha1: YES

14:52 Guest29 is now known as go4godvin

14:52 <consolers> and the -Wl,--build-id=sha1 is there in the parameters for -o src/gallium/targets/dri/libgallium_dri.so

14:52 <consolers> can i use nm or something to check the build-id on the .so directly?

14:54 <consolers> ah but no -Wl.--build-id when generating pipe_iris.so - that would do it?

14:55 <jekstrand> karolherbst: That should be unpossible

14:55 <karolherbst> jekstrand: mhh? what?

15:00 <consolers> i cant see which meson.build is building pipe_iris.so

15:01 ella-0 has joined #dri-devel

15:02 <consolers> i think that is missing a ld_args_build_id, but how did i not get a segfault in 20.x

15:03 <jekstrand> karolherbst: deadlocking in bufmgr

15:03 <karolherbst> jekstrand: ahh yeah.. it was a use after free

15:04 <karolherbst> atm I am debugging darktable now to figure out why scaling doesn't work :(

15:04 ella-0_ has quit [Read error: Connection reset by peer]

15:05 <karolherbst> it actually does some serious business compared to anything I tried rusticl on before

15:06 <karolherbst> those are the kernel launched in one "scaling" op: https://gist.githubusercontent.com/karolherbst/cb6f569f72957700ecc3618f852c5d33/raw/6c0e8a6523ff49c483178a03710dbb44d2c81fc0/gistfile1.txt

15:06 <karolherbst> :(

15:07 <karolherbst> ahh.. let me check if there is a difference between 50% and 200% (200% works, like any multiples of 100%)

15:08 <karolherbst> ahh

15:08 <consolers> success! after this patch http://ix.io/3WXJ i don't get the segfault anymore

15:08 <karolherbst> broken scalings have a weird __wrapped_interpolation_resample kernel

15:09 <consolers> i still cant explain how it worked with 20.x

15:10 <consolers> can someone take a look at that and my backtrace posted earlier http://ix.io/3WVD

15:10 anarsoul has quit [Ping timeout: 480 seconds]

15:10 anarsoul has joined #dri-devel

15:11 <karolherbst> uhhh

15:11 <karolherbst> shared

15:11 cheako has joined #dri-devel

15:12 sdutt has quit [Ping timeout: 480 seconds]

15:12 <consolers> this is also opencl?

15:13 <karolherbst> I think I found it..

15:14 <karolherbst> nope.. must be sometihng else

15:15 heat has joined #dri-devel

15:21 <karolherbst> but I am sure something is up with coords

15:22 <karolherbst> sooo..

15:22 maxzor has quit [Ping timeout: 480 seconds]

15:22 <karolherbst> every line on the x axis gets all values from the right border

15:30 sdutt has joined #dri-devel

15:34 consolers has quit [Quit: /]

15:36 Guest21 is now known as DrNick

15:42 <pepp> zmike: probably at some point but it's not a priority

15:42 <karolherbst> jekstrand: ... barriers are only legal outside of CF structures, right?

15:42 <karolherbst> well at least in glsl

15:43 <karolherbst> sooo... what if we have a scoped_barrier inside a loop with an divergent if right in front of it

15:50 <cwabbott> karolherbst: barriers for compute shaders in glsl can be anywhere, but they can't be in divergent control flow

15:50 <karolherbst> right...

15:52 <karolherbst> I guess then that's file what's happening as long as all threads enter the barrier, no?

15:52 <karolherbst> or would they ahve to enter the barrier at the same time?

15:52 <cwabbott> control flow has to reconverge

15:53 <cwabbott> so I guess they have to enter "at the same time"

15:53 <karolherbst> mhh

15:54 <karolherbst> so an if without an else before the barrier would mean the threads are divergent, correct? (it's probably impossible to write ifs in a way you can make sure they converge after, but)

15:55 <karolherbst> it's the scoped_barrer inside __wrapped_interpolation_resample in https://gist.githubusercontent.com/karolherbst/cb6f569f72957700ecc3618f852c5d33/raw/6c0e8a6523ff49c483178a03710dbb44d2c81fc0/gistfile1.txt

15:55 <karolherbst> *scoped_barrier

15:55 <karolherbst> the last one actually

15:56 <karolherbst> ohh heck

15:56 <karolherbst> there is a break

15:56 <karolherbst> but I think that one is uniform

15:56 <karolherbst> anyhow.. I think that would be not legal in GL compute

15:56 <karolherbst> *GLSL

15:58 <karolherbst> it's the only kernel having that and the one added for scalings... so I wouldn't be surprised if that's indeed the issue

16:03 <karolherbst> ahh, glsl is quite clear: "Calls to barrier may not be placed within any control flow."

16:03 <karolherbst> in compute you can put them into the non main function, but that's it

16:04 <karolherbst> although calling functions is kind of control flow?

16:04 <karolherbst> I am confused

16:04 <karolherbst> anyway... I think to fix that for iris, we might have to converge threads around barriers if that doesn't happen automatically, no?

16:09 <karolherbst> do we have something like a warp sync or something?

16:10 <karolherbst> ahh, control_barrier

16:11 <jekstrand> karolherbst: yes

16:11 <karolherbst> mhh, but the execution scope is set to workgroup

16:12 <karolherbst> intrinsic scoped_barrier () (execution_scope=WORKGROUP /*4*/, memory_scope=WORKGROUP /*4*/, mem_semantics=ACQ|REL /*3*/, mem_modes=shared /*16384*/)

16:12 <jekstrand> karolherbst: I think there are rules in CL around this but they may be tricky and our structurization may not be aware of them.

16:12 <jekstrand> s/may/is/

16:12 <karolherbst> jekstrand: in CL you can place it anywhere

16:13 <karolherbst> so we have to make sure we converge the threads

16:14 <karolherbst> anyway.. the nir matches the OpenCL C code

16:14 <karolherbst> it's just that the threads diverge because of the if I think...

16:14 <karolherbst> not 100% sure

16:14 <karolherbst> yeah.. so the if depends on the thread id

16:15 <jenatali> karolherbst: "If the barrier is inside a conditional statement, then all work-items in the work-group must enter the conditional if any work-item in the work-group enters the conditional statement and executes the barrier."

16:15 <karolherbst> and the loop variable

16:15 <jenatali> Looks like has to be uniform control flow to me

16:15 <karolherbst> jenatali: I tihnk this is more a req to the runtime

16:15 <karolherbst> the runtime has to make sure that this happens

16:15 <jenatali> Hm? I'm looking at the CL C spec

16:16 <karolherbst> but the barrier is inside a loop, not an if

16:16 <karolherbst> "If the barrier is inside a loop, then all work-items in the work-group must execute the barrier on each iteration of the loop if any work-item executes the barrier on that iteration."

16:16 <karolherbst> which they actually do

16:16 <karolherbst> just not converged

16:16 <jenatali> Yeah that's the same thing

16:17 <karolherbst> jenatali: that's the loop btw: https://github.com/darktable-org/darktable/blob/23e006beca02be0d92f37b5c548c301facbdd785/data/kernels/basic.cl#L2971

16:17 <jenatali> If a loop iteration causes one thread to hit the barrier, then all threads have to hit the barrier on that iteration too

16:17 <jenatali> I.e. uniform control flow

16:17 <karolherbst> they all do

16:17 <karolherbst> just not converged

16:17 <jenatali> I don't know what you mean by "just not converged"?

16:17 <karolherbst> the if diverges control flow

16:18 <karolherbst> some threads might enter the barrier before others (as they are still inside the if)

16:18 <jenatali> Yeah but the barrier is outside of the if?

16:18 <karolherbst> but that kind of depends on how hw handles that stuff

16:18 <karolherbst> _but_ on nv those threads can diverge and run independently

16:18 <karolherbst> just not at the same time anymore

16:18 <jenatali> Yeah but that's the point of the barrier then, to stall the threads that ran ahead to wait for the other ones to catch up, isn't it?

16:19 <karolherbst> yes

16:19 <karolherbst> but that's not defined in GLSL

16:19 <karolherbst> in GLSL that would be not legal

16:19 <karolherbst> mhh, maybe in vulkan, but not in OpenGL at least

16:19 <jenatali> Why not? Isn't that convergent control flow at that point?

16:20 <karolherbst> doesn't matter

16:20 <karolherbst> it's control flow

16:20 <karolherbst> so it's not legal

16:20 <karolherbst> I had enough fun with that when writing the nir backend for nouveau

16:20 <jenatali> Oh, you're right, wow that's really restrictive

16:20 <karolherbst> so diverging _after_ a loop is enough

16:21 <karolherbst> ehh

16:21 <karolherbst> converging

16:21 <karolherbst> as barriers won't be inside one

16:21 <karolherbst> jenatali: well.. having to converge threads is expensive

16:22 <karolherbst> well.. not in itself, but it hurts perf

16:22 <jenatali> Sure. Which is why app developers should take care with barriers. At least that's what I thought

16:22 <karolherbst> yeah...

16:22 <karolherbst> well

16:22 <karolherbst> it's not so much that any of that itself is expensive, just compilers can be smart if there are no barriers inside loops

16:23 <jenatali> FWIW HLSL I believe allows barriers in non-divergent control flow

16:23 <karolherbst> ahh

16:23 <karolherbst> well GL compute allows them inside funcitons

16:24 <karolherbst> just not inside ifs and loops

16:24 <karolherbst> :(

16:24 <karolherbst> fun.. src/intel/compiler/brw_nir_lower_scoped_barriers.c

16:24 <karolherbst> ahh, it's just splitting them

16:25 <mlankhorst> danvet: can we disable gtt relocations on pre-ppgtt platforms?

16:25 <mlankhorst> https://gitlab.freedesktop.org/drm/intel/-/issues/5806

16:26 mszyprow_ has joined #dri-devel

16:26 <mlankhorst> Problem is now that we have removed pinning, pinning to ggtt may kill our existing vma

16:27 <karolherbst> ahhh

16:27 <karolherbst> yeah.. intel is wrong :)

16:28 <karolherbst> https://gitlab.freedesktop.org/mesa/mesa/-/blob/main/src/intel/compiler/brw_fs_nir.cpp#L3825

16:28 <karolherbst> that sounds like the assumption doens't hold for OpenCL

16:29 <karolherbst> ehh.. workgroup_size_variable is true though

16:31 mszyprow has quit [Ping timeout: 480 seconds]

16:31 <karolherbst> mhh

16:33 nchery has joined #dri-devel

16:33 <karolherbst> maybe I should mess with the darktable kernels a little and figure out if that's indeed the kernel causing issues

16:36 alyssa has left #dri-devel [#dri-devel]

16:36 <dcbaker> karolherbst: the `-Doptimization` flag is supported, but meson doesn't have anything like Rust's customizable profiles, and the `-Dbuildtype` is kinda sorta deprecated

16:36 <dcbaker> zmike: thanks!

16:36 gouchi has joined #dri-devel

16:36 <danvet> mlankhorst, I'm a bit lost?

16:37 <danvet> and I'm not sure what you mean with gtt relocations

16:38 <karolherbst> dcbaker: right... that was kind of my worry

16:38 <danvet> I think I have a vague idea, but it all sounds really scary

16:38 <dcbaker> karolherbst: that's an intentional design decision because of how much of a disaster customizable profiles are in cmake

16:38 <karolherbst> dcbaker: I think it makes sense to support it at least for "test", but...

16:39 <karolherbst> ahhh :( it's indeed that kernel causing issues

16:40 <karolherbst> if I just pipe through the input kernel, the output looks fine (but wrong)

16:41 <karolherbst> but the good news is.. besides that little detail.. rusticl is able to run darktable CL kernels :)

16:45 apinheiro has joined #dri-devel

16:46 <karolherbst> yeah.. it's that loop

16:47 <karolherbst> annoying

16:48 <danvet> agd5f, thx for spinning the patches, I dropped some comments since I guess you need even more work to really polish this :-)

16:48 <danvet> but yeah one step at a time and all that

16:49 gouchi has quit [Ping timeout: 480 seconds]

16:49 <danvet> agd5f, unfortunately I don't think lockdep can catch these, since it's another case of cross release dependencies

16:50 <danvet> also in practice probably impossible to hit

16:51 FireBurn has joined #dri-devel

16:54 apinheiro has quit [Ping timeout: 480 seconds]

16:54 <zmike> dcbaker: will probably have a final batch in the next couple days, which I guess should be just in time for the scheduled release next week

16:56 <agd5f> danvet, thanks. I think the proper fix is to not just send hotplugs even when we resume (which was good enough for system suspend), but to compare the presuspend display state with the postsuspend display state and only send a hotplug event if anything changed, but I haven't had time to page atomic into my head again recently.

16:57 <agd5f> danvet, preference on tree for those patches?

16:58 <danvet> agd5f, oh wherever you like most, either yours or drm-misc

16:58 <agd5f> thanks

16:59 <danvet> agd5f, so on rpm vs connector hotplug

16:59 <danvet> one annoying thing kinda is that you don't get interrupts anymore when the chip is fully off

16:59 <agd5f> right

16:59 <danvet> so strictly speaking we should put the detect logic into polling mode again (but maybe not too often)

17:00 <danvet> i915 has been trying to get all these corner rights, but it's a bit an endless thread

17:00 <danvet> i915 has it's own detect loop (mostly due to the interrupt storm issues)

17:00 <danvet> but might be worth to push some kind of function for this into probe helpers since it's not entirely trivial

17:01 <danvet> and then maybe you can drop the unconditional uevent from rpm resume and then maybe things even work?

17:03 <agd5f> danvet, that's the hope

17:14 angerctl has joined #dri-devel

17:19 Namarrgon has quit [Ping timeout: 480 seconds]

17:19 gawin has joined #dri-devel

17:20 jagan_ has joined #dri-devel

17:21 Duke`` has quit [Ping timeout: 480 seconds]

17:27 hch12907 has quit [Ping timeout: 480 seconds]

17:29 sdutt has quit [Ping timeout: 480 seconds]

17:30 Duke`` has joined #dri-devel

17:30 <jekstrand> karolherbst: Any new thoughts on my poor RTX 2060?

17:31 <karolherbst> jekstrand: I'd wait until Ben manages to publish his patches :D he promised me to do that last week though. At least last time I spoke to him (that was after you pinged me on the bug) he said, that's close to ready

17:31 <jekstrand> hehe, ok.

17:32 <karolherbst> mhh.. that kernel is annoying :( it's broken on llvmpipe as well

17:32 * karolherbst wants darktable to run perfectly

17:33 <karolherbst> jekstrand: you don't happen to know how all that barrier/thread converging/whatever stuff works on intel?

17:33 <karolherbst> although it could be something else in the kernel, it's just not a very complicated one

17:33 <karolherbst> just tons of shared mem stuff

17:33 <jekstrand> No, I don't

17:34 <karolherbst> heh.. wait...

17:34 <karolherbst> there are three local mem buffers

17:34 <karolherbst> I hope I didn't messed this up

17:35 <karolherbst> ahhhhh

17:35 <karolherbst> crap

17:35 <karolherbst> I think I found it

17:35 ppascher has quit [Ping timeout: 480 seconds]

17:35 <karolherbst> I don't offset the buffers :)

17:35 <karolherbst> so all three shared mem buffers start at 0x0

17:35 <karolherbst> that's not good

17:36 <karolherbst> (and how did the CTS not catch this)

17:36 <jekstrand> :D

17:39 <karolherbst> yay

17:39 <karolherbst> it works

17:40 <jekstrand> :+1:

17:45 <karolherbst> it feels slower than the CPU though

17:46 <karolherbst> yeah.. CPU gets it done close to instantly, but with rusticl it takes a while on iris

17:46 <karolherbst> oh well

17:47 <karolherbst> I hope that's because of debug builds

17:47 <jenatali> karolherbst: I had that same bug! :P

17:47 <karolherbst> noooooo :D

17:47 <karolherbst> it's a nasty one

17:47 <jenatali> https://github.com/KhronosGroup/OpenCL-CTS/issues/1208

17:48 ppascher has joined #dri-devel

17:48 <javierm> danvet: is this what you meant with your latest suggestion to include in the series to fix properly the uaf ? https://paste.centos.org/view/raw/8a065558

17:48 <javierm> danvet: if that's the case I'll post this and the fixes for efifb and simplefb drivers

17:49 <karolherbst> jenatali: I think they do have tests checking alignment on multiple buffers though, and I am sure my impl fails those now :D

17:49 rasterman has quit [Quit: Gettin' stinky!]

17:49 <karolherbst> at the time where I allocate the input buffer I already lost all information about types

17:49 <karolherbst> but I think I can store the alignment somewhere

17:50 Namarrgon has joined #dri-devel

17:50 <karolherbst> jekstrand: I am thinking about dropping the first local mem arg and insert a constant 0...

17:51 <karolherbst> could be a fun optimization

17:51 <danvet> javierm, yup

17:51 <javierm> danvet: cool, thanks for the confirmation

17:51 <jekstrand> karolherbst: Not sure what you mean

17:51 <danvet> javierm, feel free to include r-b: me right away

17:52 <karolherbst> uhm... actually I don't know if we can actually do that

17:52 <danvet> javierm, uh actually no

17:52 <karolherbst> API buffers have to come after internal and kernel buffers

17:52 <danvet> javierm, using file_fb_info is wrong, we do not want to recheck here at all

17:52 <danvet> otherwise file_fb_info will make this NULL after unregister_framebuffer, which is totally fine

17:53 <karolherbst> jekstrand: nvm.. my idea was just the first local* arg could be a constant 0, but that only works if the shader itself has a shared-size of 0

17:53 <danvet> so only the WARN_ON is needed really and should be enough

17:53 <karolherbst> or well.. I could insert shared-size as the constant

17:53 <karolherbst> so kernels with one local mem buffer don't have to load the offset we already know what it is at compile time

17:54 <danvet> javierm, wait my brain isn't working

17:54 <karolherbst> (and do pointer math)

17:54 <jenatali> That's not a bad idea for optimizations

17:54 <jenatali> We just eat the load of the offset for all of 'em

17:54 <karolherbst> yeah

17:54 <karolherbst> it's a small opt

17:55 <karolherbst> one could even reorder if there are multiple and constant fold it for the one with the most ops on the offset

17:55 mszyprow_ has quit [Ping timeout: 480 seconds]

17:55 <javierm> danvet: no, I think you are correct... hmm

17:56 <danvet> https://paste.debian.net/1239943/ this is what I meant

17:56 <danvet> javierm, ^^

17:56 <karolherbst> although I can see that for some hardware saving that one indirect doesn't mean much, but some other hw it might?

17:56 <danvet> calling kfree too early is the bug, not fb_destroy being called at the wrong time

17:56 angerctl has quit [Ping timeout: 480 seconds]

17:56 <danvet> javierm, if you want your patch you could also check with file_fb_info, but only in the WARN_ON, not in the actual code

17:57 <javierm> danvet: ahh, in framebuffer_release(), I thought you meant in fb_release()

17:57 <danvet> yeah naming isn't the most awesome with these :-/

17:57 <karolherbst> okay yeah.. it's slower than the CPU impl :(

17:57 <karolherbst> but intels runtime is slower as well

17:58 <danvet> javierm, actually for correct drivers we always expect file_fb_info() to return NULL from fb_release()

17:58 <karolherbst> I don't think rusticl is slower than intel here as well :D

17:58 <danvet> so checking there isn't any good I think?

17:58 <airlied> is it faster than llvmpipe? :-p

17:58 <karolherbst> yes it is

17:58 jagan_ has quit [Remote host closed the connection]

17:58 <javierm> danvet: yeah

17:58 <karolherbst> llvmpipe is unbearable slow here

17:58 <javierm> danvet: do you mind I get that diff and write a patch with your authorship and proper subject, commit message, etc ?

17:58 <karolherbst> not sure it the kernels are crappy or...

17:59 <danvet> javierm, it's a bit much for authorship, but if you feel like sure :-)

17:59 <danvet> sob: me <- so you don't have to forge it :-P

18:00 <danvet> javierm, also maybe test it, hold fbdev chardev open somehow and then unloading efifb or so should be easy to demo it

18:01 <javierm> danvet: yeah, I'll do it

18:01 <danvet> cat > /dev/fb/0 & ; rmmod efifib; kill %1

18:01 <danvet> or something like that

18:01 <javierm> was planning to boot the rpi4 anyways to test the simplefb and efifb patches

18:01 <airlied> karolherbst: unbearbaly slow should be about right

18:02 <daniels> karolherbst, jenatali: Panfrost for a while ignored the offset you passed in to EGLImage dmabuf import, which was great on planar YUV when your luma was luma and your chroma was also your luma

18:02 <jenatali> Heh, that's a good one

18:03 <karolherbst> airlied: sure.. but I compare non CL CPU with CL CPU

18:03 <javierm> danvet: and on the FBINFO_MISC_FIRMWARE topic, you forgot about OF/DT... the "simple-framebuffer" pdev is registered by OF core and bound to simpledrm

18:03 <jenatali> My "all offsets are 0" bug resulted in computed images that looked mostly correct except for random black spots, and since it was GPU workgroup shared memory causing the problem, it was basically impossible to debug

18:04 <javierm> danvet: so you could have OF -> "simple-framebufer" -> simpledrm -> "real dev" -> "real fbdev driver" that wants to kick out simpledrm fb

18:04 <javierm> danvet: happy to also ignore that case though if you think is not worth it...

18:05 <karolherbst> jenatali: yeah....

18:07 <danvet> javierm, maybe I only thought about it, but my idea was that in the remove_conflicting_fb loop we pull the "nuke sysfb device" case out from under the FBINFO_MISC_FIRMWARE check

18:07 <jekstrand> danvet: Where are we at on the fence reworks from König?

18:07 <danvet> jekstrand, dma_resv_usage you mean for rebasing dma-buf fence import/export?

18:07 <danvet> that fully landed

18:07 <jekstrand> danvet: Yup.

18:07 <danvet> but make sure you use latest drm-tip or things will go boom, there were bugs

18:07 <jekstrand> Sounds like I should go rebase patches.

18:07 <jekstrand> Ok, will do.

18:07 <danvet> +1

18:07 <jekstrand> I need to go find the patches....

18:07 <danvet> it should be a lot simpler with the new approach

18:07 <javierm> danvet: yes, but what I meant is that "simple-framebuffer" pdev in DT nodes are not registered by sysfb

18:08 <danvet> jekstrand, don't you love m-l development

18:08 <javierm> danvet: so nuke sysfb wouldn't help in that case

18:08 <danvet> javierm, uh

18:08 <danvet> javierm, can't we teach sysfb to recognize these?

18:09 <javierm> danvet: https://elixir.bootlin.com/linux/latest/source/drivers/of/platform.c#L543

18:09 <danvet> having drivers add a flag if they bind against sysfb device so that some other places knows which ones to nuke feels a bit silly

18:10 <danvet> javierm, like glue that of/platform.c code up with sysfb.c?

18:10 <danvet> I was kinda assuming that's already how it works, but I guess not

18:11 <jekstrand> danvet: The best/worst part is that I don't have a branch anymore to rebase so I've got to try to get patches to apply. :cry:

18:11 <javierm> danvet: that FBINFO_MISC_FIRMWARE flag is not really "this driver binds to a device registered by sysfb" but rather "this driver uses a firmware provided framebuffer, but I don't know how we got here"

18:11 <zmike> anholt: I've fixed the zink flakes in ci

18:13 <danvet> javierm, yeah and I'm kinda arguing for a bit more structure

18:13 <danvet> but inflicting structure onto fbmem.c is a lot of work :-(

18:13 <javierm> danvet: yes, I understand your point and agree but think that cleanup should be a follow-up

18:14 <danvet> javierm, yeah totally agreed, I think for now we can just go with "wont care"

18:14 <javierm> danvet: my point is that right now simpledrm fbdev is the only one of the firmware-provided fb that doesn't set FBINFO_MISC_FIRMWARE

18:14 <danvet> if you use simpledrm and an fbdev driver you just get the pieces

18:14 <javierm> danvet: that works for me too :)

18:15 devilhorns has quit []

18:15 <javierm> danvet: let's ignore it for now then But probably we want all the platform code to have a central place where "simple-framebuffer" pdev is registered

18:16 <danvet> javierm, yeah I think as a goal at least that sounds like a plan

18:16 <danvet> maybe also check it with gregkh

18:16 <javierm> danvet: Ok

18:21 angerctl has joined #dri-devel

18:21 * jekstrand hates that drm-tip rebases. All the old commits get lost. :(

18:22 <bnieuwenhuizen> jekstrand: I have rebased patches, sec

18:22 <bnieuwenhuizen> at least rebased on top of the dma_resv_usage work as of patchwork

18:23 <jekstrand> I found a sha where they apply

18:24 <airlied> jekstrand: it doesn't really rebase

18:24 <airlied> it regenerates from scratch everytime

18:24 <jekstrand> airlied: And that's better?

18:24 <airlied> there's another way?

18:24 <bnieuwenhuizen> jekstrand: https://github.com/BNieuwenhuizen/linux/commits/no-implicit-sync-import if that saves you any work

18:25 <airlied> don't base work on drm-tip if at all possible not to, base it on one of the trees included into drm-tip

18:27 Namarrgon has quit [Ping timeout: 480 seconds]

18:28 kts has joined #dri-devel

18:28 sdutt has joined #dri-devel

18:31 lynxeye has quit [Quit: Leaving.]

18:34 alyssa has joined #dri-devel

18:34 <alyssa> anholt: v3d has a magical incantation for allocating for scanout

18:35 <alyssa> format=RGBA8, width=1024, height = div_round_up(size, 4096)

18:35 <alyssa> (and pass to renderonly_scanout_for_resource)

18:35 <alyssa> if I'm not mistaken, there's nothing v3d specific in there (except maybe 4K pages but eh)

18:35 <alyssa> it's just appeasing non-Mesa consumers of the buffer

18:36 <alyssa> In that light, do you think it makes sense to move to a new renderonly API?

18:36 <alyssa> (I'll write the patch if you review and CI tests ;) )

18:39 <alyssa> panfrost has something similar, but worse.

18:40 <alyssa> and I think any driver supporting framebuffer compression needs something like it

18:40 alanc has quit [Remote host closed the connection]

18:41 alanc has joined #dri-devel

18:42 <airlied> tzimmermann: that mga function is probably xf86ModeBandwidth ported to the kernel

18:42 frieder has quit [Remote host closed the connection]

18:42 <tzimmermann> airled, thanks, i'll take look

18:42 <tzimmermann> airlied ^

18:43 <tzimmermann> jfalempe ^

18:44 jagan_ has joined #dri-devel

18:44 jkrzyszt has quit [Ping timeout: 480 seconds]

18:44 <airlied> yeah looks almost exactly like it

18:45 <airlied> https://gitlab.freedesktop.org/xorg/xserver/-/blob/master/hw/xfree86/modes/xf86Modes.c#L108

18:46 <tzimmermann> indeed, except for the returned value's unit

18:46 <tzimmermann> i guess, i'll add that comment to the kernel as well

18:46 <tzimmermann> again, thanks a lot

18:55 <airlied> tzimmermann: yeah fixed pt maths suck :-P

19:01 <alyssa> airlied: sucks less than fp! :p

19:03 <Sachiel> fp math is terrible, but fp math is worse

19:05 tzimmermann has quit [Quit: Leaving]

19:05 mbrost has joined #dri-devel

19:06 <alyssa> true

19:16 gawin has quit [Ping timeout: 480 seconds]

19:18 Namarrgon has joined #dri-devel

19:25 angerctl has quit [Ping timeout: 480 seconds]

19:25 mbrost has quit [Ping timeout: 480 seconds]

19:26 <karolherbst> jenatali: where are you parsing the required alignment of a local mem buffer passed as an kernel arg?

19:27 <karolherbst> because I think we don't have this information in nir

19:27 <jenatali> https://gitlab.freedesktop.org/mesa/mesa/-/blob/main/src/microsoft/clc/clc_compiler.c#L1140

19:27 <jenatali> You're right, we don't

19:28 <karolherbst> ahh.. you use the size

19:28 <jenatali> Yep. Good enough. Worst case it over-aligns but that's fine

19:28 <karolherbst> yeah...

19:34 soreau has quit [Read error: No route to host]

19:38 soreau has joined #dri-devel

19:38 <karolherbst> airlied: I am actually wondering.. does llvmpipe cache shader variants?

19:39 <karolherbst> because it sometimes feels like that llvmpipe does a lot of recompilations

19:45 mszyprow_ has joined #dri-devel

19:54 angerctl has joined #dri-devel

19:54 CATS has quit [Read error: Connection reset by peer]

19:55 CATS has joined #dri-devel

19:57 TMM has joined #dri-devel

19:59 <airlied> karolherbst: yeah it should

19:59 <TMM> hi all! I got myself an HP zbook with an Radeon pro W6600M in it and it appears that amdgpu doesn't like it very much. https://paste.centos.org/view/7ac88ec7 Is there someone here who would be willing to help me?

19:59 <airlied> but lots of things around imgs and samplers cause recompiles

20:01 Namarrgon has quit [Ping timeout: 480 seconds]

20:01 <karolherbst> airlied: what about variable local sizes?

20:01 <karolherbst> but okay.. if stuff around img and samplers can cause recompiles then that's a bit annoying :(

20:03 <TMM> If I disable 'hybrid graphics' it does work

20:07 <TMM> Perhaps it doesn't like the mux chip HP chose?

20:08 <TMM> It *seems* that what is happening based on the logs I see that amdgpu tries to read some kind of configuration from the GPU's vram but the card is perhaps powered off?

20:08 <jekstrand> danvet: drm-tip doesn't load i915 :-/

20:08 <daniels> alyssa: that isn't generic scanout uAPI

20:09 <daniels> alyssa: it's scanout uAPI that works if you know that your GPUs will only ever be integrated with two display controllers which can be satisfied by those constraints :P

20:09 <jekstrand> uh, what?!? I used a Fedora config and it didn't build i915?

20:10 <daniels> jekstrand: make modules

20:10 <alyssa> daniels: ah..

20:10 <daniels> alyssa: I can assure you I'd be doing more interesting things if it was :)

20:11 <alyssa> I admit I don't know what assumptions that makes

20:11 <alyssa> it seems to work for rockchip, at least

20:11 <jekstrand> daniels: i915 was turned off in the config. For whatever reason, when I pulled the Fedora config and did "make menuconfig" it ended up off.

20:11 <jekstrand> Maybe someone renamed an option?

20:12 <jekstrand> /o\

20:13 deathmist1 has quit [Remote host closed the connection]

20:13 deathmist1 has joined #dri-devel

20:15 <daniels> alyssa: as a lowest common denominator it's not the worst; as a universal axiom it really fails

20:15 <alyssa> hm, ok

20:15 <daniels> what's the problem you're facing that this would solve?

20:15 <daniels> currently we get away with assuming that the display controller is the most constricted, so kmsro allocating from there and importing to GPU is very likely to succeed

20:16 <daniels> but there are definitely cases where you want something more co-operative

20:16 <alyssa> so for... some... reason

20:16 <alyssa> the "GPU render ---AFBC---> display controller" path works by allocating a... dumb buffer of all things

20:17 <alyssa> (a dumb buffer on the display controller, imported to the GPU)

20:17 <airlied> jekstrand: make localmodconfig

20:17 <alyssa> but the dumb buffer path wants a format/width/height

20:17 <alyssa> so for AFBC, currently we make one up.

20:17 <alyssa> in particular, AFBC is "like" the regular image with an extra row

20:18 <alyssa> so panfrost allocates a dumb buffer of the regular image size (rounded up) plus a number of extra rows for the header blocks

20:18 <karolherbst> geekbench5 still crashes :(

20:18 <alyssa> that... whole dance is batshit

20:18 Duke`` has quit [Ping timeout: 480 seconds]

20:18 <alyssa> if we're going to be making up dimensions anyway, we might as well do it more consistently like v3d does

20:19 <karolherbst> jekstrand: okay.. back to that iris intel context crash.. is it plausible that a timeout would cause that if that happens like under a second?

20:20 <alyssa> At least the v3d way means the "gpu driver creates resource on the scanout device and imports it to the gpu" routine doesn't need to special case AFBC, it just decides a layout for the resource (using the common layout code which is extensively tested) with the same code as "allocate an internal resource on the gpu" and only differs in where it gets the BO from

20:20 <karolherbst> although it seems like that something about preemption doesn't really work regardless :(

20:20 Namarrgon has joined #dri-devel

20:20 <jekstrand> karolherbst: The timeout is something like 0.5s for a single compute job and then 5s for a batch.

20:21 sdutt has quit []

20:21 sdutt has joined #dri-devel

20:21 <karolherbst> jekstrand: oh wow.. that's not much

20:21 <karolherbst> is that something userspace can configure?

20:21 <alyssa> I don't actually care about fighting the WSI battle. I just want to get rid of the AFBC special case so I can extend AFBC support in the common panfrost surface layout code (shared with panvk, unit tested, written for correctness rather than happening to work)

20:21 <alyssa> and not have to add even more special cases to the GL WSI path

20:22 <karolherbst> but I did notice that with intels runtime my desktop doesn't get laggy, so maybe they either split up the work or... set some magic bit? dunno

20:23 <jekstrand> karolherbst: Nope

20:23 <karolherbst> huh...

20:23 orbea has quit [Read error: Connection reset by peer]

20:24 <karolherbst> now I am confused

20:24 <pepp> TTM: can you open a bug report (https://gitlab.freedesktop.org/drm/amd/-/issues)?

20:24 <karolherbst> what's intel doing to not hit that timeout then

20:27 angerctl has quit [Ping timeout: 480 seconds]

20:31 <TMM> pepp: was that for me?

20:31 <karolherbst> mhh, I might want to look into non uniform work group sizes at some point

20:32 <airlied> karolherbst: don't think it rebuilds for variable group size

20:34 <jekstrand> bnieuwenhuizen, danvet: Sent v13. Also available here: https://gitlab.freedesktop.org/jekstrand/linux/-/commits/dma-buf/sync-import-export

20:35 <karolherbst> airlied: okay

20:35 orbea has joined #dri-devel

20:39 <karolherbst> the heck intel, nobody can figure anything out from your code :(

20:40 <jekstrand> bnieuwenhuizen, danvet: WSI patches are also rebased. \o/

20:40 <jekstrand> karolherbst: wha?

20:40 <karolherbst> jekstrand: soo... I want to figure out what intel is doing so the context doesn't crash

20:41 <dj-death> karolherbst: trash the context, create a new one

20:41 <dj-death> karolherbst: keep going

20:41 <karolherbst> dj-death: ehh... no

20:41 <karolherbst> it's one huge compute job

20:42 <karolherbst> and they succeed with that

20:42 <karolherbst> the compute job is writing "128GB" of memory in a stupid way

20:42 <airlied> danvet, jekstrand : btw want to make sure you saw ckonig posted a series for fencing

20:43 <jekstrand> airlied: Yeah, I saw. Read the cover letter. Noodling.

20:43 <karolherbst> I wouldn't be surprised if they simply split it up and do multiple compute jobs

20:43 <dj-death> karolherbst: hmm don't know then

20:44 <dj-death> karolherbst: can you see a hang in the dmesg?

20:44 <karolherbst> I don't

20:44 <karolherbst> but probably splitting it up doens't even work, because it's only 64 threads in total

20:44 <airlied> then it's unlikely to be hanging it

20:45 <karolherbst> yeah.. that's my assumption as well

20:45 <karolherbst> they do something, I just don't know this something

20:46 <karolherbst> ehh, it's actually a bite more threads, guess I checked incorrectly

20:46 <karolherbst> threads in blocks: 64, blocks: 128

20:47 <karolherbst> so that would be possible to divide in small bits

20:47 <bnieuwenhuizen> jekstrand: do you also have WSI patches for importing semaphores/fences into the dmabuf or should I clean up mine?

20:47 <karolherbst> airlied: btw, that benchmark is so silly, that llvmpipe performs quite well: 128 GB in 323.0 ms (396.3 GB/s)

20:47 <karolherbst> the value is no lie

20:47 <jekstrand> bnieuwenhuizen: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11323

20:48 <jekstrand> bnieuwenhuizen: I just don't like it as much because it's potentially a lot of ioctls to merge all those sync_files if they use multiple wait semaphores. :-/

20:48 <jekstrand> I should rebase that one too

20:48 <bnieuwenhuizen> jekstrand: ah, for my patches I just take the fence after the dummy submit and use that as a single sync_file

20:48 <jekstrand> bnieuwenhuizen: Oh. Well, that works. :)

20:49 <jekstrand> bnieuwenhuizen: Yeah, if you could rebase that and throw it on top of the MR, that'd be great.

20:49 <jekstrand> I'll pull and test on ANV

20:49 <jekstrand> Not sure why I didn't think of that...

20:50 <bnieuwenhuizen> at this point in radv the dummy submit is just a merger based on timeline points on a per queue timeline semaphore anyway

20:51 <jekstrand> yeah

20:51 <karolherbst> is there a good way to figure out what intel is doing?

20:51 <jekstrand> ANV actually does an exec ioctl but it's trivial.

20:51 <jekstrand> karolherbst: What the Intel CL driver is doing?

20:51 <karolherbst> yeah

20:52 <jekstrand> karolherbst: They probably have mid-object preemption enabled which gets them 5s, not 0.5s.

20:52 <jekstrand> If I had to take a blind guess

20:52 <karolherbst> mhhh

20:52 <karolherbst> I don't think so, but let me try something

20:53 <karolherbst> ahh.. maybe that's indeed it

20:53 <karolherbst> got a "[6734410.537203] Fence expiration time out i915-0000:00:02.0:cl-mem[3129763]:4!" now

20:54 <karolherbst> still feels like they do something else

20:54 <karolherbst> "256 GB in 10422.9 ms (24.6 GB/s)"

20:54 <karolherbst> 512 reps == 512GB makes it timeout

20:57 <karolherbst> anyway.. is there an easy way for iris to turn that on?

21:01 rasterman has joined #dri-devel

21:06 mvlad has quit [Remote host closed the connection]

21:14 <agd5f> karolherbst, for long running compute work, it might be better to use the KFD user queues. Those can be context switched.

21:14 <karolherbst> agd5f: but that's amdgpu domain, isn't it?

21:16 <agd5f> karolherbst, you'd just need a new winsys for KFD

21:17 <karolherbst> I am confused... how would that help on iris? or is KFD some opaque term which would apply to i915 as well? I mean I know that people try to figure out how to do compute properly, but afaik it's all atm amdgpu only, no?

21:18 aswar002 has joined #dri-devel

21:19 <agd5f> karolherbst, sorry, was mixing up contexts. hadn't read enough of the backlog

21:27 kts has quit [Quit: Konversation terminated!]

21:28 Haaninjo has quit [Quit: Ex-Chat]

21:29 <danvet> airlied, I replied already

21:29 <danvet> airlied, jekstrand I'm not really clear on what he's trying to solve, and I think the one thing that's clear with adding umf is that all approaches suck one way or the other

21:30 <danvet> if it's just to make amd stack work with some new hw then I still don't get why we can't just add dma_fence with the old semantics on top of userspace memory fences

21:30 <danvet> and if the goal is to actually roll out umf for real, then use drm_syncobj since that has the right semantics already

21:31 <danvet> which means winsys/compositor work and everything instead of being really clever with shoehorning something into what we have that doesn't fit

21:32 deathmist1 has quit [Read error: Connection reset by peer]

21:39 deathmist1 has joined #dri-devel

21:50 mszyprow_ has quit [Ping timeout: 480 seconds]

21:54 lemonzest has quit [Quit: WeeChat 3.4]

22:03 mszyprow_ has joined #dri-devel

22:04 * jekstrand should blog about the new ioctls....

22:30 <HdkR> Wait, new ioctls?

22:30 <HdkR> jekstrand: Tell me more

22:32 <karolherbst> mhhh

22:32 <jekstrand> HdkR: "dma-buf: Add an API for exporting sync files (v13)" on dri-devel

22:32 <jekstrand> Weirdly, it's not showing up in patchwork or the ML archives

22:32 <karolherbst> somehow those darktable benchmarks are weird

22:32 <karolherbst> 1200% CPU, but using iris CL...

22:32 <karolherbst> either I do a crappy job or something weird is happening

22:35 <karolherbst> heh.. same with intels stack

22:36 <HdkR> Hm, didn't show up in my email either

22:36 pcercuei has quit [Quit: dodo]

22:37 <HdkR> But I see the DMA_BUF_IOCTL_EXPORT_SYNC_FILE_WSI

22:37 <jekstrand> That's the ioctl

22:38 <jekstrand> Maybe the cover letter didn't go through? I accedentally prefixed it with "*" which may have screwed things up. :joy:

22:41 <HdkR> Fantastic struct packing. No need for me to add a new dma-buf handler :D

22:41 <jekstrand> ?

22:41 <bnieuwenhuizen> jekstrand: I only have your cover letter

22:42 <jekstrand> bnieuwenhuizen: :cry:

22:42 <HdkR> jekstrand: Anything that touches ioctls I always need to check if the struct packing is hecked up between 32bit and 64bit

22:42 <jekstrand> HdkR: Right. Yeah, I know better. :)

22:42 mszyprow_ has quit [Ping timeout: 480 seconds]

22:43 <HdkR> Even the best people mess it up sometimes. I still need to implement an emulation path for aarch64 -> x86_64 because there is a struct that is messed up there :|

22:43 <danvet> jekstrand, way too late but just scrolled through your patches

22:44 * jekstrand re-sends

22:44 <danvet> I think all the work from könig was worth it, looks so much neater

22:44 <jekstrand> danvet: Yeah, it's massively simpler and obviously safe now.

22:44 <danvet> (or I just misrember what the old stuff looked like)

22:44 <jekstrand> The new patches are "drp... Yup. That's how that works."

22:44 <danvet> jekstrand, yeah now it should be a joy to review instead of just "uh ... do I want to really think this through"

22:45 <jekstrand> danvet: Review away then. :P

22:46 <karolherbst> heh

22:46 icecream95 has joined #dri-devel

22:46 <karolherbst> what's the point of this -d opencl thing if it ends up rendering on the CPU anyway

22:53 <danvet> jekstrand, done

22:53 * danvet ^Z now for real

22:54 <karolherbst> I am sure there is some nice blender stuff doing cl things, no?

22:54 <airlied> cl got removed from blender

22:54 <jekstrand> karolherbst: I think cycles still has a CL back-end but it's deprecated, last I knew. Worth trying if it's still there.

22:55 <karolherbst> ohhh shit, right

22:55 <karolherbst> "OpenCL support was removed in Blender 3.0." :(

22:55 <karolherbst> "Instead there are HIP and Metal backends." ....

22:55 <karolherbst> so you replace something useless by something even more useless

22:55 <jekstrand> Yup

22:55 <jekstrand> cuda, HIP, Metal. All the vendor lock-in APIs.

22:55 <karolherbst> we should make blender devs wanting to revert that decision

22:55 <karolherbst> that's my new life goal

22:56 <jekstrand> Yup

22:56 <jekstrand> That's one of my goals too!

22:56 <karolherbst> nice

22:56 <karolherbst> intel function callin when :P

22:57 * karolherbst to busy fixing multithreading on nouveau obviously

22:57 <karolherbst> but actually.. we should prototype it with llvmpipe

22:58 <karolherbst> I guess that wouldn't be _too_ painful

22:58 <karolherbst> I know Dave is busy atm so I won't ping him and ask how much work that would be

23:01 danvet has quit [Ping timeout: 480 seconds]

23:02 <karolherbst> okay... soo.. how to fix that shit with llvm...

23:04 * karolherbst thinks about requiring llvm-14 for rusticl

23:07 tursulin has quit [Ping timeout: 480 seconds]

23:09 morphis has quit [Ping timeout: 480 seconds]

23:09 morphis has joined #dri-devel

23:10 <karolherbst> I want to use opencl-c-base.h so hard

23:11 <karolherbst> printf caching disabled:

23:12 <karolherbst> opencl-c.h: 2 minutes

23:12 <karolherbst> opencl-c-base.h: 2 seconds

23:12 <jekstrand> Yeah...

23:12 <karolherbst> the only blocker is, that those vload/vstore_half APIs are missing

23:13 <karolherbst> and we need llvm-14, but.. that's just how it is

23:13 <karolherbst> but now that I have llvm and clang built locally...

23:14 <karolherbst> I am sure those builtins are somwhere lost, because the fp16 ext isn't enabled or whatever reason there might be

23:18 <anholt> alyssa: sorry, I have very little context for v3d any more. what's special about it?

23:18 <alyssa> anholt: the context is/was https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16334 which daniels already said is probably a bad idea for any driver but v3d \s/

23:21 CATS has quit [Ping timeout: 480 seconds]

23:21 <anholt> curious what daniels dislikes about it

23:21 <anholt> it doesn't give your display a chance to align stride for linear, I guess.

23:22 <alyssa> the width0=1024 bit, i think

23:24 CATS has joined #dri-devel

23:27 rasterman has quit [Quit: Gettin' stinky!]

23:29 <karolherbst> okay.. fixing that half stuff invovles tablegen

23:29 <karolherbst> I've heard it's a nice thing llvm uses

23:35 <daniels> anholt: yeah, telling your display controller that the width is 1024 when the width is 1920 is ... not ideal

23:36 <anholt> it's just the create_dumb. does anyone store anything during create_dumb? The only thing i know of anyone doing special there is aligning for linear.

23:36 <daniels> I'm not saying that current kmsro is the absolute ideal, but this is not a forward step

23:36 <daniels> anholt: it's create_dumb if your kmsro does create_dumb?

23:38 <anholt> I'll say it more explicitly: I believe that the only side effect of lying about the width with all current kmsro display hosts is that you don't get stride alignment for linear. do you know of another problem?

23:39 <daniels> the immediate thing that makes me twitch is the mainline KMS driver (might be OMAP now I think of it, but might not) that requires width aligned to 4096px (yes really) when you want the display controller to do rotation

23:40 <karolherbst> aaaannnnndddd fixed

23:41 <daniels> the long-term reflex that makes me think something better is possible is that it would be good to have an actual negotiation between GPU & display, rather than starting with kmsro's model of the display being the lowest common denominator so ignoring the GPU, then deciding that no wait actually the GPU is the lowest common denominator so let's ignore the display controller and lie to kmsro

23:41 <daniels> like if we're deciding that we don't want to be constrained by the display anymore, then a large chunk of kmsro can just disappear and we don't even have to bump soversion

23:42 <anholt> bump soversion?

23:42 <anholt> huh?

23:42 <daniels> I mean that kmsro is not ABI

23:43 <daniels> so it seems weird to go out of our way to be lying to kmsro about dimensions, rather than just explicitly hobbling kmsro to be unaware of dimensions

23:43 <karolherbst> https://github.com/karolherbst/llvm-project/commit/2b17270e1790e198167fa06f4630e93bde9d519d :3

23:44 <karolherbst> I am sure it breaks tons of other stuff

23:45 <anholt> I think that 3d driver doing layout makes a lot of sense for modifiers. If you've got a modifier, trust 3d, and ask your allocating device to allocate that many bytes. i think the flip side is if you have linear, we should probably have display allocate it since then it gets a chance to round up stride (dumb ioctl).

23:46 <anholt> though, you've still got the vc4 exception where vc4 generally has to be the allocating device

23:46 <anholt> so you can't just dumb allocate on display.

23:48 iive has quit []

23:50 <alyssa> "3d driver doing layout makes a lot of sense"

23:50 <alyssa> gotta say, I trust Panfrost's (now unit tested!) layout code far more than I trust every display driver under the sun that might be hooked up to Mali someday....

23:50 <jekstrand> Oh, really? You don't say....

23:51 <alyssa> jekstrand: Lol

23:51 <alyssa> I should fix that Rockchip display driver bug I found, it's only the second one so far from the same test...

23:51 <alyssa> It's admittedly pretty obscure

23:52 <alyssa> Requires doing something so bizarre as using a modifiers-aware compositor with a 4K display

23:53 <daniels> oh yeah, that should fail AddFB2

23:53 <daniels> but KMS has no way to express 'you can do this modifier, but not w>2560' to userspace

23:54 <daniels> anholt: ^ I'd like to say this is the reason I pushed back against that patch, but realistically it's just another corner case I forgot

23:54 <karolherbst> https://github.com/llvm/llvm-project/issues/55275

23:55 <anholt> daniels: you're thinking resource create should try an addfb and fail on failure of that?

23:56 <daniels> anholt: that's not what I said

23:57 <daniels> anholt: I'm saying that it suggests that Mesa's GPU allocation layer telling Mesa's display allocation layer that the width is always 1024, is not a great idea

23:58 <daniels> and given that the GPU<->display allocation layer (kmsro) is not a stable ABI which cannot be changed, that changing it where necessary beats lying to it where unnecessary

23:58 <daniels> especially where lying to it precludes making it actually function at all

23:58 <anholt> I'm confused how the kernel's knowledge about w>2560 would get up to mesa allocation here.

23:59 <anholt> I thought you were saying to addfb2 to test. or are you saying just have a little bit of display code in mesa that knows about tricks like that? (not opposed)

23:59 <daniels> like I said, KMS has no way to express that RK3399's display controller can do KMS but only for sub-2560 width

23:59 <daniels> so that's obviously a non-starter, and trying AddFB2 in resource_create is also quite silly