#dri-devel on 2022-09-27 — irc logs at oftc.irclog.whitequark.org

2022-08-14 19:45 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:00 <mareko> anholt_: ^^

00:00 <ajax> i guess it depends how flexible your blit-at-the-end instruction is

00:00 <karolherbst> some old ones

00:00 <karolherbst> https://gitlab.freedesktop.org/mesa/mesa/-/jobs/29049739

00:00 <karolherbst> https://gitlab.freedesktop.org/mesa/mesa/-/jobs/29049742

00:01 <anholt_> sounds like runners dying or something?

00:01 <karolherbst> all on the same runner: 2605 (Jda81xmt) fdo-equinix-m3l-9

00:02 <karolherbst> https://gitlab.freedesktop.org/mesa/mesa/-/jobs/29048306

00:02 <anholt_> that runner has passed some other jobs in this timeframe

00:02 <karolherbst> https://gitlab.freedesktop.org/mesa/mesa/-/jobs/29048304

00:02 <karolherbst> and I have two more

00:02 <karolherbst> all the same runner

00:03 <karolherbst> and that's just the fails within the last two hours

00:03 <karolherbst> not sure if that's the runner being flaky or something else, but those are hte only hangs within that time

00:06 <anholt_> sorry, I don't have digging into this right now. hit up #freedesktop I guess.

00:07 <karolherbst> on my current MR all virgl jobs on other runners work normally :(

00:07 <mareko> can the runner be turned off?

00:08 <mareko> and removed from the CI

00:08 <karolherbst> I guess admins can?

00:10 <karolherbst> finally managed to get my job on a different runner...

00:11 <karolherbst> yeah.. and now it works

00:11 <karolherbst> after I restarted it like ~6 times it always hung on the broken one

00:13 <karolherbst> tarceri__: you want to restart the virgl jobs on your !18587 to move them to a different runner unless you want the stuff to timeout again

00:16 <karolherbst> but it's true.. not all jobs hand on that runner

00:16 <karolherbst> *hang

00:25 nchery_ has joined #dri-devel

00:27 nchery_ has quit []

00:28 nchery has quit [Read error: No route to host]

00:36 aravind has quit [Ping timeout: 480 seconds]

00:38 <mareko> does any other driver plan to enable glthread?

00:39 <karolherbst> I'd plan to do that in a newly written GL driver for nouveau

00:39 <mareko> nvc0 rewrite? or a fork for ampere/ada?

00:40 <karolherbst> nvc0 rewrite, but I doubt this is going to happen this year or next year... but I'd like to use more of those modern features (tc as well), but I don't think it's worth putting that effort into nvc0

00:41 <alyssa> karolherbst: which driver?

00:41 <alyssa> zink-on-nvk?

00:41 <alyssa> that newly written GL driver for nouveau?

00:41 <alyssa> I heard it's going to be a blast

00:41 <alyssa> supports tc

00:41 <karolherbst> :P maybe it will just be that

00:41 <karolherbst> though not sure how much work it would be to enable glthread

00:41 <karolherbst> nvc0 would probably benefit from it, because it currently busy waits on fences (uhhhh)

00:43 <karolherbst> I rebuilt llvm with a different flag and now stuff doesn't want to link against spirv-tools anymore....

00:43 <karolherbst> I am not sure if I should be mad or actually impressed

00:44 <mareko> supporting tc is recommended before glthread

00:45 <karolherbst> if a god exists, I am sure that one tries to convince me to just go stright with zink, and I don't listen

00:45 <karolherbst> figures...

00:45 <zmike> mareko: I've thought about it, but I don't have any data regarding always-on enablement

00:45 <mareko> zmike: generally, drawoverhead is worse, everything else is better

00:45 <zmike> it works fine when it's active though

00:45 <zmike> hm

00:45 <karolherbst> and I suspect supporting tc is a significant amount of work.. though maybe with fixed threading it wouldn't be too painful in nvc0

00:45 <karolherbst> dunno

00:46 <alyssa> karolherbst: as someone who really REALLY likes writing GL drivers

00:46 <alyssa> starting a new GL driver in 2022 for VK capable hardware makes increasingly little sense

00:46 <karolherbst> alyssa: you said you like writing GL drivers? I might have something for you there then

00:46 <zmike> mareko: I guess I'll test some stuff out, but probably not for a couple weeks because xdc

00:46 <alyssa> I have half a mind to scrap gallium/drivers/asahi

00:47 <zmike> it'd just be a one-liner for me

00:47 <mareko> tc isn't required for glthread

00:47 co1umbarius has joined #dri-devel

00:47 <karolherbst> yo.. but I was also thinking of supporting TC in rusticl, but after thinking about it I concluded it makes 0 sense

00:47 <mareko> you need 3 caps for good glthread perf: PIPE_CAP_ALLOW_MAPPED_BUFFERS_DURING_EXECUTION, PIPE_CAP_SIGNED_VERTEX_BUFFER_OFFSET, PIPE_CAP_MAP_UNSYNCHRONIZED_THREAD_SAFE

00:47 <zmike> hm

00:48 <zmike> not sure I have any of those

00:48 <zmike> time to make a ticket so I don't forget

00:48 <mareko> and ARB_buffer_storage

00:49 columbarius has quit [Ping timeout: 480 seconds]

00:49 <mareko> the first 2 caps are needed by u_vbuf and GL in general

00:49 <zmike> got that one

00:49 <karolherbst> nvc0 has PIPE_CAP_ALLOW_MAPPED_BUFFERS_DURING_EXECUTION

00:50 <karolherbst> PIPE_CAP_SIGNED_VERTEX_BUFFER_OFFSET seems to be not that much work, but maybe it would be? dunno

00:50 <karolherbst> PIPE_CAP_MAP_UNSYNCHRONIZED_THREAD_SAFE is probably a noop here

00:50 <karolherbst> but then again: performance and nouveau :'(

00:51 <zmike> will check pipe caps tomorrow, probably all no-ops

00:52 <mhenning> noob question: what is TC in this context?

00:52 <karolherbst> threaded context

00:52 <mhenning> ah

00:52 ngcortes has quit [Remote host closed the connection]

00:52 <karolherbst> basically serializes all pipe_context interactions and offloads it to a thread if I am not mistaken

00:54 <mareko> yes, but TC also behaves like a driver when it decides when to sync or not sync on buffer_map, etc. you could have a dumb buffer_map implementation in the driver and it wouldn't matter

00:55 <mareko> glthread is not that smart

00:56 <karolherbst> I already see it coming that I have to dig into TC one way or the other

00:59 Jeremy_Rand_Talos__ has quit [Remote host closed the connection]

01:00 Jeremy_Rand_Talos__ has joined #dri-devel

01:00 <karolherbst> llvm is the gift that keeps on giving....

01:01 <karolherbst> so I literally only changed -DLLVM_ENABLE_EXPENSIVE_CHECKS to NO and mesa links again...

01:08 chip_x has quit [Remote host closed the connection]

01:15 mhenning has quit [Quit: mhenning]

01:16 <alyssa> mareko: I wonder how "panfrost(no tc)" will compare in perf to "zink(tc)/panvk" ... ;-)

01:17 nchery has joined #dri-devel

01:22 yuq825 has joined #dri-devel

01:36 lemonzest has joined #dri-devel

01:42 ella-0_ has joined #dri-devel

01:45 ella-0 has quit [Read error: Connection reset by peer]

01:50 <mareko> zink+anything is always an interesting combination

02:16 pallavim has joined #dri-devel

02:24 Ristovski has quit [Remote host closed the connection]

02:25 Ristovski has joined #dri-devel

02:26 Leopold_ has quit [Remote host closed the connection]

02:46 jrayhawk has quit [Quit: Lost terminal]

02:48 jrayhawk has joined #dri-devel

02:54 Company has quit [Quit: Leaving]

02:56 pallavim has quit [Ping timeout: 480 seconds]

02:57 jewins has joined #dri-devel

02:58 bmodem has joined #dri-devel

02:58 <mareko> also angle+anything

03:00 <HdkR> That one is interesting in a different direction sadly

03:00 <mareko> is angle android-only?

03:01 <HdkR> No, it runs on pretty much everything

03:01 <HdkR> Windows, Linux, MacOS, ChromeOS, Android, Fuchhsia, and apparently iOS is coming up

03:01 <mareko> why a different direction then?

03:03 <HdkR> Seems to performance quite badly in real games unlike Zink

03:03 <HdkR> s/performance/perform

03:03 <mareko> I see

03:03 <HdkR> Also ES only means it is quite limited

03:04 <karolherbst> but it's also always clearing any resource so you don't have data leaks, no?

03:07 <HdkR> Hopefully only in a browser use-case then

03:07 <mareko> I would expect Zink to beat ANGLE in CPU overhead, but GPU overhead?

03:08 <HdkR> zmike: ^ Sounds like you need to find some GLES games to benchmark the two

03:09 <mareko> most GLES benchmarks are CPU-bound on big GPUs, so they are not useful for GPU overhead testing

03:09 <karolherbst> HdkR: question is, if there is a flag for it

03:09 <HdkR> yea

03:09 <karolherbst> but even on android you don't want apps to snoop memory of others

03:10 <karolherbst> but maybe on android drivers are also required to not leak data

03:10 <HdkR> ...Where it is fine on Linux?

03:10 <karolherbst> well.. nobody fixed it, so yeah, seems that way

03:10 <karolherbst> :P

03:10 <clever> karolherbst: that reminds me, i found a texture atlas cache in my minecraft folder years ago, in the undefined corners, i could see some memes i have browsed, and my irc client

03:10 <karolherbst> something something "inpractical attacks" something

03:10 <HdkR> Wayland fans over here crying

03:11 <karolherbst> yeah..

03:11 <karolherbst> if you allocate memory, it's not cleared

03:11 <karolherbst> it's a huge security issue these days, but nobody cares

03:11 <clever> karolherbst: more freaky though, if i reboot from windows to linux (dual boot), and then login, i briefly see the windows wallpaper on my desktop

03:11 <karolherbst> nice....

03:11 <mareko> radeonsi clears it if you're lucky to use DCC

03:11 <clever> thats not just uncleared memory, thats USING uncleared memory to render

03:12 <karolherbst> mareko: on every new allocation?

03:12 <mareko> only when DCC is enabled for that image

03:12 <karolherbst> mhhh

03:12 <clever> 01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Bonaire XTX [Radeon R7 260X/360]

03:12 <clever> mareko: with the amdgpu driver

03:12 <karolherbst> guess that maleware snooping credit card infos via AI or something just won't clear it then :P

03:13 <karolherbst> though I suspect none of them would go to such lengths and not use GL/VK and goes a more direct route

03:13 <mareko> Bonaire doesn't have DCC, also if you see an old screen at boot, radeonsi isn't being used anyway, it's used after you log in

03:13 <clever> karolherbst: some examples: https://imgur.com/a/p5t650A

03:14 <karolherbst> well.. it's a kernel bug

03:14 <clever> i suspect a decent amount of that corruption is either the wrong tiling mode or the wrong stride

03:14 <karolherbst> yeah

03:14 <clever> and fragments being reused for other stuff

03:14 <karolherbst> or overwritten parts

03:14 <karolherbst> or something

03:14 <karolherbst> still.. it's a kernel bug and should be fixed

03:14 <clever> yeah

03:15 <karolherbst> (in the kernel)

03:15 <clever> i'm thinking, that the kernel should track what pages have been written to, and just deny reading back pages you havent written to

03:15 <clever> or redirect those reads to /dev/zero

03:15 <karolherbst> that's actually way too hard

03:15 <karolherbst> because that would require scanning command buffers and stuff

03:16 <karolherbst> the kernel allocates the physical memory anyway, and it could just track if it comes from a different process or not or something

03:16 <karolherbst> and jsut clear it if it does

03:16 <mareko> clever: try to put radeonsi_zerovram=true into /etc/environment and reboot

03:16 <mareko> it only works with amdgpu

03:17 <karolherbst> that doesn't fix the data leak problem though :P

03:17 <clever> i assume thats implemented in userland and just memset's?

03:18 <karolherbst> afaik the uapi has a flag to clear on allocation, but userspace can decide to not clear

03:18 <mareko> clever: yes, but the memset is in the kernel

03:18 <karolherbst> could probably patch the kernel to always clear

03:18 <karolherbst> probably a one line thing

03:18 <mareko> not recommended

03:18 <karolherbst> yeah.. it needs some tracking to be low overhead

03:18 <clever> i can see that wasting bandwidth

03:18 <mareko> our clear implementation is very very slow

03:18 <mareko> in the kernel

03:19 <karolherbst> I suspect you don't want to run shaders in the kernel

03:19 <clever> what if you have a list of image buffers, and when you allocate a buffer, tag it as un-initialized

03:19 <mareko> I do, but nobody has done it yet :)

03:19 <clever> but yeah, you still need to read the command stream, to know when something is rendered into the buffer...

03:19 <karolherbst> :D

03:19 <karolherbst> well.. in nouveau we actually use the accel engines, but we have that nice 2D engine still, so no need to actually run shaders

03:19 <clever> at least on a dumb core like v3d, the command stream operates entirely on physical addresses

03:20 <clever> so the kernel MUST parse the command stream, and replace object handles with addresses

03:20 <karolherbst> we still build command buffers and all that stuff though

03:20 <karolherbst> clever: yeah, but that's like super expensive

03:20 <karolherbst> CPU bound games won't like that

03:20 <clever> the rpi also has a nice sprite based 2d engine, you just give it the x/y/addr/stride of each image

03:21 <clever> and it will dynamically composite it on the fly, racing ahead of the electron beam

03:21 <karolherbst> anyway.. you only need to clear physical mem used by other processes and you could reuse memory for the same one and skip clearing

03:21 <karolherbst> that shouldn't be all too bad then

03:21 <clever> there is no framebuffer with the final image

03:21 <clever> only the input buffers

03:21 <karolherbst> anyway.. unless there is a nifty exploit and a high prio CVE filed, I guess nobody will fix it for real

03:22 <clever> in the case of the leakage i found in minecraft, i would need to first un-scramble the images

03:22 <karolherbst> that's why you use AI :P

03:22 <clever> but its running under the same user with xorg, so the screenshot api is available

03:22 <karolherbst> you need to parse the data anyway

03:22 <clever> so i could just ignore gl and capture the screen directly

03:23 <karolherbst> mhhhh... probably

03:23 <karolherbst> do a screenshot every second :P

03:23 * clever points to obs-studio

03:23 <karolherbst> I think it would be more impressive to do that from a flatpak or something

03:23 <karolherbst> have a shitty pay to win game you download from steam...

03:24 <karolherbst> and it just collects credit card info, because why not

03:24 <zmike> HdkR mareko: afaik it's the opposite: zink wins in gpu and overall but loses in cpu (due mainly to the total mismatch of gallium vertex api vs vulkan api)

03:24 <HdkR> zmike: Neat!

03:24 <zmike> on desktop anyway

03:25 <zmike> zink doesn't fully support tilers yet

03:25 <clever> karolherbst: surprisingly, i found a dead-simple way to gather online banking info, decades ago, when i was trying to only get my online banking app to work in my tablet, lol

03:25 <karolherbst> ...lol

03:25 <clever> the hard part isnt gathering the info, its doing something with it after you have it

03:26 <karolherbst> sure

03:26 <karolherbst> but people are sensitive to leaked credit card info data

03:26 <clever> basically, my local bank uses a google mapping library, to show nearby ATM's

03:26 <clever> and that mapping library must be installed by the android oem

03:26 <clever> amazon didnt include it on kindle

03:26 <clever> so, i decompiled the banking app, removed the atm map support, and recompiled it, and boom, it just worked

03:27 <clever> then i realized, how hard would it be to log the name/pw?

03:27 <clever> i bet there are other kindle fire users, who also want this app

03:27 <clever> pop it on amazons app store

03:27 <karolherbst> the problem isn't snooping data from your devices, it's more complicated once you start doing it to others :P

03:28 <clever> how well do they review their apps? could an outsider create an online banking app?

03:28 <karolherbst> nope

03:29 <karolherbst> well.. normally no, but some still get them in

03:29 aravind has joined #dri-devel

03:29 dakr has quit [Ping timeout: 480 seconds]

03:30 <karolherbst> thing is.. there are people checking those apps for fun and once you get noticed for being a fake app you get pulled

03:30 <clever> yep

03:32 a-865 has quit [Ping timeout: 480 seconds]

03:32 <clever> the other weirdness with that minecraft texture atlas

03:32 <clever> ive heard elsewhere, that opengl should just be generating the atlas automatically

03:33 <clever> so why is minecraft even having access to the full atlas, and why is it saving it to disk?

03:34 slattann has joined #dri-devel

03:35 yshui` has quit [Quit: Reconnecting]

03:35 yshui` has joined #dri-devel

03:35 <clever> they are also stored in several LOD variants, 512x512, 256x256, 128, 64, and even 32

03:36 <clever> the 32x32 atlas, is only 2x2 per texture!!

03:36 yshui` has quit []

03:36 yshui` has joined #dri-devel

03:39 <mareko> GL doesn't have any atlas

03:40 <clever> mareko: but might an implementation maybe batch several textures into an atlas to better utilize hw?

03:43 a-865 has joined #dri-devel

03:43 kts has joined #dri-devel

03:44 <mareko> nope

03:44 kts has quit []

03:45 * clever heads off to bed

04:05 aravind has quit [Ping timeout: 480 seconds]

04:05 aravind has joined #dri-devel

04:09 nchery has quit [Quit: Leaving]

04:10 pcercuei has quit [Quit: dodo]

04:38 slattann has quit [Quit: Leaving.]

04:39 slattann has joined #dri-devel

04:45 bmodem1 has joined #dri-devel

04:45 <airlied> alyssa: I do think you should scrap it

04:51 soreau has quit [Remote host closed the connection]

04:51 bmodem has quit [Ping timeout: 480 seconds]

04:52 soreau has joined #dri-devel

04:52 Duke`` has joined #dri-devel

04:59 heat has quit [Ping timeout: 480 seconds]

05:04 kts has joined #dri-devel

05:04 kts has quit []

05:05 tzimmermann has joined #dri-devel

05:07 tzimmermann_ has joined #dri-devel

05:07 tzimmermann has quit [Read error: Connection reset by peer]

05:14 soreau has quit [Remote host closed the connection]

05:15 soreau has joined #dri-devel

05:16 itoral has joined #dri-devel

05:23 soreau has quit [Remote host closed the connection]

05:24 soreau has joined #dri-devel

05:49 <mareko> would zink->dozen make sense?

05:53 <airlied> zink->moltenvk does, so why not

05:56 jewins has quit [Ping timeout: 480 seconds]

05:56 bmodem has joined #dri-devel

05:56 bmodem1 has quit [Read error: Connection reset by peer]

06:00 sdutt has quit [Read error: Connection reset by peer]

06:01 <mareko> zmike: FYI, radeonsi might close the perf gap with radv with this MR: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17109#note_1562239

06:02 gnuiyl has joined #dri-devel

06:17 danvet has joined #dri-devel

06:29 gouchi has joined #dri-devel

06:29 gouchi has quit [Remote host closed the connection]

06:37 jkrzyszt has joined #dri-devel

06:43 nchery has joined #dri-devel

07:04 a-865 has quit [Quit: ChatZilla 0.13 [SeaMonkey 2.53.13/20220704103305]]

07:08 tursulin has joined #dri-devel

07:26 mvlad has joined #dri-devel

07:38 <javierm> tzimmermann_: I wonder if there would be any value of posting https://paste.centos.org/view/raw/0992a7f3

07:39 tzimmermann_ has quit []

07:39 <javierm> tzimmermann_: I was trying to get a dma-buf shared buffer between a v4l2 and DRM device but for example `gst-launch-1.0 v4l2src io-mode=4 ! kmssink` wouldn't work without format conversion

07:39 tzimmermann has joined #dri-devel

07:40 <tzimmermann> javierm, of course there is.

07:40 <javierm> since most v4l2 devices output YUV and ssd130x will only accept XRGB, so you need to use ! videoconvert

07:40 <tzimmermann> but the code is wrong :)

07:40 <javierm> oh, is it?

07:40 <tzimmermann> the out_free label does mark the kfree statement

07:40 <javierm> ohh, indeed LOL

07:41 <javierm> tzimmermann: I meant to be before the kfree() but pasted in the wrong place

07:41 <tzimmermann> and the call to drm_gem_fb_end_cpu_access should probably come after the update_rect function; even though it's not required

07:42 <javierm> tzimmermann: I noticed that other drivers do end_cpu_access as soon as are done copying the buffer

07:42 <javierm> but sure, no strong opinion on when to do it

07:42 <tzimmermann> then just keep it like this, if you prefer it

07:43 <javierm> tzimmermann: yeah, I like to keep things consistent with other drivers. I think it makes it easier to read

07:44 <javierm> tzimmermann: I guess that if there would be a device that can output XRGB (i.e: a v4l2 mem2mem HW decoder maybe?) then you could use dma-buf to display without copying the buffers

07:44 <javierm> tzimmermann: which is also a valid use case for your simpledrm patch

07:45 <tzimmermann> yes

07:45 <tzimmermann> i'm busy ATM. if you post the patch, i'll review later

07:45 <javierm> tzimmermann: sure. Just wanted to know your thoughts about its value. Thanks!

07:52 rasterman has joined #dri-devel

07:57 camus has quit [Remote host closed the connection]

07:57 camus has joined #dri-devel

08:01 lynxeye has joined #dri-devel

08:04 Duke`` has quit [Ping timeout: 480 seconds]

08:05 swalker_ has joined #dri-devel

08:05 any1 has joined #dri-devel

08:06 swalker_ is now known as Guest1529

08:06 <tzimmermann> javierm, we should add such logic to all affected drivers IMHO

08:06 <javierm> tzimmermann: agreed

08:07 <javierm> tzimmermann: maybe you can post a patch to todo.rst? Seems to be a good janitorial task

08:07 pH5 has joined #dri-devel

08:07 swalker__ has joined #dri-devel

08:08 <tzimmermann> well, there's more to it IMHO. we'd ideally have new plane_helpers begin_access and end_access. those would be called from the commit tail before/after a plane update.

08:08 <tzimmermann> they'd do the drm_gem_fb_begin_access and the vmap things

08:09 <javierm> tzimmermann: makes sense

08:09 Lynne has quit [Remote host closed the connection]

08:10 <tzimmermann> we already have prepare_fb/release_fb. but release_fb is only called when the framebuffer is being replaced/removed on a plane. (i.e., at the next page flip)

08:10 <tzimmermann> that's too late for drm_gem_fb_end_cpu_access

08:11 <tzimmermann> maybe we should discuss such callbacks on dri-devel

08:12 <tzimmermann> i also made a atomic_enable for plane helpers; as we talked about recently. there aren't many users of this, but its nicer for the few that use it

08:12 <pq> swick, emersion, I don't really mind a libdisplay-info release without a high-level API as long as the low-level API is clearly documented as something that compositors should use only when nothing else is possible, and it you really need to use it, please consider hard about improving the high-level API first.

08:13 <any1> What exactly does the time stamp given to drmEventContext.page_flip_handler2 represent? I've noticed that it lags behind drm_vblank_event (according to perf record) by around 500µs on my system.

08:13 Guest1529 has quit [Ping timeout: 480 seconds]

08:15 bmodem has quit []

08:15 fahien has joined #dri-devel

08:15 <emersion> any1: it's supposed to be the time at which the first pixel of the frame hits the screen

08:17 <pq> emersion, swick, I have no objections if you want to mark the low-level API stable now and do a release. I believe it's a reflection of the EDID spec, so it's hard to imagine breaking changes, since I think you addressed extensibility fine. But also because I don't really know that API much.

08:17 Lucretia has quit [Ping timeout: 480 seconds]

08:18 Lynne has joined #dri-devel

08:19 <pq> emersion, swick, I'm fine with statically linking libdisplay-info into Weston, too, until it matures more.

08:19 bmodem has joined #dri-devel

08:20 <any1> emersion: Is there a similar event that can tell me when the kernel starts processing vblank?

08:21 <emersion> you mean, when the driver starts programming the hardware?

08:21 <emersion> wouldn't that be right at atomic commit time?

08:21 <any1> Yeah, that's probably a better place

08:21 Leopold_ has joined #dri-devel

08:22 <any1> mmmm, well, I'm wondering about the deadline for the gpu to finish rendering

08:22 <emersion> i don't believe any deadline is exposed at the moment

08:22 <vsyrjala> the deadline is when the event gets emitted

08:23 <vsyrjala> well, ignoring irq latencies/etc.

08:23 <any1> vsyrjala: That's what I thought. Thanks.. :)

08:23 <emersion> i mean, it's already too late when the event gets emitted, no?

08:24 Leopold_ has quit [Remote host closed the connection]

08:24 <emersion> if you define the deadline as "the absolute last instant when a page-flip can still make it"

08:24 <vsyrjala> yes, you need to guesstimate some kind of earlier useful deadline

08:25 <emersion> makes sense

08:25 <emersion> any1: i wonder if we can easily detect missed frames

08:25 <any1> emersion: we can

08:25 <emersion> any1: maybe with the presentation seq

08:25 <emersion> any1: also need to be careful about VRR

08:27 <any1> emersion: Maybe we can add 100µs to the delay every time that a missed frame is detected?

08:27 vliaskov has joined #dri-devel

08:28 Leopold_ has joined #dri-devel

08:28 <emersion> we probably should yeah. make sure the delay doesn't get out of hand, and have a plan to reduce back the delay...

08:29 <emersion> hm

08:29 <emersion> that sounds a bit complicated

08:29 <any1> I'd say cap it at 2ms

08:30 <emersion> I'd prefer not to assume 2ms represents anything

08:30 <emersion> because this behaves poorly with high refresh ratea

08:30 <emersion> rates*

08:31 <vsyrjala> could max (whatever that is) it on a miss, and then let it converge slowly towards some sweet spot?

08:31 <vsyrjala> i guess you'd still want the sweet spot to have a bit of buffer for small spikes

08:32 <any1> Or we could just use the time at which the flip callback is received and find out how much of a delay there is between the kernel irq and the callback being received on a few systems and just make that the constant which is subtracted

08:33 <any1> ... ignoring the timestamp given to the callback

08:33 <any1> For this purpose at least

08:34 <vsyrjala> the latency is dynamic

08:35 <any1> There's no reasonable upper limit?

08:36 <any1> The best thing to do would be to add an event that can tell us the time at which the irq happened, right?

08:37 <vsyrjala> well, there is irq latency which depends mostly on cpu c-states, someone could have disabled irqs for a bit which also causes latency, and to get the flip into the hardware you need to execute a bunch of code whch is subject to scheduler decisions/possibly lock contention/etc.

08:39 <vsyrjala> and cpu speed of course

08:40 <vsyrjala> the system isn't deterministic is what i'm saying i guess. and not all machines are the same

08:40 MajorBiscuit has joined #dri-devel

08:42 <any1> So, ideally, for a good estimate of the upper limit of the deadline, we would have to sample a sliding window of deadline times and calculate the maximum jitter over that window

08:42 <emersion> that's what my MR does, FWIW

08:44 <MrCooper> any1, emersion: FWIW, see clutter/clutter/clutter-frame-clock.c:clutter_frame_clock_compute_max_render_time_us in mutter for how it handles that

08:44 <emersion> mutter uses max(previous N render times) + 2ms

08:45 <MrCooper> and https://gitlab.gnome.org/GNOME/mutter/-/merge_requests/2500 for a simplification I'm proposing

08:45 <emersion> with a fallback of .875 * refresh period

08:45 <emersion> ah, thanks for the MR link, hadn't seen that one

08:45 <any1> emersion: Huh, I'll have to take a closer look at your MR. I thought we had established that there was no event which can give us the actual deadline.

08:45 <emersion> any1: i mean the sliding window thing

08:45 <MrCooper> right, though "render time" is actually the maximum of multiple values, mainly the GPU render time and CPU processing time

08:46 <emersion> MrCooper: CPU time can be included in the GPU time

08:46 <emersion> if you read the GPU clock at the start of rendering

08:46 <emersion> any1 had this simplification idea

08:49 <vsyrjala> you could calculate a timestamp for the start of vblank from the event timestamp. that would typically be the deadline to get the thing into the hardware

08:50 <vsyrjala> assuming you actualy got the dotclock you asked for (or close enough)

08:50 <jadahl> mdnavare: IIRC we concluded that i915 should configure itself so that VRR doesn't require modeset, but in a way that the maximum refresh rate is the one set as the selected mode. then we didn't agree on how to set modes with higher refresh rates, that would only be that high if vrr was toggled on

08:51 <any1> vsyrjala: How? :)

08:51 <emersion> vsyrjala: you mean clock_gettime()+refresh_period when we get the event?

08:52 <emersion> the event timestamp itself can be in the past or the future

08:52 <emersion> this is page_flip_handler we're talking about right?

08:53 <vsyrjala> the even timestamp is for the first active pixel. just subtract the vblank length from it

08:53 <emersion> vblank length?

08:54 <emersion> you mean pixel_clock * vblank_sync_width?

08:54 <emersion> err

08:54 <emersion> pixel_clock * vert_sync_width?

08:54 * emersion doesn't remember the vblank graph very well

08:55 <emersion> (docs are here https://dri.freedesktop.org/docs/drm/gpu/drm-kms.html#vertical-blanking)

08:56 <vsyrjala> somehthing like frame_time*(vtotal-vactive)/vtotal, keep substituting stuff until you have it in the form you like :)

08:56 <emersion> ahah

08:56 <any1> Perfect!

08:56 <vsyrjala> though vrr is probably going to screw you over :P

08:57 <emersion> i *think* it should be fine

08:57 <any1> Well, there's always the option of manually setting max_render_time when auto fails

08:57 <emersion> for VRR we just don't repaint if there's no damage

08:58 <emersion> ie, stop the rendering loop

08:58 <emersion> and resume it as soon as something is damaged

09:01 <any1> vyivel: I suppose that's really what I was looking for to begin with. Thanks!

09:02 <eric_engestrom> PSA: anyone who had an MR blocked on a failing docs job should try again; the issue should be fixed, ping me if not :)

09:02 <eric_engestrom> (credit where credit is due: fixed by lygstate)

09:05 <any1> err, s/vyivel/vsyrjala. Sorry for the ping vyivel :p

09:08 kts has joined #dri-devel

09:08 kts has quit []

09:44 YuGiOhJCJ has quit [Quit: YuGiOhJCJ]

09:48 nchery has quit [Remote host closed the connection]

09:49 nchery has joined #dri-devel

09:56 i-garrison has quit [Ping timeout: 480 seconds]

09:59 chipxxx has joined #dri-devel

10:00 chipxxx has quit [Remote host closed the connection]

10:00 chipxxx has joined #dri-devel

10:03 Lucretia has joined #dri-devel

10:05 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

10:05 TMM has joined #dri-devel

10:05 slattann has quit []

10:08 devilhorns has joined #dri-devel

10:28 whald has joined #dri-devel

10:47 Soroush has quit []

10:47 Soroush has joined #dri-devel

10:50 Company has joined #dri-devel

10:54 Leopold_ has quit [Remote host closed the connection]

10:54 Leopold_ has joined #dri-devel

11:01 Lucretia has quit [Ping timeout: 480 seconds]

11:02 Lucretia has joined #dri-devel

11:07 i-garrison has joined #dri-devel

11:09 fahien has quit []

11:15 chaim has joined #dri-devel

11:17 Lynne has quit [Quit: Lynne]

11:17 Lynne has joined #dri-devel

11:26 bmodem has quit [Ping timeout: 480 seconds]

11:29 <bl4ckb0ne> are the files in src/mapi/glapi/gen/ generated or hand written?

11:31 mwk has quit [Remote host closed the connection]

11:31 mwk has joined #dri-devel

11:43 kts has joined #dri-devel

11:43 kts has quit []

12:11 ahajda has joined #dri-devel

12:12 itoral has quit [Remote host closed the connection]

12:22 gawin has joined #dri-devel

12:24 lemonzest has quit [Quit: WeeChat 3.5]

12:25 srslypascal has joined #dri-devel

12:26 khfeng has joined #dri-devel

12:31 <khfeng> Lyude: hello, I think this may interest you: https://gitlab.freedesktop.org/drm/amd/-/issues/2171, seems to be similar to "drm/nouveau: Only release VCPI slots on mode changes"

12:35 srslypascal has quit [Ping timeout: 480 seconds]

12:38 srslypascal has joined #dri-devel

12:39 lemonzest has joined #dri-devel

12:43 alatiera has quit [Ping timeout: 480 seconds]

12:46 cengiz_io has joined #dri-devel

12:47 <pinchartl> when COLOR_ENCODING and COLOR_RANGE are not exposed by a plane, what is the expected default YCbCr encoding and quantization range ?

12:48 alyssa has left #dri-devel [#dri-devel]

12:49 <pq> pinchartl, I would have no idea. I think I just wouldn't even try to use it if I cared about those.

12:50 <pinchartl> ok :-)

12:51 <pinchartl> so it's best to be explicit in drivers

12:53 fxkamd has joined #dri-devel

12:54 <emersion> it would probably be worth it to expose these props even with a single possible enum value

12:54 <emersion> just to let user-space know what's up

12:56 bmodem has joined #dri-devel

12:59 <pinchartl> marex: I'm looking at the lcdif driver and the RGB to YUV conversion. the driver hardcodes BT.601 coefficients with a full quantization range. doesn't HDMI use limited range for YUV ?

13:01 <zmike> mareko: close the perf gap? but why would you make your driver slower?

13:07 <vsyrjala> pinchartl: default is limited. i *think* the ycc quantization range knobs should apply to all ycbcr formats though, so in some cases you can override it

13:08 <vsyrjala> on a related note, cta-861-h apparently makes selectable quantization range mandatory \o/

13:08 <pinchartl> oh my... :-)

13:08 <vsyrjala> now we just have to wait a few decades for actual imlementations catch up

13:09 <pinchartl> I'll start simple with just one supported option, and expose it to userspace through the COLOR_ENCODING and COLOR_RANGE properties. more options can be added later

13:11 <pq> sounds fine to me

13:11 kts has joined #dri-devel

13:12 <vsyrjala> lol. cta-861-h blames *sources* for not following the standard. we follow it in i915, and it's half the *sinks* that don't follow it

13:13 <vsyrjala> i wonder if we should really consider a quirk for it... the list will be massive

13:17 <vsyrjala> either that or we add more heuristics. but my only idea left is to check the monitor name descriptor for the string "TV"

13:23 kts has quit [Quit: Konversation terminated!]

14:03 <swick> pretty sure that some modes default to limited and some to full

14:05 Labnan[m] has joined #dri-devel

14:08 yuq825 has left #dri-devel [#dri-devel]

14:08 Surkow|laptop has quit [Ping timeout: 480 seconds]

14:15 <JoshuaAshton> fdo ded?

14:15 <swick> works for me

14:16 <JoshuaAshton> oh its back now

14:16 <JoshuaAshton> still cant push tho

14:16 <JoshuaAshton> remote: GitLab: Internal API unreachable

14:17 Surkow|laptop has joined #dri-devel

14:18 jewins has joined #dri-devel

14:22 Haaninjo has joined #dri-devel

14:26 mlankhorst has quit [Quit: Reconnecting]

14:26 mlankhorst has joined #dri-devel

14:32 dakr has joined #dri-devel

14:36 gawin has quit [Remote host closed the connection]

14:44 pcercuei has joined #dri-devel

14:56 chipxxx has quit [Read error: Connection reset by peer]

15:02 <illwieckz> hi! is there any hope to get this merged? https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18761

15:03 sdutt has joined #dri-devel

15:04 <zmike> I have a tab open for it

15:04 <zmike> but it got lost

15:10 <illwieckz> that happens to me as well! 😁️

15:12 Duke`` has joined #dri-devel

15:14 alanc has quit [Remote host closed the connection]

15:15 cengiz_io_ has joined #dri-devel

15:15 alanc has joined #dri-devel

15:16 cengiz_io has quit [Ping timeout: 480 seconds]

15:17 <mdnavare> jadahl: vsyrjala: So at init, always compute VRR parameters vmn, vmax, flipline based on the timings of the requested mode whenever a mode is requested. And only on the flip with VRR enabe requested is when we set vrr en reg and push to terminate vblank appropriately assuming now there is no mode change

15:20 Danct12 has quit [Remote host closed the connection]

15:22 bmodem has quit [Ping timeout: 480 seconds]

15:29 kem has quit [Ping timeout: 480 seconds]

15:35 tzimmermann has quit [Quit: Leaving]

15:35 <illwieckz> zmike, it looks like lavapipe now builds without patch, so I closed the MR, but I don't know what fixed it. mesa/src/vulkan/wsi/meson.build already had dep_libudev since almost a year so the bug was probably elsewhere

15:35 <zmike> illwieckz: yeah no idea

15:37 <eric_engestrom> illwieckz: I was also looking into that and couldn't figure out how the flag could be missing

15:37 <illwieckz> unfortunately I have not written the commit I was building when I opened the MR so I will probably never know

15:37 <illwieckz> git reflog should print the date when the commits are changed

15:37 devilhorns has quit []

15:38 <illwieckz> oh WAIT

15:38 kem has joined #dri-devel

15:39 <illwieckz> I may have a workaround in my env…

15:39 tobiasjakobi has joined #dri-devel

15:39 tobiasjakobi has quit [Remote host closed the connection]

15:40 srslypascal has quit [Ping timeout: 480 seconds]

15:41 <karolherbst> huh.. radeonsis nir to llvm code doesn't seem to understand "vec4 16 div ssa_22 = vec4 ssa_21.x, ssa_20.x, ssa_21.y, ssa_20.y" :(

15:41 <karolherbst> is there some lowering code I might have to urn?

15:41 <ajax> illwieckz: git-reflog takes all the options git-log does, try 'git reflog --pretty=medium'

15:41 <karolherbst> *run

15:41 <illwieckz> zmike, the bug is still there

15:42 <illwieckz> I had a *FLAGS workaround in my env

15:42 <ajax> or =full if you want both dates

15:42 gawin has joined #dri-devel

15:44 <illwieckz> ajax, thanks, that will be useful in other cases =)

15:45 <illwieckz> it looks like I won't bisect a fix today… as the bug is still there 😅️

15:46 <eric_engestrom> illwieckz, ajax: or `git reflog --date=iso` to just add the dates

15:47 <illwieckz> anyway ajax that seems to print the commit date

15:47 <illwieckz> eric_engestrom, is it the date of checkout?

15:47 swalker__ has quit [Ping timeout: 480 seconds]

15:47 <ajax> hmmmm.

15:47 <eric_engestrom> no, the date when HEAD was moved to that commit

15:47 <eric_engestrom> (I think? I'm not 100% sure actually)

15:47 <illwieckz> eric_engestrom, yes, that's what I was looking for, maybe I was not clear, thank you very much!

15:48 <illwieckz> the data I checkout a commit == the date when HEAD was moved to that commit

15:48 <illwieckz> the date*

15:48 <ajax> oh nice

15:48 <illwieckz> that's very good to know

15:49 <eric_engestrom> yeah, I just verified, it's the date when HEAD was moved

15:49 <illwieckz> that's good!

15:58 fab has joined #dri-devel

16:00 <eric_engestrom> illwieckz: are you on an old meson version? perhaps it's a meson bug, I seem to remember some issues with its handling of link_whole or something around that

16:02 whald has quit [Remote host closed the connection]

16:02 tursulin has quit [Ping timeout: 480 seconds]

16:07 MajorBiscuit has quit [Ping timeout: 480 seconds]

16:12 <illwieckz> eric_engestrom, I'm on meson master

16:12 paulk has quit [Remote host closed the connection]

16:12 paulk has joined #dri-devel

16:12 <illwieckz> I also suspected meson

16:15 DemiMarieObenour[m] is now known as DemiMarie

16:16 srslypascal has joined #dri-devel

16:18 fab has quit [Quit: fab]

16:26 <karolherbst> mareko: I have this annoying pattern going on and radeonsi trips on that because it requires vecX sources to be scalar: https://gist.githubusercontent.com/karolherbst/9814991fb091b3dfaec074d755c7d30b/raw/1bc59d16667c6305a57e11d1d8132e694c5d69ab/gistfile1.txt

16:26 <karolherbst> so it trips over that "vec4 16 ssa_24 = vec4 ssa_27.x, ssa_26.x, ssa_27.y, ssa_26.y"

16:26 <karolherbst> was wondering if making nir_to_llvm just handle swizzles on vec

16:27 <karolherbst> radeonsi seems to force 2 components for 16 bit values

16:35 heat has joined #dri-devel

16:39 ngcortes has joined #dri-devel

16:45 <mareko> how does it trip?

16:45 <mareko> LLVM assertion failure/crash?

16:48 <mareko> it wouldn't surprised me if store_global didn't fully support 16-bit types

16:52 <karolherbst> mareko: https://gitlab.freedesktop.org/mesa/mesa/-/blob/main/src/amd/llvm/ac_nir_to_llvm.c#L601 and the assert insode get_alu_src

16:53 <karolherbst> "assert(src.swizzle[i] < src_components);"

16:58 <qyliss> Is dri-devel the place to report a userspace regression? And is there anything I should check prior to reporting, apart from checking whether drm-tip fixes it?

16:58 <qyliss> (A userspace regression caused by a kernel change, that is)

16:59 <mareko> karolherbst: it's not immediately obvious to me why it trips, the code looks correct

17:00 gouchi has joined #dri-devel

17:00 gouchi has quit [Remote host closed the connection]

17:00 <karolherbst> ssa_27.y

17:00 <karolherbst> swizzle is 1

17:00 <karolherbst> src_compoennts is 1

17:00 <karolherbst> src_component is set to 1 because it's a vec4

17:01 <mareko> unpack_32_2x16_split_x should return vec1 16, this is a NIR bug

17:01 <karolherbst> nope

17:01 <mareko> no?

17:02 <karolherbst> nir_opt_vectorize makes it to vec2

17:02 <karolherbst> and radeonsi wants 16 bit things to be vec2

17:02 <karolherbst> dunno if we want nir_opt_vectorize to skip over vectorizing unpack_32_2x16_split_y , but at least nir is doing what it is told, no?

17:03 <karolherbst> si_vectorize_callback is the cb which deals with that

17:03 <mareko> I see

17:04 <mareko> ac_nir_to_llvm doesn't support vec2 unpack_32_2x16_split_x/y, and I wonder why it didn't crash there

17:04 <karolherbst> mhh.. good question

17:05 <karolherbst> I do have some other crashes and even GPU resets going on, so maybe it is indeed not caught

17:06 <karolherbst> could check for nir_op_unpack_32_2x16_split_x in the callback and keep it scalar for now

17:06 <karolherbst> and _y

17:06 <mareko> yes

17:08 <mareko> I think you need to build LLVM with LLVM_ENABLE_ASSERTIONS=ON to get a failure in unpack_32_2x16_split_* translation

17:08 <karolherbst> ahh probably

17:09 <karolherbst> I should enable that again

17:10 lynxeye has quit [Quit: Leaving.]

17:11 <FLHerne> qyliss: probably yes; some drivers have issue tracking under https://gitlab.freedesktop.org/drm

17:11 <karolherbst> okay cool.. keeping those scalar fixes the crash :)

17:11 soreau has quit [Read error: Connection reset by peer]

17:12 soreau has joined #dri-devel

17:14 jkrzyszt has quit [Ping timeout: 480 seconds]

17:14 <Lyude> khfeng: ack, will try to take a look today

17:14 jkrzyszt has joined #dri-devel

17:16 kem has quit [Ping timeout: 480 seconds]

17:16 <qyliss> FLHerne: ah, thanks

17:16 iive has joined #dri-devel

17:16 <qyliss> this is in dma-buf, so I guess there's no special issue tracker

17:20 aravind has quit [Ping timeout: 480 seconds]

17:23 neoXite has joined #dri-devel

17:24 kem has joined #dri-devel

17:30 <marex> vsyrjala: maybe you can implement some detection code in BPF ?

17:30 <marex> pinchartl: I wonder if that's what the YUV/YCbCr selection setup is for in the HDMI block control

17:30 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

17:30 TMM has joined #dri-devel

17:33 <gawin> if one of nir passes fails due to lack of pointcoords/texcoords, then is it better to debug why this doesn't work without them or try to implement them?

17:36 <pinchartl> marex: that may be related indeed, but I think the coefficients also need to be changed

17:37 <pinchartl> if I can figure it out, would you accept a patch that switches to limited range unconditionally ?

17:38 <marex> pinchartl: only if it contains a Fixes: tag

17:40 Danct12 has joined #dri-devel

17:43 <pinchartl> ok :-)

17:51 illwieckz_ has joined #dri-devel

17:52 illwieckz has quit [Read error: Connection reset by peer]

17:55 illwieckz_ has quit []

17:55 illwieckz has joined #dri-devel

18:00 <vsyrjala> marex: why is bpf hepful? does it have a working crystal ball in there somewhere?

18:02 soreau has quit [Remote host closed the connection]

18:03 soreau has joined #dri-devel

18:04 mbrost has joined #dri-devel

18:06 ybogdano has joined #dri-devel

18:07 warpme___ has quit []

18:23 Leopold_ has quit []

18:28 <bl4ckb0ne> if anybody has a few minutes to spare for a piglit test review pls https://gitlab.freedesktop.org/mesa/piglit/-/merge_requests/734

18:32 ybogdano has quit [Quit: The Lounge - https://thelounge.chat]

18:43 tobiasjakobi has joined #dri-devel

18:43 tobiasjakobi has quit []

18:56 <mdnavare> vsyrjala: I am going to start working on adding a debugfs node that taps into crtc_state->vrr.enable to indicate the status of vrr enable that userspace can tap on to check that when userspace requested VRR if it was turned on in the driver. Does this make sense?

18:57 <vsyrjala> is there some reason you wouldn't get vrr if you have a vrr monitor?

19:01 dakr has quit [Read error: Connection reset by peer]

19:03 <mdnavare> vsyrjala: No this is mainly for a bad userspace request to enable VRR on a non VRR monitor - Either we reject this or just indicate through crtc_state->vrr.enable

19:04 <vsyrjala> but what does it have to do with debugfs?

19:05 dakr has joined #dri-devel

19:05 <mdnavare> some way for userspace to tap in and check the status, or we could change the crtc vrr prop to be bidirectional and set that to indicate the status

19:05 <mdnavare> what would be your recommendation here

19:06 <mdnavare> since you suggested we should not reject the modeset

19:07 <vsyrjala> why would userspace want to look into debugfs to confirm that it did something?

19:07 <mdnavare> actually we would need to use the HW state readout since now we are thinking of switching to a design in i915 where we will always calculate VRR params and only enable in HW when requested directly in atomic tail

19:08 Leopold has joined #dri-devel

19:08 <mdnavare> vsyrjala: So can we just set a prop to indicate the HW state of VRR to indicate that back to UMD?

19:08 nchery_ has joined #dri-devel

19:09 <vsyrjala> again, what umd would want to know this and why?

19:11 <mdnavare> vsyrjala: Dont you think its a good idea to somehow let the userspace know that the request was ignored and actually the result is not as expected because now userspace is expecting something else and kernel has done something else

19:11 <mdnavare> vsyrjala: This simply came up through VRR negative testing

19:13 <vsyrjala> if userspace is stupid enough to expect vrr when there is no vrr monitor then i think it deserves to suffer

19:14 nchery has quit [Ping timeout: 480 seconds]

19:18 <emersion> mdnavare: fwiw, user-space can never rely on debugfs, because debugfs has no stable API promise and is only accessible by root

19:19 <emersion> IOW, debugfs is for debugging, but cannot be used for more

19:19 Guest1427 is now known as pmoreau

19:22 <mdnavare> emersion: Agree thanks!

19:22 <HdkR> Should mesa enable GLVND by default these days since most distros enable it by default, and users doing self-building tend to forget to enable it?

19:23 <mdnavare> vsyrjala: So you think kernel should not do anything to let userspace know that vrr was not enabled, no negative VRR testing needed in userspace?

19:24 <mdnavare> vsyrjala: May be we can just add this negative test internally in vrr IGT so we can check that kernel is doing the right thing

19:25 <mdnavare> vsyrjala: Okay on the other topic, I am going start making VRR kernel changes to always compute vrr params, ad then in atomic ommit tail is where it will check the vrr crtc prop and commit the VRR params to HW if this is set and then set push

19:25 <vsyrjala> for testing just issue a few flips to make sure vrr gets properly enabled/disabled?

19:26 <mdnavare> vsyrjala: well for testing we wanted to add a negative test that vrr doesnt get enabled if it is non VRR monitor but currently we have no means toc heck this

19:26 <vsyrjala> you can flip and see if you get vrr or not

19:27 gawin has quit [Ping timeout: 480 seconds]

19:27 <vsyrjala> no need to trust some software thing you read from somewhere when you can just probe the hardware behaviour

19:29 <emersion> you mean by checking the event timestamps i assume

19:30 <mdnavare> vsyrjala: So just issue a few flips and if the vblank event timestamps are all corresponding to same 60Hz and conclude vrr not enabled?

19:30 <emersion> similarly to async flip tests

19:30 <mdnavare> Yes emersion that is how I interpreted this, is this correct vsyrjala?

19:33 <anholt_> for anyone else: if you see "Can't make sense of .rodata section mapping" from valgrind, that's valgrind choking on valid mold output, rerun your build with another linker.

19:35 <vsyrjala> mdnavare: yeah, you can check the timestamps. and could also confirm the events were delivered at the right schedule more or less

19:35 <swick> emersion: have you looked at the CVT stuff yet?

19:35 <vsyrjala> starting to sound a bit like kms_flip...

19:36 <emersion> swick: nope

19:36 <mdnavare> vsyrjala: Yea cool got it, so no need for any kernel changes for negative testing, now the only change I have to work on is get rid of the crtc_state->vrr.enable , always compute params and in commit tail, check for crtc vrr enabled prop and then commit to HW and send push

19:36 cengiz_io has joined #dri-devel

19:36 <mdnavare> so that full modeset not needed

19:37 <vsyrjala> and someone needs to fix the bugs

19:37 <swick> emersion: alright, I'll take a look then. last base edid feature \o/

19:37 <swick> (manufacturer specifc stuff doesn't count :P)

19:39 cengiz_io_ has quit [Ping timeout: 480 seconds]

19:39 alyssa has joined #dri-devel

19:40 <alyssa> anholt_: giving noltis ffma a try on panfrost

19:41 cengiz_io has quit []

19:42 mbrost has quit [Ping timeout: 480 seconds]

19:43 <anholt_> alyssa: so, things feel a little fragile with trying to get it to happen at the right point in late algebraic

19:43 gawin has joined #dri-devel

19:43 <anholt_> some of which is: well, if you had those other late alg things expressed in noltis, then the right behavior would fall out without clever ordering.

19:44 <anholt_> but we do some stuff in algebraic that you can't do in noltis (repeatedly apply this transform to slide a constant up a chain of fmuls), so this is not a silver bullet.

19:44 warpme___ has joined #dri-devel

19:45 <alyssa> OK

19:45 <alyssa> (I'm still in the "information gathering" phase of NOLTIS opinion formation)

19:45 <alyssa> (So this is good to know)

19:46 jfalempe has quit [Ping timeout: 480 seconds]

19:51 alatiera has joined #dri-devel

19:52 <mdnavare> Thanks a lot vsyrjala and emersion for VRR discussion

19:52 <ajax> HdkR: probably

19:57 <HdkR> blammo, PR opened. Let the bikeshedding occur

19:59 <eric_engestrom> anholt_: for that .rodata problem, mold changed its behaviour in 1.5 to avoid this, so an alternative to changing linkers is to update it :)

20:01 <alyssa> noltis ffma is a loss on valhall with the simple cost function, will need to investigate why I guess (and if I need a more tailored cost function?)

20:08 oneforall2 has quit [Remote host closed the connection]

20:10 nchery has joined #dri-devel

20:10 oneforall2 has joined #dri-devel

20:15 <gawin> anholt_: amazing work, the amount of presubs on r300 is crazy

20:15 oneforall2 has quit [Remote host closed the connection]

20:15 jkrzyszt has quit [Ping timeout: 480 seconds]

20:15 nchery_ has quit [Ping timeout: 480 seconds]

20:15 <alyssa> Regressions because I set has_fsub and copied the late/ffma/late pattern from ntt which doesn't work

20:16 <alyssa> Reordered to ffma/late/late and the regressions go away

20:16 <alyssa> that is a *very* slight win over main

20:16 <alyssa> (but a loss compared to !18814)

20:18 <alyssa> I'm honestly unsure what more sophisticated cost function I could use here, 1 FMA + 1 MOV is usually better than 1 FMUL + 1 FADD and the backend has lots of smarts to avoid the MOV in common cases

20:21 mvlad has quit [Remote host closed the connection]

20:24 <alyssa> .

20:25 ngcortes has quit [Ping timeout: 480 seconds]

20:26 YuGiOhJCJ has joined #dri-devel

20:26 oneforall2 has joined #dri-devel

20:32 Haaninjo has quit [Quit: Ex-Chat]

20:44 <anholt_> alyssa: interesting that you saw much of a difference compared to !18814 -- it seems like it should get the same answer that that MR did for the case the MR was trying to solve?

20:45 <ccr> alyssa, perhaps you just need to ... (f)mull over this thing ..

20:46 <airlied> hmm stk has a 1kx1k DXT3 cubemap, I think this why llvmpipe dislikes it

20:46 <zmike> that's a big cubemap

20:46 <airlied> and compressed

20:46 ngcortes has joined #dri-devel

20:47 <airlied> for some reason two of the 12 threads I have here really hit decoding it

20:47 <anholt_> skybox?

20:48 <airlied> yeah probably

20:58 <airlied> I'd consider in the past just always decompressing dxt upfront to rgba, but then was sad about memory consumption

21:00 <pinchartl> marex: I figured out the difference between YUV and YCbCr in the LCDIF

21:00 <pinchartl> it's related to how the U/Cb and V/Cr values are interpreted

21:01 <pinchartl> for the LCDIF, YCbCr means that the Cb/Cr value range [0, 255] maps to [-0.5, +0.5] with 120 == 0.0

21:02 <pinchartl> while YUV seems to interpret the 8-bit value as a signed integer

21:03 <pinchartl> [0, 127] maps to [0.0, +0.5] and [128, 255] to [-0.5, 0.0[

21:03 <pinchartl> (give or take the off-by-one errors on those calculations)

21:03 <pinchartl> so what we need is YCbCr, not YUV

21:04 <pinchartl> furthermore, there's an error in the RGB <- YUV equations in the documentation of CSC0_CTRL

21:04 <pinchartl> the D1, D2 and D3 coefficients are added, not subtracted

21:06 <alyssa> anholt_: guess I should look at the shaders !18814 helps and see why noltis isn't figuring it out

21:06 <alyssa> I might still have a pass ordering problem, who knows

21:16 ybogdano has joined #dri-devel

21:16 <marex> pinchartl: nice

21:16 <marex> pinchartl: I am starting to feel like I cannot write a patch without bugs

21:16 <marex> :)

21:20 <pinchartl> do you know anyone who can ? :-)

21:20 <pinchartl> and the driver correctly uses RGB2YCbCr, there was no bug there

21:22 <marex> pinchartl: it might make sense to document the above findings in some comment to the patches you likely plan to submit

21:23 <pinchartl> yep, I'll do so

21:23 <marex> pinchartl: it seems like a useful information

21:23 <marex> thanks

21:24 Duke`` has quit [Ping timeout: 480 seconds]

21:25 danvet has quit [Ping timeout: 480 seconds]

22:09 <airlied> okay the current cache code doesn't little to make stk happier

22:30 ahajda has quit [Ping timeout: 480 seconds]

22:37 danilo has joined #dri-devel

22:37 eukara has quit []

22:39 dakr has quit [Ping timeout: 480 seconds]

22:39 danilo has quit []

22:39 dakr has joined #dri-devel

22:46 Leopold___ has joined #dri-devel

22:49 Leopold has quit [Ping timeout: 480 seconds]

22:59 lemonzest has quit [Quit: WeeChat 3.5]

23:01 iive has quit [Quit: They came for me...]

23:04 rasterman has quit [Quit: Gettin' stinky!]

23:06 vliaskov has quit [Remote host closed the connection]

23:12 lygstate has quit [Remote host closed the connection]

23:16 <airlied> oh wow stk eventually rendered a scene

23:18 cheako has quit [Quit: Connection closed for inactivity]

23:19 <FLHerne> hours per frame?

23:20 <airlied> yeah it might have been about 40 minutes

23:22 <airlied> I even can "drive" the car a little bit

23:24 <alyssa> airlied: have you tried nitro?

23:24 <alyssa> it speeds the car up a lot

23:24 <alyssa> hit "n"

23:24 <alyssa> should solve your performance problems

23:25 <airlied> alyssa: I think it caused a shader recompile :-P

23:25 <alyssa> okay but after that car go brrrr

23:33 bgs has joined #dri-devel

23:38 <ccr> :]

23:44 digetx has quit [Ping timeout: 480 seconds]

23:44 <zmike> airlied: so it sounds like you're hot in pursuit of that perf

23:44 gawin has quit [Remote host closed the connection]

23:47 <airlied> zmike: the perp^Hf got away from me

23:47 Leopold_ has joined #dri-devel

23:47 <airlied> by my metric of does it render eventually, this seems fine :-)

23:48 Leopold___ has quit [Ping timeout: 480 seconds]

23:48 <zmike> oof

23:50 <airlied> though it's wierd that every so often one or two of the fs rendering threads get so smashed, assuming they are rendering the skybox

23:52 <zmike> and here I was hoping I'd have an actual app to test

23:52 <airlied> the heaven demo is all anyone needs :-P

23:52 <zmike> does that actually work on sw?

23:53 <airlied> yes, it's how I wrote llvmpipe tessellation support

23:53 <airlied> and inspired overlapping

23:53 <airlied> overlapping took it from 2 spf to 1.5 spf :-P

23:54 <karolherbst> though I suspect llvmpipe could be make signfiicantly faster there, though not sure by how much, and one really would have to want improve perf there :P

23:57 <airlied> overlapping got most of the easy wins, once the frag shader hits memory bw it's hard to move the needle