#zink on 2025-03-04 — irc logs at oftc.irclog.whitequark.org

2024-07-16 04:51 ChanServ changed the topic of #zink to: official development channel for the mesa3d zink driver || https://docs.mesa3d.org/drivers/zink.html

00:08 <fdobridge> <redsheep> Thanks, now that I'm not going office space on a printer I can head home and test

00:09 <fdobridge> <gfxstrand> @zmike. No courage to review my damage MR? 😛

00:09 <fdobridge> <zmike.> look if I couldn't do math 3 years ago when my brain actually worked I'm not gonna be able to do it now when I can barely remember what I had for lunch

00:09 <fdobridge> <redsheep> Almost exactly getting PC load letter, felt like the twilight zone

00:11 <fdobridge> <gfxstrand> But I fixed it by deleting all your math. 😛

00:12 <fdobridge> <gfxstrand> Maybe we can con Ken into reviewing

00:12 <fdobridge> <zmike.> maybe contact mensa

00:12 <fdobridge> <zmike.> simple arithmetic seems like something they could handle

00:12 <fdobridge> <zmike.> but me?

00:12 <fdobridge> <zmike.> no way

00:13 <fdobridge> <gfxstrand> lmao

00:13 gfxstrand has joined #zink

00:14 <fdobridge> <gfxstrand> I poked Ken on IRC.

00:16 <fdobridge> <gfxstrand> He might have the courage to review damage code. Or the blind *faith* to just trust me. :frog_upside_down:

00:16 <fdobridge> <zmike.> I trusted @fooishbar once

00:16 <fdobridge> <zmike.> look where that got me

00:16 <fdobridge> <zmike.> I can't trust anyone

00:16 <fdobridge> <zmike.> not ever again.

00:18 <fdobridge> <gfxstrand> https://tenor.com/view/nick-furry-trust-trust-issues-winter-soldier-gif-22608292

00:19 <fdobridge> <zmike.> https://media3.giphy.com/media/v1.Y2lkPTc5MGI3NjExNzZuN3I2Y244eHV4MHdmZW45MHAwNHM2bW1vejBtc2pjcHBtNndodCZlcD12MV9pbnRlcm5hbF9naWZfYnlfaWQmY3Q9Zw/1012ANeLJ8unAY/giphy.gif

00:19 <fdobridge> <zmike.> any time I have to think about math now

00:22 <fdobridge> <ermine1716> My maths major isn't helping >_<

00:22 <fdobridge> <zmike.> I used to be a math major in another lifetime

00:22 <fdobridge> <gfxstrand> It wasn't even really math

00:22 <fdobridge> <zmike.> are there math operators involved?

00:23 <fdobridge> <ermine1716> There are words like "union" and "intersection"

00:23 <fdobridge> <zmike.> yeah also trying to sneak that entire patch adding a new math function in

00:23 <fdobridge> <ermine1716> Smells math

00:23 <fdobridge> <zmike.> what a load of bull hockey

00:23 <fdobridge> <gfxstrand> Can you spot the bug?

00:23 <fdobridge> <gfxstrand> ```C++

00:23 <fdobridge> <gfxstrand> res->damage.extent.width = u.y1 - u.y0;

00:23 <fdobridge> <gfxstrand> res->damage.extent.height = u.x1 - u.x0;

00:23 <fdobridge> <gfxstrand> res->damage.offset.x = u.x0;

00:23 <fdobridge> <gfxstrand> res->damage.offset.y = u.y0;

00:23 <fdobridge> <gfxstrand> ```

00:23 <fdobridge> <gfxstrand> Can you spot the bug?

00:24 <fdobridge> <gfxstrand> ```C++

00:24 <fdobridge> <gfxstrand> res->damage.extent.width = u.y1 - u.y0;

00:24 <fdobridge> <gfxstrand> res->damage.extent.height = u.x1 - u.x0;

00:24 <fdobridge> <gfxstrand> res->damage.offset.x = u.x0;

00:24 <fdobridge> <gfxstrand> res->damage.offset.y = u.y0;

00:24 <fdobridge> <gfxstrand> ``` (edited)

00:24 <fdobridge> <zmike.> smh trying to drive this full load of bull hockey right past my nose

00:24 <fdobridge> <zmike.> THAT'S MATH

00:26 <fdobridge> <ermine1716> Wild uneducated guess: wrong offset? Something about those egl origins

00:26 <fdobridge> <zmike.> well I'll tell you the wrong answer

00:26 <fdobridge> <zmike.> the wrong answer is that it's using y coords for width and x coords for height

00:26 <fdobridge> <ermine1716> Also width and height are messed up

00:26 <fdobridge> <zmike.> so it's not that

00:27 <fdobridge> <ermine1716> Okay

00:27 <fdobridge> <zmike.> has to be something more clever

00:27 <fdobridge> <zmike.> something that requires at least a PhD to figure out

00:27 <fdobridge> <zmike.> check for acrostics

00:27 <fdobridge> <zmike.> or maybe one of those diagonal messages in the letters

00:36 <fdobridge> <gfxstrand> Took a while but I think I hit another one. 😖

00:41 <fdobridge> <gfxstrand> The good news is that they're getting harder and harder to hit.

00:42 <fdobridge> <gfxstrand> The bad news is that that also means they're getting harder and harder to debug. :frog_upside_down:

00:43 <fdobridge> <gfxstrand> I think there was a bug in my async patch

00:44 <fdobridge> <gfxstrand> But also, IDK how to actually hit it bsides run FF and surf the web for a while

00:45 <fdobridge> <gfxstrand> In any case, my all-the-fixes branch has the latest version (as does the MR) so people should just pull that fresh.

00:45 <fdobridge> <gfxstrand> IDK how long I'll still be at my PC. I should re-heat supper

00:46 <fdobridge> <Owo> why is the width y and height x :ferrisballSweat:

00:46 <fdobridge> <Owo> who wrote this

00:46 <fdobridge> <zmike.> I'm telling you that can't be the problem

00:47 <fdobridge> <zmike.> there's no way something like that could've slipped past both me and @fooishbar

00:47 <fdobridge> <gfxstrand> Clearly, it needed a math PhD to figure out.

00:50 <fdobridge> <Owo> unrelated, kind of. Is there a way I can disable damage regions in zink

00:50 <fdobridge> <Owo> https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27867/

00:50 <fdobridge> <Owo> didn't see anything here

00:51 <fdobridge> <Owo> I wanna do some testing with having mutter re-render the ENTIRE surface

00:51 <fdobridge> <gfxstrand> Just put a `return;` at the top of the function

00:51 <fdobridge> <Owo> ~~forgot to add a "without patching the code" clause~~

00:51 <fdobridge> <gfxstrand> But also that doesn't guarantee anything. The damage is just a hint. It doesn't actually make mutter re-draw

00:52 <fdobridge> <gfxstrand> In order to do that, you need to disable the buffer age extension

00:52 <fdobridge> <Owo> if I disable passing damage regions in zink, mutter *should* redraw everything, unless it does a diff on the buffers on its end, right?

00:53 <fdobridge> <gfxstrand> You need to disable the EGL_buffer_age extension

00:53 <fdobridge> <Owo> grr

00:53 <fdobridge> <Owo> lemme take a look

00:53 <fdobridge> <gfxstrand> platform_wayland.c:3093

00:54 <fdobridge> <Owo> no environment variable I can use to disable them without recompiling?

00:54 <fdobridge> <gfxstrand> I don't think so

00:54 <fdobridge> <gfxstrand> Or at least not that I'm aware of

00:55 <fdobridge> <Owo> maybe I should make that one of my first few Mesa PRs :wires:

00:56 <fdobridge> <gfxstrand> Also, I think we still have some sort of buffer_age/damage issue. It's way more minor but I'm still sometimes seeing my screen go back by a frame or two.

00:58 <fdobridge> <redsheep> I'm so glad you're looking at these because you're describing exactly the things that have been the most annoying on nvk+zink for like... probably the whole last year

00:58 <fdobridge> <Owo> hmmm

00:58 <fdobridge> <Owo> https://docs.mesa3d.org/envvars.html#envvar-MESA_EXTENSION_OVERRIDE

00:58 <fdobridge> <gfxstrand> Right now I'm just poking around, waiting to see if I get another hang.

00:59 <fdobridge> <gfxstrand> I think that's for GL extensions, not EGL

00:59 <fdobridge> <Owo> *ah*

00:59 <fdobridge> <gfxstrand> The firefox issue really gave me something to latch onto

01:00 <fdobridge> <Owo> so I should add something like EGL_EXTENSION_OVERRIDE with the same formatting as MESA_EXTENSION_OVERRIDE

01:00 <fdobridge> <gfxstrand> Sure. Why not?

01:00 <fdobridge> <Owo> I'll take a look then

01:01 <fdobridge> <Owo> currently having some GBM issues with flatpak, so can't test zink stuff right now

01:01 <fdobridge> <Owo> (NixOS sets GBM stuff and flatpak doesn't override it :CatAngy:)

01:02 <fdobridge> <redsheep> Yeah involving flatpak when you're trying to debug driver stuff is not a terribly comfortable experience, it's not just nix

01:02 <fdobridge> <Owo> flatpak is fine, since I just build mesa manually and set FLATPAK_GL_DRIVERS to the name of my custom package

01:02 <fdobridge> <Owo> I did that for asahi before

01:02 <fdobridge> <Owo> https://github.com/orowith2os/mesa-asahi-flatpak

01:02 <fdobridge> <Owo> Asahi Linux is still using this as a bse ^^

01:02 <fdobridge> <Owo> *base

01:03 <fdobridge> <Owo> NixOS is weird here because, i think, I'm using mesa-git from chaotic-nyx and I'm pretty sure it's setting the GBM variables itself

01:04 <fdobridge> <Owo> unless I want to override the nixpkgs mesa package, and then I rebuild the world

01:05 <fdobridge> <gfxstrand> Is Firefox going to survive a suspend cycle? We're about to find out!

01:05 <fdobridge> <gfxstrand> It did!

01:06 <fdobridge> <gfxstrand> There was some SIGPIPE I had to continue through but it survived.

01:10 <fdobridge> <Owo> ah, Electron my beloved.

01:10 <fdobridge> <Owo> MESA: error: ZINK: vkCreateInstance failed (VK_ERROR_INCOMPATIBLE_DRIVER)

01:11 <fdobridge> <Owo> nope, nevermind, Firefox falls back to sw too.

01:12 <fdobridge> <gfxstrand> nix being cursed again?

01:12 <fdobridge> <Owo> this time, nope

01:12 <fdobridge> <Owo> booted into stable-mesa

01:12 <fdobridge> <Owo> env vars are fine

01:12 <fdobridge> <Owo> mesa-git is set

01:12 <fdobridge> <Owo> MESA_LOADER_DRIVER_OVERRIDE is set to zink

01:14 <fdobridge> <redsheep> I feel like we're well on our way to a blogpost full of memes

01:14 <fdobridge> <redsheep> https://cdn.discordapp.com/attachments/1209954375766908948/1346289640688320555/image.png?ex=67c7a56f&is=67c653ef&hm=47087895808eb5d662dffd52968ce927a4c7a2f2ccecbeb4fc6ce592c6661c19&

01:14 <fdobridge> <redsheep> About like, thanos snapping the zink bugs in half a few times

01:16 <fdobridge> <redsheep> At least when it comes to sessions and ui

01:17 <fdobridge> <Owo> ugh

01:17 <fdobridge> <Owo> even unsetting all the variables, firefox is still on sw

01:22 <fdobridge> <gfxstrand> Actually, I'm starting to suspect this one is an sRGB issue. Usually when I see a flash, it's mostly the right thing, just not quite right. That doesn't scare the hell out of me...

01:22 <fdobridge> <Owo> oh. haha.

01:22 <fdobridge> <Owo> Flatpak didn't have any GL drivers installed.

01:33 <fdobridge> <Owo> and it still doesn't work

01:33 <fdobridge> <Owo> whyyyyyy

01:33 <fdobridge> <Owo> Vulkan works fine

01:36 <fdobridge> <mhenning> what are you running under flatpak? I don't actually know how to get flatpak to use a user-built graphics driver

01:37 <fdobridge> <Owo> uhm, everything

01:37 <fdobridge> <mhenning> I tend to avoid flatpak for that reason

01:37 <fdobridge> <Owo> you just build it yourself under org.freedesktop.Platform.GL and set FLATPAK_GL_DRIVERS to the name

01:37 <fdobridge> <Owo> I got it working, `flatpak update` worked

01:37 <fdobridge> <Owo> turns out I needed the 23.08 branch, Firefox was still on that

01:38 <fdobridge> <mhenning> Okay, as long as you know you need to do additional steps for flatpak to get the new driver

01:39 <fdobridge> <Owo> everything app-wise on my system is running under Flatpak except a system resources monitor (which has some issues under flatpak due to glibc shenanigans)

01:39 <fdobridge> <Owo> well, it's working now

01:39 <fdobridge> <Owo> let me go build that zink branch

01:42 <fdobridge> <Owo> If anybody ever wants help with Flatpak bs btw, just @ me.

01:42 <fdobridge> <Owo> The situation is either underdocumented (and I probably have docs saved in my brain or smth), or `flatpak update`

01:49 <fdobridge> <Owo> https://cdn.discordapp.com/attachments/1209954375766908948/1346298559951147079/image.png?ex=67c7adbd&is=67c65c3d&hm=defcf81d8d5d5a0a8c3d76c5b6a5cd43ba61bc482c735fec26ddf9f18ff3ab25&

01:49 <fdobridge> <Owo> :hammy:

01:51 <fdobridge> <gfxstrand> Okay, Wayland explicit sync is hanging again. 😫

01:56 <fdobridge> <Owo> ugh

01:56 <fdobridge> <Owo> building complains about pyyaml

01:57 <fdobridge> <Owo> I think I got it now

01:57 <fdobridge> <gfxstrand> Okay, the hang in seeing is Zink not handling a DEVICE_LOST well. That's a different issue...

01:59 <fdobridge> <gfxstrand> Switching back to iris for the moment

01:59 <fdobridge> <gfxstrand> At least now I'm not freaking out about my MRs

02:38 MoeIcenowy has quit [Quit: ZNC 1.8.2 - https://znc.in]

02:49 MoeIcenowy has joined #zink

03:19 <fdobridge> <Owo> @gfxstrand it's working!

03:19 <fdobridge> <Owo> https://cdn.discordapp.com/attachments/1209954375766908948/1346321099792584786/image.png?ex=67c7c2bb&is=67c6713b&hm=19ac68bba252477e998e81d420293186c1a52ded3bc1ca9028b7af2e2aaa44dc&

03:19 <fdobridge> <Owo> thanks so much!

03:19 <fdobridge> <gfxstrand> Hooray! 🎉

03:21 <fdobridge> <Owo> OH MY GOD

03:21 <fdobridge> <Owo> THE EXPLICIT SYNC THING IS GONE TOO

03:21 <fdobridge> <Owo> IT DOESNT CRASH

03:30 <fdobridge> <Owo> https://gitlab.freedesktop.org/mesa/mesa/-/issues/12194#note_2807958

03:30 <fdobridge> <Owo> @redsheep @ermine1716 if yall want to test Firefox on your end with the fixes ^^

03:31 <fdobridge> <Owo> (or just strip the sources from the linked file if you want to do it system-wide)

03:33 <fdobridge> <Owo> (yall might have to change things up for NVK and ANV, since my manifest there only builds AMD and zink)

03:34 <fdobridge> <redsheep> I can get to it in a little bit. I never was having any crashing so I can probably just use the arch package just the same as before

03:36 <fdobridge> <Owo> this is peak Zink

03:36 <fdobridge> <Owo> https://cdn.discordapp.com/attachments/1209954375766908948/1346325396055920842/image.png?ex=67c7c6bb&is=67c6753b&hm=6a2c0556d3a7fd391f80e7fa5b5f42f4e7cb6e8c10da15711d6403df5db89e43&

03:41 <fdobridge> <gfxstrand> Remember to grab the branch with everything instead of pulling individual MRs.

03:42 <fdobridge> <gfxstrand> Yay

03:43 <fdobridge> <Owo> The Mutter bug is still there, I guess, but we're not hitting it anymore?

03:43 <fdobridge> <Owo> Is that what's going on?

03:43 <fdobridge> <gfxstrand> I'm not sure. I need to dig into that tomorrow. It may have been a ghost.

03:43 <fdobridge> <gfxstrand> I thought doing double commits made everything worse but I'm not convinced anymore.

04:33 <fdobridge> <redsheep> Hmm. I must be having different issues. My desktop is quite unresponsive intermittently, which seems to be new. Could be an update to something else, so if I am the only one seeing that on nvk+zink in particular probably no cause for concern yet

04:33 <fdobridge> <redsheep> But also, my particular firefox rendering issues remain unchanged, so far as I can tell

04:35 <fdobridge> <Sid> what did I miss

04:35 <fdobridge> <redsheep> https://gitlab.freedesktop.org/gfxstrand/mesa/-/tree/zink/all-the-fixes?ref_type=heads

04:35 <fdobridge> <redsheep> New mega-fix branch from Faith bug grinding seemingly all day

04:36 <fdobridge> <Sid> interesting

04:36 <fdobridge> <redsheep> I was pretty hopeful all of my issues overlapped with these fixes, but unfortunately it appears they do not

04:36 <fdobridge> <Sid> f

04:38 <fdobridge> <redsheep> it's annoying how fast my ssd and filesystem are, I can only get like 15 good seconds of fast scrolling out of my terminal by typing tree

04:39 <fdobridge> <redsheep> I'm attempting to replicate the more general desktop flicker. I think that miiiight be fixed for me? That is good if true but was also so difficult for me to replicate I can't really call it yet

04:39 <fdobridge> <Sid> let's try the branch

04:43 <fdobridge> <redsheep> Wow this last time I went to open firefox it briefly had a load of corrupted stuff appearing that pulled from discord on my other monitor. It flickered that stuff in and out for a few seconds, before stabilizing on just being firefox

04:45 <fdobridge> <redsheep> If you don't find anything wrong try playing some games. I am getting wicked long stutters, and pretty frequently when under load.

04:45 <fdobridge> <Sid> opengl games?

04:46 <fdobridge> <redsheep> Well I typicall test minecraft when I am doing a quick game but I don't think it's specific to that, I think it's something hitching the display server

04:46 <fdobridge> <redsheep> Well I typically test minecraft when I am quickly checking a game but I don't think it's specific to that, I think it's something hitching the display server (edited)

04:47 <fdobridge> <Sid> so opengl games, got it

04:47 <fdobridge> <Sid> (since they go through zink)

04:48 <fdobridge> <redsheep> Right but with how the freezing behaves it's clearly not the game freezing, it's the game loading *something* and causing the entire session to briefly stall

04:49 <fdobridge> <Sid> fwiw MR 33855 regressed plasma wayland for me

04:49 <fdobridge> <Sid> oh wait wtf

04:50 <fdobridge> <Sid> nvm I'm still on x11 apparentl

04:50 <fdobridge> <Sid> nvm I'm still on x11 apparently (edited)

04:50 <fdobridge> <Sid> :doomthink:

04:51 <fdobridge> <redsheep> So far I am testing x11 as well since that is the more stable session for me and that is the baseline for most of my testing, I will do wayland next. I believe I have confirmed the hanging is not specific to gl games going through zink

04:51 <fdobridge> <Sid> ogay

04:52 <fdobridge> <gfxstrand> I'm starting to think there's something even more seriously wrong with your multi-monitor setup. Possibly affecting other stuff. If Firefox is pulling from other Windows that's either memory that should have been zeroed and wasn't or something is broken with process isolation.

04:54 <fdobridge> <redsheep> It's an X session so I just assumed it was compositing breaking, but now that you mentioned it if it is app specific that does seem bad

04:54 <fdobridge> <gfxstrand> Yeah, with X weird things can happen

04:54 <fdobridge> <redsheep> Moving onto wayland now

04:55 <fdobridge> <redsheep> Oh dear. Um. Yeah I can't test a damn thing

04:56 <fdobridge> <mhenning> the nouveau kernel driver doesn't zero vram. It's pretty common to get garbage across processes with it

04:56 <fdobridge> <redsheep> https://cdn.discordapp.com/attachments/1209954375766908948/1346345524713885707/rn_image_picker_lib_temp_935a8beb-f6f0-459a-a96e-f64612b20998.jpg?ex=67c7d97a&is=67c687fa&hm=9320052888f67298b175ada68629feceaa3ae0396809b4cfa8ee8af722b4dc5e&

04:56 <fdobridge> <gfxstrand> I was seeing some GPU hangs with Zink on ANV today, which I think are unrelated to the WSI stuff I fixed. We may not be out of the woods on Zink bugs just yet.

04:57 <fdobridge> <redsheep> The Wayland session is pretty angry

04:57 <fdobridge> <gfxstrand> Uh... Pretty colors?

04:57 <fdobridge> <Sid> bisexual lighting

04:57 <fdobridge> <Sid> anyway, reboot and log in again

04:57 <fdobridge> <redsheep> I think it's my background just going apeshit

04:57 <fdobridge> <Sid> there's a plasma bug that breaks the wayland session if you log out of an x11 session and switch to wayland

04:58 <fdobridge> <Sid> I've hit it on nv prop too, where it fails to put up the wayland session entirely

05:01 <fdobridge> <redsheep> Wow this lack of zeroing is really something. I soft rebooted so I could have a fresh go at x11 but I accidentally did Wayland again, and it shows the same pixels... Including the ones that appear to be vram from discord... From before the soft reboot

05:01 <fdobridge> <redsheep> If so that bug also survives soft reboots

05:01 <fdobridge> <Sid> session specific?

05:01 <fdobridge> <redsheep> Wayland was the first session for that

05:02 <fdobridge> <Sid> ..nuts

05:02 <fdobridge> <Sid> bah

05:02 <fdobridge> <redsheep> Well it was x11 since Wayland is kill

05:02 <fdobridge> <Sid> debuginfod is taking five billion years to download

05:03 <fdobridge> <redsheep> I'll try a complete reboot and start with wayland

05:03 <fdobridge> <redsheep> Just have to wait 75 years for my bios

05:03 <fdobridge> <Sid> w/ the all-the-fixes branch my x11 session has become worse :ahh:

05:03 <fdobridge> <Sid> kwin has honest-to-god crashes now

05:04 <fdobridge> <redsheep> Oh wow, I'm not crashing at all. How are we all seeing such wildly different results?

05:05 <fdobridge> <Sid> am🅱️ere vs a🇼a

05:05 <fdobridge> <Sid> /j

05:06 <fdobridge> <redsheep> Yeah no, Wayland first hits the same issue, just a lot more black without lots of garbage already in vram

05:06 <fdobridge> <Sid> https://pastebin.com/1Xb9tKeq

05:06 <fdobridge> <Sid> kwin crash

05:06 <fdobridge> <Owo> Uh oh.

05:06 <fdobridge> <Owo> Well, good luck y'all.

05:07 <fdobridge> <Owo> :hammy:

05:07 <fdobridge> <Sid> waylate time

05:08 <fdobridge> <redsheep> It is my bedtime, true

05:08 <fdobridge> <Sid> wayland is even more broken than before

05:09 <fdobridge> <Sid> https://cdn.discordapp.com/attachments/1209954375766908948/1346348783172063322/rn_image_picker_lib_temp_fb06fb45-5413-4983-8bfc-80143a789cf0.jpg?ex=67c7dc83&is=67c68b03&hm=bc0cec4f94d926a7bbde8cf4e5d1c7d53ce8cac97406e1079013dc747a42db75&

05:09 <fdobridge> <Sid> fresh boot btw

05:09 <fdobridge> <Sid> that's supposed to be spider-man diving

05:10 <fdobridge> <Owo> Random idea. What if y'all run kmscube or some other direct DRM client that isn't just a Wayland compositor?

05:10 <fdobridge> <Sid> like that

05:10 <fdobridge> <Sid> https://cdn.discordapp.com/attachments/1209954375766908948/1346349038005391382/rn_image_picker_lib_temp_00492b5e-7ef8-48d6-aed9-2f24073f44ff.jpg?ex=67c7dcc0&is=67c68b40&hm=e09e39f918b05db1c8494e3b51e5f7bab0bfd9cbe4f1e41f5267990c1f9e8d48&

05:10 <fdobridge> <redsheep> Interestingly this profoundly corrupt session is actually functional, I just have to be kind of blindly using it. I managed to get Firefox open and um. Yeah I can't tell if it's flickering through the more complete corruption covering it

05:10 <fdobridge> <Sid> funnily enough

05:11 <fdobridge> <Sid> sddm-wayland is perfectly fine

05:11 <fdobridge> <redsheep> How do

05:12 <fdobridge> <Owo> I'd try gamescope with an OpenGL client (fuck it, Vulkan too, just to see if anything changes), and I'm pretty sure there are other KMS clients out there

05:12 <fdobridge> <Sid> https://aur.archlinux.org/packages/kmscube-git

05:12 <fdobridge> <Owo> That one is what I'm talking about ^

05:12 <fdobridge> <Owo> Not sure if it uses a graphics API tho

05:12 <fdobridge> <mhenning> Are y'all connecting your displays to your nvidia gpu, or is a different card driving the display?

05:12 <fdobridge> <Sid> :frog_nvidia:

05:13 <fdobridge> <Sid> over displayport

05:13 <fdobridge> <Owo> Ah, yeah, gl, it's good then

05:13 <fdobridge> <redsheep> No other GPU enabled

05:15 <fdobridge> <Sid> ~~gamescope is also a compositor btw~~

05:15 <fdobridge> <gfxstrand> I think some of what we're seeing is some sort of KMS interaction.

05:15 <fdobridge> <gfxstrand> Does Weston work?

05:15 <fdobridge> <Sid> let me check

05:16 <fdobridge> <Owo> That's partially why I recommended gamescope, because I expected it to be simpler than kwin or mutter in terms of how it does things under kms

05:16 <fdobridge> <gfxstrand> (I ask because that's a little easier to repro with than GNOME or KDE)

05:16 <fdobridge> <Owo> Weston completely left my mind :wires:

05:17 <fdobridge> <redsheep> Appears the kmscube aur package doesn't work

05:17 <fdobridge> <Sid> BESTon seems to be fine

05:17 <fdobridge> <Sid> like, perfectly fine

05:17 <fdobridge> <Sid> let's try sway for the hell of it

05:17 <fdobridge> <Sid> wlroots based

05:17 <fdobridge> <Sid> and also try gnome

05:17 <fdobridge> <Sid> :ha:

05:18 <fdobridge> <redsheep> Um I don't have Weston but I have openbox? It's just a black screen with cursor but I don't remember how to use this at all so for all I know that's normal

05:18 <fdobridge> <gfxstrand> So that's definitely a tiling issue. It's rendering to tiled and displaying linear or vice versa. The good news is that shouldn't be total hell to track down if we can reproduce with something I can reasonably debug.

05:19 <fdobridge> <Sid> openbox is x11 however

05:19 <fdobridge> <redsheep> Oh like 3 minutes later it went gray and started responding to the right click context menu, it does work. It's odd that it was delayed

05:19 <fdobridge> <Sid> does look like it, yeah

05:19 <fdobridge> <Sid> gonna try sway and mutter too

05:19 <fdobridge> <Sid> to check one wlroots comp and mutter

05:20 <fdobridge> <Sid> sway is also fine

05:20 <fdobridge> <gfxstrand> As long as it doesn't crash, debugging KWin might not be terrible. I haven't done much with it before, though.

05:21 <fdobridge> <Sid> kwin does crash on x11 sometimes, wayland doesn't however

05:21 <fdobridge> <Sid> for me at least

05:21 <fdobridge> <redsheep> Fascinating, openbox seems to flicker Firefox more rapidly than plasma did, but it's about the same effect otherwise

05:21 <fdobridge> <redsheep> I think that may just be the compositor going faster

05:21 <fdobridge> <Sid> ~~mother~~ mutter is also fine

05:22 <fdobridge> <gfxstrand> If it's a regression, you might be able to bisect. Nothing I did today affects tiling.

05:22 <fdobridge> <Sid> so looks like only 🅱️lasma is running into this tiling issue

05:22 <fdobridge> <Sid> will do

05:22 <fdobridge> <Sid> though, it only broke on your branch :D

05:22 <fdobridge> <Sid> can try current main again to confirm

05:22 <fdobridge> <gfxstrand> Did you test main from today?

05:23 <fdobridge> <gfxstrand> Because I'm gonna be surprised if a tiling issue bisects to one of today's patches.

05:23 <fdobridge> <Sid> on it

05:24 <fdobridge> <gfxstrand> Anyway, I'm off to sleep now. I'll check back in in the morning.

05:24 <fdobridge> <Sid> have yesterday's main on disk, will check today's

05:24 <fdobridge> <Sid> goob night :hug:

05:29 <fdobridge> <Sid> it is a regression

05:29 <fdobridge> <Sid> happened somewhere in the last 50 commits

05:29 <fdobridge> <ermine1716> Good night

05:59 <fdobridge> <Sid> this regression might be unrelated to mesa

05:59 <fdobridge> <Sid> :doomthink:

06:03 <fdobridge> <Sid> euh

06:03 <fdobridge> <Sid> :ConfusedDoggy:

06:07 <fdobridge> <Sid> oh, nvm

06:07 <fdobridge> <Sid> am dumb

06:07 <fdobridge> <Sid> time to restart this bisect

06:17 <fdobridge> <ermine1716> firefox doesn't seem to crash anymore for me

06:18 <fdobridge> <ermine1716> i can try to run sway

06:20 <fdobridge> <ermine1716> it seems to work

06:30 <fdobridge> <Sid> tempted to buy a fingerprint reader for my pc so I don't have to enter my password so many times

06:31 <fdobridge> <ermine1716> i have one on my laptop but i don't use it

06:34 <fdobridge> <ermine1716> KWin crashes though

06:41 <fdobridge> <Sid> @gfxstrand here's the offending commit that breaks kwin wayland

06:41 <fdobridge> <Sid> ```

06:41 <fdobridge> <Sid> df1ff3c711459467432fcd48f7348a8aa78de814 is the first bad commit

06:41 <fdobridge> <Sid> commit df1ff3c711459467432fcd48f7348a8aa78de814

06:42 <fdobridge> <Sid> Author: Mike Blumenkrantz <michael.blumenkrantz@gmail.com>

06:42 <fdobridge> <Sid> Date: Fri Feb 21 12:35:03 2025 -0500

06:42 <fdobridge> <Sid> zink: enable single-plane modifiers for generic 2D exports

06:42 <fdobridge> <Sid>

06:42 <fdobridge> <Sid> this should be fine; multi-plane ones won't work because not all callers

06:42 <fdobridge> <Sid> expect to get multiple fds back

06:42 <fdobridge> <Sid>

06:42 <fdobridge> <Sid> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33675>

06:42 <fdobridge> <Sid> src/gallium/drivers/zink/zink_resource.c | 21 ++++++++++++++++-----

06:42 <fdobridge> <Sid> 1 file changed, 16 insertions(+), 5 deletions(-)

06:42 <fdobridge> <Sid> ```

06:42 <fdobridge> <Sid> cc: @zmike.

06:53 <fdobridge> <ermine1716> How do i make gdb to show symbols from the right libegl_mesa ? meson devenv gdb doesn't help

09:55 mceier has joined #zink

11:21 <austriancoder> what options do I have to get libOpenGL.so on a system that uses zink+kopper?

11:29 <DodoGTA> austriancoder: GLVND should already provide that

12:33 karolherbst has quit [Quit: bye bye]

12:33 karolherbst has joined #zink

12:53 <austriancoder> DodoGTA: will give it a try - thanks

13:34 <fdobridge> <zmike.> are you mixing zink and non-zink?

13:40 <fdobridge> <gfxstrand> Did you file a bug?

13:49 <fdobridge> <Sid> no

13:49 <fdobridge> <Sid> not yet

13:58 <fdobridge> <zmike.> if all the other compositors work (especially the reference compositor) then my guess is a kwin bug

14:01 <fdobridge> <Sid> yeah, sway, gnome, and weston are all fine

14:04 <fdobridge> <Sid> want me to go bother xaver w/ it, or should I file it under mesa?

14:05 <fdobridge> <zmike.> bother xaver first

14:05 <fdobridge> <zmike.> I would think if every other compositor works then kwin should too

14:05 <fdobridge> <Sid> ogay

14:10 <fdobridge> <gfxstrand> Yeah, there's a decent chance KWin is just dropping a modifier on the floor somewhere.

14:10 <fdobridge> <zmike.> that'd be my guess

14:11 <fdobridge> <Sid> put it up in kwin's matrix channel

14:13 airlied_ has joined #zink

14:14 airlied has quit [Ping timeout: 480 seconds]

14:56 Sid127 has quit [Quit: ZNC - https://znc.in]

14:57 <austriancoder> DodoGTA: looking at mesa's sources I do not see any clear path that libglvnd is used for opengl entry points (without using using glx)

14:59 Sid127 has joined #zink

15:02 <fdobridge> <Sid> austriancoder: have you looked at the glvnd source/readme?

15:03 <fdobridge> <Sid> > libglvnd is a vendor-neutral dispatch layer for arbitrating OpenGL API calls between multiple vendors. It allows multiple drivers from different vendors to coexist on the same filesystem, and determines which vendor to dispatch each API call to at runtime.

15:03 <fdobridge> <Sid> > Both GLX and EGL are supported, in any combination with OpenGL and OpenGL ES.

15:03 <fdobridge> <Sid> https://github.com/NVIDIA/libglvnd/

15:04 <austriancoder> I know

15:04 <austriancoder> I even have build it from the sources for my non-linux target :)

15:04 <austriancoder> and even gdb'ed into it

15:05 <austriancoder> and even looked at the mesa sources I am building

15:05 <austriancoder> thats why I am asking such a technical question

15:06 <austriancoder> for libegl i see how its done in mesa - but not for libgl without any glx impl

15:07 <zmike> probably ask in #dri-devel

15:08 <austriancoder> zmike: will do

15:09 <fdobridge> <Sid> ah, apologies 😅

15:17 <austriancoder> np - been doing too much gdb'ing lately .. need a coffee :)

15:19 <fdobridge> <gfxstrand> @zmike. Is that patch doing what I think it's doing? If Zink gets a dma-buf import without modifiers specified by the client, it now looks at the gallium driver and uses whatever modifiers it supports? If so, that's very very bogus.

15:20 <fdobridge> <zmike.> **export**

15:23 <fdobridge> <gfxstrand> That's still very surprising from an EGL clien'ts PoV. It created an image without modifiers, goes to export it, and boom! modifiers.

15:23 <fdobridge> <fooishbar> that's why that extension is criminally terrible and should not exist

15:23 <fdobridge> <gfxstrand> which extension?

15:24 <fdobridge> <Sid> https://tenor.com/view/fps-doug-boom-headshot-headshot-pure-pwnage-gif-4455343198282149618

15:24 <fdobridge> <fooishbar> https://registry.khronos.org/EGL/extensions/MESA/EGL_MESA_image_dma_buf_export.txt

15:25 <fdobridge> <gfxstrand> Really, EGLImage is the terrible thing.

15:26 <fdobridge> <fooishbar> EGLImage is fine

15:26 <fdobridge> <fooishbar> just use it as a reference to something you've exported, rather than trying to turn it into some kind of external-facing allocation API

15:27 <fdobridge> <fooishbar> anyway, I have nfi the implications of that commit, but if it's to support that export, it should only apply to a resource which was created internally with no modifiers set

15:28 <fdobridge> <fooishbar> anyway, I have nfi the implications of that commit, but if it's to support that export, it should only apply to a resource which was created internally with no modifiers set, and also never previously exported (edited)

15:28 <fdobridge> <zmike.> and this is what it does

15:29 <fdobridge> <zmike.> it's for enabling compression on CL exports

15:29 <fdobridge> <zmike.> though also it will potentially impact anyone using this

15:29 <fdobridge> <fooishbar> I can't see KWin calling ExportDMABUF thus far

15:29 <fdobridge> <zmike.> but idgaf too much there

15:30 <MoeIcenowy> EGLImage is surely the terrible thing

15:30 <MoeIcenowy> but it has to exist

15:30 <MoeIcenowy> KWin uses it to import windows contents

15:31 <MoeIcenowy> Xorg uses it to implement front buffer rendering with EGL

15:31 <fdobridge> <fooishbar> @tiredchiku I think you want to insert a breakpoint in the changed bit and get a backtrace from there

15:31 <MoeIcenowy> EGL_MESA_image_dma_buf_export isn't so bad then -- it does not promise success

15:32 <fdobridge> <gfxstrand> That would help. Though I think I may have an idea what's going on.

15:32 <fdobridge> <fooishbar> it doesn't seem like KWin calls ExportDMABUF anywhere, and it's presumably not trying to share an image between GL & CL, so presumably 'let's funge modifiers' is happening in a context it shouldn't

15:32 <MoeIcenowy> and surely we should be able to export something

15:32 <fdobridge> <fooishbar> export it to where?

15:32 <MoeIcenowy> any other APIs

15:32 <MoeIcenowy> or even other devices

15:32 <fdobridge> <fooishbar> right

15:33 <fdobridge> <fooishbar> so do you export in the optimal format that is going to work well for you, or do you assume everyone always supports linear so only ever export to that?

15:33 <MoeIcenowy> Well you can say that EGLImages should be allocated with specialized APIs

15:33 <MoeIcenowy> such as GBM

15:33 <fdobridge> <fooishbar> either you're broken or you're slow

15:33 <fdobridge> <fooishbar> which is why every other API has explicit negotiation

15:33 <fdobridge> <gfxstrand> @zmike. What's the difference between `zink_resource` and `zink_resource_object`?

15:33 <fdobridge> <Sid> :birdnotes:

15:33 <fdobridge> <zmike.> the former is the state tracker object, the latter is the internal state tracker object

15:34 <fdobridge> <zmike.> the former is the public state tracker object, the latter is the internal state tracker object (edited)

15:34 <MoeIcenowy> fooishbar: for other APIs on the same device, the answer is the former; for other devices, it's the latter

15:34 <fdobridge> <Sid> I'll do it tomorrow morning, I was gonna go ~~play~~ test a game for a bit and then wind down with S2 of The Expanse

15:34 <fdobridge> <fooishbar> yes, but when you say 'export this image as a dmabuf', you don't know which device(s) or API(s) it'll be imported into

15:34 <fdobridge> <Sid> (it's 2104 where I am)

15:35 <fdobridge> <fooishbar> so Mesa can pick the option which might not work, or the option that will be slow

15:35 <fdobridge> <zmike.> S2 is when it starts to get good

15:35 <fdobridge> <fooishbar> hence why that API is fundamentally misconceived

15:35 <MoeIcenowy> well, do we have some more thing than EGL_MESA_image_dma_buf_export ?

15:35 <fdobridge> <fooishbar> gbm

15:35 <fdobridge> <Sid> I've heard 😅

15:35 <MoeIcenowy> gbm can only allocate things

15:35 <fdobridge> <fooishbar> also vk alloc, or pretty much everything else

15:35 <fdobridge> <fooishbar> yes

15:35 <MoeIcenowy> it cannot export things already in some API

15:36 <MoeIcenowy> (and I think the implementation of gbm is also a hack

15:36 <fdobridge> <fooishbar> that's why you allocate with gbm (a real allocator) and import into whatever 'some API' is

15:36 <MoeIcenowy> gbm cannot cooperate with another libEGL

15:36 <MoeIcenowy> e.g. gbm is built from one Mesa version and libEGL is built from another

15:36 <fdobridge> <fooishbar> you can use gbm to allocate without having to touch any of the parts of GBM which interact with EGL

15:37 <MoeIcenowy> then export dmabuf from gbm and import dmabuf in EGL?

15:37 <fdobridge> <fooishbar> anyway, between an API that has an implementation which could be improved for the edge case where you have multiple versions of things and you only want to install some of them, or a completely conceptually broken API, I'm going to take the first one

15:37 <fdobridge> <fooishbar> yes, that

15:38 <fdobridge> <Sid> just so I know I understand correctly, I have to set the breakpoint w/ `break add_resource_bind`?

15:38 <MoeIcenowy> BTW I think EGL_KHR_platform_gbm is broken too

15:39 <fdobridge> <fooishbar> break on the line where it hits the 'I'm going to invent my own modifiers' case

15:39 <fdobridge> <fooishbar> you don't have to use it if you don't like it?

15:39 <fdobridge> <zmike.> it's a bit more complex; you need to also check what modifier the import is using

15:40 <fdobridge> <Sid> okay, I think I got you

15:40 <fdobridge> <Sid> hmm

15:40 <MoeIcenowy> fooishbar: well yes

15:40 <fdobridge> <zmike.> I'm guessing kwin is assuming that if it didn't explicitly create with a modifier then it's safe to use LINEAR

15:40 <fdobridge> <zmike.> or something similar

15:41 <fdobridge> <gfxstrand> That seems plausible

15:41 <fdobridge> <Sid> how would I do that :derproo:

15:41 <fdobridge> <zmike.> :grimace:

15:41 <fdobridge> <Sid> https://tenor.com/view/patrick-star-dumb-duh-gif-1800223369574447801

15:41 <fdobridge> <Sid> am learning!

15:42 <fdobridge> <zmike.> I guess you break on like... the `PIPE_RESOURCE_PARAM_MODIFIER` case in `zink_resource_get_param()` for export

15:42 <fdobridge> <zmike.> and then you can check `whandle->modifier` in `zink_resource_from_handle()` for import

15:43 <fdobridge> <zmike.> and these may not be in the same processes

15:43 <fdobridge> <Sid> :birdnotes:

15:43 <fdobridge> <Sid> I will try my best :salute:

15:44 <fdobridge> <Sid> ..tomorrow

15:44 <fdobridge> <zmike.> I'll see if I can take a look today if I ever dig myself out of refcounting

15:45 <fdobridge> <gfxstrand> kwin doesn't seem to be using either the import or export API

15:46 <fdobridge> <fooishbar> `EglDisplay::importDmaBufAsImage` calls `eglCreateImageKHR(EGL_LINUX_DMA_BUF_EXT)`

15:46 <fdobridge> <fooishbar> which definitely seems like an improvement over swrast

15:46 <MoeIcenowy> (offtopic) I start to wonder whether the usage of EGL_KHR_platform_gbm in Glamor is valid -- it does not use gbm_surface at all, only gbm_bo, but I don't think the definition of EGL_KHR_platform_gbm mentions BO at all

15:47 <fdobridge> <fooishbar> you need an EGLDisplay to have a GL context, and you need a GL context to do any rendering

15:48 <fdobridge> <gfxstrand> I've definitely found their GBM code

15:48 <fdobridge> <gfxstrand> Okay, there's the EGL code

15:49 <fdobridge> <gfxstrand> Okay, so here's a theory: Maybe they support modifiers but only for clients and not for scanout and they're assuming that a blind `gbm_bo_create()` and `gbm_bo_get_fd()` will give them an image they can pass to KMS?

15:50 <fdobridge> <gfxstrand> That's starting to look likely...

15:51 <fdobridge> <zmike.> the world is not ready for zink

15:51 <fdobridge> <gfxstrand> Nah, the legacy paths just aren't capable of modifiers.

15:52 <fdobridge> <gfxstrand> The legacy paths suck

15:52 <fdobridge> <gfxstrand> And I'm a little annoyed kwin is maybe still using them.

15:52 <MoeIcenowy> BTW I wonder why Zink cannot work with a X server capable of modifiers

15:52 <MoeIcenowy> but it works when the DDX gains a dummy support for modifiers (which only allows linear/invalid)

15:52 <MoeIcenowy> well I may say Kopper instead of Zink here?

15:53 <fdobridge> <zmike.> ...?

15:53 <MoeIcenowy> s/may say/should say/

15:54 <fdobridge> <fooishbar> `src/core/gbmgraphicsbufferallocator.cpp` calls `gbm_bo_create_with_modifiers()`, if any modifiers are supplied

15:54 <fdobridge> <fooishbar> `src/core/gbmgraphicsbufferallocator.cpp` calls `gbm_bo_create_with_modifiers()`, if any modifiers are advertised by the implementation (edited)

15:57 <MoeIcenowy> ( I did such a dummy implementation of modifiers in https://github.com/revyos/xf86-video-thead/commit/5432fd83c60e205e032bcd39b2ec9f4b62826cd9 , although the TH1520 SoC has a too out-of-date PVR blob that makes Zink not seriously usable at all

15:58 <fdobridge> <zmike.> did you perhaps mean why zink does not work on an xserver that is NOT capable of modifiers?

15:58 <fdobridge> <zmike.> because if so, it's the same issue that is being investigated now

16:00 <MoeIcenowy> zmike.: in the case that things are NOT capable of modifiers, should everything be linear?

16:01 <fdobridge> <zmike.> ideally, though some drivers do not universally interpret INVALID = LINEAR

16:01 <MoeIcenowy> or is everything some undefined format that can be exported by X and imported by app by accident?

16:01 <fdobridge> <zmike.> without explicit modifers, DRM_FORMAT_MOD_INVALID is used

16:01 <fdobridge> <zmike.> which is "idklol maybe use a modifier or don't?"

16:02 <fdobridge> <zmike.> zink maintains a list of platforms which have consistent behavior here

16:02 <MoeIcenowy> so is these "some drivers" why explicit LINEAR is here?

16:02 <fdobridge> <zmike.> drivers which aren't on the list don't work

16:02 <fdobridge> <zmike.> and need to use LIBGL_KOPPER_DRI2=1

16:03 <MoeIcenowy> well my dummy DRI 1.2 implementation only returns INVALID here...

16:03 <fdobridge> <zmike.> yes, so you need to set that env var

16:03 <fdobridge> <zmike.> or, if your platform guarantees that INVALID==LINEAR, add yourself to the list of allowed drivers

16:03 <MoeIcenowy> zmike: well with this env var I can get around w/o LIBGL_KOPPER_DRI2

16:04 <MoeIcenowy> even no LINEAR (although I know these buffers are linear because of being used for exchanging data between three IP cores from two vendors (3D GPU from IMG and 2D GPU / Disp from VeriSilicon

16:05 <fdobridge> <zmike.> you can try adding your driver to the `can_do_invalid_linear_modifier` case and see what happens

16:07 <fdobridge> <gfxstrand> Yeah, reading the code it looks like kwin should be doing modifiers all the way through. I think we need a backtrace to know what's going on.

16:13 <fdobridge> <gfxstrand> It's more "This is ancient decision made by monks in the Himalayas that somehow the kernel and userspace have agreed on and it probably works for scanout."

16:14 <fdobridge> <gfxstrand> It's more "This is ancient decision, made by monks in the Himalayas that somehow the kernel and userspace have agreed on and it probably works for scanout." (edited)

16:14 <fdobridge> <gfxstrand> It's more "This is an ancient decision, made by monks in the Himalayas that somehow the kernel and userspace have agreed on and it probably works for scanout." (edited)

17:47 <fdobridge> <zmike.> were there any example apps that are supposed to exhibit the tiling issue?

17:47 <fdobridge> <zmike.> I'm trying weston demo apps on kwin+drm and it seems to work

17:54 <fdobridge> <Sid> no, it's kwin itself

17:54 <fdobridge> <zmike.> hm

17:55 <fdobridge> <zmike.> seems to be working on radv, but maybe I need to do a full plasma session and not just kwin

18:07 <fdobridge> <zmike.> plasma works great too

18:08 <fdobridge> <Sid> :hmmmg:

18:08 <fdobridge> <Sid> nvk issue? :doomthink:

18:08 <fdobridge> <zmike.> installing there next

18:19 <fdobridge> <zmike.> can't actually get it to start :migraine:

18:47 <fdobridge> <zmike.> oh I see, on radv it wasn't actually using the mesa I told it to use

18:49 <fdobridge> <zmike.> ...and even when I tell it to use the right mesa it still refuses

18:50 <fdobridge> <zmike.> think that's a sign that it's time to go back to the things I'm able to sometimes handle: eating lunch

18:52 <fdobridge> <redsheep> The sometimes is concerning. I wonder if that's the same qt bug that's causing us to have to set the vulkan icds explicitly?

19:02 DodoGTA has left #zink [#zink]

19:08 <fdobridge> <gfxstrand> kwin is working okay for me

19:08 <fdobridge> <gfxstrand> https://cdn.discordapp.com/attachments/1209954375766908948/1346560005066981519/snapshot.png?ex=67c8a13b&is=67c74fbb&hm=aee81f41c6ca21c1530fd913eb2695b56426e9dfc584d71f3e877ec817141a20&

19:09 <fdobridge> <zmike.> are you sure it's using the right mesa

19:10 <fdobridge> <zmike.> i.e., the right libgallium

19:12 <fdobridge> <gfxstrand> Hrm... It's not the right mesa

19:12 <fdobridge> <gfxstrand> Now to figure out how to fix that

19:15 <fdobridge> <gfxstrand> Is it trashing my environment variables?

19:21 DodoGTA has joined #zink

19:21 <fdobridge> <gfxstrand> Not all of them, apparently. `NOUVEAU_USE_ZINK` goes through.

19:22 <fdobridge> <gfxstrand> But I'm getting Zink+NVK from my system, not the ones I told it to use.

19:24 <fdobridge> <Sid> where are you setting the env vars

19:24 <fdobridge> <Sid> and how are you launching kwin

19:24 <fdobridge> <gfxstrand> I have a script that sets them and then launches whatever

19:24 <fdobridge> <gfxstrand> I'm launching kwin with `kwin`

19:31 <fdobridge> <gfxstrand> IDK why it's picking up `NOUVEAU_USE_ZINK` and not `VK_ICD_FILENAMES` or `LD_LIBRARY_PATH`.

19:34 <fdobridge> <gfxstrand> scrubbing `LD_LIBRARY_PATH` doesn't surprise me too much. But `VK_ICD_FILENAMES`?

19:39 <fdobridge> <gfxstrand> I made my script call `env` before invoking kwin

19:39 <fdobridge> <gfxstrand> ```

19:39 <fdobridge> <gfxstrand> LD_LIBRARY_PATH=/home/faith/projects/mesa/main/_install/lib64:

19:39 <fdobridge> <gfxstrand> VK_ICD_FILENAMES=:/home/faith/projects/mesa/main/_install/share/vulkan/icd.d/intel_icd.x86_64.json:/home/faith/projects/mesa/main/_install/share/vulkan/icd.d/nouveau_icd.x86_64.json

19:39 <fdobridge> <gfxstrand> ```

19:39 <fdobridge> <gfxstrand> It's all right there

19:39 <fdobridge> <gfxstrand> Maybe it's a libglvnd issue?

19:39 <fdobridge> <gfxstrand> It works for everything else, though.

19:40 <fdobridge> <gfxstrand> I ran with `VK_LOADER_DEBUG=all` and it didn't see `VK_ICD_FILENAMES` either

19:44 airlied_ is now known as airlied

19:46 <fdobridge> <gfxstrand> So it picks up `VK_LOADER_DEBUG` but not `VK_ICD_FILENAMES`

19:47 <fdobridge> <gfxstrand> But I see nothing in the kwin code that looks like it's purging specific environment variables

19:49 <fdobridge> <gfxstrand> /proc/PID/environ says it has them

19:50 <fdobridge> <gfxstrand> I hate debugging compositors....

19:51 <fdobridge> <mhenning> yeah that all sounds terrifying

19:52 <fdobridge> <gfxstrand> Though maybe /proc/PID/environ is just what it was launched with and not what it has now?

19:53 <fdobridge> <gfxstrand> ```

19:53 <fdobridge> <gfxstrand> (gdb) p getenv("VK_ICD_FILENAMES")

19:53 <fdobridge> <gfxstrand> $1 = 0x7ffe611edc10 ":/home/faith/projects/mesa/main/_install/share/vulkan/icd.d/intel_icd.x86_64.json:/home/faith/projects/mesa/main/_install/share/vulkan/icd.d/nouveau_icd.x86_64.json"

19:53 <fdobridge> <gfxstrand> (gdb) p getenv("LD_LIBRARY_PATH")

19:53 <fdobridge> <gfxstrand> $2 = 0x0

19:53 <fdobridge> <gfxstrand> ```

19:53 <fdobridge> <gfxstrand> Okay, so someone is trashing `LD_LIBRARY_PATH` but not `VK_ICD_FILENAMES`

19:53 <fdobridge> <gfxstrand> And yet it's still loading the wrong Vulkan driver

19:53 <fdobridge> <gfxstrand> blarg!

19:55 <fdobridge> <gfxstrand> Good to know `LD_LIBRARY_PATH` is being trashed. But who's trashing it?!?

19:58 <fdobridge> <mhenning> `strings blah.so | grep LD_LIBRARY_PATH` maybe

20:01 <fdobridge> <gfxstrand> nothing interesting

20:01 <fdobridge> <gfxstrand> Not in kwin, not in libkwin.so

20:02 <fdobridge> <gfxstrand> Hrm... libsystemd has LD_LIBRARY_PATH

20:05 <fdobridge> <mhenning> kwin isn't setuid, is it?

20:05 <fdobridge> <gfxstrand> ```

20:05 <fdobridge> <gfxstrand> faith@zoot% ls -hl /usr/bin |grep kwin

20:05 <fdobridge> <gfxstrand> lrwxrwxrwx. 1 root root 12 Feb 26 18:00 kwin -> kwin_wayland

20:05 <fdobridge> <gfxstrand> -rwxr-xr-x. 1 root root 1.5M Feb 26 18:00 kwin_wayland

20:05 <fdobridge> <gfxstrand> -rwxr-xr-x. 1 root root 49K Feb 26 18:00 kwin_wayland_wrapper

20:05 <fdobridge> <gfxstrand> ```

20:05 <fdobridge> <gfxstrand> Nope

20:06 <fdobridge> <gfxstrand> But I think it uses logind to get access to the tty and that might be scrubbing things

20:08 <fdobridge> <gfxstrand> But again, it's not scrubbing `VK_ICD_FILENAMES` and yet it's loading the system Vulkan driver. This all makes no sense!

20:10 <fdobridge> <gfxstrand> Looks like it talks to logind itself so no library loaded for that

20:41 <fdobridge> <!DodoNVK (she) 🇱🇹> It looks like that the Vulkan Mesa device select layer may be problematic with zink in some cases (its Wayland event loop may freeze a Wayland compositor)

20:43 <fdobridge> <Owo> @gfxstrand off work now. Any new things I should test on radv+Zink?

20:43 <fdobridge> <Owo> I'm about to see if I can repro the performance stuff in a bit and file an issue if so

20:46 <fdobridge> <gfxstrand> Yeah, depending on when it gets invoked with respect to other stuff in the compositor setup.

20:47 <fdobridge> <zmike.> pretty sure I already fixed all of that

20:48 <fdobridge> <!DodoNVK (she) 🇱🇹> In Git code (or the 25.0 release)?

20:48 <fdobridge> <zmike.> I haven't touched it in a while

21:17 karolherbst has quit [Quit: Ping timeout (120 seconds)]

21:21 <fdobridge> <gfxstrand> Wait... Is kwin implicitly using Flatpack or something crazy like that?

21:22 <fdobridge> <gfxstrand> I see a lot of flatpack in the strace

21:23 <fdobridge> <redsheep> I don't think I even have flatpak installed, so I doubt it

21:26 chiku has joined #zink

21:27 karolherbst has joined #zink

21:33 <fdobridge> <Owo> what about it?

21:34 <fdobridge> <Owo> kwin shouldn't have anything to do with flatpak, aside from maybe security-context stuff

21:48 <fdobridge> <gfxstrand> I saw a bunch of flatpack stuff in the strace and it threw me

21:48 <fdobridge> <gfxstrand> I don't think it actually is.

21:48 <fdobridge> <gfxstrand> Something somewhere is just stomping LD_LIBRARY_PATH

21:53 <fdobridge> <gfxstrand> Okay, I think I got it to load my libEGL but now it's failing. 😢

22:05 <fdobridge> <gfxstrand> The Vulkan loader is using `secure_getenv()`

22:10 <fdobridge> <gfxstrand> But also, regular `getenv()` is returning NULL for `LD_LIBRARY_PATH`

22:20 <fdobridge> <gfxstrand> I'm pretty close to giving up for the moment

22:20 <fdobridge> <gfxstrand> Or maybe I have to just bite my lip and install system-wide

22:25 <fdobridge> <gfxstrand> Here's hoping I don't regret this...

22:31 <fdobridge> <gfxstrand> Okay, system-wide install and I'm now getting my custom Zink+NVK and... kwin is fine.

22:31 <fdobridge> <zmike.> what about:

22:31 <fdobridge> <zmike.> the issue is caused by kwin being system mesa and apps being non-system mesa

22:32 <fdobridge> <zmike.> and somehow it's a local issue that doesn't affect us

22:33 <fdobridge> <gfxstrand> possible?

22:33 <fdobridge> <gfxstrand> https://gitlab.freedesktop.org/gfxstrand/mesa

22:34 <fdobridge> <gfxstrand> https://cdn.discordapp.com/attachments/1209954375766908948/1346611680276254742/snapshot.png?ex=67c8d15b&is=67c77fdb&hm=caedb818e4b160a2cc1b690e6df50dbbd2ea33f949eb9495908b887264e8d71b&

22:34 <fdobridge> <gfxstrand> Multi-monitor even seems to work.

22:34 <fdobridge> <gfxstrand> Maybe I need a full KDE session?

22:34 <fdobridge> <zmike.> 💪

22:34 <fdobridge> <zmike.> idk I tried full plasma and that worked fine too

22:34 <fdobridge> <zmike.> maybe you'll have ~~better~~worse luck

22:36 <fdobridge> <gfxstrand> Pardon me. Downloading half a gig of KDE...

22:41 <fdobridge> <gfxstrand> Okay, I've blown away my system GL with my zink/all-the-fixes branch + a hack to use Zink by default.

22:42 <fdobridge> <gfxstrand> https://cdn.discordapp.com/attachments/1209954375766908948/1346613806226473010/snapshot.png?ex=67c8d356&is=67c781d6&hm=a452bf773baf3c33b1292c00585c92ac4739d69a6a786cf6a4f700081e794c79&

22:42 <fdobridge> <mhenning> There's an actual zink by default MR, no need to hack it

22:42 <fdobridge> <zmike.> if you're talking about doing things "the right way" you're in the wrong channel

22:43 <fdobridge> <gfxstrand> Wow. It's been a while since I've used KDE...

22:43 <fdobridge> <mhenning> sorry, I'll only do things the wrong way from now on

22:44 <fdobridge> <zmike.> thank you

22:44 <fdobridge> <gfxstrand> https://cdn.discordapp.com/attachments/1209954375766908948/1346614264215109735/snapshot.png?ex=67c8d3c3&is=67c78243&hm=3765bba56813552febe9f8d776d37cb862a3b66a21efe743c3b9dfae6c16674e&

22:48 <fdobridge> <Owo> I'm going to run into the woods and never appear again if that's a problem

22:48 <fdobridge> <zmike.> I'm skeptical that it is

22:48 <fdobridge> <gfxstrand> Hrm... That might not be the right Mesa sha

22:48 <fdobridge> <Owo> kwin shouldn't have any problem with it, it's just dmabufs getting passed to it that it composites on its own

22:49 <fdobridge> <Owo> :Shrug:

22:52 <fdobridge> <gfxstrand> Got it!

22:52 <fdobridge> <gfxstrand> https://cdn.discordapp.com/attachments/1209954375766908948/1346616205653446727/snapshot.png?ex=67c8d592&is=67c78412&hm=27340d5baa7ba142e308b9675ccdac01ae19eb292a56dd4b1dbc02c18ed374eb&

22:52 <fdobridge> <gfxstrand> Now to figure out why...

22:52 <fdobridge> <magic_rb.> Pretty!

22:53 <fdobridge> <gfxstrand> I failed at git. :frog_upside_down:

22:53 <fdobridge> <gfxstrand> Doing things incorrectly, just like @zmike. said!

22:53 <fdobridge> <zmike.> that's the spirit

22:57 hikiko has joined #zink

22:58 <fdobridge> <gfxstrand> ```

22:58 <fdobridge> <gfxstrand> (gdb) bt

22:58 <fdobridge> <gfxstrand> #0 add_resource_bind (ctx=0x55a32b4f7e10, res=0x55a32b825940, bind=536870912)

22:58 <fdobridge> <gfxstrand> at ../src/gallium/drivers/zink/zink_resource.c:1733

22:58 <fdobridge> <gfxstrand> #1 0x00007fa5c48b7232 in zink_resource_get_handle

22:58 <fdobridge> <gfxstrand> (pscreen=0x55a32a627260, context=0x0, tex=0x55a32b825940, whandle=0x7fffaa7436d0, usage=2)

22:58 <fdobridge> <gfxstrand> at ../src/gallium/drivers/zink/zink_resource.c:1948

22:58 <fdobridge> <gfxstrand> #2 0x00007fa5c48b7017 in zink_resource_get_param

22:58 <fdobridge> <gfxstrand> (pscreen=0x55a32a627260, pctx=0x0, pres=0x55a32b825940, plane=0, layer=0, level=0, param=PIPE_RESOURCE_PARAM_HANDLE_TYPE_KMS, handle_usage=2, value=0x7fffaa7437c0) at ../src/gallium/drivers/zink/zink_resource.c:1896

22:58 <fdobridge> <gfxstrand> #3 0x00007fa5c361460e in dri2_resource_get_param

22:58 <fdobridge> <gfxstrand> (image=0x55a32a75cf30, param=PIPE_RESOURCE_PARAM_HANDLE_TYPE_KMS, handle_usage=2, value=0x7fffaa7437c0)

22:58 <fdobridge> <gfxstrand> at ../src/gallium/frontends/dri/dri2.c:1288

22:58 <fdobridge> <gfxstrand> #4 0x00007fa5c36146dd in dri2_query_image_by_resource_param (image=0x55a32a75cf30, attrib=8193, value=0x55a32b5d71e8)

22:58 <fdobridge> <gfxstrand> at ../src/gallium/frontends/dri/dri2.c:1332

22:58 <fdobridge> <gfxstrand> #5 0x00007fa5c3614824 in dri2_query_image (image=0x55a32a75cf30, attrib=8193, value=0x55a32b5d71e8)

22:58 <fdobridge> <gfxstrand> at ../src/gallium/frontends/dri/dri2.c:1370

22:58 <fdobridge> <gfxstrand> #6 0x00007fa5e4cb6400 in gbm_dri_bo_create

22:58 <fdobridge> <gfxstrand> (gbm=0x55a32a61ac10, width=1280, height=720, format=875713112, usage=5, modifiers=0x0, count=0)

22:58 <fdobridge> <gfxstrand> at ../src/gbm/backends/dri/gbm_dri.c:995

22:58 <fdobridge> <gfxstrand> #7 0x00007fa5e0ce9ddf in gbm_bo_create (gbm=0x55a32a61ac10, width=1280, height=720, format=875713112, flags=5)

22:58 <fdobridge> <gfxstrand> at ../src/gbm/main/gbm.c:498

22:58 <fdobridge> <gfxstrand> #8 0x00007fa5e42681d6 in KWin::allocateDmaBuf(gbm_device*, KWin::GraphicsBufferOptions const&) () at /lib64/libkwin.so.6

22:58 <fdobridge> <gfxstrand> #9 0x00007fa5e4353a63 in KWin::EglSwapchain::acquire() () at /lib64/libkwin.so.6

22:58 <fdobridge> <gfxstrand> #10 0x00007fa5e4502d2a in KWin::EglGbmLayerSurface::startRendering(QSize const&, KWin::OutputTransform, QHash<unsigned int, QList<unsigned long> > const&, KWin::ColorDescription const&, QVector3D const&, std::shared_ptr<KWin::IccProfile> const&, double, KWin::Output::ColorPowerTradeoff) () at /lib64/libkwin.so.6

22:58 <fdobridge> <gfxstrand> #11 0x00007fa5e44fa6ca in KWin::EglGbmLayer::doBeginFrame() () at /lib64/libkwin.so.6

22:58 <fdobridge> <gfxstrand> #12 0x00007fa5e4272e88 in KWin::OutputLayer::beginFrame() () at /lib64/libkwin.so.6

22:59 <fdobridge> <gfxstrand> #13 0x00007fa5e42617a7 in KWin::WaylandCompositor::composite(KWin::RenderLoop*) () at /lib64/libkwin.so.6

22:59 <fdobridge> <gfxstrand> #14 0x00007fa5e155a26e in void doActivate<false>(QObject*, int, void**) () at /lib64/libQt6Core.so.6

22:59 <fdobridge> <gfxstrand> #15 0x00007fa5e42759b4 in KWin::RenderLoop::frameRequested(KWin::RenderLoop*) () at /lib64/libkwin.so.6

22:59 <fdobridge> <gfxstrand> #16 0x00007fa5e427a9d2 in KWin::RenderLoopPrivate::dispatch() () at /lib64/libkwin.so.6

22:59 <fdobridge> <gfxstrand> #17 0x00007fa5e155a26e in void doActivate<false>(QObject*, int, void**) () at /lib64/libQt6Core.so.6

22:59 <fdobridge> <gfxstrand> Looks like it's using `gbm_bo_create()` for display buffers and not using modifiers

23:00 <fdobridge> <gfxstrand> Which works fine on every driver except Zink

23:01 <fdobridge> <gfxstrand> Why isn't it using modifiers? I have no idea...

23:02 <fdobridge> <gfxstrand> Pardon me while gdb downloads debug symbols...

23:03 <fdobridge> <gfxstrand> And it's KDE so it's a lot of C++. :frog_weary:

23:29 <fdobridge> <gfxstrand> Okay, I know what's going on.

23:33 <fdobridge> <gfxstrand> Yeah, we need to just revert that commit

23:38 <fdobridge> <zmike.> But why?

23:38 <fdobridge> <gfxstrand> When KWin can't find usable atomic mode-setting, it disables modifiers for display and falls back to the legacy APIs. This means `gbm_bo_create()` and `gbm_bo_get_fd()` without querying the modifier. This is perfectly reasonable because those APIs are expected to magically create scanout-capable buffers which you can pass off to KMS. When Zink picks a random modifier and assigns it, there's no guarantee that the resulting buffer will be scan

23:38 <fdobridge> <zmike.> Hmmm

23:38 <fdobridge> <zmike.> Seems like that should just add a case depending on the export handle type

23:39 <fdobridge> <gfxstrand> Well, the handle type in this case will be dma-buf

23:40 <fdobridge> <zmike.> 🤕

23:41 <fdobridge> <zmike.> Alright so @karolherbst you really need to plumb that CL screen create param

23:41 <fdobridge> <karolherbst> for the refcounting rework you are doing?

23:41 <fdobridge> <zmike.> No for this^

23:42 <fdobridge> <karolherbst> ohh.. modifier stuff broke?

23:42 <fdobridge> <zmike.> Yeah because GL sucks

23:42 <fdobridge> <karolherbst> ...

23:42 <fdobridge> <karolherbst> *sigh*

23:43 <fdobridge> <zmike.> Though I still think there should be a case to handle this normally

23:43 <fdobridge> <gfxstrand> https://cdn.discordapp.com/attachments/1209954375766908948/1346629259841765386/snapshot.png?ex=67c8e1ba&is=67c7903a&hm=2e4b6d7e55f0755f7f6971ebd492b324e82267a5e253ed6a5ba86425773f5dbb&

23:46 <fdobridge> <karolherbst> why is that such a mess

23:46 <fdobridge> <gfxstrand> Why can we not tell the difference between CL<->GL sharing and GBM?

23:47 <fdobridge> <gfxstrand> Why does CL<->GL sharing have to go through the import/export APIs anyway? Isn't this just for Zink to share with itself?

23:47 <fdobridge> <karolherbst> we could pass in a special usage flag?

23:48 <fdobridge> <karolherbst> isolation mostly, but also because it's all a pain

23:48 <fdobridge> <zmike.> Yeah maybe add a different handle type or something

23:49 <fdobridge> <karolherbst> just a custom `PIPE_HANDLE_USAGE_*` no?

23:50 <fdobridge> <karolherbst> though I think dma-buf is saner here, because I don't really want to share the pipe_resource the GL side is using and hope that nothing fucks it up badly

23:51 <fdobridge> <karolherbst> but I'm considering doing it anyway, because mipmaps are pure pain (tm)

23:51 <fdobridge> <gfxstrand> There's no safe way to share images without sharing the pipe_resource

23:51 <fdobridge> <gfxstrand> Not outside of 2D and maybe 2D arrays (but I wouldn't bet on it)

23:52 <fdobridge> <gfxstrand> CL<->GL sharing was a mistake

23:52 <fdobridge> <karolherbst> that's why zink does linear except for 2D

23:52 <fdobridge> <karolherbst> or something

23:52 <fdobridge> <gfxstrand> The one thing NVIDIA can do linear on. :frog_upside_down:

23:52 <fdobridge> <karolherbst> there are new sharing extensions and they just use dma_buf on linux. Were mostly added for vulkan

23:52 <fdobridge> <gfxstrand> Yes there are

23:53 <fdobridge> <gfxstrand> Those are basically the CL version of GL_EXT_external_objects

23:53 <fdobridge> <gfxstrand> Only maybe better? I'm not actually sure if they're better.

23:53 <fdobridge> <karolherbst> yeah.. and atm they force linear, becuase somehow modifiers aren't part of it

23:53 <fdobridge> <gfxstrand> But they're definitely more sane than the legacy mess.

23:53 <fdobridge> <karolherbst> oh, for sure

23:53 <fdobridge> <karolherbst> but also they don't support modifiers yet

23:53 <fdobridge> <gfxstrand> Last I knew, they used the same driver/deviceUUID stuff and did it the Vulkan way.

23:54 <fdobridge> <karolherbst> well.. you just tell the CL runtime to import an fd

23:54 <fdobridge> <gfxstrand> But it's been a loooong time since I looked at a draft of that extension

23:54 <fdobridge> <gfxstrand> Like Ben and I talked about it 6 years ago sort of long. 😅

23:54 <fdobridge> <karolherbst> at least there is also an external semaphore part of it, so syncing isn't a disaster

23:55 <fdobridge> <karolherbst> at some point I want to support it, but we'd still end up doing dma-buf

23:55 <fdobridge> <karolherbst> so not sure if special casing GL is really such a great idea there

23:56 <fdobridge> <karolherbst> also because I think having proper isolation there is actually not a terrible idea... it's just...

23:56 <fdobridge> <karolherbst> if you ignore zink it all works well

23:56 <fdobridge> <karolherbst> just zink is a problem child

23:56 <fdobridge> <gfxstrand> Hey, look! My name is on that extension. 🙈

23:56 <fdobridge> <karolherbst> native gallium drivers can just give all the information to properly import the resource

23:57 <fdobridge> <karolherbst> there is `mesa_glinterop_export_out` and `mesa_glinterop_export_in` to deal with all that

23:57 <fdobridge> <gfxstrand> Yeah, so if it's imported with `CL_EXTERNAL_MEMORY_HANDLE_OPAQUE_FD_KHR`, it works the same way as Vulkan. You just assume that, given identical parameters, the two drivers will compute identical image layouts, cross your fingers, and hope for the best.

23:58 <fdobridge> <karolherbst> well..

23:58 <fdobridge> <karolherbst> sure, but...

23:58 <fdobridge> <karolherbst> I think the only sane way is to do linear unless you use private exts to transport the metadata

23:58 <fdobridge> <gfxstrand> Nope

23:58 <fdobridge> <gfxstrand> You assume both drivers do identical calculations

23:58 <fdobridge> <karolherbst> pain

23:58 <fdobridge> <gfxstrand> Yup

23:59 <fdobridge> <karolherbst> I'd rather use a private ext as we do for cl_gl_sharing 🙃

23:59 <fdobridge> <gfxstrand> That's why all the drivers have image layout libraries that are separate from the GL/Vulkan driver now.

23:59 <fdobridge> <gfxstrand> One of the reasons, anyway.

23:59 <fdobridge> <gfxstrand> But, annoyingly, there are ways that those can break because image creation is annoyingly complicated.

23:59 <fdobridge> <karolherbst> I'm wondering why we only have one stride field and not two 🙃