#dri-devel on 2022-09-19 — irc logs at oftc.irclog.whitequark.org

2022-08-14 19:45 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:09 dakr has quit [Ping timeout: 480 seconds]

00:24 kts has joined #dri-devel

00:46 linearcannon has joined #dri-devel

00:51 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

00:51 TMM has joined #dri-devel

00:56 co1umbarius has joined #dri-devel

00:57 columbarius has quit [Ping timeout: 480 seconds]

01:08 Thymo_ has joined #dri-devel

01:10 Thymo has quit [Ping timeout: 480 seconds]

01:11 <linearcannon> is there any technical reason that a fairly simple framebuffer driver like "ast" could not support PRIME?

01:19 <airlied> in theory it doesn't have hw accel so it's a lot of CPU overhead

01:19 <linearcannon> for context, i'm doing some research and development work involving pure software-rendered graphics, on a SuperMicro board which uses that driver for the onboard graphics. i want to run Sway, which currently seems to require PRIME support, and i want to be able to test vgem.

01:20 <linearcannon> if possible, i'd rather do some kernel hacking and let my (rather beefy) cpu handle it, than grab a GPU that will barely actually be used

01:20 <airlied> I think there are some patches posted for ast to enable it

01:20 <airlied> not idea how it works in practive

01:21 <linearcannon> ah, so there are! somehow i missed that in my initial round of searching, i'll have to give that a shot

01:40 kts has quit [Ping timeout: 480 seconds]

01:51 <alyssa> jekstrand: We need to land https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16676 or something like it

01:52 <alyssa> I think any backend that wants OpenCL needs some flavour of that pass ... It's blocking OpenCL on Valhall at any rate

01:52 <alyssa> for naming "lower_mem_width" is my preferred bikeshed flavour (and I have my own version of !16676 with some panfrost fixes), I don't really care as long as we figure something out

01:53 <alyssa> You put a preliminary... de facto nak? on there for 2 reasons

01:53 <alyssa> #2 was about nir_opt_load_store_vectorize, I don't see any connection tbh

01:54 <alyssa> ld/st vectorize is fundamentlaly about combining instructions, this is fundamentally about splitting them up

01:54 <alyssa> (In other words, an optimized OpenCL implementation will want to use both)

01:55 <alyssa> #1 ("It's Intel specific") is the bigger issue ... I don't really know how to square this, because I don't know what other backends want

01:57 linearcannon has quit [Remote host closed the connection]

01:57 pallavim has joined #dri-devel

02:06 camus has joined #dri-devel

02:15 Jeremy_Rand_Talos__ has quit [Remote host closed the connection]

02:16 Jeremy_Rand_Talos__ has joined #dri-devel

02:16 camus has quit [Remote host closed the connection]

02:18 camus has joined #dri-devel

02:23 pallavim has quit [Ping timeout: 480 seconds]

02:32 nchery_ has joined #dri-devel

02:33 pallavim has joined #dri-devel

02:38 nchery__ has quit [Ping timeout: 480 seconds]

02:59 ella-0 has joined #dri-devel

03:02 ella-0_ has quit [Read error: Connection reset by peer]

03:05 camus1 has joined #dri-devel

03:09 camus has quit [Ping timeout: 480 seconds]

03:41 <jekstrand> alyssa: Yeah...

03:41 <jekstrand> alyssa: That wasn't a nak it was a "this needs clean-up on it's way to core".

03:42 <jekstrand> alyssa: It pretty much just needs a callback function which says how wide to go.

03:42 <jekstrand> And we'll have to figure out how to make the wider load hack work.

03:43 <jekstrand> alyssa: Very much not a nak

03:54 oneforall2 has quit [Ping timeout: 480 seconds]

03:54 <alyssa> :-D

03:54 oneforall2 has joined #dri-devel

04:07 LexSfX has quit []

04:11 LexSfX has joined #dri-devel

04:20 slattann has joined #dri-devel

04:24 nchery_ has quit [Ping timeout: 480 seconds]

04:39 aravind has joined #dri-devel

04:41 bmodem has joined #dri-devel

04:48 bmodem has quit []

04:52 danvet has joined #dri-devel

04:59 Duke`` has joined #dri-devel

05:17 itoral has joined #dri-devel

05:20 nchery_ has joined #dri-devel

05:21 adavy has joined #dri-devel

05:23 Namarrgon has quit [Ping timeout: 480 seconds]

05:24 tzimmermann has joined #dri-devel

05:24 fab has joined #dri-devel

05:26 dviola has joined #dri-devel

05:27 Namarrgon has joined #dri-devel

05:32 nchery_ has quit [Ping timeout: 480 seconds]

05:35 mszyprow has joined #dri-devel

05:36 Duke`` has quit [Ping timeout: 480 seconds]

05:39 srslypascal is now known as Guest914

05:39 srslypascal has joined #dri-devel

05:40 Guest914 has quit [Ping timeout: 480 seconds]

05:42 srslypascal has quit []

05:47 srslypascal has joined #dri-devel

06:09 everfree has quit [Quit: leaving]

06:10 everfree has joined #dri-devel

06:10 dviola has quit [Quit: WeeChat 3.6]

06:12 kts has joined #dri-devel

06:14 dviola has joined #dri-devel

06:25 JohnnyonFlame has quit [Ping timeout: 480 seconds]

06:29 fab has quit [Quit: fab]

06:38 itoral_ has joined #dri-devel

06:44 itoral has quit [Ping timeout: 480 seconds]

06:47 nchery_ has joined #dri-devel

06:52 jfalempe has joined #dri-devel

06:53 <tzimmermann> javierm, did you ever understand the purpose of the line at https://elixir.bootlin.com/linux/latest/source/drivers/gpu/drm/drm_simple_kms_helper.c#L112 ?

07:10 fab has joined #dri-devel

07:14 <pq> daniels, javierm, with weston's kiosk-shell, don't you need to explicitly configure the apps to run in weston.ini? Simply running a Wayland client manually won't work? Or does it?

07:15 itoral__ has joined #dri-devel

07:16 Lucretia has quit []

07:17 jkrzyszt has joined #dri-devel

07:20 Lucretia has joined #dri-devel

07:21 itoral_ has quit [Ping timeout: 480 seconds]

07:22 sergi3 has quit []

07:22 sergi has joined #dri-devel

07:24 sergi has quit []

07:24 sergi has joined #dri-devel

07:27 kts has quit [Ping timeout: 480 seconds]

07:32 lemonzest has joined #dri-devel

07:34 <javierm> pq: I have no idea :)

07:35 nchery_ has quit []

07:35 <javierm> it certainly didn't in my tests. The foot terminal started and I could see it in the scene-graph but wasn't displayed

07:35 <javierm> daniels: oh, it seems the paste expires too quickly

07:36 nchery has joined #dri-devel

07:36 <javierm> tzimmermann: the drm_atomic_add_affected_planes() call ?

07:36 <tzimmermann> yes

07:37 <javierm> I'm reading that function now

07:37 <tzimmermann> it adds the planes for all of the state's crtcs

07:40 vliaskov has joined #dri-devel

07:42 sdutt has quit [Ping timeout: 480 seconds]

07:48 kts has joined #dri-devel

07:50 MajorBiscuit has joined #dri-devel

07:50 <javierm> tzimmermann: hmm, it seems is already done by drm_atomic_helper_check() so maybe isn't needed for drivers whose struct drm_mode_config_funcs .atomic_check is set to that helper?

07:51 kts has quit []

07:53 <tzimmermann> javierm, calling it results in an atomic_update for all planes. but atomic_check for crtcs runs after atomic_check for planes. so these atomic_updates run without atomic_check. that is suspicios :/

07:54 <tzimmermann> see https://elixir.bootlin.com/linux/latest/source/drivers/gpu/drm/drm_atomic_helper.c#L895 for the atimic_check logic

07:55 dviola has quit [Quit: WeeChat 3.6]

07:55 <tzimmermann> javierm, my next thought was that it triggers the simplekms update helper at https://elixir.bootlin.com/linux/latest/source/drivers/gpu/drm/drm_simple_kms_helper.c#L243

07:55 <tzimmermann> but that should happen in any case

07:56 itoral_ has joined #dri-devel

07:57 <tzimmermann> javierm, i copied the call into several drivers and i saw that ssd130x also has it. but i think it's not required. we should be able to leave it out until we figure out what it does

07:57 <tzimmermann> i thought you might have seen its purpose

07:58 <javierm> tzimmermann: no, in ssd1306 it's cargo cultting from looking at what other drivers do

07:58 <tzimmermann> :)

07:58 lynxeye has joined #dri-devel

07:59 <tzimmermann> these simplekms drivers only have one plane, so the call seems unnecessary. when i recently worked on ast, which uses two planes, it didn't do what i expected.

08:00 <javierm> tzimmermann: agreed that for simplekms it's not needed

08:01 <javierm> nor should be needed for ssd130x and simpledrm that also have 1 plane

08:02 itoral__ has quit [Ping timeout: 480 seconds]

08:05 <tzimmermann> melissawen, please see the discussion above. do you know the purpose of the call to add_affected_planes at https://elixir.bootlin.com/linux/latest/source/drivers/gpu/drm/vkms/vkms_crtc.c#L190 ?

08:11 <javierm> tzimmermann: there's also an drm_atomic_add_affected_connectors() but that's much less used by drivers

08:11 <javierm> both are called by drm_atomic_helper_check_modeset()

08:11 <tzimmermann> i know, but i've not come across it much

08:13 <javierm> the question is why the atomic state needs to be recalculated for all planes in a CRTC check? It's because you could change CRTC and needs to add all planes associated with that CRTC?

08:13 <javierm> and wouldn't this be done already by drm_atomic_helper_check_modeset() if that helper is used by the driver?

08:14 <tzimmermann> from what i can tell, the calls make sense in check_modeset. connectors and planes are atomic_checked afterwards in the codepath

08:14 digetx has quit [Remote host closed the connection]

08:14 <javierm> tzimmermann: yes, that's my thinking too

08:14 <javierm> but calling it in the CRTC check seems superflous to me

08:15 <tzimmermann> javierm, for example: if you enable a crtc, it most likely needs an active primary plane. adding the planes will guarantee that

08:15 <tzimmermann> so its in check_modeset()

08:16 digetx has joined #dri-devel

08:19 <javierm> tzimmermann: yes, I understand why it's done by drm_atomic_helper_check(), and makes sense because after that it calls drm_atomic_helper_check_planes(dev, state)

08:20 <javierm> but don't see why drivers would need to call it again in their crtc atomic_check handler

08:20 thaytan has quit [Ping timeout: 480 seconds]

08:21 <javierm> in any case they should do in their drm_mode atomic_check if they have a custom one

08:28 garrison has joined #dri-devel

08:28 i-garrison has quit [Read error: Connection reset by peer]

08:29 garrison has quit []

08:30 i-garrison has joined #dri-devel

08:32 thaytan has joined #dri-devel

08:33 garnet has joined #dri-devel

08:37 jkrzyszt has quit [Remote host closed the connection]

08:40 jkrzyszt has joined #dri-devel

08:41 pcercuei has joined #dri-devel

08:42 bmodem has joined #dri-devel

08:45 <Venemo> karolherbst: who the hell is this guy who's still trolling on the rusticl MR?

08:47 bmodem1 has joined #dri-devel

08:47 pa- has quit [Ping timeout: 480 seconds]

08:50 kts has joined #dri-devel

08:52 bmodem has quit [Ping timeout: 480 seconds]

08:59 garnet has quit [Remote host closed the connection]

09:11 JohnnyonFlame has joined #dri-devel

09:25 pallavim has quit [Remote host closed the connection]

09:26 pallavim has joined #dri-devel

09:26 <MrCooper> Venemo: curro is the clover maintainer

09:27 <Venemo> MrCooper: how is that possible? he hasn't made any single commit to clover for many years

09:29 bmodem has joined #dri-devel

09:30 YuGiOhJCJ has joined #dri-devel

09:32 bmodem1 has quit [Ping timeout: 480 seconds]

09:35 <MrCooper> even so, I doubt anyone else would want to claim that title :)

09:43 <Venemo> hehe

09:45 rsalvaterra has quit []

09:46 rsalvaterra has joined #dri-devel

09:48 lemonzest has quit [Remote host closed the connection]

09:49 devilhorns has joined #dri-devel

09:50 <daniels> pq: nope, you can just start clients externally - worked fine for me

09:51 <daniels> javierm: can you please start weston as weston --log=/path/to/foo.log --logger-scopes=log,proto,drm-backend, and attach the foo.log

09:51 <pq> alright

09:56 pa has joined #dri-devel

09:59 lemonzest has joined #dri-devel

10:03 JohnnyonFlame has quit [Ping timeout: 480 seconds]

10:08 pallavim has quit [Remote host closed the connection]

10:08 pallavim has joined #dri-devel

10:14 chip_x has joined #dri-devel

10:16 <javierm> daniels: https://javierm.fedorapeople.org/weston/

10:16 chipxxx has quit [Read error: Connection reset by peer]

10:16 <javierm> to make sure that this time won't go away :)

10:17 kts has quit [Quit: Konversation terminated!]

10:19 <javierm> I've also copied there the systemd unit files I'm using to test starting weston and foot on boot

10:19 lemonzest has quit [Quit: WeeChat 3.5]

10:20 mvlad has joined #dri-devel

10:20 <javierm> pq: about seatd vs systemd the other day, I noticed that with XDG_SEAT=seat0 and PAMName=login weston can be started by systemd

10:27 <melissawen> tzimmermann, javierm, for the vkms case, we are using it to have a link between all plane_state to crtc_state and get active planes for the planes composition that the driver does and compute crc in the end

10:27 <melissawen> well... afaik

10:27 kts has joined #dri-devel

10:27 <melissawen> this commit explains the context a little: https://cgit.freedesktop.org/drm/drm-misc/commit/?id=8b1865873651d

10:27 <pq> javierm, I think XDG_SEAT should be already set by systemd. But yeah, the running Weston doc has lots about having Weston in a service unit.

10:28 <melissawen> maybe danvet has more thoughts

10:28 <pq> javierm, ideally there would be a specific PAMName for weston only, with the appropriate PAM configuration to go with it. I've been told "login" is not quite right.

10:29 <pq> javierm, but that's all more in the distro integration domain.

10:29 <emersion> i think it's to be able to configure special auth rules for weston only

10:29 <pq> yeah

10:29 <emersion> so yeah, you'd need to ship a weston PAM file as well, which includes login

10:30 <javierm> pq: AFAICT is not in the docs yet, but in an open MR https://gitlab.freedesktop.org/wayland/weston/-/merge_requests/439

10:30 <emersion> /etc/pam.d/weston with `auth include login`

10:31 <javierm> pq: and yes, I read that specifying the seat shouldn't be needed anymore but I guess is because I'm trying to not run it from logind ?

10:31 <pq> javierm, https://wayland.pages.freedesktop.org/weston/toc/running-weston.html#running-weston-from-a-systemd-service does have some stuff

10:31 <pq> javierm, wait, so are you using seatd with weston in a systemd service unit?

10:31 <tzimmermann> melissawen, ok. thanks for your answer

10:31 <dolphin> javierm: I think you would prefer to run an user session and start foot from there

10:32 <javierm> dolphin: probably yes. I'm currently just experimenting

10:32 rasterman has joined #dri-devel

10:32 <dolphin> I'm actually doing much of the same but with sway and cog

10:32 <pq> there are three ways: weston as root from system service, weston as a user from system service, and weston as a user service (and you need to arrange that user to auto-login)

10:33 fahien has joined #dri-devel

10:33 <javierm> pq: there's a seat0 already that's started by systemd-user-sessions.service IIUC

10:33 <javierm> I'm not using neither seatd nor logind

10:33 <javierm> pq: currently option 1 from your list

10:33 <dolphin> I have found the path of least resistance to add override systemd for getty@tty1 to autologin

10:34 <pq> javierm, yes. PAMName ideally starts a new sessions for the named user, activates all user services for that user, and then runs weston.

10:34 <javierm> pq: I see. Let me remove the explict seat0 but I think that was failing without it...

10:34 <pq> what PAMName actually does depends on how PAM in configured for that PAMName

10:34 dviola has joined #dri-devel

10:35 <pq> javierm, you might be missing the service directives that get a seat and user for the service.

10:36 <javierm> Sep 19 12:35:02 fedora weston[1432]: [12:35:02.670] [libseat/backend/logind.c:704] The sd_session_get_seat() failed: -61

10:36 <pq> or it might even be dependant on VTs

10:36 <javierm> Sep 19 12:35:02 fedora weston[1432]: [12:35:02.670] [libseat/libseat.c:76] Backend 'logind' failed to open seat, skipping

10:36 <javierm> pq: well, I'm trying to run without VTs :)

10:36 <pq> I know, that's why I said the normal procedure might not work.

10:37 <javierm> pq: ah, sorry. Misunderstood

10:37 <javierm> so yeah, probably is TTYPath=/dev/tty7 what ties it with the seat

10:38 <pq> possibly, yes

10:38 <javierm> pq: wonder then if no tty should mean seat0 by default to weston

10:38 bmodem has quit [Ping timeout: 480 seconds]

10:39 <pq> I don't think it's Weston who needs XDG_SEAT set, it defaults to seat0 anyway.

10:40 <pq> it might be any of the other components

10:40 <pq> like a PAM plugin

10:40 <javierm> pq: I see. Thanks for the pointer

10:40 <javierm> I've so many knowledge gaps in the user-space graphics stack...

10:41 <pq> it's not even that, this all is session management stuff

10:41 <pq> I have huge gaps too

10:42 <kennylevinsen> javierm: want logind or seatd?

10:42 <javierm> pq: and it seems there are too many assumptions about a VT / tty? being always present in the system

10:42 <javierm> kennylevinsen: logind

10:42 <pq> also since stuff like PAM configs are distribution-specific mostly, it's even more vague

10:42 <javierm> yeah

10:43 <javierm> pq: so it seems I need to do some reading before doing more random experiments :P

10:43 <kennylevinsen> for logind you need the service to use a PAM stack (through PAMName) that calls out to pam_systemd.so

10:43 <javierm> kennylevinsen: yes, but I thought PAMName=login would be enough

10:43 <javierm> although it seems that relies on the tty to figure out the seat

10:43 <kennylevinsen> If on a systemd distro it should be enough. Try to make the service start bash instead and check loginctl output

10:44 <javierm> kennylevinsen: Ok, thanks

10:44 <kennylevinsen> then you can see what logind thinks about things, show-session is useful and sometimes you get more info by looking in /run/systemd stuff

10:45 <javierm> got it. I'll figure out the session management part then

10:45 <pq> kennylevinsen, the quirk is, javierm wants this with CONFIG_VT=n, and I don't know how to define which seat a service should take over so that logind would be happy to let take control.

10:46 tobiasjakobi has joined #dri-devel

10:46 <kennylevinsen> XDG_SEAT I believe

10:46 itoral__ has joined #dri-devel

10:46 <kennylevinsen> Systemd-specific of course, as the name suggests >.>

10:47 <kennylevinsen> And requires udev tags set appropriately

10:47 <pq> so you need XDG_SEAT set before pam_systemd.so runs?

10:47 <kennylevinsen> yes, although I only remember faintly

10:47 <pq> well, it's the default seat, so no udev tags needed

10:47 <kennylevinsen> Also seat0 should still be default regardless

10:48 <kennylevinsen> I can have a look at it later if it still fails, need to go to a meeting

10:48 <pq> but with CONFIG_VT=n, you cannot assign a TTY to it

10:48 pallavim_ has joined #dri-devel

10:49 pallavim_ has quit [Remote host closed the connection]

10:50 <pq> pam_env.so could be used to set environment variables...

10:50 pallavim_ has joined #dri-devel

10:51 <emersion> pq, it's deprecated

10:51 <kennylevinsen> just use Environment= in the unit

10:52 <pq> kennylevinsen, does Environment apply before or after PAM stack?

10:52 <pq> since this env var is needed in the PAM stack and not in weston

10:52 <kennylevinsen> It's my best bet before I look at source. I think you can pass a debug flag to the systemd module ("debug")

10:53 itoral_ has quit [Ping timeout: 480 seconds]

10:53 tobiasjakobi has quit []

10:53 <javierm> kennylevinsen: using Environment="XDG_SEAT=seat0" in the unit file is what I did but thought that's a hacky way to do it

10:54 <pq> It's not hacky if it actually convinced logind to give control on that seat.

10:55 <pq> the service necessarily needs to take one specific seat

10:55 pallavim has quit [Ping timeout: 480 seconds]

10:56 <pq> I guess what confused me is that "normally" pam_systemd.so uses a heuristic to *set* XDG_SEAT rather than *use* XDG_SEAT.

10:57 <javierm> pq: it convinced and it starts weston on seat0, but thought that there would be another place to put that policy

10:57 <javierm> i.e: pam or logind setting a default XDG_SEAT=seat0 if no TTY or something like that

10:57 <pq> and I suppose setting the TTY is a way to trigger that heuristic.

10:58 <daniels> javierm: that's strange - are you using a packaged version of foot or git? I was running from git

10:58 <pq> I don't think "give ownership of the default seat" is something one would do by default.

10:58 <daniels> I can see that foot is making all the right requests, but for some reason it's just disappearing into the void

10:58 <pq> like if you log in via ssh, it's not seat0 - it's not any seat, IIRC

10:59 <javierm> daniels: from the f36 package: foot-1.13.1-1.fc36.x86_64

11:00 <javierm> but since it was working with the weston desktop shell, I thought that foot wouldn't be to blame

11:00 <daniels> javierm: do you mind trying from git please? it just built ootb for me at least

11:00 <daniels> yeah

11:00 <javierm> daniels: sure, one min

11:00 <daniels> thanks!

11:03 ahajda has joined #dri-devel

11:05 <javierm> daniels: same result with latest HEAD, commit https://gitlab.com/dnkl/foot/-/commit/debf1b8453ada57e69ec86fcb3fcb9ebf140d218

11:05 <javierm> daniels: this is a VM with virtio GPU in case that matters

11:06 <javierm> pq: got it. Makes sense then for explictly set that for this service file then

11:08 <pq> javierm, if you suspect GPU might have a problem, pass --use-pixman to Weston. Then it will use software rendering and DRM dumb buffers.

11:09 <pq> that then causes foot to use software rendering as well, if it wasn't already.

11:14 bmodem has joined #dri-devel

11:14 <javierm> pq: thanks, same result with pixman backend. Need to dig further but the strange thing is that foot is happy running but just not displayed

11:15 <javierm> anyways, need to work on more boring stuff now :) thanks to all folks for your assistance

11:15 <pq> no prob

11:15 <pq> it could be just kiosk-shell deciding to not show foot for some reason, if it's not present in the scenegraph.

11:18 <daniels> javierm: oh, I see the issue ...

11:19 <daniels> [atomic] drmModeAtomicCommit

11:19 <daniels> [repaint] flushed pending_state 0x20a2260

11:20 <daniels> somehow we've got ourselves into a state where we haven't committed anything, but seem to still be expecting a repaint event

11:20 <daniels> (this exact usecase wfm on i915 btw)

11:20 <daniels> hmm, maybe

11:27 vliaskov has quit [Remote host closed the connection]

11:28 vliaskov has joined #dri-devel

11:30 <vsyrjala> airlied: danvet: could you lay dowm some law for christian koenig? really getting fed up with him constantly pushing untested patches and breaking i915

11:30 Daanct12 is now known as Danct12

11:34 bmodem1 has joined #dri-devel

11:40 bmodem has quit [Ping timeout: 480 seconds]

11:44 pallavim_ has quit [Ping timeout: 480 seconds]

11:44 pallavim_ has joined #dri-devel

11:53 dakr has joined #dri-devel

11:58 abws has joined #dri-devel

11:59 pallavim_ has quit [Remote host closed the connection]

11:59 pallavim_ has joined #dri-devel

12:08 itoral__ has quit [Remote host closed the connection]

12:15 bmodem has joined #dri-devel

12:15 <kennylevinsen> pq, javierm: pam_systemd.so reads a lot of environment variables: https://github.com/systemd/systemd/blob/main/src/login/pam_systemd.c#L762

12:18 <kennylevinsen> and it has a bunch of "fun" heuristics, like seeing if there's an X11 server running, figuring out its controlling tty, converting that to a VT number and deciding the seat from that

12:18 <kennylevinsen> telling it explicitly is *definitely* better than relying on all that magic :P

12:20 bmodem1 has quit [Ping timeout: 480 seconds]

12:22 fahien has quit [Ping timeout: 480 seconds]

12:24 <javierm> kennylevinsen: yes, agree

12:25 <javierm> kennylevinsen: I guess this is similar to when we disabled all fbdev driver in favour of simpledrm in fedora, a bunch of stuff broke and needed fixes because were assuming that an early fbdev would exist

12:25 <javierm> gdm, plymouth, etc

12:27 fahien has joined #dri-devel

12:37 <pq> kennylevinsen, that makes sense now. Before I had no idea you even could tell pam_systemd the seat. :-)

12:37 <kennylevinsen> The More You (never wanted to) Know™

12:47 saurabhg has joined #dri-devel

12:50 fab has quit [Ping timeout: 480 seconds]

12:57 fab has joined #dri-devel

13:01 jewins has joined #dri-devel

13:23 mbrost has joined #dri-devel

13:25 abws has quit [Quit: abws]

13:26 mattst88 has quit [Read error: Connection reset by peer]

13:26 mattst88 has joined #dri-devel

13:32 Company has joined #dri-devel

13:35 rgallaispou has quit [Quit: Leaving.]

13:38 kts has quit [Ping timeout: 480 seconds]

13:41 <karolherbst> sooo.. to get iris pass the CL CTS I need !15811 !16442 and !18670

13:41 bmodem has quit [Read error: Connection reset by peer]

13:41 <karolherbst> would be cool if somebody could take a look to review/merge those

13:42 bmodem has joined #dri-devel

13:44 rgallaispou has joined #dri-devel

13:44 kts has joined #dri-devel

13:44 pcercuei has quit [Read error: Connection reset by peer]

13:45 pcercuei has joined #dri-devel

14:01 fxkamd has joined #dri-devel

14:03 chip_x has quit [Read error: No route to host]

14:09 Dr_Who has joined #dri-devel

14:15 anarsoul has quit [Quit: ZNC 1.8.2 - https://znc.in]

14:15 mbrost has quit [Read error: Connection reset by peer]

14:15 anarsoul has joined #dri-devel

14:18 fab has quit [Quit: fab]

14:19 fxkamd has quit []

14:21 sinatosk has joined #dri-devel

14:23 sdutt has joined #dri-devel

14:24 <DemiMarie> Could drmlog be used on panic? Windows at least manages to get text out on BSOD.

14:25 lemonzest has joined #dri-devel

14:25 <karolherbst> there were some ideas on showing a QR code on panics, but not sure where that went

14:25 YuGiOhJCJ has quit [Quit: YuGiOhJCJ]

14:25 sinatosk has quit []

14:26 sinatosk has joined #dri-devel

14:26 <DemiMarie> Another thought I had was for userspace to provide the kernel with an asymmetric encryption key during boot. In the case of a panic, Linux would use that to encrypt a crash dump to swap.

14:26 <karolherbst> there are ways of doing that already, but nothing of that is really user friendly atm

14:26 <DemiMarie> yeah

14:27 sinatosk has quit []

14:27 <karolherbst> but doing any fs related once you crashed the kernel is also quite dangerous

14:27 <karolherbst> what if you trash the fs?

14:27 <DemiMarie> I was thinking swap partition

14:27 <DemiMarie> but kexec might be the better solution, as is so often the case.

14:27 <karolherbst> well... you can have memory corruptions or weirdo locks taken

14:27 <karolherbst> yeah, kexec already solved some of the issue

14:28 <karolherbst> just that it's a mess if the graphics driver crashed :)

14:28 <karolherbst> or the GPU driver not able to use the GPU after kexec

14:28 <karolherbst> DemiMarie: there is kdum btw

14:28 <karolherbst> *kdump

14:29 rgallaispou has quit [Read error: Connection reset by peer]

14:29 <DemiMarie> karolherbst: yes, and as far as I can tell it is unencrypted so nobody shipping to end-users actually uses it

14:29 <DemiMarie> Encryption would make it possible to use in practice

14:30 <DemiMarie> karolherbst: How common is this?

14:30 <karolherbst> it's more confusing than a real problem

14:30 <karolherbst> so when kdump takes the dump, users might force reboot because black screen and such

14:30 <DemiMarie> What is?

14:30 <karolherbst> crashed GPU driver

14:31 alyssa has left #dri-devel [#dri-devel]

14:39 alyssa has joined #dri-devel

14:40 JohnnyonFlame has joined #dri-devel

14:40 <alyssa> jekstrand: So, panfrost needs to disable some optimizations for compute kernels if shared memory is used

14:40 <alyssa> (workgroup local memory)

14:41 <alyssa> namely, if workgroup local memory is used, then the hardware cannot split up or merge together workgroups that are too small or too large

14:41 fahien1 has joined #dri-devel

14:41 <alyssa> so we need a way to detect whether shared memory is used

14:41 <alyssa> nominally, `nir->info.shared_size > 0` is that check ... but that doesn't work!

14:42 <alyssa> because in OpenCL, `nir->info.shared_size` can be 0, with a variable amount of shared memory allocation at enqueue time!

14:42 <alyssa> so I guess I have 2 optios here

14:42 <alyssa> one is to extend nir_gather_info to statically check for any shared memory intrinsics and just report a bool of "can this shader possibly use shared memory?"

14:43 os369510 has joined #dri-devel

14:43 fahien has quit [Ping timeout: 480 seconds]

14:43 <alyssa> the other is to pinky-promise not to ever look at nir->info.shared_size at compile-time and move the "can shared memory be used?" check to enqueue-time when we actually have the shared size.

14:43 <alyssa> the other is a lot lazier and should work fine for my hardware

14:44 <alyssa> unsure whether we want this solved in NIR more properly, though, because nir->info.shared_size is a huge footgun for every driver that wants to eventually support CL

14:49 cphealy has joined #dri-devel

14:49 rgallaispou has joined #dri-devel

14:50 fahien1 is now known as fahien

14:52 anarsoul has quit [Read error: Connection reset by peer]

14:53 anarsoul has joined #dri-devel

14:56 <jekstrand> alyssa: Yeah...

14:56 <jekstrand> alyssa: Does the workgroup merging stuff involve re-compiles?

15:01 <alyssa> No, not for Mali at least

15:02 <alyssa> ir3 is the only hw in-tree that actually requires recompiles based on shared size

15:02 kts has quit [Ping timeout: 480 seconds]

15:03 xroumegue has quit [Ping timeout: 480 seconds]

15:03 mszyprow has quit []

15:07 <karolherbst> alyssa: this is totally fine though, CL allows you to report the max workgroup size based on what the shader has

15:07 <karolherbst> but yeah.. having a variable amount of shared mem can be problematic, as you have to assume the worst case I think

15:10 <karolherbst> there is one thing I am still wondering about: apparently you can declare a sized local mem array in a CL kernel, but I don't actually know what the compiler is making out of this

15:11 alanc has quit [Remote host closed the connection]

15:11 alanc has joined #dri-devel

15:12 xroumegue has joined #dri-devel

15:12 <karolherbst> ehh maybe not

15:13 <karolherbst> alyssa: I think it might make sense to add another info field: has_variable_shared_mem and nir_gather_info could set it

15:14 <karolherbst> it's a bit tricky to figure that out though

15:14 Duke`` has joined #dri-devel

15:15 mbrost has joined #dri-devel

15:15 <karolherbst> the frontend knows this though, so we could make it part of the gallium API

15:17 bmodem has quit []

15:18 bmodem has joined #dri-devel

15:21 * alyssa shrugs

15:21 <alyssa> karolherbst: The commit I linked from the low-overhead MR deals with my panfrost problem

15:21 <karolherbst> yeah, I already saw it

15:21 <alyssa> I don't *want* to kick the can down the road but also I am quickly running out of OpenCL time for the month :-p

15:21 <karolherbst> I think it's fine to make runtime decisions at runtime and not compile time

15:22 <karolherbst> what does that merging affect btw, block size?

15:22 <karolherbst> I am planning to rework all the workgroup info stuff based on my MR to actually allow drivers to report back runtime info so I don't have to assume the worst case (subgroup size)

15:23 <karolherbst> and I also want to hook up last_block :)

15:23 <alyssa> nod

15:26 bmodem has quit [Ping timeout: 480 seconds]

15:29 mbrost has quit [Ping timeout: 480 seconds]

15:29 slattann has quit []

15:30 rgallaispou has left #dri-devel [#dri-devel]

15:31 os369510 has quit [Remote host closed the connection]

15:33 rgallaispou has joined #dri-devel

15:36 tzimmermann has quit [Quit: Leaving]

15:39 <pinchartl> sravn: "[PATCH v1 0/12] drm bridge updates" looks very nice. sorry for not noticing it earlier. I only have a small comment on 05/12. I think you can apply patches 01/12 to 11/12 (excluding 07/12 as the issue it addresses has already been fixed in drm-misc)

15:41 ybogdano has joined #dri-devel

15:45 anarsoul has quit []

15:45 anarsoul has joined #dri-devel

15:51 rgallaispou has left #dri-devel [#dri-devel]

15:53 <jenatali> alyssa: sounds like what you want us a scan for whether there's any ops on shared memory

15:53 <alyssa> jenatali: that's the gather_info change I suggested

15:54 <jenatali> (our backend has to do that currently because DXIL is invalid if it declares unused shared memory...)

15:54 <alyssa> but then I realized instead of typing out a 50 line patch to common code I can just do a 1 line patch to panfrost and ignore the whole mess :D

15:54 <jenatali> So if you did put it in common I could use it instead of our current scan, but it's not complicated so w/e

15:55 <alyssa> your scan seems to be all _dxil ops

15:55 <jenatali> Oh, sure, but those are just the same as the common ones but with uint offsets iirc

15:56 <jenatali> Instead of byte offsets

15:56 <alyssa> nod

15:56 <karolherbst> jenatali: it will become complicated once we don't inline everything

15:56 <alyssa> That reminds me I need to introduce load_global_agx and friends..

15:57 <jenatali> karolherbst: Yeah, DXIL requires everything to be inclined currently though (I think) so if the frontend didn't, I'd do it in the backend for now

15:57 <karolherbst> but it's really easy to actually know this: if the kernel as local mem args, there is variable shared mem

15:57 <karolherbst> end of story

15:58 <alyssa> actually what I want is a formatted memory load for AGX

15:58 <karolherbst> huh?

15:58 <alyssa> karolherbst: AGX's memory loads are formatted

15:58 <karolherbst> formatted how?

15:59 <alyssa> i8, i16, f16, i32, rgba8, rgb10a2, ...

15:59 <karolherbst> ahh images

15:59 <alyssa> no

15:59 <robclark> karolherbst: re: crashes vs fs.. chromebooks have console-ramoops so most cases we can get previous dmesg after a (warm) reboot.. which along w/ suzyq is a pretty big improvement vs debugging windows laptops ;-)

15:59 <alyssa> just memory

15:59 <karolherbst> alyssa: like.. you have to load differently depending on the datatypes?

15:59 <karolherbst> so loading a f32 with a i8v4 load would get you different results or something?

16:00 <karolherbst> robclark: sure.. but samsung (or some OEM) had buggy UEFI so people don't rely on that in distributions...

16:00 <karolherbst> there is actually UEFI storage for this purpose

16:00 <karolherbst> but no

16:00 <karolherbst> one had to screw it up

16:00 <alyssa> output_type load(base address, offset, extra shift, format) {

16:00 <alyssa> format *array = (format *) base_address;

16:00 <alyssa> return array[offset << extra shift] as output_type;

16:00 <alyssa> }

16:01 <alyssa> that is roughly the hardware behaviour

16:01 <karolherbst> okay.. so the format doesn't matter

16:01 <alyssa> yes, it does

16:01 <alyssa> there's a format conversion from the memory format to the register format

16:01 <karolherbst> would it break stuff if you load all 32 bit types as u32?

16:02 <alyssa> never mind

16:03 <karolherbst> the heck... the luxmark v3.1 C++ impl is faster than CL even on intel :D

16:08 heat has joined #dri-devel

16:09 heat has quit [Remote host closed the connection]

16:10 heat has joined #dri-devel

16:10 aravind has quit [Ping timeout: 480 seconds]

16:15 <zmike> is there like a msaa version of glxgears somewhere?

16:15 <zmike> or some other very simple msaa-using app?

16:16 <karolherbst> zmike: glxgears -samples ?

16:16 <zmike> oh wow

16:16 <zmike> incredible

16:17 <alyssa> wild

16:18 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

16:18 <dj-death> OMG

16:18 TMM has joined #dri-devel

16:19 <karolherbst> okay.. so luxmark v3.1 ranking: 1. pocl 2. iris 3. llvmpipe 4. intel NEO: it crashes

16:20 lynxeye has quit [Quit: Leaving.]

16:20 <karolherbst> uhm.. well their C++ is even faster than pocl, but that's not CL, so it's clearly sheating

16:20 <karolherbst> *cheating

16:20 <karolherbst> also.. a good data point on the state of CL in general

16:27 <anholt> would be curious to see clvk in that mix

16:31 ybogdano has quit [Ping timeout: 480 seconds]

16:32 <karolherbst> yeah... wouldn't be surprised to be faster than intels stack actually, but it's crashing and I have no idea why...

16:34 jhli has quit [Remote host closed the connection]

16:36 saurabh_1 has joined #dri-devel

16:36 fab has joined #dri-devel

16:38 MajorBiscuit has quit [Ping timeout: 480 seconds]

16:41 saurabhg has quit [Ping timeout: 480 seconds]

16:48 <MrCooper> karolherbst: FYI, F37 has LLVM 15 but still spirv-llvm-translator 14, so rusticl (or opencl-spirv) doesn't build

16:48 <karolherbst> uhhh.. :(

16:49 <karolherbst> guess we'll have to make sure the toolchain people keep an eye on that and don't break it :/

16:49 <karolherbst> MrCooper: thought here are llvm-14 packages around, no??

16:50 <karolherbst> well.. llvmpipe is busted with llvm-15 anyway

16:50 <karolherbst> maybe only for CL

16:50 <MrCooper> indeed, there are llvm14-* packages

16:51 ybogdano has joined #dri-devel

16:54 <kisak> Debian/Ubuntu also has strange spirv-llvm tooling

16:56 danvet has quit [Read error: No route to host]

16:59 heat_ has joined #dri-devel

17:00 <tjaalton> kisak: strange how?

17:00 heat has quit [Read error: No route to host]

17:00 devilhorns has quit []

17:01 danvet has joined #dri-devel

17:07 <alyssa> panfrost/rusticl needs llvm 15 for conformance

17:09 <karolherbst> yeah... though I think llvm-14 is enough

17:09 saurabh_1 has quit [Ping timeout: 480 seconds]

17:09 <karolherbst> actually...

17:09 <karolherbst> yeah.. should be

17:10 <alyssa> I think there was some bug fix in -15 we needed

17:10 <karolherbst> not directly

17:10 <karolherbst> the fix we needed was when dropping opencl-c.h

17:11 <karolherbst> but I gated that with llvm-15

17:11 <karolherbst> so older llvm should be fine, it's just takes longer to compile kernels

17:13 <alyssa> oh right yes

17:16 <karolherbst> atm I am on llvm-14, because that's where llvmpipe isn't broken and it seems like none of the fails are llvm related

17:16 <kisak> tjaalton: llvm-toolchain-# needs to be built for spirv-llvm-translator-# to be built to *rebuild* llvm-toolchain-# with all the bits needed for libclc and it's not using the same release iteration between the two.

17:17 <karolherbst> kisak: well.. it hardly matters from which version libclc is from though

17:17 <turol> zmike: I don't see any way to make vk_dispatch_table const in vk_device

17:18 <turol> the compiler won't allow it even with egregious abuse of pointer casts

17:19 <DavidHeidelberg[m]> Should we start looking for LLVM 15 in CI?

17:19 <kisak> karolherbst: right, what matters is that there's *.spv bits non-deterministically missing from the llvm build based if there was a test rebuild of the distro release before reaching production. This is quite nasty in a PPA environment where I want to get rid of the retired llvm version, but it's technically needed for the older llvm-spirv-#

17:19 <zmike> turol: I wonder if we're going about that wrong and should instead collapse the table so it isn't a pointer

17:20 ngcortes has joined #dri-devel

17:20 <turol> i think it's not possible to do it this way because of c aliasing rules

17:21 <karolherbst> DavidHeidelberg[m]: after llvmpipe is fixed https://gitlab.freedesktop.org/mesa/mesa/-/issues/6735

17:21 <turol> since the calls to vulkan functions are external, it doesn't matter how much restrict or const we put there

17:22 <zmike> it shouldn't matter whether they're local or external, only whether the pointer gets reevaluated

17:22 <turol> the compiler is not allowed to treat the pointer as hoistable

17:24 <zmike> adding to just dev and the function pointers themselves wasn't enough, I assume?

17:24 iive has joined #dri-devel

17:24 <turol> i did not try changing the function pointers

17:25 <turol> addin restrict to the function pointer members is apparently not allowed

17:26 <zmike> oh huh

17:27 <turol> since vk_dispatch_table is directly part of of vk_device it should have worked when changing the definition of dev if it was going to

17:27 anholt has quit [Ping timeout: 480 seconds]

17:37 anholt has joined #dri-devel

17:56 <swick> emersion: am I blind or is the DRA format layout not documented?

18:05 pallavim_ has quit [Ping timeout: 480 seconds]

18:10 <airlied> MrCooper: someone is fixing f37, just had a build dep to sort out

18:27 jkrzyszt has quit [Ping timeout: 480 seconds]

18:35 jhli has joined #dri-devel

18:35 <karolherbst> airlied: how does it look like with the coro stuff btw?

18:37 <karolherbst> Venemo: I think it's best to ignore the rusticl MR now.. at least I won't respond because it's really just a waste of time :(

18:40 <Venemo> karolherbst: yeah the dude seems to be just trolling now

18:41 <Venemo> at least it is difficult to believe that he truly doesn't understand what everyone else there is saying

18:41 <karolherbst> Venemo: the sad part I think is, that I don't think it's trolling... :(

18:42 <karolherbst> just a huge disconnect between "technical reasons" and social aspects + project governance/maintenance

18:42 <Venemo> yeah but his 'technical concerns' were already discussed to death

18:42 <eric_engestrom> might not be intended as trolling, but it's basically indistinguishable from it now

18:42 <karolherbst> if one thinks "technical" points stand above all you get such a discussion

18:43 <karolherbst> anyway.. it's pointless and I just hope that not similar damages are done in the intel compiler space caused by the same person :/

18:44 <alyssa> Venemo: FWIW I'm inclined to nak rust gfx frontends for the "bindings hell" reason

18:44 <alyssa> then again I'm also of the boring school of thought that gallium is for gl+cl and nothing else :-p

18:44 <karolherbst> I was so close 🤏 of just saying the "damage" which was done was that I wasn't be willing to put up the emotional strength to convince him of any important clover change, but then I was thinknig: why should I even bother now

18:45 <Venemo> yeah

18:45 <karolherbst> alyssa: let's.... talk about this once somebody suggests it :D

18:45 <alyssa> karolherbst: glide

18:45 <karolherbst> jo.. right

18:45 <karolherbst> though not sure if we really want to take that one :D

18:46 <airlied> karolherbst: I got distracted building llvm yesterday, but it looks like something I fixed previously which makes me wonder if some version confusion is happening

18:46 <alyssa> Venemo: I'm also very reluctant around binding NIR to Rust, which is a shame because Rust has a lot of nice features for backend compilers

18:46 <ajax> i just had the thought "do i need to become the glide maintainer so we can stop arguing about this"

18:46 <alyssa> (ADTs/match alone is a Big thing. I guess C++ can do that these days.)

18:46 <ajax> and had horrible vertigo from being so glad to finally get to _stop_ shipping glide

18:46 <alyssa> ajax: wait which side are you on? :-p

18:46 <Venemo> alyssa: I see

18:47 <karolherbst> well.. nir bindings is a problem for future us and I'd keep it like this until we cross that bridge :)

18:47 <alyssa> Yeah

18:47 <karolherbst> I do use some stuff of nir and I suspect it might become more and more over time

18:47 <karolherbst> and we'll get a better feeling around stuff over time

18:47 <alyssa> Practically I want to leave rusticl in tree but not merge new Rust components in the medium term, so we can figure out as a community what the actual pain points are instead of the theoretical ones

18:48 <karolherbst> anyway.. until bindgen supporting static inlines it's way too painful anyway

18:48 <karolherbst> correct

18:48 <alyssa> and after rusticl has survived a few "rip up all of mesa and rewrite it" MRs from zmike or mareko, we'll have a lot more data to work with for the future :P

18:48 <karolherbst> also I am sure it will make more sense in the future if more people are used to seeing, reading and changing rust code

18:48 <zmike> got one of those coming in hot

18:49 <karolherbst> joooo

18:49 <zmike> bout to end this frontend's whole career

18:49 <karolherbst> though rusticls surface area is really not that huge

18:49 <karolherbst> :D

18:49 <karolherbst> that reminds me.. I still wanted to wire up spirv and the more I think about it the less I am convinced it's not a huge pita

18:50 <alyssa> zmike: gogogogo

18:50 <alyssa> karolherbst: good luck lol

18:50 <alyssa> for zink+rusticl to replace clvk, or?

18:50 <karolherbst> so one of the biggest advantages of doing all that funny stuff in nir is, that I can do proper DCE of kernel params

18:50 <anholt> novice question: I've got a dlopen("libEGL.so"), and LD_LIBRARY_PATH at the point of the call is pointing to my mesa build dir that does have a libEGL.so, and yet /home/anholt/src/angle/out/arm64-Release/libEGL.so gets loaded. what could get in the way of my LD_LIBRARY_PATH?

18:50 <karolherbst> but with spir-v.... that might become more of an issue

18:50 <karolherbst> though I _could_ do some optimizations on a spir-v level

18:50 <anholt> (this is all in service of doing angle vs zink shootout on real workloads)

18:50 <zmike> anholt: not seeing icd params?

18:50 <zmike> setting*

18:50 <anholt> what do you mean by icd params?

18:51 <zmike> like __EGL_VENDOR_LIBRARY_FILENAMES

18:51 <karolherbst> alyssa: yes... and the idea was that we just pass in the CL SPIRV into the vulkan runtime

18:51 <anholt> zmike: dlopen() doesn't look at that

18:51 <alyssa> anholt: strace?

18:51 <zmike> ohh I see

18:51 <jenatali> karolherbst: I really don't think that's a good idea, personally

18:51 <karolherbst> I'd be inclinced to ignore that DCE issue, but I know kernels where like 80% of the params are actually dead :(

18:52 <zmike> if it's directly linked?

18:52 <karolherbst> yes

18:52 <zmike> otherwise I'd check LD_DEBUG=all

18:52 <zmike> which I'd guess you've done

18:52 <alyssa> or that. usually I just use strace because I am a horse who only knows 1 trick :-p

18:52 <zmike> so...I'm out of ideas if it's not any of those

18:52 <karolherbst> ahh.. different problem

18:52 <anholt> zmike: info sharedlibrary shows it not loaded before the call. it shows the wrong egl loaded after the call.

18:52 <jenatali> karolherbst: You've seen how massive/complex some of those kernels can be, and you'd just be pushing the burden down to the Vk driver to deal with it instead of handling it in zink

18:52 <alyssa> (and strace shows exactly what paths get tried and the errnos)

18:52 <karolherbst> anyway... being able to cut the input buffers size by quite a lot is a huge advantage

18:52 <zmike> the exe might just be directly linked to angle

18:52 <zmike> I've had issues with that in the past

18:53 <karolherbst> jenatali: yeah....

18:53 <anholt> alyssa: yeah, strace shows it looking in exactly that directory.

18:53 <zmike> not sure I ever resolved it

18:53 <karolherbst> but zink has to convert that nir to spir-v which might be fine actually

18:53 <anholt> zmike: again, ldd and gdb's info sharedlibrary show it not linked to it at the point of dlopen() being called.

18:53 Haaninjo has joined #dri-devel

18:53 <alyssa> zmike: anholt is talking about .so's, I don't think you can directly link dynamic libraries (not .a's)

18:53 <jenatali> Yeah you get a round trip through nir, but as a result you get one place to deal with all of CL's craziness

18:53 <karolherbst> now that I replaced load_kernel_input by load_ubo it's not even a huge deal anymore, except load_global lowering

18:53 <alyssa> anholt: How did you teach your system about that ANGLE path in the first place?

18:54 <karolherbst> _but_

18:54 <alyssa> (I've never built ANGLE)

18:54 fahien has quit [Ping timeout: 480 seconds]

18:54 <karolherbst> maybe we can just use a few CL features in vk spv and write a vk_spv_cl_instructions extension or something...

18:54 <anholt> alyssa: the angle path is getting injected somehow by the angle build system.

18:54 <karolherbst> and that's limited to deal with global loads and other trivial things

18:54 <karolherbst> so we don't need ssbo lowering

18:54 <jenatali> That sounds like a better idea

18:55 <alyssa> anholt: dealing with bazel is above my pay grade

18:55 * alyssa taps out

18:55 <zmike> yeah now the ptsd is coming back to me

18:55 <karolherbst> also.. now that I create the CSO when the kernel is created, all that conversion overhead doens't even matter :)

18:55 <zmike> I don't think I ever solved this issue when I ran into it previously

18:55 <alyssa> karolherbst: you lower load_kernel_input to load_ubo and then agx backend will lower load_ubo to load_global_constant and then agx backend pass 2 will lower load_global_constant to load_global_constant_agx ... layers, lol

18:55 <karolherbst> :P

18:55 Haaninjo has quit [Read error: Connection reset by peer]

18:55 <alyssa> (I guess the latter lowerings could be combined, meh)

18:56 <karolherbst> I want to get rid of load_global_cosntant anyway I think

18:56 Haaninjo has joined #dri-devel

18:56 <alyssa> why?

18:56 <alyssa> replace it with access flags on load_global?

18:56 camus1 has quit [Read error: Connection reset by peer]

18:56 <karolherbst> because it's literally load_global, just promising the data won't change

18:56 camus has joined #dri-devel

18:56 <alyssa> the load_global -> load_global_agx lowering will be needed to handle 8-bit at any rate

18:56 <anholt> ah. rpath is the answer. and today I learned that rpath beats ld_library_path.

18:56 <alyssa> AGX doesn't do 8-bit at all

18:57 rasterman has quit [Quit: Gettin' stinky!]

18:57 <jenatali> alyssa: DXIL doesn't either :(

18:57 <karolherbst> do we have an access flag for constant data?

18:57 <alyssa> nir's alu bitsize lowering can get rid of all the 8-bit ALU except for u2u8/u2u16

18:57 <karolherbst> guess we could add one...

18:57 <alyssa> but you're still expected to handle the conversions and be able to do 8-bit loads/stores

18:57 <karolherbst> anyway

18:57 <karolherbst> load_constant_global should really become load_ubo as well

18:58 <karolherbst> rusticl could even keep track of what buffers are accessed the most and lower some to load_global if we get above limits

18:58 <alyssa> AGX doesn't do 8-bit loads/stores either in terms of registers -- but it has an i8 memory format!

18:58 <turol> question for radv developers

18:58 <karolherbst> or it's all indirect ubo

18:58 <turol> https://gitlab.freedesktop.org/mesa/mesa/-/blob/main/src/amd/vulkan/si_cmd_buffer.c#L895

18:58 <alyssa> OCOC

18:58 <turol> checks wd_switch_on_eop

18:58 <turol> but it's set after that on line 914

18:59 <karolherbst> I don't have a good idea on how to deal with constants, because a driver can literally bind the same constant buffer unlimited times

18:59 <turol> i would expect first all things which set it, then things which check it

18:59 <turol> bug or intentional?

18:59 <alyssa> so can lower `8 ssa_1 = load_global` to `16 ssa_0 = load_global format i8; 8 ssa_1 = i2i8 ssa_0`, and then the optimizer can clean up the conversions

18:59 <alyssa> I think.

18:59 <jenatali> alyssa: Stores are harder, you can't lower to 16bit stores

19:00 <alyssa> jenatali: Yes, I can. Because it's a formatted store.

19:00 <karolherbst> heh wait...

19:00 <jenatali> Because you can't modify the neighboring bytes

19:00 <karolherbst> actually... the limit is args.. not buffers

19:00 <karolherbst> and it's 8 or more

19:00 <karolherbst> well.. some hardware doesn't support 8 :(

19:00 <alyssa> jenatali: We can lower to a formatted store. The memory format is i8, but the register format is i16.

19:00 <jenatali> Oh I see, that makes sense

19:00 <alyssa> Yep

19:01 <jenatali> As long as you can still specify a byte-aligned address even with an i16 register?

19:01 <karolherbst> jenatali: did you play around with using actual ubos for constant buffers?

19:01 <jenatali> karolherbst: Yeah, we didn't use it though because D3D drivers (notably our software driver (WARP)) had bugs when dynamically selecting a ubo

19:01 <karolherbst> heh...

19:01 <jenatali> So we just lower it to ssbo the same as global

19:02 <karolherbst> why selecting one dynamically though?

19:02 <karolherbst> so I see that CL_DEVICE_MAX_CONSTANT_ARGS has to be at least 8, which is quite low

19:02 <karolherbst> so you can make it all static

19:03 <karolherbst> and if you need a pointer to inside a constant buffer you get a idx/offset vec2, which should be fine.. unless that's what's also causing you issues

19:03 <karolherbst> I think I'll play around with this and see how it goes

19:07 <jenatali> karolherbst: I mean in a shader, an app doing `foo ? constant1[x] : constant2[x]`

19:07 <karolherbst> okay.. mhhh

19:07 <karolherbst> annoying

19:07 <jenatali> Or other types of insidious things like dynamically computing a constant pointer that could point to one buffer or another

19:07 <karolherbst> don't have indirect ubos in d3d?

19:07 <jenatali> Smuggling it through shared or global memory to break pointer tracking, etc

19:08 <jenatali> We do, but they're apparently so rarely used that there's driver bugs

19:08 <karolherbst> uhhh... well.. it's still fits in 64 bit unless you have 32 bit pointers :(

19:08 <karolherbst> ahh

19:08 <karolherbst> makes sense ...

19:08 <karolherbst> I suspect though it matters for performance :(

19:08 <airlied> vsyrjala: should we be requesting he cc intel-gfx for CI?

19:08 <Venemo> turol: as far as I see it is set a few lines above the highlighted line

19:08 <airlied> or did he do that and ignore the results?

19:08 <jenatali> Yeah we use 32bit buffer index, 32bit buffer offset, but if you can't track down a literal constant for that 32bit index due to the pointer smuggling, then you hit issues

19:09 <turol> Venemo: yes but also below

19:09 <karolherbst> yeah...

19:09 <turol> that looks suspicious to me

19:09 <karolherbst> iuf you can't use indirect ubos, because of bugs you really are in a world of pain there :(

19:09 <karolherbst> s/iuf/if/

19:10 <Venemo> turol: so what is your question?

19:10 <turol> the question is: is this intentional or a bug

19:10 <Venemo> I don't know

19:11 <karolherbst> jenatali: the other annoying part is... you can set the constant* arg to NULL and one also needs to encode that

19:11 <turol> neither do I, hence asking for radv developers

19:11 <Venemo> I'm a radv developer, hence tried to answer

19:11 <karolherbst> so I suspect we always want to encode the idx/offset in the input buffer, and just have a static access to be an optimization or something.. how annoying

19:12 <alyssa> jenatali: Yes. AGX requires addresses(and offsets) to be aligned to the alignment of the memory format, not the register format.

19:12 <karolherbst> which insane hw doesn't :P

19:13 <Venemo> turol: that code looks like it was copied from radeonsi a long time ago, best would be to check how radeonsi handles that now and see if it matches. but i wouldn't worry about it unless you suspect that this causes issues on affected hw

19:16 <alyssa> Venemo: Speaking of, any interest in standardiing on formatted loads/stores in NIR?

19:17 <alyssa> and on VS input lowering?

19:18 <alyssa> not 100% what that woud look like yet

19:18 <turol> Venemo: the matching radeonsi code appears to be https://gitlab.freedesktop.org/mesa/mesa/-/blob/main/src/gallium/drivers/radeonsi/si_state_draw.cpp#L949

19:18 <turol> there appears to be a mismatch so this is probably a bug

19:22 <Venemo> alyssa: same answer as last time, I'm open to suggestions :)

19:22 <Venemo> turol: feel free to open a bug report against radv on the mesa gitlab then

19:22 <alyssa> Venemo: fair enough

19:23 <alyssa> okay, how about this: I'll do something that makes sense for AGX and vendor it, and if you think you can use it in radv, you'll extend it and move it to common and drop the aco lowering? :)

19:23 <Venemo> alyssa: my plan was to add something like load/store_buffer_amd and add a format field to it

19:23 <alyssa> yeah, same here

19:23 <alyssa> load_global_agx

19:24 <Venemo> this would have a constant offset, scalar offset, vector offset, and a vector index

19:24 <alyssa> load_global_agx taking two parameters "base address" and "offset" with FORMAT, SHIFT, MASK immediates I guess

19:26 <Venemo> I think the mismatching requirements for the sources may be an issue here

19:30 <Venemo> alyssa: we would most likely want to use the vector index for these (we would pass the vertex id or instance id)

19:30 <alyssa> yeah, same here

19:30 <Venemo> does agx also have an index src?

19:31 <Venemo> you didn't say so

19:31 <alyssa> that's my offset source

19:31 <alyssa> offset in units of alignment(FORMAT)

19:31 heat has joined #dri-devel

19:31 heat_ has quit [Read error: No route to host]

19:31 <Venemo> okay, so your offset is not the same as our offset?

19:31 <alyssa> probably not

19:31 <alyssa> in practice it probably is? or maybe my offset is your vector index?

19:32 <Venemo> I think it is

19:32 <Venemo> do you not have a normal byte offset?

19:32 <alyssa> nope

19:32 <Venemo> or that's the base address?

19:32 <alyssa> well, I guess the base address

19:33 <alyssa> as I wrote earlier today, it's literally:

19:33 <alyssa> format_t *array = (format_t *) base_address;

19:33 <alyssa> return array[index << extra_shift];

19:33 <Venemo> well, one thing we could do is have all srcs and you would emit an extra add in your backends

19:33 <alyssa> there's no add needed in the usual case

19:34 <alyssa> E.g. for an rgba8 vertex buffer at base address B with stride #S and no instancing, the load is a single instruction:

19:34 <Venemo> I mean when the intrin has both the scalar and vector offset, you'd add those in your banckend and we'd emit those as part of the instr

19:34 gouchi has joined #dri-devel

19:34 <alyssa> load B, vertex ID << (log2(S / 4)), rgba8

19:34 <alyssa> Oh, er, right ok

19:34 <Venemo> sorry I can't type on my phone...

19:35 <DavidHeidelberg[m]> I was thinking about how to avoid restricted traces. One of the ideas is that instead of just asking developers of games/benchmarks/apps permission to use trace, we could offer them something like certification that Mesa3D supports (is tested with) their product. How does that sound?

19:35 <alyssa> right, if the stride is not power of two or there's an extra constant offset, we end up emitting an extra imad instruction, sure

19:35 Dr_Who has quit [Ping timeout: 480 seconds]

19:35 <Venemo> alyssa: maybe it's also OK to keep vendored intrinsics and once we see how the backends look we can decide if we can make them common

19:35 <alyssa> Nod

19:36 <alyssa> I want to say "sticking in an extra constant offset is just one ssa_scalar_chase away" but I guess you want to keep down compile times as always

19:36 <Venemo> well, the intrinsic would have a base, which would be the constant offset

19:36 <Venemo> similar to load_buffer_amd

19:36 <alyssa> nod

19:36 <Venemo> or do you mean you need an extra one besides that?

19:37 <alyssa> maybe this won't work out nicely then

19:37 <alyssa> could you write out pseudo-C code for what the AMD instruction does with all the sources? like I wrote for AGX? thanks

19:37 <Venemo> uhhh

19:39 <Venemo> alyssa: it's compilcated, can you look at the RDNA2 shader isa chapter 8.1?

19:41 <Venemo> 8.1.5 describes the addressing, and 8.1.1 shows a simplified version of the formula

19:42 Haaninjo has quit [Quit: Ex-Chat]

19:43 Haaninjo has joined #dri-devel

19:43 <Venemo> alyssa: I would rather not try to type it on my phone sorry

19:43 <alyssa> fair enough, will try to remember ot look when I get a chance

20:20 heat has quit [Read error: No route to host]

20:21 heat has joined #dri-devel

20:25 oneforall2 has quit [Remote host closed the connection]

20:28 oneforall2 has joined #dri-devel

20:39 Duke`` has quit [Ping timeout: 480 seconds]

20:42 Nimr-alIslam has joined #dri-devel

20:42 fab has quit [Quit: fab]

20:47 Nimr-alIslam has quit [autokilled: This host violated network policy. Contact support@oftc.net for further information and assistance. (2022-09-19 20:47:12)]

20:48 mvlad has quit [Remote host closed the connection]

20:52 ngcortes has quit [Read error: Connection reset by peer]

20:55 lemonzest has quit [Quit: WeeChat 3.5]

21:06 ngcortes has joined #dri-devel

21:22 danvet has quit [Ping timeout: 480 seconds]

21:25 gouchi has quit [Remote host closed the connection]

21:45 mbrost has joined #dri-devel

22:04 mbrost has quit [Ping timeout: 480 seconds]

22:07 ahajda has quit [Ping timeout: 480 seconds]

22:11 ybogdano has quit [Ping timeout: 480 seconds]

22:12 mbrost has joined #dri-devel

22:26 ybogdano has joined #dri-devel

22:34 ybogdano is now known as Guest966

22:34 Guest966 has quit [Read error: Connection reset by peer]

22:34 ybogdano has joined #dri-devel

22:39 vliaskov has quit [Remote host closed the connection]

22:46 ybogdano has quit [Ping timeout: 480 seconds]

22:51 paulk-bis has joined #dri-devel

22:52 paulk has quit [Ping timeout: 480 seconds]

23:01 Haaninjo has quit [Quit: Ex-Chat]

23:02 pcercuei has quit [Quit: dodo]

23:05 mbrost has quit [Ping timeout: 480 seconds]

23:08 Weiss-Fder[m] has joined #dri-devel

23:27 iive has quit [Quit: They came for me...]