#dri-devel on 2023-09-05 — irc logs at oftc.irclog.whitequark.org

2022-12-21 00:45 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:00 Bill has joined #dri-devel

00:01 Bill is now known as Guest1818

00:01 Guest1818 has quit []

00:03 Company has quit [Remote host closed the connection]

00:07 sarnex has quit [Read error: Connection reset by peer]

00:08 sarnex has joined #dri-devel

00:18 mareko has quit [Ping timeout: 480 seconds]

00:18 dri-logger has quit [Ping timeout: 480 seconds]

00:18 glisse has quit [Ping timeout: 480 seconds]

00:19 quantum5 has quit [Quit: ZNC - https://znc.in]

00:19 quantum5 has joined #dri-devel

00:26 mareko has joined #dri-devel

00:26 dri-logger has joined #dri-devel

00:31 glisse has joined #dri-devel

00:35 <alyssa> karolherbst: I gave some more thought to the "62-bit generic pointers make a mess if you don't inline"

00:35 <alyssa> Here's a crude take

00:37 <alyssa> If a function has any _private arguments, we really want to inline, because we can likely copy prop away the scratch memory access entirely, and eliminating scratch should outweigh the cost of inlining

00:37 <alyssa> If a function has any local arguments, we probably also want to inline, since local mem access behaves differently from global/scratch mem in terms of performance characteristics (e.g. on AGX there's no need to insert a wait after reading from local mem)

00:38 <alyssa> also, I suspect that passing in a pointer to local mem to a generic ptr function is.. probably pretty rare in practice

00:38 <alyssa> so it might not matter what we do

00:38 <karolherbst> that all sounds fine in theory, but I have shaders with millions of SSA values

00:38 <alyssa> So then the heuristic is

00:38 <alyssa> inline if any arguments are private or local ptrs, and then whatever you didn't inline assume everything is _global

00:39 <karolherbst> we can't always inline anything, because there will be situations we can't

00:39 <karolherbst> what if a 500 loc function is called 100 times

00:39 <alyssa> (I imagine it doesn't /quite/ work like that because generic ptrs mean we can't necessarily know the address spaces at compile-time. So might need to template stuff, but. shrug)

00:39 <alyssa> then you have a 50,000 line kernel \shrug/

00:39 dri-logg1r has joined #dri-devel

00:40 <karolherbst> well.. sure, but the point I was making here, I already have shaders with _millions_ of SSA values

00:40 <karolherbst> literally

00:40 <alyssa> I mean

00:40 <alyssa> if you have shaders executing millions of instructions I kinda feel like you're already toast?

00:40 <karolherbst> it's not millions of instructions

00:40 <karolherbst> it's just control flow

00:40 <alyssa> oh

00:40 <alyssa> meh?

00:40 <karolherbst> and very nasty one

00:41 <karolherbst> the thing is.. those shaders usually run if LLVM compiles them to AMD

00:41 <karolherbst> but on mesa you OOM your system

00:41 <karolherbst> so any heurestic where we always inline functions based on argument types won't work

00:41 glisse has quit [Ping timeout: 480 seconds]

00:41 <karolherbst> because what if that function is called bazillion times?

00:41 <karolherbst> then we are again toast

00:42 mareko has quit [Ping timeout: 480 seconds]

00:42 <karolherbst> some shaders also do switches on type parameters to call into certain functions and other unky bits

00:42 dri-logger has quit [Ping timeout: 480 seconds]

00:42 <karolherbst> like hand rolled function tables

00:43 <karolherbst> some of the compute kernels are just massive and wild

00:44 <karolherbst> but if we allow function calls, we can also just duplicate functions with generic arguments and call the variant we actually need

00:44 <karolherbst> might be better than if-else-ladders resolving generic pointers

00:44 <karolherbst> but we also kinda want to make use of hardware supporting generic pointers natively

00:45 <karolherbst> which is the best case and solves a lot of the pain points here

00:45 mareko has joined #dri-devel

00:48 yyds has joined #dri-devel

00:48 columbarius has joined #dri-devel

00:49 glisse has joined #dri-devel

00:50 co1umbarius has quit [Ping timeout: 480 seconds]

01:06 jewins has quit [Ping timeout: 480 seconds]

01:08 ungeskriptet0 has joined #dri-devel

01:13 <alyssa> sure

01:13 <alyssa> I suspect they're in the minority, though?

01:13 <alyssa> I mean, Mali does but it's deeply terrible

01:13 ungeskriptet has quit [Ping timeout: 480 seconds]

01:14 <alyssa> and honestly i'd be tempted to do 62-bit on mali

01:15 flynnjiang has quit [Remote host closed the connection]

01:16 flynnjiang has joined #dri-devel

01:16 <alyssa> karolherbst: FWIW, Apple claims that they force inline functions that read stack or constant mem

01:17 <alyssa> citing "SROA, Buffer preloading"

01:25 <DemiMarie> karolherbst: is this some sort of numerical algorithm or scientific computing code? If so, this would not surprise me at all.

01:26 quantum5 has quit [Quit: ZNC - https://znc.in]

01:27 flynnjiang has quit [Remote host closed the connection]

01:28 flynnjiang has joined #dri-devel

01:28 quantum5 has joined #dri-devel

01:37 flynnjiang has quit [Remote host closed the connection]

01:40 flynnjiang has joined #dri-devel

01:43 sassefa has joined #dri-devel

01:49 kts has quit [Ping timeout: 480 seconds]

01:53 <youmukonpaku1337> hey guys

01:53 <youmukonpaku1337> so im trying to do a bit of trickery with my mainlined ebook and get usb display

01:53 <youmukonpaku1337> and it WORKS but its using llvmpipe instead of lima and i get this

01:53 <youmukonpaku1337> libGL error: failed to load driver: gud

01:53 <youmukonpaku1337> libGL error: MESA-LOADER: failed to open gud: /usr/lib/dri/gud_dri.so: cannot open shared object file: No such file or directory (search paths /usr/lib/arm-linux-gnueabihf/dri:\$${ORIGIN}/dri:/usr/lib/dri,

01:53 <youmukonpaku1337> suffix _dri)

01:53 <youmukonpaku1337> am i missing something? am using mesa from debian repos

01:54 <youmukonpaku1337> oh and the way i got GUD is very fucky (compiling out of tree with kernel headers) but it seems fine

01:56 <youmukonpaku1337> i have the module and theres a drm device at card1

01:59 <youmukonpaku1337> oh

01:59 <youmukonpaku1337> right

01:59 <youmukonpaku1337> i probs need gl4es lol

02:01 flynnjiang has quit [Ping timeout: 480 seconds]

02:02 flynnjiang has joined #dri-devel

02:03 <youmukonpaku1337> es2gears also works but uhh

02:03 <youmukonpaku1337> same err

02:09 youmukon1 has joined #dri-devel

02:09 youmukonpaku1337 is now known as Guest1834

02:09 youmukon1 is now known as youmukonpaku1337

02:11 Guest1834 has quit [Ping timeout: 480 seconds]

02:17 <kode54> I have to test a regression in ANV since 23.1.6

02:18 <kode54> It renders that game, The Spirit and The Mouse, into a colorful and flickery mess

02:20 ayaka_ has joined #dri-devel

02:32 dviola has quit [Quit: WeeChat 4.0.4]

02:39 sassefa has quit [Quit: sassefa]

02:39 crabbedhaloablut has joined #dri-devel

02:43 yuq825 has joined #dri-devel

02:58 yyds has quit [Quit: Lost terminal]

02:58 yyds has joined #dri-devel

02:59 yyds has quit []

02:59 yyds has joined #dri-devel

03:01 yyds has quit []

03:01 yyds has joined #dri-devel

03:05 Daanct12 has joined #dri-devel

03:16 flynnjiang has quit [Ping timeout: 480 seconds]

03:17 flynnjiang has joined #dri-devel

04:02 Duke`` has joined #dri-devel

04:09 ohmltb^ has quit [Remote host closed the connection]

04:23 ayaka_ has quit [Ping timeout: 480 seconds]

04:28 bmodem has joined #dri-devel

04:29 anarsoul has quit [Ping timeout: 480 seconds]

04:33 yyds has quit [Quit: Lost terminal]

04:40 flynnjiang has quit [Ping timeout: 480 seconds]

04:41 flynnjiang has joined #dri-devel

04:45 yyds has joined #dri-devel

04:45 camus has quit [Ping timeout: 480 seconds]

04:55 flynnjiang has quit [Ping timeout: 480 seconds]

04:56 flynnjiang has joined #dri-devel

04:57 camus has joined #dri-devel

05:01 flynnjiang has quit [Remote host closed the connection]

05:02 flynnjiang has joined #dri-devel

05:05 fab has joined #dri-devel

05:09 anarsoul has joined #dri-devel

05:16 junaid has joined #dri-devel

05:18 Duke`` has quit [Ping timeout: 480 seconds]

05:19 junaid has quit [Remote host closed the connection]

05:23 youmukonpaku1337 has quit [Quit: WeeChat 4.0.4]

05:24 kzd has quit [Ping timeout: 480 seconds]

05:26 sima has joined #dri-devel

05:48 ayaka_ has joined #dri-devel

05:53 * airlied fails to get a gitlab container to run deqp tests locally, the docs don't seem to be up to date, or just don't tell you how to run deqp/piglit tests against a build

05:54 <airlied> or at least the docs explain builds, but not how to test already built artifacts

05:54 youmukonpaku1337 has joined #dri-devel

06:03 rasterman has joined #dri-devel

06:03 tzimmermann has joined #dri-devel

06:04 fab has quit [Ping timeout: 480 seconds]

06:07 mszyprow has joined #dri-devel

06:10 <airlied> okay hacked it around, and now the tests don't hit the assert in my container they hit in the CI one

06:12 <youmukonpaku1337> huh lima DOES have desktop gl

06:12 <youmukonpaku1337> why is mesa looking for a gud.so though

06:12 <youmukonpaku1337> *gud_dri.so

06:13 <youmukonpaku1337> what did i mess up lmao

06:13 <Sachiel> what the hell is gud?

06:13 <youmukonpaku1337> generic usb display

06:13 <youmukonpaku1337> essentially a way to get display output with a pi turned into a usb gadget

06:14 <youmukonpaku1337> it *works* (kinda) but mesa freaks out and spits this: libGL error: failed to load driver: gud

06:14 <youmukonpaku1337> libGL error: MESA-LOADER: failed to open gud: /usr/lib/dri/gud_dri.so: cannot open shared object file: No such file or directory (search paths /usr/lib/arm-linux-gnueabihf/dri:\$${ORIGIN}/dri:/usr/lib/dri,

06:14 <Sachiel> oh, if that's using a specific kernel driver, then something that doesn't recognize it might be trying to find a userspace driver matching the name, thus the failed search for gud_dri.so

06:14 <youmukonpaku1337> suffix _dri)

06:14 <youmukonpaku1337> ah

06:15 <youmukonpaku1337> any way to make it not do that?

06:15 <youmukonpaku1337> but yea theres no userspace driver

06:15 <Sachiel> try MESA_LOADER_DRIVER_OVERRIDE=whateveryouexpecttowork

06:15 <youmukonpaku1337> oh true

06:15 <youmukonpaku1337> also for some reason even with an override es2gears and glxgears run at 15fps

06:15 <youmukonpaku1337> ;-;

06:16 mszyprow has quit [Ping timeout: 480 seconds]

06:16 <youmukonpaku1337> i doubt the mali 400 is *that* bad

06:16 <youmukonpaku1337> maybe i should test with wayland instead of X

06:18 mszyprow has joined #dri-devel

06:19 <youmukonpaku1337> anyway i guess ill test once im home lol

06:19 <youmukonpaku1337> still kinda cool that im able to get any display output at all on an ebook

06:20 <youmukonpaku1337> using wifi pins for usb lol

06:20 <youmukonpaku1337> https://youmu.i-am-in-your.systems/EzbdBigmCqRj

06:21 <youmukonpaku1337> i changes out the about-to-short usb port for a breakout board now but its still about as cursed

06:22 <youmukonpaku1337> also had to compile GUD out of tree because it isnt enabled in linux-image-armmp :(

06:31 mszyprow has quit [Ping timeout: 480 seconds]

06:38 yyds has quit [Remote host closed the connection]

06:39 yyds has joined #dri-devel

06:42 <kode54> cool

06:42 <kode54> I found the bad commit, or commits

06:42 <kode54> it outright crashes on them

06:42 <kode54> I'm building a full debug build now to produce proper backtraces

06:43 <kode54> the thing I hate about debug builds of mesa is that this full build results in about a 2GB install footprint

06:43 <kode54> most of which is the debugging symbols package

06:43 An0num0us has joined #dri-devel

06:43 <kode54> it also takes upwards of 10-15 minutes for the strip/objcopy process that pulls the debug data off the binaries and stuffs it into a debug package

06:44 <kode54> the default mesa-tkg-git package config and script, the PKGBUILD hardcodes b_ndebug=true, and the config file defaults to --strip --buildtype release

06:47 itoral has joined #dri-devel

07:00 sghuge has quit [Remote host closed the connection]

07:00 <kode54> crap

07:00 <kode54> doesn't crash in debug build

07:00 sghuge has joined #dri-devel

07:00 <kode54> but it does have the rendering bugs

07:01 fab has joined #dri-devel

07:07 flynnjiang has quit [Ping timeout: 480 seconds]

07:08 flynnjiang has joined #dri-devel

07:09 mwk_ has quit [Ping timeout: 480 seconds]

07:10 youmukon1 has joined #dri-devel

07:10 youmukonpaku1337 has quit [Read error: Connection reset by peer]

07:17 Ahuj has joined #dri-devel

07:20 flynnjiang has quit [Ping timeout: 480 seconds]

07:20 flynnjiang has joined #dri-devel

07:25 mwk has joined #dri-devel

07:31 flynnjiang has quit [Ping timeout: 480 seconds]

07:32 flynnjiang has joined #dri-devel

07:32 Omax has quit [Ping timeout: 480 seconds]

07:33 Omax has joined #dri-devel

07:37 frieder has joined #dri-devel

07:40 swalker_ has joined #dri-devel

07:41 youmukonpaku1337 has joined #dri-devel

07:41 swalker_ is now known as Guest1860

07:42 swalker__ has joined #dri-devel

07:44 An0num0us has quit [Ping timeout: 480 seconds]

07:45 youmukon1 has quit [Read error: Connection reset by peer]

07:48 <karolherbst> DemiMarie: ray tracer

07:48 Guest1860 has quit [Ping timeout: 480 seconds]

07:49 <karolherbst> alyssa: I'm sure we could do it for _most_ functions, but we have to be mindful about how we do it all. There will be situation we can't simply inline certain type of functions, because it would blow up the kernel. If inlining works for 99% of the applications, good, but we need a fallback for the 1%

07:50 vliaskov has joined #dri-devel

07:52 youmukon1 has joined #dri-devel

07:53 mripard has joined #dri-devel

07:55 lynxeye has joined #dri-devel

07:56 <pq> youmukonpaku1337, I don't think that USB display drivers (that is, *not* USB-C DP alt mode) would support hardware rendered content (dmabuf), which is the reason why you'd get software rendering on GUD. There could be hardware+display specific exceptions, but I don't know about those.

07:56 mripard has quit []

07:57 youmukonpaku1337 has quit [Ping timeout: 480 seconds]

07:57 mripard has joined #dri-devel

07:57 <pq> youmukonpaku1337, a Wayland compositor could implement hardware rendering and then do a CPU copy into GUD's buffers, but I don't know if anyone implemented that.

07:59 <pq> oh right, Mutter does at least

08:02 <pq> youmukonpaku1337, a USB display driver is probably always going to shovel pixels with the CPU, so that will always hurt.

08:03 donaldrobson has joined #dri-devel

08:05 rgallaispou has joined #dri-devel

08:16 samuelig_ has quit []

08:17 samuelig has joined #dri-devel

08:19 Haaninjo has joined #dri-devel

08:21 youmukon1 has quit [Remote host closed the connection]

08:22 youmukonpaku1337 has joined #dri-devel

08:30 youmukonpaku1337 has quit [Remote host closed the connection]

08:30 youmukonpaku1337 has joined #dri-devel

08:43 youmukonpaku1337 has quit [Remote host closed the connection]

08:44 youmukonpaku1337 has joined #dri-devel

08:46 flynnjiang has quit [Ping timeout: 480 seconds]

08:47 flynnjiang has joined #dri-devel

08:54 kts has joined #dri-devel

09:01 bmodem has quit [Ping timeout: 480 seconds]

09:02 danylo has quit [Quit: Ping timeout (120 seconds)]

09:02 danylo has joined #dri-devel

09:07 <youmukonpaku1337> pq: oh yea i see (btw is there a way to use mutter alone without gnome)

09:08 <pq> umm... mutter can run without gnome-shell, but I'm not sure how useful that is

09:08 <pq> other than testing

09:10 <youmukonpaku1337> pq: though i probs gotta test weston too, might work

09:10 <youmukonpaku1337> ~~as long as it doesnt use waaaay too much ram~~

09:10 <pq> who knows, maybe you could configure even Xorg to render with lima and copy to GUD...

09:11 <pq> I don't remember Weston having such copy you'd need, but it has had some multi-DRM-device patches I haven't really looked into what they do.

09:12 <pq> for Xorg, if you can get it to recognize both rendering and GUD devices, playing with xrandr --setprovideroutputsource / --setprovideroffloadsink

09:13 <pq> ..might do something maybe

09:20 Haaninjo has quit [Quit: Ex-Chat]

09:23 ced117 has quit [Ping timeout: 480 seconds]

09:29 flynnjiang has quit [Read error: Connection reset by peer]

09:29 ap51 has joined #dri-devel

09:29 flynnjiang has joined #dri-devel

09:29 turol_ has joined #dri-devel

09:33 <turol_> is it possible for non-developers to get rights to add tags to issues?

09:33 <turol_> labels, whatever gitlab calls them

09:37 kts has quit [Ping timeout: 480 seconds]

09:40 flynnjiang has quit [Ping timeout: 480 seconds]

09:48 simon-perretta-img has joined #dri-devel

10:14 tristan has joined #dri-devel

10:14 tristan is now known as Guest1874

10:16 <DavidHeidelberg[m]> to everyone who running manually pipelines for testing their MR: We currently have too many rootfs images hiting the caches, please always rebase before running pipeline, if you can.

10:21 youmukonpaku1337 has quit [Remote host closed the connection]

10:21 youmukonpaku1337 has joined #dri-devel

10:24 pekkari has joined #dri-devel

10:49 alkisg_irc has joined #dri-devel

10:49 alkisg_irc is now known as alkisg

10:51 Company has joined #dri-devel

10:55 youmukonpaku1337 has quit [Remote host closed the connection]

10:55 youmukonpaku1337 has joined #dri-devel

10:57 bmodem has joined #dri-devel

11:00 bbhtt- is now known as bbhtt

11:06 An0num0us has joined #dri-devel

11:06 padovan has joined #dri-devel

11:07 <alyssa> DavidHeidelberg[m]: rebase on upstream to pick up the latest image?

11:09 <turol_> alyssa: the nir if condition change also seems to apply to loops

11:09 <turol_> that caused a regression

11:09 <turol_> was is intended to apply to loops?

11:10 <turol_> issue 9750 if you want more details

11:12 <alyssa> uh oh

11:13 ayaka_ has quit [Remote host closed the connection]

11:13 ayaka_ has joined #dri-devel

11:13 <turol_> it triggered unrolling of a loop that previously wasn't

11:13 <alyssa> turol_: What's the regression?

11:13 <alyssa> Being able to unroll more loops is a good thing..

11:13 <turol_> causing increased register pressure and lowered subgroups per SIMD

11:13 <turol_> not when there's a texture read inside the loop

11:14 <alyssa> ok, but that's a deficiency in the loop unrolling heuristic then (deciding to unroll loops when it's not beneficial)

11:15 <alyssa> not the fault of last night's patch

11:15 <alyssa> and also, unrolling loops with a texture read inside may *still* be a win in practice?

11:15 <alyssa> you get lower occupancy, but you get more ILP to hide the latency, and might come out ahead despite the pipeline stats

11:16 <DavidHeidelberg[m]> alyssa: if you have older MR, which using .gitlab-ci/image-tags.yml which been produced long time ago, the CI (until assigned to Marge) will use the old images

11:16 <DavidHeidelberg[m]> and the old images aren't usually that much cached

11:16 <alyssa> DavidHeidelberg[m]: +1, got it2

11:16 <alyssa> thx

11:17 <alyssa> turol_: see the discussion in https://gitlab.freedesktop.org/mesa/mesa/-/issues/7161

11:17 <turol_> just tried, 142 fps unrolled, 146 fps not unrolled

11:17 <alyssa> OK. That's a more interesting statistic then

11:17 <turol_> it's the slowest shader of SMAA

11:17 <turol_> the others are pretty simple

11:18 <turol_> it's actually a little bit infamous for causing issues in both spirv-tools and spirv-cross

11:18 <pendingchaos> I'm not sure this particular form of loop unrolling is a good idea (it's that weird nested if form) since it usually doesn't overlap iterations

11:18 <pendingchaos> but we would need LICM/GCM to fully replace it

11:18 <pendingchaos> it's not very beneficial for this particular shader (just doing LICM for a descriptor load)

11:20 youmukon1 has joined #dri-devel

11:21 <pendingchaos> (complex_unroll() in nir_opt_loop_unroll.c, I think)

11:23 youmukonpaku1337 has quit [Ping timeout: 480 seconds]

11:27 <turol_> on nvidia proprietary driver unrolling or not affects the binary size but not register count

11:27 <turol_> fps seems identical

11:27 <turol_> don't have other amds to easily test

11:28 <turol_> does someone have instructions for setting up a chroot/vm for compiling mesa for the steam deck?

11:28 <pendingchaos> you can use RADV_FORCE_FAMILY and Fossilize to loop at how shaders compile for other gpus

11:29 <turol_> but that doesn't let me test the fps

11:29 <alyssa> pendingchaos: will NIR ever grow a dedicated LICM? or is that purely part of nir_opt_gcm?

11:30 <alyssa> (Every time I look at opt_gcm, it blows up my reg pressure and slows things down)

11:30 <pendingchaos> no idea

11:31 <turol_> and like i mentioned in the issue while i can fix this shader for myself that doesn't help everyone else who's used it in their proprietary game

11:31 <turol_> on the other hand in more complicated render it's proportionately less important

11:31 <pendingchaos> maybe nir_opt_gcm can be modified so that it can only do LICM

11:32 <alyssa> pendingchaos: fair

11:32 <alyssa> the other case that comes up is duplicated stuff on both sides of an if

11:33 <alyssa> another case that's not nearly as problematic of gcm's usual thing of "move EVERYTHING!!"

11:33 <alyssa> but opt_gcm seems like a blunt hammer, idk

11:33 youmukon1 has quit [Remote host closed the connection]

11:33 youmukonpaku1337 has joined #dri-devel

11:42 ayaka_ has quit [Ping timeout: 480 seconds]

11:44 pekkari has quit [Ping timeout: 480 seconds]

11:47 youmukonpaku1337 has quit [Remote host closed the connection]

11:47 youmukonpaku1337 has joined #dri-devel

11:54 alkisg has left #dri-devel [#dri-devel]

12:01 yyds has quit [Remote host closed the connection]

12:23 Guest1874 has quit [Remote host closed the connection]

12:24 tristan has joined #dri-devel

12:25 tristan is now known as Guest1914

12:25 itoral has quit [Remote host closed the connection]

12:31 youmukonpaku1337 has quit [Remote host closed the connection]

12:31 hansg has joined #dri-devel

12:31 youmukonpaku1337 has joined #dri-devel

12:34 hansg has quit []

12:41 youmukonpaku1337 has quit [Remote host closed the connection]

12:42 youmukonpaku1337 has joined #dri-devel

12:45 <pq> swick[m], what property setting ioctls did you refer to in the email?

12:50 <swick[m]> pq: DRM_IOCTL_MODE_SETPROPERTY, etc

12:50 <pq> why would you use those?

12:51 <swick[m]> to set the property of a connector?

12:51 <swick[m]> it's all hidden in libdrm

12:51 <pq> no, that's atomic commit ioctl

12:51 <pq> let's see...

12:51 <swick[m]> mhh, is it?

12:52 <pq> it wouldn't be atomic, if each property was set with a separate ioctl

12:52 <swick[m]> oh, you're right...

12:52 <swick[m]> I mean, it could still be atomic

12:54 <pq> atomic commit ioctl argument is struct drm_mode_atomic, and it seems to contain the whole lot.

12:55 <swick[m]> yes, it actually only issues one ioctl

12:55 <swick[m]> my bad

12:56 <pq> I thought I missed something :-)

12:56 <swick[m]> just saying, that's not a requirement for it to be atomic, just like in wayland where we built up state in the compositor and then start using it on a commit message

12:57 <pq> right, if DRM_IOCTL_MODE_SETPROPERTY staged stuff

12:58 mauld has quit [Ping timeout: 480 seconds]

12:58 <zamundaaa[m]> Please don't reinvent the atomic API in worse

12:58 youmukonpaku1337 has quit [Remote host closed the connection]

12:58 <pq> we're not

12:59 youmukonpaku1337 has joined #dri-devel

13:02 mauld has joined #dri-devel

13:05 Daanct12 has quit [Quit: WeeChat 4.0.4]

13:06 turol_ has quit [Quit: Leaving]

13:13 <pq> Is it so that KMS has no way of choosing BT.2100 ICtCp as video stream colorimetry?

13:15 <pq> not in v6.5 it seems

13:16 <swick[m]> do sinks support that?

13:17 <pq> I dunno, but CTA-861 defines it

13:18 <pq> no hits in linuxhw/EDID, so I guess not

13:18 <swick[m]> oh, in 861-H

13:18 <swick[m]> pretty new then

13:19 <pq> oh, yeah, I'm reading H, and wasn't there I already too?

13:19 youmukonpaku1337 has quit [Remote host closed the connection]

13:19 youmukonpaku1337 has joined #dri-devel

13:19 <swick[m]> only YCbCr in 861-G

13:21 <pq> there is no RGB variant of it, is there?

13:21 <pq> or you mean BT2020_YCC?

13:22 <pq> swick[m], t

13:23 <pq> swick[m], this reminds me, should the new color pipeline UAPI replace the automatic RGB/YCC selection from the start?

13:23 <swick[m]> yeah, bt 2020 YCC is defined in CTA-861-G already but not ICtCp

13:24 <swick[m]> it's only for the plane right now, so I don't think so

13:24 <pq> right, memory is slowly coming back

13:24 <pq> and it can be added later with "auto"

13:29 <pq> were the diagrams supposed to appear as rendered images in https://dri.freedesktop.org/docs/drm/gpu/drm-kms.html ? I see source, e.g. Overview.

13:30 youmukonpaku1337 has quit [Remote host closed the connection]

13:31 youmukonpaku1337 has joined #dri-devel

13:32 yyds has joined #dri-devel

13:34 ced117 has joined #dri-devel

13:37 Danct12 has quit [Read error: Connection reset by peer]

13:41 fab has quit [Quit: fab]

13:57 heat has joined #dri-devel

14:00 JTL has quit []

14:02 YuGiOhJCJ has quit [Quit: YuGiOhJCJ]

14:15 fab has joined #dri-devel

14:15 JTL has joined #dri-devel

14:18 hansg has joined #dri-devel

14:19 rasterman has quit [Quit: Gettin' stinky!]

14:20 youmukonpaku1337 has quit [Remote host closed the connection]

14:20 youmukonpaku1337 has joined #dri-devel

14:26 Guest1914 has quit [Remote host closed the connection]

14:30 xzhan34_ has joined #dri-devel

14:30 xzhan34 has quit [Remote host closed the connection]

14:35 mripard has quit [Quit: mripard]

14:36 mripard has joined #dri-devel

14:36 agd5f has quit [Remote host closed the connection]

14:36 heat has quit [Remote host closed the connection]

14:38 ap51 has quit [Ping timeout: 480 seconds]

14:39 kzd has joined #dri-devel

14:39 agd5f has joined #dri-devel

14:41 <mareko> llvmpipe-traces times out randomly: https://gitlab.freedesktop.org/mesa/mesa/-/jobs/48524383

14:41 <zmike> yeah something to do with new infra

14:41 tristan has joined #dri-devel

14:41 <zmike> being discussed in #freedesktop

14:42 tristan is now known as Guest1928

14:43 <karolherbst> jasuarez: I'm kinda looking into compute stuff for v3d, but I'm running issues with fences. At least it looks like some aren't signaled and I wonder what's the best approach here to debug it

14:45 fab has quit [Ping timeout: 480 seconds]

14:46 mszyprow has joined #dri-devel

14:46 jewins has joined #dri-devel

14:47 pekkari has joined #dri-devel

14:48 <jasuarez> I never deal with such issues so not sure what's the best approach

14:49 <jasuarez> I don't remember to have anything special for that

14:54 <karolherbst> mhh.. maybe I'm doing something incorrectly, but I also don't see the GPU faulting, or at least nothing in dmesg

14:54 mripard has quit [Quit: mripard]

14:56 yuq825 has quit []

14:56 <karolherbst> jasuarez: do you know if all memory (a.k.a. pipe_resources) need to be referenced before work can be launched/waited on or somehting odd like that? I'm currently not doing this, so maybe I want to figure out how to properly do it in v3d

14:56 Guest1928 has quit [Remote host closed the connection]

14:56 <karolherbst> but then again it's a bit odd to not see any errors

14:59 <karolherbst> yeah mhh.. doesn't seem to be it either

15:00 alpalcone has quit [Quit: WeeChat 3.8]

15:00 pekkari has quit [Quit: Konversation terminated!]

15:19 ap51 has joined #dri-devel

15:20 <jasuarez> Pretty sure Iago dealt with then when developing v3dv, but he is not connected now. I could ping him tomorrow

15:20 <karolherbst> cool

15:23 agd5f has quit [Remote host closed the connection]

15:25 tzimmermann has quit [Quit: Leaving]

15:25 agd5f has joined #dri-devel

15:27 pekkari has joined #dri-devel

15:32 yyds has quit [Remote host closed the connection]

15:43 frieder has quit [Remote host closed the connection]

15:43 tristan has joined #dri-devel

15:44 tristan is now known as Guest1931

15:46 pekkari has quit [Ping timeout: 480 seconds]

15:50 jessica_24 has joined #dri-devel

15:56 bmodem has quit [Ping timeout: 480 seconds]

16:01 bmodem has joined #dri-devel

16:03 junaid has joined #dri-devel

16:09 hansg has quit [Remote host closed the connection]

16:10 hansg has joined #dri-devel

16:15 <mareko> rustcuda when

16:16 <karolherbst> I wonder if layering HIP on CL is good enough here, at least that's my hope and that the project in question works out :D

16:16 <karolherbst> but I also don't know if AMD plans to stay compatible with CUDA forever or not

16:16 <youmukonpaku1337> amd is compatible with cuda??

16:16 <youmukonpaku1337> the hell

16:17 <mareko> youmukonpaku1337: it's called HIP

16:17 <karolherbst> well.. HIP is basically `s/cu/hip/` + some mistakes or something

16:17 <youmukonpaku1337> i see

16:17 <mareko> I'm hearing nobody uses OpenCL

16:18 <karolherbst> yeah, hearing that a lot from AMD people

16:18 <youmukonpaku1337> yep thats true

16:18 <youmukonpaku1337> most stuff uses cuda

16:18 <karolherbst> yeah, but the reason is, that all the CL stacks were horrible in the past :D

16:18 <karolherbst> but yeah..

16:19 <karolherbst> at least there are a couple of companies still invested in CL.. anyway.. I think layering CUDA/HIP/whatever on top of CL or whatever is probably the best strategy here

16:19 <karolherbst> and such projects already exists

16:19 <youmukonpaku1337> yep that could work

16:19 <youmukonpaku1337> ~~the zink of opencl~~

16:19 <karolherbst> HIP on CL on zink on....

16:20 <mareko> .. glide

16:20 <zmike> nope shut it down

16:20 <karolherbst> *layers weren't supposed to be layered on top of layers*

16:20 <mareko> or zink on r600

16:20 <zmike> don't encourage them

16:20 <karolherbst> uhhh

16:21 <karolherbst> anyway... all those HIP on CL layers require insane extensions

16:21 <karolherbst> e.g. SVM

16:21 <karolherbst> :')

16:21 kts has joined #dri-devel

16:21 <youmukonpaku1337> anyway unrelated but why the hell does an e reader need a dedicated video encoder/decoder chip on the soc LOL

16:21 <karolherbst> mhh

16:21 <karolherbst> copyrighted embedded videos?

16:21 <youmukonpaku1337> am not complaining but its kind of funny

16:21 <youmukonpaku1337> nope

16:21 <karolherbst> (with DR)

16:22 <karolherbst> *DR

16:22 <youmukonpaku1337> the reader never plays any videos of sort

16:22 <karolherbst> ... my M is stuck

16:22 <karolherbst> huh.. weird

16:22 <karolherbst> adds?

16:22 <karolherbst> :D

16:22 <karolherbst> ehh

16:22 <karolherbst> ads

16:22 <youmukonpaku1337> its just the allwinner a13 has a cedar VPU and they couldnt be bothered to get anothee soc lol

16:23 <karolherbst> uhh

16:23 <youmukonpaku1337> ads are impossible, this device doesnt have wifi (by default at least)

16:23 <youmukonpaku1337> HOWEVER

16:23 <youmukonpaku1337> you can uh

16:23 <youmukonpaku1337> do something so utterly cursed

16:23 <karolherbst> like using the sound card?

16:23 <youmukonpaku1337> that its just plain insane

16:23 <youmukonpaku1337> karolherbst: no sound card to speak of

16:23 <mareko> lavacuda would be interesting, zink can help I'm sure

16:24 <karolherbst> sooo... there is this cuda driver API we could potentially implement

16:24 <karolherbst> which is libcuda.so

16:24 <karolherbst> but I have no idea how painful that would be

16:24 yyds has joined #dri-devel

16:24 <youmukonpaku1337> karolherbst: check out this monstrosity i made https://youmu.i-am-in-your.systems/MtWKwDaqmTye https://youmu.i-am-in-your.systems/fnpkpGMuwCpB

16:25 <youmukonpaku1337> so technically this system doesnt have wifi *right*

16:25 <youmukonpaku1337> BUT the usb interface used for it works and is very easy to use

16:25 <youmukonpaku1337> so its like a nice usb interface

16:25 fab has joined #dri-devel

16:26 <youmukonpaku1337> i should probs boost the 3.3v it provides to 5v

16:26 <youmukonpaku1337> instead of external power

16:26 <karolherbst> cursed

16:26 Guest1931 has quit [Ping timeout: 480 seconds]

16:26 <youmukonpaku1337> very

16:26 <youmukonpaku1337> but it works (horribly)

16:26 <youmukonpaku1337> i love compiling wifi drivers for an hour

16:26 <youmukonpaku1337> best pastime

16:27 <youmukonpaku1337> thank god i didnt have to cross comp mesa and lima is included in stock debian mesa lol

16:30 <youmukonpaku1337> i just hope that i can get desktop gl lima to work

16:30 swalker__ has quit [Remote host closed the connection]

16:30 <youmukonpaku1337> because if so this makes this actually workable instead of pure hell

16:37 Duke`` has joined #dri-devel

16:38 haagch has quit [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]

16:40 haagch has joined #dri-devel

16:44 kts has quit [Ping timeout: 480 seconds]

16:44 tristan has joined #dri-devel

16:45 Jeremy_Rand_Talos__ has quit [Remote host closed the connection]

16:45 Jeremy_Rand_Talos__ has joined #dri-devel

16:45 tristan is now known as Guest1936

16:46 Jeremy_Rand_Talos__ has quit [Remote host closed the connection]

16:46 Jeremy_Rand_Talos__ has joined #dri-devel

16:50 <gildekel> @pq @emersion Hi! I am currently going through review with Intel on a series that suggest a fix around complete link-training failures, in which in these cases, the effective bandwidth of a connector is set to 0Gbps, which will cause all its modes to be pruned in in the next probing. The risk here is introducing a change that userspaces are not expecting. The intuition suggests that connectors without modes should be ignored...

16:50 <gildekel> The series is here: https://patchwork.freedesktop.org/series/122850/

16:50 <gildekel> I would love to get your input as weston/sway maintainers (hope I got it right)

16:52 <gildekel> And, needless to say, anyone else here who feel like this change is relevant to their product stability

16:52 Danct12 has joined #dri-devel

16:54 yyds has quit [Remote host closed the connection]

16:59 Guest1936 has quit [Ping timeout: 480 seconds]

17:06 <zamundaaa[m]> For KWin connectors with zero modes would be fine; this already happened in the past (don't remember in what circumstances though) so we have a workaround in place

17:12 <gildekel> That's good. The approach here is that upon link-training failure, userspace will get a uevent in which it will see the failed connector is "sterile", so ignoring it, or marking it in a bad state is the goal. At least that's what we would like to see in ChromeOS.

17:13 lynxeye has quit [Quit: Leaving.]

17:17 jljusten has quit [Quit: WeeChat 3.8]

17:20 jljusten has joined #dri-devel

17:24 kxkamil has quit []

17:33 anarsoul|2 has joined #dri-devel

17:34 anarsoul has quit [Read error: No route to host]

17:35 rasterman has joined #dri-devel

17:37 kxkamil has joined #dri-devel

17:39 alanc has quit [Remote host closed the connection]

17:40 Ahuj has quit [Ping timeout: 480 seconds]

18:00 agd5f has quit [Read error: Connection reset by peer]

18:02 agd5f has joined #dri-devel

18:08 bmodem has quit [Ping timeout: 480 seconds]

18:10 flto has quit [Read error: Connection reset by peer]

18:14 flto has joined #dri-devel

18:21 ungeskriptet0 has quit []

18:21 ungeskriptet0 has joined #dri-devel

18:25 mszyprow has quit [Ping timeout: 480 seconds]

18:35 alanc has joined #dri-devel

18:38 gouchi has joined #dri-devel

18:48 Haaninjo has joined #dri-devel

18:55 ap51 has quit [Ping timeout: 480 seconds]

19:06 <airlied> karolherbst: ptx parser in rust?

19:07 <HdkR> Don't even need to parse PTX, plenty of Switch emulators proved you can just take the raw ISA and translate it :P

19:07 <karolherbst> airlied: why not tho...

19:07 <airlied> HdkR: that assumes yoy have raw isa though

19:08 <karolherbst> I just think it makes more sense to have an open ecosystem besides CUDA, so opting in into supporting CUDA is kinda a double edge one here

19:08 <airlied> but yeah sass to nir translator

19:08 <karolherbst> HdkR: right... we could even pattern match commong lowering, it shouldn't be all too hard

19:08 <karolherbst> for compute almost none of this exists anyway

19:09 <karolherbst> though cuda supports texgrad and other evil things :')

19:10 <airlied> also does nvidia have a d9cumented calling convention to their sass kernels?

19:12 <karolherbst> uhm... no idea

19:12 <karolherbst> airlied: you mean sass kernels as in normal compute shaders?

19:13 <karolherbst> the elf binaries usually document all the constant buffers

19:13 <karolherbst> but not sure how flexible they are with that

19:13 <karolherbst> but there doesn't really exist any kinda of calling convention here besides some internal data passed in via const buffers

19:13 <karolherbst> at fixed locations

19:16 <airlied> dont they have libs you link against, or is it just pre made kernels?

19:17 <karolherbst> they have some internal binaries and I'm sure there is some kinda of calling convention for those, but I don't actually know what they are doing there

19:18 <karolherbst> anyway, nvidia does not want you to target SASS

19:18 <karolherbst> so I doubt they document anything

19:20 <karolherbst> and I'm sure I wont' even be allowed to help out writing a SASS parser....

19:20 <karolherbst> or at least that might bring me in a icky legal situation

19:23 rauji___ has joined #dri-devel

19:47 rasterman has quit [Quit: Gettin' stinky!]

19:48 fab has quit [Quit: fab]

19:51 junaid has quit [Remote host closed the connection]

19:54 youmukonpaku1337 has quit [Remote host closed the connection]

19:55 youmukonpaku1337 has joined #dri-devel

19:56 <jtatz[m]> cuobjdump can dump SASS, and the ISA is somewhat documented https://docs.nvidia.com/cuda/cuda-binary-utilities/index.html#instruction-set-reference

19:59 <jtatz[m]> Also for JIT kernels you can use CUPTI to grab it at runtime

20:06 <alyssa> considerably more docs than I was expecting, neat

20:08 <youmukonpaku1337> okay this is not working out

20:08 <youmukonpaku1337> cant change even resolution

20:08 <youmukonpaku1337> no gles2 in es2info with lima

20:08 <youmukonpaku1337> no desktop gl either

20:08 <youmukonpaku1337> and X is buggy as all hell

20:09 <youmukonpaku1337> and gud + lima arent in xrandr providers

20:09 <DemiMarie> youmukonpaku1337: try Wayland

20:10 <youmukonpaku1337> weston literally locks up the system

20:10 <youmukonpaku1337> so uh

20:10 <youmukonpaku1337> anything else

20:11 iive has joined #dri-devel

20:14 <youmukonpaku1337> DemiMarie: can i run mutter separate from gnome? it seems to support this kind of trickery

20:14 mszyprow has joined #dri-devel

20:16 <karolherbst> "somehwat documented" :D

20:16 <karolherbst> yeah...

20:18 <DemiMarie> youmukonpaku1337: Try Sway or KWin.

20:18 <youmukonpaku1337> kwin... definitely no

20:18 <youmukonpaku1337> am gonna try mutter first because pq mentioned it having support for trickery like what im doing

20:19 <DemiMarie> And report a kernel bug, because Weston should not lock up the system in a way that killing it cannot correct.

20:19 <youmukonpaku1337> i mean, system is more or less alive but it crashes GUD it seems

20:20 <DemiMarie> GUD?

20:23 Duke`` has quit [Ping timeout: 480 seconds]

20:25 melonai3 has quit []

20:25 <youmukonpaku1337> Generic USB Display

20:26 melonai3 has joined #dri-devel

20:32 melonai3 has quit []

20:32 melonai3 has joined #dri-devel

20:33 hansg has quit [Quit: Leaving]

20:35 crabbedhaloablut has quit []

20:40 <DemiMarie> Ah

20:40 <DemiMarie> Probably a kernel bug; I would report it to the relevant mailing lists.

20:42 <youmukonpaku1337> hmm

20:42 <youmukonpaku1337> how can i run mutter with PRIME

20:48 youmukon1 has joined #dri-devel

20:49 <youmukon1> youch

20:49 <youmukon1> 4fps under mutter with es2gears wayland

20:49 <youmukon1> also permission denied with kmscube

20:50 <youmukon1> oh nvm

20:51 <youmukon1> okay so i can get 2fps in kmscube if i run it on card0 which is software accelerated (and badly at that) GUD

20:52 <youmukon1> question is how can i use lima to render to card0

20:52 <youmukon1> if i do mesa loader override to use lima it just throws an invalid modeset argument error

20:53 youmukonpaku1337 has quit [Ping timeout: 480 seconds]

20:54 <Sachiel> mesa drivers expect to talk to their corresponding kernel driver, not just some random out of tree thing, so if you are having issues with some random out of tree thing, go ask their authors for support. I don't think you'll find much help here with that

20:55 <youmukon1> GUD is in mainline lol

20:56 <zmike> sounds like bugs

20:57 <youmukon1> ehhh

20:57 <youmukon1> its probably intended and it uses sw accel by default

20:57 <youmukon1> question is how do i make it offload to lima

21:00 <airlied> might have to hack gud kmsro, not sure if that would help

21:01 <youmukon1> what's kmsro

21:02 <youmukon1> as long as it isn't *too* difficult im fine with a little trickery

21:03 <airlied> kmsro is mesa internal thing to link accel and display drivers

21:03 <youmukon1> aha

21:03 <youmukon1> i see

21:04 <airlied> i think you might need to write code in mesa, but that is close to the limit of what i know about it

21:04 <youmukon1> oh fuck

21:04 <youmukon1> i have almost 0 knowledge of how to write C lol

21:05 <karolherbst> does setting `DRI_PRIME=1` help or did you already try that?

21:05 <youmukon1> tried that

21:05 <youmukon1> nope

21:05 <youmukon1> also

21:05 <youmukon1> kmscube shows renderer as mali400

21:05 <youmukon1> but uhh i somehow doubt that's right

21:06 <karolherbst> why not?

21:06 <youmukon1> 2.5 frames per second

21:06 <karolherbst> there might be a different reason it's so slow

21:06 <karolherbst> maybe it's CPU overhead

21:07 <youmukon1> hm

21:07 <karolherbst> the content of the frames kinda need to be copied over to the display driver

21:07 <karolherbst> and if there is no accelerated path for that the performance is kinda toast

21:07 <airlied> also copied over usb

21:07 <karolherbst> try LIBGL_ALWAYS_SOFTARE=1 and see if that changes antyhing

21:07 <youmukon1> thats possible but also es2 info and glxinfo list driver as llvmpipe

21:07 <youmukon1> oh

21:07 <youmukon1> will test

21:07 <youmukon1> in a sec

21:08 <karolherbst> LIBGL_ALWAYS_SOFTWARE=1 I mean

21:08 <karolherbst> but kmscube is kinda special

21:08 <karolherbst> there might be a different way for kmscube to use llvmpipe

21:12 <youmukon1> karolherbst: libgl always software makes kmscube throw "failed to set mode: invalid argument"

21:13 <karolherbst> fair

21:13 <youmukon1> it does show that jts using llvmpipe before that

21:13 <youmukon1> hm

21:14 <youmukon1> i would go the kmsro route but i have absolutelt no idea how to program C so i suppose thats not an option lol

21:16 <youmukon1> is there a way to test rendering speed headlessly?

21:16 <karolherbst> I think it's already working as intented

21:16 <karolherbst> it's just that the kernel driver doesn't provide what we need for proper offloading here

21:16 <karolherbst> at least that's my working theory

21:16 <karolherbst> did you check the CPU load?

21:16 <youmukon1> am about to do that

21:16 <karolherbst> and where it spends most of the CPU cycles at

21:17 <karolherbst> or rather what process uses most of the CPU

21:20 <youmukon1> kmscube was not using much cpu at all

21:20 <karolherbst> yeah.. so it's indeed not softare rendering, or if it is, the bottleneck is something else

21:21 <youmukon1> and how would i go about finding it i guess?

21:21 <karolherbst> is your CPU busy nonetheless?

21:21 <youmukon1> what do you consider busy

21:22 <youmukon1> around 10% with htop open

21:22 <karolherbst> mhh, that's not much

21:22 <karolherbst> like total or is one core at 100%?

21:22 <youmukon1> theres a single core lol

21:22 <karolherbst> heh

21:22 <karolherbst> yeah.. so I guess something with the GUD driver and usb and... other things is why it's slow

21:23 <daniels> there’s no magic bullet here - the only possible solution (pipelining rendering) absolutely requires knowing C

21:23 <youmukon1> ugh

21:23 <daniels> fundamentally, you have a very slow GPU rendering to system memory, then copying back out over USB, and waiting for this to take effect, every frame

21:24 youmukon1 has quit [Read error: Connection reset by peer]

21:25 youmukonpaku1337 has joined #dri-devel

21:26 <youmukonpaku1337> yeah i can see why itd be slow...

21:26 <youmukonpaku1337> wait

21:26 <youmukonpaku1337> how can i check usb bandwidth?

21:27 <youmukonpaku1337> i may have a hunch something went horribly wrong and im running over usb1.1 bandwidth

21:27 <karolherbst> uhh.. that would be terrible indeed

21:28 <youmukonpaku1337> very

21:28 <karolherbst> but would the bandwidth be enough for displaying anything?

21:28 <glennk> lsusb -t should show the theoretical bandwidth for each port

21:28 <youmukonpaku1337> karolherbst: for a tty should be lol

21:28 <glennk> i'm also guessing this platform is stuck with single channel memory too?

21:29 <youmukonpaku1337> OH

21:29 <youmukonpaku1337> LMFAO

21:29 <youmukonpaku1337> it is running at 12mbit bandwidth

21:29 <youmukonpaku1337> the entire hub

21:29 <karolherbst> RIP

21:29 <youmukonpaku1337> ugh

21:29 <youmukonpaku1337> guess im gonna have to somehow use the main port

21:29 <glennk> all pixels, line up in single file...

21:30 <youmukonpaku1337> glennk: i mean i doubt it would have dual channel 256mb ram lol

21:30 <youmukonpaku1337> and no its single channel

21:30 <youmukonpaku1337> but yea

21:30 <glennk> gpu + cpu + usb memory access

21:30 <youmukonpaku1337> i think i found the problem lmao

21:31 <youmukonpaku1337> lemme try uh

21:31 <youmukonpaku1337> the main micro b port

21:31 <youmukonpaku1337> it didnt work before but you never know

21:32 mszyprow has quit [Remote host closed the connection]

21:32 mszyprow has joined #dri-devel

21:33 glennk has quit [Remote host closed the connection]

21:33 glennk has joined #dri-devel

21:34 <youmukonpaku1337> oh GREAT

21:34 <youmukonpaku1337> guess we're back to the roots of this

21:34 <youmukonpaku1337> ;-;

21:35 iive has quit [Quit: They came for me...]

21:35 <youmukonpaku1337> how can i check mode of a usb port?

21:39 <youmukonpaku1337> i do have dr_mode set to host in DT but i dont think it works lol

21:40 gouchi has quit [Quit: Quitte]

21:40 <glennk> cat /sys/bus/usb/devices/<device>/speed and version is one way

21:41 <youmukonpaku1337> mode as in peripheral or host

21:41 <youmukonpaku1337> not speed

21:47 <youmukonpaku1337> hm

21:47 rauji___ has quit []

21:47 <youmukonpaku1337> it should be host

21:47 <youmukonpaku1337> this is quite weird

21:59 <alyssa> karolherbst: hmm, this is a spicy problem

21:59 <alyssa> I /really/ want read-only buffers to be read with load_global_Constant, not load_global

21:59 <alyssa> Is that even a thing in CL? I guess _constant?

22:00 <alyssa> doesn't work with generic ptrs, though

22:03 <alyssa> ok I guess I can just not use generic ptrs, fine

22:05 sassefa has joined #dri-devel

22:05 sassefa has quit []

22:05 sassefa has joined #dri-devel

22:05 sassefa has quit []

22:06 sassefa has joined #dri-devel

22:06 sassefa has quit []

22:06 sassefa has joined #dri-devel

22:06 sassefa has quit []

22:07 sassefa has joined #dri-devel

22:07 sassefa has quit []

22:07 sassefa has joined #dri-devel

22:07 sassefa has quit []

22:08 sassefa has joined #dri-devel

22:08 ngcortes has joined #dri-devel

22:08 sassefa has quit [Read error: Connection reset by peer]

22:09 sassefa has joined #dri-devel

22:09 sassefa has quit []

22:09 <alyssa> OK, yeah, using constant does what I need. Cool

22:09 <alyssa> thanks

22:09 sassefa has joined #dri-devel

22:09 sassefa has quit []

22:10 sassefa has joined #dri-devel

22:10 sassefa has quit []

22:10 <karolherbst> alyssa: yeah.. I want to use ubos for those things in the near future

22:10 <alyssa> I don't =D

22:10 sassefa has joined #dri-devel

22:10 * alyssa flexes her agx hardwrae

22:10 sassefa has quit []

22:11 <karolherbst> ehh...

22:11 sassefa has joined #dri-devel

22:11 <karolherbst> it should at least use load_global_constant though I think

22:11 <alyssa> yeah I just wrote that patch

22:11 sassefa has quit []

22:11 <alyssa> https://rosenzweig.io/0001-nir-lower_io-Use-load_global_constant-for-OpenCL.patch

22:11 <karolherbst> but some hardware benefits from those being actual ubos

22:12 <karolherbst> yeah... I think I have a patch like that somewhere as well

22:12 sassefa has joined #dri-devel

22:12 An0num0us has quit [Ping timeout: 480 seconds]

22:12 <karolherbst> I suspect you lower ubos to load_global_constant in agx?

22:12 <alyssa> yes

22:12 sassefa has quit []

22:12 <alyssa> in the gl driver

22:12 sassefa has joined #dri-devel

22:12 <karolherbst> do you load the descriptor at runtime?

22:13 sassefa has quit []

22:13 <alyssa> there is no descriptor

22:13 <karolherbst> I mean.. the actual address

22:13 sassefa has joined #dri-devel

22:13 <alyssa> it's pushed in

22:13 <karolherbst> mhhh

22:13 sassefa has quit []

22:13 <alyssa> AGX is literally the CL model

22:13 <karolherbst> right...

22:13 <alyssa> pass __constant pointers in and read em

22:13 sassefa has joined #dri-devel

22:14 sassefa has quit []

22:14 <karolherbst> yeah, that's fair, I'm just wondering if there is a significant overhead when using ubos and if we want to make that optional

22:14 sassefa has joined #dri-devel

22:14 <karolherbst> on nvidia it really should be an ubo e.g.

22:14 sassefa has quit []

22:14 sassefa has joined #dri-devel

22:14 <karolherbst> I probably don't even need a new cap for, if the constant buffer size is below 1M it's probably a hardware thing :D

22:15 sassefa has quit []

22:15 sassefa has joined #dri-devel

22:15 sassefa has quit []

22:15 sassefa has joined #dri-devel

22:15 <karolherbst> `PIPE_CAP_MAX_SHADER_BUFFER_SIZE_UINT` is what I'm currently reporting as constant memory size

22:16 sassefa has quit []

22:17 sima has quit [Ping timeout: 480 seconds]

22:17 <karolherbst> alyssa: the thing on nvidia is, that you can literally use ubos as sources for alu instructions

22:17 <karolherbst> and they are as fast as gprs

22:17 <karolherbst> but I know that some drivers really just map them to raw memory

22:18 <karolherbst> so I guess this situation all warrants a flag (which st/mesa might also do in the far future if at all)

22:18 <karolherbst> though I thikn there are also robustness arguments, so drivers are supposed to bound check them?

22:23 Haaninjo has quit [Quit: Ex-Chat]

22:24 <alyssa> i can literally use arbitrarily complicated expressions on UBOs as sources for alu instructions as fast as gpr :~)

22:24 <karolherbst> sounds cursed

22:25 <alyssa> nir_opt_preamble

22:28 mszyprow has quit [Ping timeout: 480 seconds]

22:28 soreau has quit [Ping timeout: 480 seconds]

22:29 soreau has joined #dri-devel

22:40 <karolherbst> right.. but I guess you don't have like 1MB of space there

22:41 <karolherbst> anyway, you still bind it as a normal buffer

22:42 <karolherbst> not as some special ubo thing

22:43 <alyssa> 1KiB, so not quite as big as you ;)

22:44 <karolherbst> right.. I'm just wondering if it makes sense to add complexity to bind constant buffers as ubos or normal global memory depending on what the driver wants or if it's good enough to always bind them as ubos in the future

22:47 <alyssa> always UBOs is fine

22:48 <karolherbst> okay, cool

23:53 <emersion> gildekel: yeah i think 0 modes is a kernel bug

23:59 vliaskov has quit []