#dri-devel on 2022-05-03 — irc logs at oftc.irclog.whitequark.org

2022-03-22 11:57 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:00 rasterman has quit [Quit: Gettin' stinky!]

00:01 <karolherbst> ahh a crash :)

00:02 nchery is now known as Guest3390

00:02 nchery has joined #dri-devel

00:03 ybogdano has joined #dri-devel

00:04 rkanwal has quit [Quit: rkanwal]

00:04 rkanwal has joined #dri-devel

00:07 columbarius has joined #dri-devel

00:07 neonking__ has joined #dri-devel

00:08 Guest3390 has quit [Ping timeout: 480 seconds]

00:09 co1umbarius has quit [Ping timeout: 480 seconds]

00:09 <karolherbst> airlied: ehh.. does this scratch code even work if the values types are different?

00:09 <karolherbst> like this scratch area contains of a 64 bit and a 32 bit value

00:09 <karolherbst> s/of//

00:10 <karolherbst> and I think you place the elements in a "vector", no

00:10 <karolherbst> ?

00:11 <karolherbst> so I think one thread writes into offset 0x8

00:11 <karolherbst> and another thread reads 0x0 (as 64 bit) and gets a garbaged pointer

00:11 neonking__ has quit [Remote host closed the connection]

00:11 neonking__ has joined #dri-devel

00:11 <airlied> karolherbst: yes it should work for 64-bit or 32-bit values

00:11 <karolherbst> well.. it has both

00:12 <airlied> assuming the bit shift is in the right place :-P

00:12 <karolherbst> I don't think it is

00:13 <karolherbst> it really looks like the content of the scratch buffer gets corruped

00:13 <karolherbst> airlied: https://gist.githubusercontent.com/karolherbst/69bae372c3a03e670f589d3dc348fa86/raw/73b87471f3a88b663d7c08220ed2428f486f3ad6/gistfile1.txt

00:13 <karolherbst> see those non ptr looking 32 bit values mixed in?

00:13 neonking_ has quit [Ping timeout: 480 seconds]

00:14 <karolherbst> at idx 8

00:14 <karolherbst> and 11

00:15 <karolherbst> 7 and 8 are clearly a heap pointer, but 0x7fffb00000003 is kind of a bad pointer

00:15 <airlied> btw which luxmark scene are you testing with?

00:15 <karolherbst> luxball

00:15 <karolherbst> the others won't compile :D

00:15 <karolherbst> well... I think they would compile at some point

00:15 <karolherbst> anyway

00:15 <karolherbst> I think the offset calculation in load/store scratch is wrong

00:16 <karolherbst> I am sure it all works if _all_ values are either 32 or 64 bit within scratch

00:16 <karolherbst> but not if it's mixed

00:17 <airlied> karolherbst: so it casts the scratch ptr to a 32-bit or 64-bit ptr

00:17 <airlied> then adds the offset to it

00:17 <karolherbst> I mean the thread_offsets value

00:17 <karolherbst> but weird...

00:17 <airlied> that is done before the shift though

00:18 <karolherbst> right...

00:18 <karolherbst> yeah so shift_val is different mhh

00:19 <karolherbst> yeah.. shift_val being wrong _would_ be a valid explenation here I think?

00:21 <airlied> when I break in there, I get scratch at 8 bytes for that demo

00:21 <karolherbst> it launches multiple kernels

00:21 <karolherbst> the second or third one with scratch space has 12

00:21 <airlied> oh i see it now

00:22 * airlied assumes it's not doing unaligned 64-bit loads

00:22 <karolherbst> I am sure it is :P

00:23 <karolherbst> the offsets are also a bit odd

00:23 <karolherbst> store offset = 0 1 3 4 6 7 9 10

00:23 <karolherbst> the 32 bit value gets store offset = 2 5 8 11 14 17 20 23

00:24 <karolherbst> maybe the size of 12 confuses it

00:24 <karolherbst> maybe something should align it?

00:25 <airlied> oh nmaybe

00:25 <karolherbst> let me try that

00:25 <airlied> might be worth trying to get that to 16 somewhere

00:25 <karolherbst> yeah

00:26 <karolherbst> but that would also explain why it works on iris

00:32 <karolherbst> airlied: question is now, should llvmpipe or the frontend work around that?

00:32 <airlied> llvmpipe I think

00:32 <karolherbst> seems to fix it

00:32 <karolherbst> but llvmpipe doesn't know the alignment of the biggest thing :(

00:33 <karolherbst> worst case you need to align to long16, no?

00:33 <airlied> should I align to 8 or just next power of two it?

00:33 <karolherbst> next power of two can hurt if you got huge scratch space

00:33 <airlied> no I think we don't ever vector load

00:33 <airlied> we always only load 32-bit or 64-bit

00:33 <karolherbst> maybe next pot _or_ long16?

00:33 <karolherbst> depening on what's smaller

00:34 <airlied> so I think 8 is probably fine

00:34 <karolherbst> airlied: CL has some stupid reqs on pointer alignments and shit though

00:34 <airlied> but when the IR gets to llvmpipe, it's pretty much load a 32-bit or load a 64-bit this number of components

00:34 <airlied> so you have to iterate it

00:34 <karolherbst> right

00:34 <airlied> it's not like we do a 256-bit fetch even if we could

00:35 <karolherbst> I am just wondering if some of the alignment fails I see are caused by this

00:35 <karolherbst> basic kernel_memory_alignment_constant

00:35 <karolherbst> basic kernel_memory_alignment_global

00:35 <karolherbst> but...

00:35 <karolherbst> could be just llvmpipe

00:36 <karolherbst> sooo.. let's benchmark on my non crappy desktop? :D

00:36 <airlied> https://paste.centos.org/view/raw/ef9a1b1e

00:38 <karolherbst> I am sure this also fixes other random crashes with fp64

00:38 <karolherbst> what a pita of a bug

00:39 <airlied> thanks for digging in!

00:39 <karolherbst> airlied: yeah.. that works :)

00:39 <airlied> 16288 has the patch

00:44 <karolherbst> heh.. my ADL-S doesn't seem so much faster than my CML-H

00:46 <karolherbst> ahh.. it's GT-1 vs GT-2

00:47 <karolherbst> mhh, it should still be faster..

00:48 <airlied> make sure you took of the LP_NUM_THREADS :-P

00:48 <karolherbst> airlied: how can I make llvmpipe use more threads? :D

00:48 <karolherbst> airlied: nah.. I was testing iris first

00:49 <karolherbst> heh.. LP_NUM_THREADS=24 and still only 1100%

00:49 <karolherbst> where is my perf

00:51 <karolherbst> airlied: so uhm... how do I get more perf out of llvmpipe on my machine? :D

00:52 <karolherbst> guess local size of 32 isn't helping? dunno

00:52 <karolherbst> 468 points and image validation seems happy

00:52 <airlied> not really sure, it's probably limited by launch params

00:53 <karolherbst> yeah... so luxmark uses 32 threads on CPU devices

00:53 <karolherbst> and 64 on GPUs

00:53 <HdkR> Is llvmpipe still bounded by vertex heavy jobs rather than fragment?

00:53 <karolherbst> that's on CL :P

00:53 <HdkR> oh wow

00:54 <karolherbst> so.. llvmpipe is a GPU now

00:54 <karolherbst> still only 1100%

00:54 * karolherbst doens't have 20 cores for nothing

00:55 <karolherbst> but iris seems a little slow

00:56 <karolherbst> so iris on my desktop should be around 50% faster

00:56 <karolherbst> but is only 10%

00:57 <karolherbst> intel_gpu_top says 99%

00:57 <karolherbst> ¯\_(ツ)_/¯

00:58 <karolherbst> maybe I hurt perf

00:59 <karolherbst> yeah.. no

00:59 ybogdano has quit [Ping timeout: 480 seconds]

01:01 <karolherbst> ADL-S GT1: 2719

01:01 <karolherbst> CML GT2: 2305

01:02 <karolherbst> airlied: guess I have to figure out the llvm header situation

01:02 <karolherbst> and I think I might even require llvm-14, because the opencl header stuff isn't as terribly broken there...

01:02 <karolherbst> it still is, but.. uhhh

01:03 * airlied is going to go dig into coroutines

01:03 <karolherbst> good luck

01:03 <karolherbst> ADL-S GT1 + LP: 3139

01:06 mclasen_ has quit [Ping timeout: 480 seconds]

01:07 rkanwal has quit [Quit: rkanwal]

01:17 kts has quit [Quit: Konversation terminated!]

01:22 <karolherbst> airlied: btw, your skynet email is dead

01:27 <airlied> yeah need to chase down where that server went, might have to retire it

01:28 kts has joined #dri-devel

01:32 nchery has quit [Ping timeout: 480 seconds]

01:36 kts has quit [Quit: Konversation terminated!]

02:06 elongbug__ has quit [Ping timeout: 480 seconds]

03:23 rsalvaterra_ has joined #dri-devel

03:23 rsalvaterra is now known as Guest3407

03:23 rsalvaterra_ is now known as rsalvaterra

03:28 Guest3408 has quit [Ping timeout: 480 seconds]

04:01 fxkamd has quit []

04:10 sdutt has quit []

04:15 jimjams has joined #dri-devel

04:23 mwalle has quit [Quit: WeeChat 3.0]

04:38 Duke`` has joined #dri-devel

04:44 famfo has joined #dri-devel

05:23 itoral has joined #dri-devel

05:30 consolers has joined #dri-devel

05:32 <consolers> i think since i moved from mesa-20.2 to 21.2 clinfo started segfaulting: now it segfaults when loading loading /usr/lib64/gallium-pipe/pipe_iris.so

05:44 <consolers> that was on 22.0 i think i hit this before and figured something out but my mind is a blank

05:51 mhenning has quit [Quit: mhenning]

05:51 jewins has quit [Read error: Connection reset by peer]

05:52 Duke`` has quit [Ping timeout: 480 seconds]

05:58 lemonzest has quit [Quit: WeeChat 3.4]

06:00 consolers has quit [Ping timeout: 480 seconds]

06:07 ppascher has joined #dri-devel

06:10 danvet has joined #dri-devel

06:26 frieder has joined #dri-devel

06:34 garrison has joined #dri-devel

06:34 i-garrison has quit [Read error: Connection reset by peer]

06:41 consolers has joined #dri-devel

06:41 <consolers> anyclues on troubleshooting why clinfo is just crashing with mesa?

06:42 mvlad has joined #dri-devel

06:42 <consolers> i know it worked with mesa-20.2.0

06:42 <consolers> but apparently not since, then when i've had 21.2.1 and 22.2.0

06:42 digetx has quit [Ping timeout: 480 seconds]

06:43 digetx has joined #dri-devel

06:48 MajorBiscuit has joined #dri-devel

06:51 consolers has quit [Ping timeout: 480 seconds]

07:01 tzimmermann has joined #dri-devel

07:06 <airlied> danvet: fyi I backmerged rc5, I had an arm build fail it had a fix for

07:07 cheako has quit [Quit: Connection closed for inactivity]

07:10 <dolphin> airlied, danvet: no patches got picked up for drm-intel-fixes this week

07:23 ppascher has quit [Ping timeout: 480 seconds]

07:26 tursulin has joined #dri-devel

07:31 thellstrom has joined #dri-devel

07:33 lumag_ has joined #dri-devel

07:34 mwalle has joined #dri-devel

07:34 tzimmermann has quit [Quit: Leaving]

07:35 tzimmermann has joined #dri-devel

07:37 <tzimmermann> javierm, if you have a bit, could you please comment on https://patchwork.freedesktop.org/series/103222/ ?

07:38 <javierm> tzimmermann: sure, let me do that now

07:39 <tzimmermann> no hurries

07:39 <javierm> tzimmermann: no worries, is that I happen to have time now :)

07:43 thellstrom has quit [Remote host closed the connection]

07:43 rsripada_ has quit [Remote host closed the connection]

07:44 rsripada has joined #dri-devel

07:50 <javierm> tzimmermann: are you familiar with https://www.kernel.org/doc/html/latest/dev-tools/kunit/index.html ?

07:51 <tzimmermann> javierm, no sorry

07:51 <javierm> yeah, me neither. But I think that would be nice to have kunits for all the conversion helpers

07:51 xperia64_ has joined #dri-devel

07:52 <javierm> tzimmermann: I'll add that to my TODO to look at some point, which just keeps growing :)

07:53 <tzimmermann> that's a good idea with these unit tests

07:53 xperia64 has quit [Ping timeout: 480 seconds]

07:54 lynxeye has joined #dri-devel

07:54 <javierm> tzimmermann: Ok, I comment in the list too

07:55 <mripard> javierm: I had to use it a bit recently for the clocks framework, so I can help if needed

07:55 <mripard> (it's awesome)

07:55 <javierm> mripard: great

07:56 <javierm> mripard: yes, I was in a talk about kunit at some conference (plumbers in lisbon maybe?) and thought that was awesome but never had the time to dig deeper

07:57 <javierm> mripard: thanks for the offering, I'll for sure bug you if want to write some unit tests with kunit :)

07:57 nvishwa1 has quit [Read error: Connection reset by peer]

07:58 Lyude has quit [Ping timeout: 480 seconds]

07:58 mattrope has quit [Ping timeout: 480 seconds]

07:58 Lyude has joined #dri-devel

07:59 mattrope has joined #dri-devel

08:01 vyivel has quit [Read error: Connection reset by peer]

08:01 vyivel has joined #dri-devel

08:06 <mripard> I wanted to write some infrastructure for drivers to create unit tests in KMS, but got distracted

08:06 <mripard> maybe that would be worth adding in the TODO too

08:09 <javierm> mripard: Ok, I'll see to add that too when writing the patch for Documentation/gpu/todo.rst

08:10 <mripard> for vc4 for example, we have an atomic_check function that I have unit-tests for, but on my workstation, and it "works" with me copy/pasting the source code each and every time I need to rework it

08:11 <mripard> it's very far from optimal :)

08:12 <javierm> :D

08:13 <javierm> mripard: now you made me even more curious about kunit, gah I wish that had more time

08:13 <javierm> tzimmermann: what a nice patch series, the diff stat speaks for itself. And is great to see that much of code duplication going away

08:13 <tzimmermann> thanks :)

08:14 <tzimmermann> as i said before, i'd like to make these helpers composable, so that complex conversions can be assembled from multiple simple ones. we're not there yet, but it's a big step

08:15 <javierm> tzimmermann: it is a big step indeed

08:16 <javierm> specially since then someone reading these helpers will have to just understand drm_fb_xfrm() (which is complex, true) rather than the small differences between the different conversion helpers

08:20 <javierm> tzimmermann: and the diffstat after your patches speak for itself :)

08:24 <tzimmermann> javierm, the next step is to use iosys_map for the pointers arguments. iosys_map will be ammended with caching information. from this, we can easily detect which dbuf/sbuf need temporary buffers and which can be used as-is. we should also be able to merge drm_fb_xfrm() and drm_fb_xfrm_toio() into a single function

08:25 <javierm> tzimmermann: yup, I remember you mentioned that. Will speed up for the cases that don't use CMA/need a temp buffer

08:25 <javierm> since currently we are always doing the extra copy just in case

08:30 maxzor has joined #dri-devel

08:35 mszyprow has joined #dri-devel

08:44 pcercuei has joined #dri-devel

08:45 mszyprow has quit [Ping timeout: 480 seconds]

08:47 <pq> tzimmermann, javierm, mripard, FYI https://lists.freedesktop.org/archives/dri-devel/2022-April/349437.html has also per-line pixel conversion operations.

08:48 <pq> it's that the source or dest is always an internal 16 bpc representation used for blending in VKMS

08:48 rasterman has joined #dri-devel

08:49 jimjams has quit [Quit: Connection closed for inactivity]

08:50 <pq> tzimmermann, drm_fb_xrgb8888_to_rgb565_swab_line() sounds confusing. On one hand, the pixel formats are absolutely defined. OTOH, you add a swab.

08:51 <pq> or are these not reference to DRM_FORMAT_XRGB8888 and DRM_FORMAT_RGB565?

08:54 siqueira has quit []

08:54 lemes has quit []

08:54 melissawen has quit [Quit: ZNC 1.8.2+deb2+b1 - https://znc.in]

08:54 exit70 has quit [Quit: ZNC 1.8.2 - https://znc.in]

08:55 exit70 has joined #dri-devel

08:55 lemes has joined #dri-devel

08:57 siqueira has joined #dri-devel

08:58 melissawen has joined #dri-devel

08:58 <tzimmermann> pq, there are drivers that want a conversion+byteswap. i think we can already express that with the proper 4cc code. but conversion helpers are not there yet. i've been unifying these functions for some time and still in the middle of it. for now, i'd prefer to keep is as-is

08:59 <tzimmermann> pq, i'll see if some of that vkms code can go into generic helpers

09:00 <pq> tzimmermann, cool, thanks :-)

09:01 <pq> also, someone who actually does kernel dev would be nice to check by review comments on that series, since I'm not familar with kernel practises

09:01 <pq> *my review comment

09:03 <pq> tzimmermann, for now, the VKMS intermediate pixel format is not defined as a 4cc in order to use a struct conveniently.

09:06 apinheiro has joined #dri-devel

09:30 ppascher has joined #dri-devel

10:00 digetx has quit [Ping timeout: 480 seconds]

10:06 mclasen has joined #dri-devel

10:07 echoed has joined #dri-devel

10:10 echoed has left #dri-devel [#dri-devel]

10:10 consolers has joined #dri-devel

10:10 Lucretia has quit []

10:11 <consolers> could it be some thread thing that causes any opencl thing to segfault when loading the mesa iris gallium dll?

10:12 <consolers> i cant spot any reports on it either - except 2 on libreoffice/opencv i thinkfrom 2021 which were solved with downgrades

10:14 <consolers> and if i search for opencl google is giving me results for opened, like some ocr typo

10:15 digetx has joined #dri-devel

10:25 devilhorns has joined #dri-devel

10:26 Lucretia has joined #dri-devel

10:26 consolers has quit [Ping timeout: 480 seconds]

10:59 sagar__ has quit [Remote host closed the connection]

10:59 sagar__ has joined #dri-devel

11:00 consolers has joined #dri-devel

11:00 MajorBiscuit has quit [Ping timeout: 480 seconds]

11:01 rkanwal has joined #dri-devel

11:15 rasterman has quit [Quit: Gettin' stinky!]

11:16 rasterman has joined #dri-devel

11:21 Lucretia has quit []

11:22 Lucretia has joined #dri-devel

11:37 lemonzest has joined #dri-devel

11:43 itoral has quit [Remote host closed the connection]

11:43 itoral has joined #dri-devel

11:47 itoral has quit [Remote host closed the connection]

11:47 itoral has joined #dri-devel

11:49 consolers has quit [Ping timeout: 480 seconds]

11:50 ppascher has quit [Ping timeout: 480 seconds]

11:51 MajorBiscuit has joined #dri-devel

11:53 anarsoul has quit [Quit: ZNC 1.8.2 - https://znc.in]

11:53 anarsoul has joined #dri-devel

12:06 rgallaispou has quit [Remote host closed the connection]

12:29 maxzor has quit [Ping timeout: 480 seconds]

12:41 rpigott has quit [Read error: Connection reset by peer]

12:43 <javierm> tzimmermann: https://github.com/raspberrypi/linux/issues/5011#issuecomment-1116048782

12:44 icecream95 has quit [Ping timeout: 480 seconds]

12:50 itoral has quit []

12:51 <HdkR> robclark: I noticed you wanted a way to determine big.little cores. Welcome to the pain train, there is no exact way to determine this, you must use heuristics. You could peek at what FEX-Emu does to classify big versus little but it leaves open the possibility of getting things wrong.

12:51 alyssa has left #dri-devel [#dri-devel]

12:59 sdutt has joined #dri-devel

13:03 <tzimmermann> javierm, sounds good

13:05 <tzimmermann> javierm, same here: https://lore.kernel.org/linux-fbdev/BN9PR11MB537070E36D25158B265C8490ECC09@BN9PR11MB5370.namprd11.prod.outlook.com/T/#m48bb67f14303608a73ab955c24255c46d78aa91a

13:06 ppascher has joined #dri-devel

13:09 rgallaispou has joined #dri-devel

13:10 hch12907 has joined #dri-devel

13:12 MajorBiscuit has quit [Ping timeout: 480 seconds]

13:16 MajorBiscuit has joined #dri-devel

13:16 <robclark> HdkR: I believe "capacity" is exposed in sysfs.. crosvm (or rather the thing that launches it) looks at this to setup big and little vcpu's.. but haven't had a chance to look more closely at that

13:22 <MrCooper> daniels: do you remember what happened to https://lists.x.org/archives/xorg-devel/2017-November/055172.html (v3 of DRI3 v1.2: DMA fences), i.e. why it wasn't applied or followed up on?

13:28 <daniels> MrCooper: I think we already had this discussion on IRC a couple of years ago?

13:29 <daniels> MrCooper: by the time the dust had settled between keithp and ickle, it sounded like the conclusion was that it wouldn't be landed until xserver had a smart scheduler which would wait until the fences had actually signaled before doing anything

13:29 <daniels> unfortunately mine & lfrb's time on this earth is but finite

13:31 <MrCooper> thanks, don't remember such a fight, maybe I forgot about it :/

13:32 <MrCooper> that argument does make sense to me though

13:32 <daniels> yeah, it sounded like there was zero support for simply shuttling fences through, i.e. acting as we currently do with implicit sync

13:32 <daniels> and that nothing would be landable until the server was taking decisions itself

13:33 <daniels> it does sound like a good idea in isolation, but given the time that would require, and that you'd only really see any benefit if you weren't simply proxying via Xwl, or if you were mixing present + core rendering ... eh

13:35 <MrCooper> the same thing could be done with implicit sync in principle, using dma-buf fds

13:35 rgallaispou has quit [Read error: Connection reset by peer]

13:38 <daniels> it could!

13:38 <daniels> many things are possible

13:38 <daniels> a superset of the things which are sensible :P

13:51 <MrCooper> it's arguably a requirement for proper mailbox behaviour

13:52 * daniels shrugs

13:52 rpigott has joined #dri-devel

13:53 jewins has joined #dri-devel

13:58 <javierm> tzimmermann: interesting, should we just land it then ?

13:59 <tzimmermann> javierm, sure, why not.

13:59 <javierm> tzimmermann: I wondered the same that Junxiao asked but just did the minimum change to fix this particular issue

14:00 mszyprow has joined #dri-devel

14:00 <tzimmermann> well, he has a point

14:01 <tzimmermann> then maybe do a v2 with the other interfaces fixed.

14:01 Company has joined #dri-devel

14:02 maxzor has joined #dri-devel

14:05 <daniels> MrCooper: it sounds like a good thing to do and I'm certainly not going to talk you out of it :)

14:10 <zmike> anholt: what's the deqp-runner syntax for multiple --env options?

14:11 <zmike> or any deqp-runner expert

14:12 <javierm> tzimmermann: sure. It can't do any harm I guess^Whope :)

14:24 consolers has joined #dri-devel

14:25 alyssa has joined #dri-devel

14:27 alyssa has left #dri-devel [#dri-devel]

14:30 tzimmermann has quit [Quit: Leaving]

14:41 consolers has quit [Ping timeout: 480 seconds]

14:41 mszyprow has quit [Ping timeout: 480 seconds]

14:42 fxkamd has joined #dri-devel

14:45 MajorBiscuit has quit [Ping timeout: 480 seconds]

14:46 rgallaispou has joined #dri-devel

14:49 <pepp> zmike: I think you just pass multiple "--env name=value" param

14:50 <zmike> hm

14:50 <zmike> maybe I was doing something else wrong

14:51 mszyprow has joined #dri-devel

14:51 MajorBiscuit has joined #dri-devel

14:54 <robclark> HdkR: fwiw: cat /sys/devices/system/cpu/cpu*/cpu_capacity

15:00 ella-0_ has joined #dri-devel

15:03 ella-0 has quit [Read error: Connection reset by peer]

15:03 mszyprow has quit [Ping timeout: 480 seconds]

15:13 i-garrison has joined #dri-devel

15:13 garrison has quit [Read error: Connection reset by peer]

15:16 nvishwa1 has joined #dri-devel

15:22 <jekstrand> daniels, anholt: https://gitlab.freedesktop.org/mesa/mesa/-/jobs/22105382

15:23 <jekstrand> If's an infra problem. Not sure if it's fd.o or Google

15:25 <daniels> jekstrand: it's google, cf. #freedreno and also #freedesktop :P

15:25 <daniels> robclark is trying to fix it

15:26 tobiasjakobi has joined #dri-devel

15:26 tobiasjakobi has quit []

15:28 <jekstrand> :(

15:29 * jekstrand goes back to fixing nir_lower_blend

15:29 <daniels> sozzers

15:32 alyssa has joined #dri-devel

15:33 <alyssa> Static functions (not marked inline) in a header don't compile in release builds, but are fine in debug builds.

15:33 <alyssa> Any idea why that might be? Ideally any code that builds in debug also builds in release.

15:33 <alyssa> (In this case, it sounds like that should've failed in a local debug build.)

15:33 <alyssa> (Example: https://gitlab.freedesktop.org/mesa/mesa/-/jobs/22103773 )

15:34 <ajax> because release builds enforce -Werror=unused-function. so any file you include the header in, must reference all the statics therein

15:34 <ajax> unless they're inline, or __attribute__((unused)), or whatever

15:34 <alyssa> ajax: ok.. is there a reason debug builds don't enforce -Werror=unused-function?

15:35 <daniels> alyssa: you're calling it from an assert()

15:35 <daniels> which makes it used in debug builds, and unused in release builds

15:35 <ajax> hah, indeed, i skipped a step there

15:35 <alyssa> aaaah

15:35 <daniels> but anyway, just make it inline ... ?

15:35 <alyssa> Yeah, the correct fix it is make it inline

15:36 <alyssa> I'm more baffled why it went through my local build (and made it to CI at all)

15:36 Lucretia has quit [Remote host closed the connection]

15:36 <alyssa> instead of gcc screaming at me to mark it inline

15:36 <alyssa> I'm the kind of person that needs compilers to scream at me :p

15:37 <karolherbst> alyssa: some people implement things in headers and include those

15:37 <karolherbst> so if gcc would scream, it would break code :)

15:37 <alyssa> sounds like a good thing to break ;)

15:38 <karolherbst> if you are ready for that bikeshedding, please write the patch and explain why violating some weird spec is fine :D

15:38 <karolherbst> let's make it a daily thing: shitting on C and be annoyed by how bad it is or something

15:38 <alyssa> which is the spec violation?

15:38 <karolherbst> does C even know about headers ?

15:38 <karolherbst> the pre processor is probably a spec on its own

15:40 <ajax> would be somewhat weird for the C standard to both define what goes in what standard headers and not know what headers are

15:41 <karolherbst> I am sure the standard lib is another spec

15:41 <karolherbst> maybe it isn't.. :D

15:42 <ajax> open-std.org is down atm so i can't pull up n1570.pdf to check, but

15:43 <karolherbst> the C spec is cursed

15:43 <karolherbst> "??=define arraycheck(a, b) a??(b??) ??!??! b??(a??)"

15:44 <karolherbst> who is a C expert and knows what that resolves to?

15:44 Lucretia has joined #dri-devel

15:44 <karolherbst> that's right, it's #define arraycheck(a, b) a[b] || b[a]

15:44 <daniels> trigraphs are so awesome

15:45 <karolherbst> I didn't even knew they existed

15:45 <ajax> they don't anymore iirc

15:45 <karolherbst> ajax: I have the C17 spec here :(

15:45 <daniels> in fairness gcc warns when you use trigraphs unless you specifically suppress it

15:46 <daniels> 'you 100% do not mean this, if you did then you can enable it but you didn't'

15:46 <karolherbst> what's the reason to add those anyway?

15:46 <ajax> i thought they were getting dropped in c23 was the rumor

15:46 <ajax> because ebcdic doesn't have all of the basic character set for c89 in its minimal subset

15:47 <ajax> so depending which s360 you find yourself on you might not have [] as, like, keys on the keyboard

15:47 <karolherbst> ohh wow.. "The trigraph sequences enable the input of characters that are not defined in the Invariant Code Set as described in ISO/IEC 646, which is a subset of the seven-bit US ASCII code set."

15:47 <alyssa> karolherbst: "I have the C17 spec here :(" ditching clang+llvm-spirv are we now?

15:47 <karolherbst> alyssa: :D

15:47 <karolherbst> I won't comment on that

15:47 <ajax> so you cannot write those characters into files, which makes it hard for the compiler to tokenise them

15:47 <karolherbst> uhhh

15:48 <ajax> i'm blaming ebcdic here and i think there's at least one other non-ascii encoding that was partly to blame here, but

15:48 <jekstrand> hrm... nir_lower_blend really shouldn't require 32-bit for logic ops...

15:48 * karolherbst should use univode emoticons as function names more often

15:48 nchery has joined #dri-devel

15:49 <ajax> greek alphabet in math functions please

15:49 <karolherbst> good idea actually

15:49 <karolherbst> assert becomes 🔥

15:50 <karolherbst> we have those joke programminc language, but maybe there needs to be one where ANSI chars are invalid

15:51 <ajax> every character must be from a unicode codepoint > 0xff

15:51 <alyssa> karolherbst: do it in rust :p

15:52 <karolherbst> I am not sure if the world is ready for that yet

15:52 <alyssa> a C->NIR compiler written in Rust? how hard can it be?

15:52 <alyssa> famous last

15:52 <karolherbst> mhhh

15:52 <karolherbst> don't tempt me

15:52 <alyssa> You've been tempted! :-p

15:53 <karolherbst> how much is rust self hostet, if llvm is still written in C anyway

15:53 <karolherbst> *hosted

15:56 stuart has joined #dri-devel

15:57 Duke`` has joined #dri-devel

16:04 <rgallaispou> Hi. I'm struggling with gamma again...

16:04 <rgallaispou> In drm_atomic_uapi.c:384, what is the point of this test ? Is it only to test data alignment ? Because it won't pass any error to userland if the data is aligned according to 'expected_elem_size' but out of the struct (let it be 2048 + 8). This is my current issue, shown by kms_color@pipe-a-invalid-gamma-lut-sizes: the ioctl returns 0 when it should not. How does it go on Intel/AMD sides ?

16:06 <jekstrand> Ok, here's a fun question: If someone doesn't write to gl_FragData.w but blending is such that w doesn't matter, do they get well-defined results? I think the answer is yes, unfortunately.

16:06 <hch12907> alyssa: I think I had a C parser somewhere, written in rust... maybe we can repurpose that and make a C->NIR compiler, lol

16:07 * karolherbst doesn't think he is ready for linking inside nir yet

16:08 <karolherbst> heck, not even vtn would be ready

16:10 <alyssa> jekstrand: I think so. Why is that unfortunate?

16:10 <jekstrand> alyssa: Just more juggling we have to do in nir_lower_blend

16:11 <alyssa> right, okay

16:11 <jekstrand> I think the easy thing to do is just make the variable always match the format. Then we'll even get some dead-code action happening, maybe.

16:15 <vsyrjala> rgallaispou: sounds like you're not checking that the blob has the correct size

16:16 <alyssa> jekstrand: hm, alright

16:16 <alyssa> it might be nice to nir_lower_blend for radeonsi-style shader epilogs on AGX

16:16 <alyssa> but.. meh, tbh

16:16 <jekstrand> Sure

16:17 <jekstrand> Doesn't sound like a terrible idea

16:17 <alyssa> actually, jank from shader variants on AGX with AAA games sounds like a great problem to have, don't worry about it ;)

16:17 <alyssa> (and presumably that's all Vulkan content when someday asahivk is a thing)

16:20 MajorBiscuit has quit [Ping timeout: 480 seconds]

16:28 <rgallaispou> vsyrjala: it seems it resolves to drm_atomic_replace_property_blob_from_id(), but I don't see any call to the stm driver

16:28 <rgallaispou> vsyrjala: did you meant on a userland level or on the kernel side ?

16:29 <vsyrjala> kernel. driver needs to check that

16:46 hikiko_ has joined #dri-devel

16:46 <rgallaispou> vsyrjala: okay, I'll check that, thanks

16:50 hikiko has quit [Ping timeout: 480 seconds]

16:52 <MrCooper> daniels: FWIW, assuming a fence fd becomes readable when the fence is signalled, it shouldn't require a "smart scheduler": IgnoreClient if fence isn't signalled yet, AttendClient when the fd becomes readable

16:52 gouchi has joined #dri-devel

16:54 <daniels> MrCooper: sure

16:55 gouchi has quit []

16:58 hikiko has joined #dri-devel

17:02 hikiko_ has quit [Ping timeout: 480 seconds]

17:03 <MrCooper> daniels: FWIW, the context for my question is https://gitlab.freedesktop.org/xorg/xserver/-/issues/1317

17:08 <zmike> mareko: you good with https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15504 ?

17:19 slattann has joined #dri-devel

17:21 <ajax> why are there two generated copies of vk_cmd_queue.h in my build directory

17:21 <ajax> and why do they have different content

17:22 devilhorns has quit []

17:23 <ajax> and, most importantly, why for me does lavapipe include the one that doesn't declare everything

17:31 slattann has quit [Remote host closed the connection]

17:33 <zmike> rm -r build/src/vulkan

17:37 alyssa has left #dri-devel [#dri-devel]

17:40 <daniels> MrCooper: NV being special again then

17:46 frieder has quit [Remote host closed the connection]

17:59 <jenatali> Huh... I think there's a double-close fd bug for Android native fences...

18:00 imirkin_ has joined #dri-devel

18:00 <jenatali> Oh, no I just got the semantics wrong, nevermind

18:09 nvishwa1 has quit [Read error: Connection reset by peer]

18:14 imirkin_ has quit [Quit: Leaving]

18:15 lynxeye has quit [Quit: Leaving.]

18:21 tjmercier has joined #dri-devel

18:22 krushia has joined #dri-devel

18:40 alanc has quit [Remote host closed the connection]

18:40 alanc has joined #dri-devel

18:48 Haaninjo has joined #dri-devel

18:54 gawin has joined #dri-devel

19:02 stuart has quit [Ping timeout: 480 seconds]

19:11 <zmike> jenatali: can I get an ack on https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16153

19:13 eukara has quit []

19:14 <zmike> and https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16311

19:17 eukara has joined #dri-devel

19:23 apinheiro has quit [Ping timeout: 480 seconds]

19:25 <jenatali> zmike: What does has_alpha control?

19:25 <zmike> jenatali: whether the swapchain has alpha

19:26 <zmike> XRGB or ARGB basically

19:26 <jenatali> Ah, sure, yeah I don't see any reason to not always have alpha

19:36 rasterman has quit [Quit: Gettin' stinky!]

19:40 stuart has joined #dri-devel

19:46 <HdkR> robclark: sadly capacity only works if that is actually filled out. Also only gives you an idea, you still need to make a choice in big.bigger.biggest or small.smaller.smallest weirdo clustering setups :|

19:47 nchery has quit [Ping timeout: 480 seconds]

19:50 alyssa has joined #dri-devel

19:50 <alyssa> if an OpenCL shader gets a pointer to something on the stack, what does that look like in (optimized, lowered) NIR?

19:51 <alyssa> I guess load_scratch_base_ptr

19:55 apinheiro has joined #dri-devel

19:58 <karolherbst> alyssa: yes and no

19:58 <karolherbst> I think we have enough opt passes by now to resolve a lot of those things

19:59 <karolherbst> but yes.. if it ends up as funtion_temp memory, that gets lowered to scratch

19:59 <karolherbst> alyssa: I just don't think we end up with nir_load_scratch_base_ptr in CL

20:00 <alyssa> Hmm

20:00 <karolherbst> the base_ptr is only relevant for shader_calls as it seems

20:00 <alyssa> huh, ok

20:00 <karolherbst> for CL we just have a scratch space starting at 0 and the driver has to allocate that

20:01 <alyssa> that's better for Mali, I guess

20:01 <karolherbst> llvmpipe just mallocs :)

20:01 <karolherbst> on Nvidia we'd use local memory

20:01 <alyssa> though er how does that work

20:01 <karolherbst> like the same as for spilled memory

20:01 <karolherbst> ehh... spilled registers

20:01 <alyssa> yeah, but those aren't spilled to 0x0

20:01 <alyssa> ..

20:01 <karolherbst> the address doesn't matter

20:01 <alyssa> even if you take an address of it..?

20:02 <karolherbst> the only thing CL cares about is alignment of the address

20:02 <karolherbst> alyssa: the neat part is, sharing those pointers across invocations is just undefined behavior

20:02 <alyssa> ...oh, I see the trick you're doing now.

20:03 <alyssa> so even if the app does something cruel like *(&x[63] + y)

20:03 <karolherbst> anyway.. we have deref_cast to get the actual pointer value

20:03 <alyssa> it still just turns into load_scratch, never load_global

20:03 <karolherbst> yep

20:03 <alyssa> excellent

20:03 <alyssa> will do the easy thing then

20:03 <karolherbst> yeah

20:03 <karolherbst> just use whatever stuff you use for indirect arrays

20:04 mvlad has quit [Remote host closed the connection]

20:04 <karolherbst> and if you don't support scratch mem yet, just port your driver over to it :D

20:04 <alyssa> heh, we have scratch

20:04 <karolherbst> ahh

20:04 <karolherbst> excellent

20:04 <alyssa> but the hw likes to mangle the addresses for cache reasons

20:04 <karolherbst> then it should just work, no?

20:04 <karolherbst> yeah.. shouldn't matter

20:04 <alyssa> and at first blush it looks like that mangling needs to be disabled for CL

20:04 <alyssa> but yeah, ok

20:04 <karolherbst> as long as the alignment stays the same

20:04 <alyssa> Yep

20:05 <karolherbst> CL has strict rules though

20:05 <alyssa> each 16 byte chunk remains as-is

20:05 <karolherbst> so int16 is 0x80 aligned

20:05 <karolherbst> ehh

20:05 <karolherbst> long16

20:05 <alyssa> grumble. guess the mangling goes.

20:05 <karolherbst> it should be fine though

20:05 <alyssa> though... maybe not..?

20:05 <karolherbst> I think if the kernel wants the address we simply use the offset

20:05 <alyssa> because the app can never get to the physical pointer, only the virtual pointer starting at 0, which is aligned?

20:06 <karolherbst> deref_cast (ssa_x) whatever

20:06 <karolherbst> and that's just casting the deref thing to the constant

20:06 <karolherbst> alyssa: yeah, I think so

20:06 <karolherbst> the nir shader doesn't know the physical pointer anyway

20:06 <karolherbst> you get the offset into load_scratch/store_scratch

20:06 <karolherbst> what you do with that is up to you

20:07 nchery has joined #dri-devel

20:07 <karolherbst> I don't think we even get a load_scratch_base_ptr at all

20:07 <karolherbst> alyssa: ahhhh.. I know why I never saw any nir_load_scratch_base_ptr

20:08 <karolherbst> using nir_address_format_32bit_offset_as_64bit, for temp memory :)

20:08 <karolherbst> if you'd use nir_address_format_64bit_global _then_ you'd get load_scratch_base_ptr

20:09 <alyssa> Sure, that works great for us :)

20:09 <karolherbst> yeah.. I don't know who even wants real pointers on temp mem

20:09 <jenatali> Intel does

20:09 <karolherbst> well.. except llvmpipe

20:09 <jenatali> That's why jekstrand added the scratch base ptrs IIRC

20:09 <karolherbst> jenatali: seems to work fine without it?

20:10 <jenatali> Or maybe it was only for making it work with generic pointers

20:10 <karolherbst> maybe

20:10 <karolherbst> ahh yeah

20:10 <karolherbst> I think that's it

20:10 <alyssa> generic pointers...?

20:11 <karolherbst> because you need to allow drivers to map it into global mem

20:11 <alyssa> why does CL do this to us

20:11 <karolherbst> so load_scratch_base_ptr is the pointer into _global_ mem of the scratch space

20:11 <jenatali> It's optional in 3.0 at least

20:11 <alyssa> jenatali: optional means wontfix! :p

20:11 <jenatali> Yeah until you find some app that needs it

20:11 <jenatali> Which I hope there aren't any?

20:11 <karolherbst> alyssa: I got CL C 2.0 kernels using generic pointers to compile without any of this mess though :D

20:12 <karolherbst> jenatali: luxmark 3.1

20:12 <karolherbst> but...

20:12 <karolherbst> nir was able to resolve all generics to its original type

20:12 <jenatali> Huh really? It uses generic?

20:12 <karolherbst> yeah

20:12 <jenatali> Interesting

20:12 <karolherbst> that's why we added that alu of cast optimization

20:12 <karolherbst> so we can optimize away NULL checks on generics

20:13 <karolherbst> well.. if NULL is passed as an arg that is

20:13 <karolherbst> anyway.. my hope is, that we can always resolve those... but I am sure that function calling will make that impossible

20:13 <karolherbst> or we duplicate...

20:13 <karolherbst> dunno

20:14 <karolherbst> not a fan of having to generate worse code, just because of generics

20:15 <karolherbst> jenatali: my mistake was to expose CL C 3.0 as the "default" languge, turns out, some applications assume you support CL C 2.0 then :)

20:15 <karolherbst> and the spec specifically says to only do that if you support _all_ CL C 2.0 features

20:15 <jenatali> Ah, yeah that makes sense

20:15 <karolherbst> you can still expose it in the list property

20:15 <karolherbst> just not through that single value one

20:16 <karolherbst> CL_DEVICE_OPENCL_C_VERSION needs to be 1.2

20:16 <karolherbst> CL_DEVICE_OPENCL_C_ALL_VERSIONS can list 3.0

20:16 <airlied> i think we will need generic addresses for sycl

20:16 <karolherbst> airlied: that's fine

20:16 <karolherbst> I don't claim support for generics, not even using that address mode, but it still works fine

20:17 <karolherbst> there are just realy rare corner cases where that would be required

20:17 <karolherbst> like storing it into global mem and loading it loader

20:17 <karolherbst> but I think for most applications just implementing functions with generic args we can probably wing it and hope it works out

20:18 <karolherbst> airlied: is there a sycl CTS or something btw?

20:18 <karolherbst> :D

20:19 <zmike> dcbaker: I'll have a couple more backports for the next rc

20:19 <zmike> will prob do them tomorrow morning before you get up

20:20 <karolherbst> alyssa: anyway.. once you get rusticl working, I'd be interested how much breaks :D

20:20 <karolherbst> my hope is that stuff simply passes, but...

20:21 <karolherbst> alyssa: btw.. I have a patch which uses an ubo for the input buffer

20:21 <karolherbst> https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15439/diffs?commit_id=eb614b928d4a528c06f06a417d96eea3d3b84e2b

20:22 <karolherbst> airlied: we might want to do the same in clover and get rid of that input stuff :D

20:22 <karolherbst> ehh wait.. radeon...

20:22 <karolherbst> *sigh*

20:23 mszyprow has joined #dri-devel

20:26 <alyssa> what's the deal with radeon cl

20:26 <karolherbst> alyssa: it uses llvm directly

20:27 <karolherbst> like directly directly

20:27 <karolherbst> has to use the AMD ABI and stuff

20:27 <alyssa> yeah... why do we support that again?

20:27 <alyssa> :p

20:27 <karolherbst> because there is no other way

20:27 <karolherbst> ask airlied for details

20:27 <alyssa> :V

20:27 <alyssa> i'm better off not knowing

20:27 <karolherbst> I am not going to support anything besides nir anyway

20:28 <karolherbst> so...

20:28 <alyssa> and panfrost isn't going to support clover once rusticl is merged, I think ;)

20:28 <karolherbst> :D

20:28 <karolherbst> but I'd be really curious how well it works

20:29 <dcbaker> @zmike: sounds good

20:29 <karolherbst> I still want to wire it up with nouveau, but for that I need to fix multithreading

20:30 <karolherbst> actually.. let me try it with my patches and see what happens

20:32 MrCooper has quit [Ping timeout: 480 seconds]

20:38 <alyssa> karolherbst: ooi what are the blockers for rusticl in-tree?

20:38 <karolherbst> alyssa: mostly just reviews?

20:39 <karolherbst> I do have a bunch of fixes for random stuff in tree though

20:39 <karolherbst> alyssa: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6311#note_1340336

20:39 <karolherbst> there is a list of MRs

20:39 <karolherbst> but I think I need to create more

20:39 <alyssa> ah

20:39 <mlankhorst> danvet: ping?

20:39 <karolherbst> there are 161 commits, and only 95 do rusticl stuff

20:40 <alyssa> delight

20:40 <karolherbst> most of it is bumping texture/sampler view limits

20:40 <karolherbst> and some iris fixes

20:41 <karolherbst> we also need to fix llvm for conformance, but..

20:41 <alyssa> 22.3 then?

20:42 <karolherbst> maybe?

20:42 <danvet> mlankhorst, too late here, pls ping me again tomorrow ...

20:42 <karolherbst> though 22.2 should be possible

20:42 <karolherbst> we just need reviews

20:43 <karolherbst> alyssa: most of the stuff isn't really needed though.. I could do a run without any of those patches and see how bad it would be :D

20:43 cheako has joined #dri-devel

20:43 <alyssa> "rusticl: the CTS is a piece of shit"

20:43 <alyssa> maybe some git rebase needed too? :p

20:44 <karolherbst> no, that's intentional

20:44 <karolherbst> :D

20:44 <karolherbst> although I think we might get that fixed in the CTS

20:44 <karolherbst> there are other applications broken by it though

20:44 <karolherbst> it's all so terrible

20:44 <karolherbst> really hate that we have to do it like that

20:45 <karolherbst> yeah.. I guess I'll change that at some point

20:45 nchery has quit [Ping timeout: 480 seconds]

20:46 Duke`` has quit [Ping timeout: 480 seconds]

20:47 nchery has joined #dri-devel

20:51 MrCooper has joined #dri-devel

20:54 <jekstrand> jenatali: base pointers are for making it work with generic pointers and for making it work with ray-tracing.

20:56 <karolherbst> jekstrand: you need it for ray tracing? :( sounds aweful

20:58 <alyssa> is unaligned access with load/store_scratch defined?

20:58 <karolherbst> nope

20:58 <alyssa> excellent

20:58 <karolherbst> at least not inside llvmpipe as we figured out yesterday :)

20:58 <jekstrand> karolherbst: Yup. RT kernels do scratch totally differently for $REASONS

20:58 Haaninjo has quit [Quit: Ex-Chat]

20:59 <karolherbst> alyssa: anyway.. you can assume that you'll get correct alignments for everything

20:59 <karolherbst> if not, we messed up

20:59 <jekstrand> Well, actually, the reason is really simple: Scratch offsets are assigned per logical invocation, not per physical thread because invocations may move around between threads as shaders are dispatched, rays are traces, continuations happen, etc.

20:59 <jekstrand> Ok, maybe that's not simple. (-:

20:59 <karolherbst> sounds horrible

20:59 <jekstrand> It's a pretty straightforward consequence of the API

20:59 <karolherbst> I bet it was sure fun to implement all of that

21:01 <alyssa> raytracing sounds awful

21:01 <jekstrand> Eh, it's kinda fun, actually.

21:01 <karolherbst> implementing OpenCL is also kind of fun :P

21:02 * alyssa fixes piles of spilling bugs on Valhall

21:04 <karolherbst> yay

21:04 <karolherbst> are you running luxmark yet?

21:13 mszyprow has quit [Ping timeout: 480 seconds]

21:15 rasterman has joined #dri-devel

21:16 stuart has quit [Ping timeout: 480 seconds]

21:17 <alyssa> no, ES3.1 cts

21:21 danvet has quit [Ping timeout: 480 seconds]

21:21 anarsoul has quit [Ping timeout: 480 seconds]

21:34 ppascher has quit [Ping timeout: 480 seconds]

21:35 rasterman has quit [Quit: Gettin' stinky!]

21:35 heat has joined #dri-devel

21:36 <anholt> danylo: does gfxreconstruct have a way to look at the state (particularly image contents) along the way of rendering a frame?

21:37 rasterman has joined #dri-devel

21:39 iive has joined #dri-devel

21:40 stuart has joined #dri-devel

21:44 rasterman has quit [Quit: Gettin' stinky!]

21:46 rasterman has joined #dri-devel

21:47 ppascher has joined #dri-devel

21:48 nchery has quit [Ping timeout: 480 seconds]

21:49 <danylo> anholt: nope, no way to look at any state there

21:50 nchery has joined #dri-devel

21:51 <danylo> only way is to make a renderdoc capture and inspect it there, which could be tricky when you trying to debug a hang...

22:07 fxkamd has quit []

22:08 rasterman has quit [Quit: Gettin' stinky!]

22:09 <anholt> luckily not a hang on this one, just the first 2kb of gfxbench vk-5-normal's screen being corrupted.

22:11 apinheiro has quit [Quit: Leaving]

22:22 maxzor has quit [Ping timeout: 480 seconds]

22:27 <HdkR> Is there any way to get wayland to not autodetect monitor/output removal like X?

22:31 <daniels> HdkR: ask your compositor

22:32 <HdkR> hmmm

22:32 <daniels> Wayland only does what it’s told to

22:33 <HdkR> Sadly I don't think sway has a swaymsg command to disable autodetect

22:39 lemonzest has quit [Quit: WeeChat 3.4]

22:49 <HdkR> Oh well, I'll wait for that part of the ecosystem to mature some more :)

22:57 pcercuei has quit [Quit: dodo]

22:58 eukara has quit []

23:00 <Ristovski> karolherbst: Where can I find progress on radeonsi support for rusticl? In the draft comments you mentioned that airlied is working on that part?

23:01 <karolherbst> Ristovski: dunno.. but talking with airlied on this made it sound like it would take a while, because how AMD is doing compute is super messy

23:01 <Ristovski> Heh, sounds about right

23:01 <karolherbst> they have their own kernel ABI and stuff

23:01 <Ristovski> Hmm, as in amdkfd?

23:02 <karolherbst> no, shader ABI

23:02 Kayden has quit [Quit: go to office]

23:02 <Ristovski> Aaah, that makes more sense

23:02 <karolherbst> so the idea would be to wire up ACO or something, but that also sounds like ton of work

23:03 * Ristovski reads discussion from logs

23:05 alyssa has left #dri-devel [#dri-devel]

23:07 icecream95 has joined #dri-devel

23:07 icecream95 has quit []

23:08 icecream95 has joined #dri-devel

23:08 eukara has joined #dri-devel

23:14 mclasen has quit []

23:14 mclasen has joined #dri-devel

23:14 <karolherbst> anybody ever used phoronix-test-suite with their own compiled binaries? I think it just cleans the environment making it a pita to use

23:14 <Ristovski> I guess most of the stuff is in https://gitlab.freedesktop.org/airlied/mesa/-/commits/radeonsi-aco-clover :D

23:14 <karolherbst> probably

23:15 <dschuermann> will definitely take some time to land. we first have to get rid of the remaining radv bits in aco

23:15 <Ristovski> Hmm, does it only support recent GFX or does it go all the way back to GCN1?

23:15 <karolherbst> yeah.. sounds like quite the project

23:15 <karolherbst> Ristovski: probably the same thing where radv runs on

23:16 heat has quit [Remote host closed the connection]

23:16 heat has joined #dri-devel

23:16 <Ristovski> I see, that should cover GCN1 as well then

23:16 <Ristovski> (asking since I saw MCBP mentioned and that is GFX8+)

23:18 <karolherbst> "The test run did not produce a result." *sigh*

23:18 <karolherbst> ahh works with system bins

23:18 <karolherbst> "fun"

23:19 <karolherbst> well.. it doesn't afterall

23:19 mdroper has joined #dri-devel

23:19 <karolherbst> "The test run ended quickly" yeah well...

23:20 <karolherbst> wow.. it does crash the GPU context

23:20 anarsoul has joined #dri-devel

23:22 <karolherbst> ehh "write: 512 GB in 736.9 ms: 694.8 GB/s" I have questions

23:28 gawin has quit [Ping timeout: 480 seconds]

23:28 tursulin has quit [Read error: Connection reset by peer]

23:29 <Ristovski> lol

23:30 <karolherbst> either we are that good or something is fishy

23:32 <karolherbst> I suspect we are not handling 64 bit sized things all that well

23:34 <karolherbst> "Test buffers will use GB" well..

23:34 <karolherbst> what crappy code is that

23:44 morphis has quit [Ping timeout: 480 seconds]

23:44 <karolherbst> ahh yeah.. it passes a null buffer in? wtf

23:45 morphis has joined #dri-devel

23:45 <Ristovski> unrelated PSA: https://github.com/iovisor/bpftrace is seriously OP, I just used it as a no-mess `initcall_debug` alternative and it's probably much less overhead as well. Possibilities are truly endless *goes back to profiling random crap*

23:48 <karolherbst> jekstrand: I can trigger a "[drm] rusticl queue t[1648304 context reset due to GPU hang" reliably :(

23:48 <karolherbst> sometihng with loops

23:48 <karolherbst> like.. _long_ loops

23:48 <karolherbst> like millions of iterations

23:48 <karolherbst> "for (block = 0; block < ((1024*1024*1024/sizeof(ulong))/32); block += 256)"

23:50 lumag_ has quit [Ping timeout: 480 seconds]

23:50 jhli has quit [Quit: ZNC 1.8.2 - https://znc.in]

23:50 rkanwal has quit [Ping timeout: 480 seconds]

23:51 jhli has joined #dri-devel

23:53 Kayden has joined #dri-devel

23:54 <karolherbst> yeah.. just iterating more makes it crash

23:57 <heat> Ristovski, ebpf is singlehandedly the best and worst thing in the linux kernel :D

23:58 <heat> but yeah, pretty pretty nifty. especially on networking stuff