#dri-devel on 2023-11-18 — irc logs at oftc.irclog.whitequark.org

2022-12-21 00:45 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:05 iive has quit [Quit: They came for me...]

00:23 pcercuei has quit [Quit: dodo]

00:25 Jeremy_Rand_Talos_ has quit [Remote host closed the connection]

00:26 Jeremy_Rand_Talos_ has joined #dri-devel

00:55 alanc has quit [Remote host closed the connection]

00:55 alanc has joined #dri-devel

01:15 co1umbarius has joined #dri-devel

01:16 columbarius has quit [Ping timeout: 480 seconds]

01:24 crabbedhaloablut has quit []

01:24 kts has joined #dri-devel

01:26 jkhsjdhjs has quit [Quit: Error: Leaving not permitted]

01:27 jkhsjdhjs has joined #dri-devel

02:20 Emantor has quit [Quit: ZNC - http://znc.in]

02:20 Emantor has joined #dri-devel

02:36 kts has quit [Ping timeout: 480 seconds]

03:02 jolan_ has quit []

03:05 sumits has quit [Quit: ZNC - http://znc.in]

03:05 andrey-konovalov has quit [Quit: ZNC - http://znc.in]

03:07 jolan has joined #dri-devel

03:13 andrey-konovalov has joined #dri-devel

03:14 jolan has quit [Quit: leaving]

03:48 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

03:48 TMM has joined #dri-devel

03:49 heat has quit [Ping timeout: 480 seconds]

03:49 YuGiOhJCJ has joined #dri-devel

04:02 i509vcb has joined #dri-devel

04:16 glennk has joined #dri-devel

05:45 airlied has joined #dri-devel

06:03 fab has joined #dri-devel

06:27 Company has quit [Remote host closed the connection]

06:28 airlied has quit [Ping timeout: 480 seconds]

06:33 Company has joined #dri-devel

06:40 Company has quit [Quit: Leaving]

07:07 kts has joined #dri-devel

07:37 glennk has quit [Ping timeout: 480 seconds]

07:41 macslayer has quit [Ping timeout: 480 seconds]

08:00 sghuge has quit [Remote host closed the connection]

08:00 sghuge has joined #dri-devel

08:22 crabbedhaloablut has joined #dri-devel

08:29 apinheiro has joined #dri-devel

08:39 ungeskriptet has quit [Quit: The Lounge - https://thelounge.chat]

08:48 sima has joined #dri-devel

08:53 JohnnyonFlame has joined #dri-devel

09:03 kts has quit [Ping timeout: 480 seconds]

09:07 ungeskriptet has joined #dri-devel

09:15 kts has joined #dri-devel

09:15 kts has quit []

09:28 yyds has joined #dri-devel

09:45 kzd has quit [Ping timeout: 480 seconds]

09:52 camus1 has quit [Remote host closed the connection]

09:53 camus has joined #dri-devel

10:07 gouchi has joined #dri-devel

10:14 pcercuei has joined #dri-devel

10:37 flom84 has joined #dri-devel

10:40 i509vcb has quit [Quit: Connection closed for inactivity]

10:55 camus has quit []

11:10 flom84 has quit [Ping timeout: 480 seconds]

11:19 glennk has joined #dri-devel

11:21 simon-perretta-img has joined #dri-devel

11:27 cdslooef^ has joined #dri-devel

11:32 sgruszka has joined #dri-devel

11:53 sgruszka has quit [Read error: Connection reset by peer]

12:01 kts has joined #dri-devel

12:27 rasterman has joined #dri-devel

12:28 gouchi has quit [Remote host closed the connection]

13:14 Company has joined #dri-devel

13:22 kts has quit [Ping timeout: 480 seconds]

13:47 mceier has quit [Quit: leaving]

13:49 mceier has joined #dri-devel

14:26 <jenatali> karolherbst: I think I just figured out a way to do BDA in dzn, the exact same way that CLOn12 emulates pointers (idx+offset pairs), but instead of index being a locally bound array index, just make it a global resource ID which we already have for descriptor_indexing

14:27 <jenatali> I don't think I would've come up with that if you hadn't suggested rusticl on zink on dzn, so thanks for that

14:46 YuGiOhJCJ has quit [Quit: YuGiOhJCJ]

15:03 <karolherbst> jenatali: cool. But would that also work with arbitrary pointers?

15:04 <karolherbst> though _might_ be fine

15:04 <karolherbst> or at least for most things

15:05 <karolherbst> jenatali: I think the only problem you'd need to solve is to keep the idx valid across kernel invocations for things like global variables or funky stuff kenrels might do

15:05 <jenatali> What do you mean arbitrary pointers? When the app asks for a buffer address, I just give back an index and offset

15:05 <jenatali> Yeah, that's what I mean by a global index

15:05 <karolherbst> sure, but applications can do random C nonsense

15:05 <karolherbst> right..

15:05 <karolherbst> yeah.. then it should be fine

15:06 <jenatali> Yeah, it won't be stable for capture/replay but that's a different feature so that's fine

15:06 <karolherbst> so set_global_bindings would return an index and offset packed into 64 bits, I pass this into the kernel via ubo0 (kenrel arguments) and then it should be good to go

15:06 <jenatali> Right

15:07 <karolherbst> and gallium doesn't use load_global(_constant) and store_global for anything, so you can deal with the madness there

15:07 neniagh_ has quit []

15:07 neniagh has joined #dri-devel

15:07 <karolherbst> I wonder if I want to support different pointer layouts directly, but....

15:08 <jenatali> Well I don't have that bindless path in the gallium driver currently, only in dozen

15:08 kts has joined #dri-devel

15:08 yyds has quit [Remote host closed the connection]

15:08 <karolherbst> the CL path is really special sadly

15:08 <karolherbst> we have this `set_global_bindings` api which is a bit funky...

15:08 <karolherbst> but that's everything you'd need

15:08 <jenatali> Yeah makes sense

15:09 <karolherbst> luckily there are no bindless images or anything

15:09 <karolherbst> and `set_global_bindings` basically means: give me the GPU address for those pipe resources, and make them available on compute dispatches

15:09 <karolherbst> *for

15:10 <karolherbst> there is also some funky offset business going on, but iris/radeonsi/zink have it correctly implemented

15:11 <karolherbst> jenatali: uhm.. there is another thing: `pipe_grid_info::variable_shared_mem`, no idea if you can support that

15:12 <karolherbst> how are CL local memory kernel parameters currently implemented on your side?

15:12 <jenatali> Only by recompiling shaders

15:12 <karolherbst> mhhh

15:12 <jenatali> Same with local group size because that's a compile-time param in D3D

15:13 <karolherbst> I see, so you have to deal with pain like that already anyway

15:13 <jenatali> Yeah

15:14 <karolherbst> kinda sucks, but not much you or I could do about it...

15:15 <jenatali> karolherbst: btw, I noticed you're computing a dynamic local size by using gcd() with the SIMD (wave) size and the global size. That's always going to return 2 for even global sizes and 1 for odd, since SIMD sizes are powers of 2

15:16 <jenatali> I was looking because CLOn12's handling of odd global dimensions was... Bad

15:16 <karolherbst> yeah...

15:16 <karolherbst> I reworked that code tho, just never landed it as it was part of non uniform workgroup support

15:16 <jenatali> Cool

15:16 <karolherbst> it doesn't matter anyway as most applications aren't silly enough to run into this edge case

15:17 <karolherbst> can you support non uniform work groups?

15:17 <karolherbst> if so.. doesn't matter long term anyway

15:17 <jenatali> Not natively

15:18 <karolherbst> mhhh

15:18 <jenatali> karolherbst: apparently Photoshop does

15:18 <karolherbst> figures...

15:18 <jenatali> At least that's what one of our teams is telling me

15:18 <karolherbst> yeah.. it makes perfect sense if they use image sizes for stuff

15:20 <karolherbst> but uhhh.. why do you think I'm using the simd size with gcd?, I'm using the thread count and the grid size

15:20 <karolherbst> subgroups only as a last ressort if things align really terribly

15:20 <karolherbst> *SIMD size

15:21 <karolherbst> `optimize_local_size` is what I'm looking at

15:23 <karolherbst> so if you have 512 threads and a grid of 500x1x1, you'd get 500x1x1 still

15:24 <karolherbst> it just has some weirdo edge cases where it uses terrible local sizes

15:24 <karolherbst> I don't like the third part of that function and it could be better, but it's not _as_ bad

15:29 simon-perretta-img has quit [Ping timeout: 480 seconds]

15:30 <jenatali> Hmm ok, I thought I saw SIMD size in there

15:30 simon-perretta-img has joined #dri-devel

15:30 <jenatali> The gcd is still always going to be 2 or 1 though, since that thread count will also be a power of 2

15:33 neniagh has quit [Ping timeout: 480 seconds]

15:38 simon-perretta-img has quit [Ping timeout: 480 seconds]

15:39 simon-perretta-img has joined #dri-devel

15:42 neniagh has joined #dri-devel

15:44 <karolherbst> it can be any pot number

15:44 <karolherbst> if your gpu supports 1024 threads, you have 2^10 on one side, and anything else on the other one

15:44 <jenatali> ... Yeah that's what I meant

15:44 <jenatali> A power of 2 or 1

15:44 <karolherbst> ahh yeah, fair

15:45 <karolherbst> the last block is supposed to fill it up if the middle one couldn't find a pot of a SIMD size or bigger

15:46 <karolherbst> so if the loop manages to set local to the SIMD size, fine, nothing else to do. I just wanted to prevent sub optimal distribution of threads

15:46 <karolherbst> _however_

15:46 <karolherbst> threads doesn't have to be pot

15:46 <karolherbst> intel is kinda weird there...

15:47 <karolherbst> jenatali: https://github.com/KhronosGroup/OpenCL-CTS/issues/1716

15:48 <karolherbst> there are some intel extensions to make better use of it, and I also kinda have to take that into account

15:48 <jenatali> Fun

15:48 <karolherbst> but I also kinda wanted to finish non uniform first

15:48 <karolherbst> the intel extension e.g. allows you to set the subgroup size

15:49 <karolherbst> but yeah.. that part of the code has a big TODO to take all of that into account

16:23 fab has quit [Quit: fab]

16:24 fab has joined #dri-devel

16:28 simon-perretta-img has quit [Ping timeout: 480 seconds]

16:29 simon-perretta-img has joined #dri-devel

16:37 Duke`` has joined #dri-devel

17:13 gouchi has joined #dri-devel

17:27 heat has joined #dri-devel

17:32 maxzor has joined #dri-devel

17:41 kts has quit [Ping timeout: 480 seconds]

17:42 RSpliet has quit [Quit: Bye bye man, bye bye]

17:52 maxzor has quit []

17:53 maxzor has joined #dri-devel

17:54 RSpliet has joined #dri-devel

17:58 glennk has quit [Ping timeout: 480 seconds]

18:01 glennk has joined #dri-devel

18:05 kzd has joined #dri-devel

18:13 Aura has joined #dri-devel

18:38 rasterman has quit [Quit: Gettin' stinky!]

18:40 macslayer has joined #dri-devel

19:07 maxzor has quit [Ping timeout: 480 seconds]

19:08 maxzor has joined #dri-devel

19:29 sima has quit [Ping timeout: 480 seconds]

19:47 heat has quit [Remote host closed the connection]

19:47 heat has joined #dri-devel

19:50 maxzor_ has joined #dri-devel

20:26 glennk has quit [Ping timeout: 480 seconds]

20:36 glennk has joined #dri-devel

20:54 simon-perretta-img has quit [Ping timeout: 480 seconds]

20:55 simon-perretta-img has joined #dri-devel

21:18 Duke`` has quit [Ping timeout: 480 seconds]

21:21 fab has quit [Quit: fab]

21:25 Ristovski has quit [Quit: 0]

21:26 Ristovski has joined #dri-devel

21:32 Ristovski has quit [Quit: 0]

21:36 sravn_ is now known as sravn

21:40 simon-perretta-img has quit [Ping timeout: 480 seconds]

21:41 simon-perretta-img has joined #dri-devel

22:00 gouchi has quit [Remote host closed the connection]

22:21 maxzor has quit []

22:29 maxzor has joined #dri-devel

22:37 maxzor has quit [Remote host closed the connection]

22:42 oneforall2 has quit [Remote host closed the connection]

22:45 oneforall2 has joined #dri-devel

23:03 apinheiro has quit [Quit: Leaving]

23:09 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

23:09 TMM has joined #dri-devel

23:56 crabbedhaloablut has quit []