#dri-devel on 2023-06-25 — irc logs at oftc.irclog.whitequark.org

2022-12-21 00:45 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:01 heat has joined #dri-devel

00:01 heat_ has quit [Read error: Connection reset by peer]

00:19 columbarius has joined #dri-devel

00:21 co1umbarius has quit [Ping timeout: 480 seconds]

00:24 YuGiOhJCJ has joined #dri-devel

01:09 penguin42 has quit [Remote host closed the connection]

01:16 sassefa has quit [Remote host closed the connection]

01:23 YuGiOhJCJ has quit [Ping timeout: 480 seconds]

01:26 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

01:26 TMM has joined #dri-devel

01:26 YuGiOhJCJ has joined #dri-devel

01:29 i509vcb has joined #dri-devel

01:45 camus has quit [Ping timeout: 480 seconds]

01:59 Danct12 has quit [Remote host closed the connection]

02:01 camus has joined #dri-devel

03:33 heat has quit [Ping timeout: 480 seconds]

03:33 mbrost has joined #dri-devel

03:46 Company has quit [Quit: Leaving]

04:15 junaid has joined #dri-devel

05:21 djbw has joined #dri-devel

05:22 sima has joined #dri-devel

05:33 mbrost has quit [Ping timeout: 480 seconds]

06:12 fab has joined #dri-devel

06:24 Duke`` has joined #dri-devel

06:38 alanc has quit [Remote host closed the connection]

06:38 alanc has joined #dri-devel

06:46 rasterman has joined #dri-devel

06:58 sghuge has quit [Remote host closed the connection]

06:58 sghuge has joined #dri-devel

07:03 Lyude has quit [Remote host closed the connection]

07:03 Lyude has joined #dri-devel

07:11 djbw has quit [Read error: Connection reset by peer]

07:13 junaid has quit [Ping timeout: 480 seconds]

07:29 pcercuei has joined #dri-devel

07:42 dviola has joined #dri-devel

08:47 tobiasjakobi has joined #dri-devel

08:47 tobiasjakobi has quit []

09:07 rsalvaterra has quit [Ping timeout: 480 seconds]

09:15 frankbinns has quit [Ping timeout: 480 seconds]

09:26 rsalvaterra has joined #dri-devel

09:35 Haaninjo has joined #dri-devel

09:35 rasterman has quit [Quit: Gettin' stinky!]

09:36 bgs has joined #dri-devel

09:43 rsalvaterra has quit [Ping timeout: 480 seconds]

09:58 <llyyr> does anyone know what DCN X.XX mean in the drm/amd? I'm trying to bisect an issue so it'd be helpful to know what DCN3 and DCN3.1 refer to

09:58 <emersion> display core next

09:59 <emersion> https://kernel.org/doc/html/latest/gpu/amdgpu/driver-core.html#gpu-hardware-structure

09:59 <llyyr> thanks

10:09 frankbinns has joined #dri-devel

10:17 pallavim has quit [Read error: Connection reset by peer]

10:21 JohnnyonFlame has quit [Read error: Connection reset by peer]

10:39 penguin42 has joined #dri-devel

11:04 rsalvaterra has joined #dri-devel

11:13 fcarrijo has joined #dri-devel

11:49 knr has quit []

11:52 Harvey[m] has quit []

11:53 alarumbe has quit [Quit: ZNC 1.8.2+deb2 - https://znc.in]

11:54 alarumbe has joined #dri-devel

12:09 DottorLeo has joined #dri-devel

12:10 <DottorLeo> hi!

12:13 <DottorLeo> karolherbst: did you check FluidX3D with rusticl? It seems a nice OpenCL app to test the cards :)

12:13 <DottorLeo> https://github.com/ProjectPhysX/FluidX3D

12:13 <karolherbst> uhh, fancy

12:15 <karolherbst> is it packaged anywhere?

12:17 <karolherbst> the license is a little.. mhh custom?

12:19 <DottorLeo> phoronix has a test profile but on the github there are the downloads also for linux for fp16 and 32

12:19 <psykose> it's not open source no

12:19 <DottorLeo> yeah, i think it's free and open to use unless commercial

12:20 <karolherbst> psykose: well... it's open source, but I suspect it's not GPL compatible

12:20 <DottorLeo> karolherbst: can't you use it personally only to test rusticl?

12:20 <psykose> it's not either

12:20 <karolherbst> it's basically like MIT I think, just disallows military and commercial use

12:21 fcarrijo has quit []

12:21 <karolherbst> ahh yeah.. looks like MIT was the template for this license

12:21 <DottorLeo> there are already some numbers from amd/intel/nvidia to test rusticl against :D

12:21 <karolherbst> huh, where? :D

12:22 <DottorLeo> go down on github paghe

12:22 <DottorLeo> page

12:22 <karolherbst> ohh

12:22 <karolherbst> I thought you meant tests running rusticl :D

12:22 <karolherbst> but yeah.. I might look into it and see if it runs, but it should if it's just CL 1.2

12:22 <karolherbst> should just work (tm)

12:23 <DottorLeo> oh no ^^" sorry, but i think it's useful because it uses both FP16/32 mixed and FP32

12:23 <DottorLeo> and it's quite taxing as simulation :D

12:23 <karolherbst> yeah.. fp16 is atm not supported, because of.... reasons

12:23 <karolherbst> are there examples/demos one can use?

12:23 <DottorLeo> it's not experimental??

12:23 <karolherbst> it's broken, but people are free to enable it

12:24 <karolherbst> I think it should be fine if you don't use any of the OpenCL builtins with fp16 types

12:25 <DottorLeo> https://github.com/ProjectPhysX/FluidX3D/issues/8, here is the benchmark issue

12:25 kts has joined #dri-devel

12:27 <karolherbst> I want to see stuff though :D

12:27 <karolherbst> forget the benchmark, I want to see the simulation

12:28 <karolherbst> I wonder if with rusticl it's faster than ROCm...

12:28 <karolherbst> I think I have one of the GPUs listed there

12:29 <DottorLeo> karolherbst: i asked you in the past that in theory rusticl can use simultaneous different gpus, right?

12:29 <DottorLeo> lets say nvidia+amd+igpu intel

12:29 <karolherbst> yeah... though I think the memory model is a bit broken on this

12:29 <karolherbst> not sure, but it e.g. worked with luxmark just fine

12:30 <karolherbst> so if you see any issues there feel free to report it

12:30 <DottorLeo> so that software could use ALL the computing units from a PC, CPU+all the gpus? :D

12:30 <karolherbst> yeah

12:30 <karolherbst> just

12:30 <karolherbst> llvmpipe is slower than pocl :D

12:30 <karolherbst> it got better, but there are still some issues left to resolve

12:30 <karolherbst> llvmpipe is really bad at utilizing the CPU

12:31 <DottorLeo> wow you know that when rusticl will be conformant on all the platforms it will be seen as the second reborn of OpenCL? :D

12:31 <karolherbst> maybe?

12:31 <karolherbst> my goal is just to make _some_ compute API available to the linux desktop

12:32 <karolherbst> as in: people can rely on it being functional

12:32 <karolherbst> at this point, only Nvidias and Intels stack is what I'd consider somewhat functional

12:33 <DottorLeo> it's interesting why the author of FLuidX3D used OpenCL instead of only CUDA, not only just for the multivendor support. He says that if done right, openCL on Nvidia is as good as CUDA

12:33 <karolherbst> yeah, it is

12:33 <karolherbst> most code is just bad

12:33 <karolherbst> and most runtimes are

12:33 <karolherbst> nvidia's CL impl is really the best so far

12:34 <karolherbst> but they also have the compiler to back it up

12:34 <karolherbst> I mean.. there are computational heavy benchmarks where rusticl outperforms ROCm by 20%

12:34 <karolherbst> it's a bit disappointing to be honest

12:34 <DottorLeo> why?

12:35 <karolherbst> I expected a serious business company like AMD would put more effort into this

12:35 <penguin42> but there again I've got kernels where ROCm wins for me; so shrug

12:36 <DottorLeo> Maybe rusticl will be used on some GPU computation farms instead of ROCM, i think that for the final client doesn't matter, only the speed and correctness matter :)

12:36 <karolherbst> penguin42: yeah.. but rusticl isn't optimized at all

12:37 <DottorLeo> did you tried it with blender?

12:37 <karolherbst> blender dropped CL

12:37 <karolherbst> so.. it's either CUDA or HIP

12:37 <karolherbst> there is a HIP on CL implementation, but it's not ready for blender

12:38 <DottorLeo> yeah, sorry the HIP implementation

12:38 <karolherbst> but rusticl also still has huge issues so it's still gonna take a while

12:38 <DottorLeo> and the SyCL from intel?

12:39 <karolherbst> it's progressing

12:39 <karolherbst> the issue with SyCL from intel is, that they produce invalid spir-v

12:39 <karolherbst> a lot

12:40 aravind has joined #dri-devel

12:40 <DottorLeo> karolherbst: one last thing, when you merged the Optional image support, it also enables it on r600? it was one of the missing things on clover for that cards

12:41 <karolherbst> ohh, images were supported since day 1

12:41 <karolherbst> the optional stuff are just more formats

12:42 <DottorLeo> yes but when you merge a feature, it is enabled for all the supported platforms that uses rusticl? Sorry, i'm trying to understand how it works when you add new stuff to rusticl :)

12:43 <karolherbst> yeah

12:43 <karolherbst> sometimes there are driver bits to it

12:43 <karolherbst> but we try to be accurate in the features.txt file

12:43 <karolherbst> also

12:43 <karolherbst> https://mesamatrix.net/#RusticlOpenCL

12:43 <DottorLeo> because @gerddie said on the MR request for r600 that images support was missing

12:44 <karolherbst> doesn't list r600 yet, because it's broken

12:44 <karolherbst> ehh.. should be fine

12:44 <karolherbst> there is a bit missing for r600, I just don't have the hardware to test it

12:44 <penguin42> karolherbst: If you need a test run on r600 I can do that for you, this <--- laptop has oene

12:45 <DottorLeo> i should have an old 5450 (cedar) to test it :D

12:46 <DottorLeo> @illwieckz has probably all the R600 cards :D

12:46 <DottorLeo> it's impressive

12:47 <penguin42> 'AMD Thames [Radeon HD 7550M/7570M/7650M]'

12:47 <karolherbst> amazing.. Intel's CL stack ooms my system

12:47 <penguin42> (Very oddly configured HP Elitebook I found in a 2nd hand shop; nice i7, 8G RAM, Radeon, every interface you can imagine, and a shit 1366x768 display...]

12:48 <karolherbst> hashcat MD5 test: Intel: Speed.#1.........: 231.4 MH/s (1.53ms) @ Accel:16 Loops:16 Thr:64 Vec:1

12:48 <karolherbst> rusticl: Speed.#1.........: 2002.3 GH/s (0.00ms) @ Accel:2048 Loops:1024 Thr:32 Vec:1

12:48 <karolherbst> hashcat benchmarks in the most silly way though

12:48 <karolherbst> so whatever

12:48 <karolherbst> penguin42: yeah.. so somebody needs to implement the `get_compute_info` hook

12:54 fcarrijo has joined #dri-devel

13:02 * penguin42 tries to get himself past his existing patches first

13:06 <karolherbst> the key to the compute info stuff is really calculating how many threads can be launched

13:12 Putti has joined #dri-devel

13:26 DottorLeo has quit [Quit: Konversation terminated!]

13:40 fcarrijo has quit []

13:43 aravind has quit []

14:18 pallavim has joined #dri-devel

14:42 Company has joined #dri-devel

14:47 frankbinns1 has joined #dri-devel

14:49 Putti has quit [Ping timeout: 480 seconds]

14:51 Putti has joined #dri-devel

14:53 frankbinns has quit [Ping timeout: 480 seconds]

14:59 junaid has joined #dri-devel

15:00 kts has quit [Ping timeout: 480 seconds]

15:03 glennk has quit [Ping timeout: 480 seconds]

15:44 <penguin42> karolherbst: Nora is asking for 'Btw, please add the CL_PLATFORM_HOST_TIMER_RESOLUTION and CL_PLATFORM_HOST_TIMER_RESOLUTION device info queries in api/device.rs' aren't they in platform.rs query?

15:46 YuGiOhJCJ has quit [Quit: YuGiOhJCJ]

15:50 <karolherbst> yeah looks like CL_PLATFORM_HOST_TIMER_RESOLUTION is indeed a platform query

15:50 <penguin42> ah yes Nora confirmed that

15:52 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

15:52 TMM has joined #dri-devel

15:57 junaid has quit [Remote host closed the connection]

16:05 kasper93 has joined #dri-devel

16:06 glennk has joined #dri-devel

16:54 rasterman has joined #dri-devel

17:01 idr has joined #dri-devel

17:06 kzd has quit [Ping timeout: 480 seconds]

17:11 pcercuei has quit [Quit: brb]

17:14 heat has joined #dri-devel

17:19 pcercuei has joined #dri-devel

17:41 djbw has joined #dri-devel

17:52 Duke`` has quit [Ping timeout: 480 seconds]

18:00 Duke`` has joined #dri-devel

18:15 Thymo has joined #dri-devel

18:32 Thymo has quit [Ping timeout: 480 seconds]

18:32 Thymo has joined #dri-devel

18:42 <karolherbst> jenatali: sooo.. I looked a bit into what LLVM passes actually help: EarlyCSEPass roughly 10% cut in spir-v size, MergeFunctions roughly a 50% cut in spir-v size. 10% cut is great, so enabling EarlyCSE is probably what we should do. However, MergeFunctions can generate function pointers and the translater only allows us to use it with SPV_INTEL_function_pointers

18:43 <karolherbst> but 50% in reduction is kinda neat... but a lot of LLVM passes are just generating random stuff we can't handle, so maybe it's better to really rely on spirv-opt here instead :/

18:43 heat has quit [Read error: No route to host]

18:43 heat_ has joined #dri-devel

18:44 <jenatali> I don't know that I see much point in running spirv-opt, vtn is pretty lightweight I feel like

18:45 <karolherbst> it's more about reducing the size of the spir-v

18:45 <karolherbst> like.. hashcat generates a 7MB spirv by default

18:45 <karolherbst> for one hash function

18:46 <karolherbst> but smaller spirv also means less time spend in clc_parse_spirv, which.. kinda makes up a huge amount of CPU overhead at that size

18:46 <karolherbst> but smaller spirv also helps with the disk cache and evertyhing

18:47 <karolherbst> and I also suspect the compilation to be quicker the earlier we drop massive amount of code

18:47 <karolherbst> but anyway...

18:47 <karolherbst> would be cool to just be able to use MergeFunctions on the LLVM IR level

18:47 <karolherbst> but... it generates function pointers :(

18:47 <karolherbst> sometimes

18:48 cleverca22[m] has quit []

18:50 <karolherbst> another problem is, that linking spirvs isn't cheap either :/ and even with a single spirv file we kinda have to do it, because... random nonsense

19:02 Thymo has quit [Ping timeout: 480 seconds]

19:21 kzd has joined #dri-devel

19:21 <karolherbst> mhh GVN seems to also help a lot, nice

19:23 smiles_1111 has quit [Ping timeout: 480 seconds]

19:48 rasterman has quit [Remote host closed the connection]

20:16 konstantin has joined #dri-devel

20:19 konstantin_ has quit [Ping timeout: 480 seconds]

20:28 fab has quit [Quit: fab]

20:31 heat_ has quit [Read error: Connection reset by peer]

20:31 heat_ has joined #dri-devel

20:52 Duke`` has quit [Ping timeout: 480 seconds]

21:15 bgs has quit [Remote host closed the connection]

21:25 jewins has joined #dri-devel

21:26 sima has quit [Ping timeout: 480 seconds]

21:49 heat_ has quit [Remote host closed the connection]

21:49 heat_ has joined #dri-devel

22:05 <DemiMarie> Is the compilation happening ahead of time or at runtime? If the latter, could LLVM IR be translated directly to NIR, without going through SPIR-V?

22:06 <karolherbst> no

22:06 <karolherbst> the point is to use spirv

22:06 <DemiMarie> Ah

22:06 <DemiMarie> sorry, I was missing some context

22:21 ceoarrrrrrrrrrrrrrr^ has quit [Ping timeout: 480 seconds]

22:28 JohnnyonFlame has joined #dri-devel

23:07 smiles_1111 has joined #dri-devel

23:25 probablymoony has joined #dri-devel

23:27 moony has quit [Ping timeout: 480 seconds]

23:29 pcercuei has quit [Quit: dodo]

23:32 jewins has quit [Ping timeout: 480 seconds]