#dri-devel on 2022-12-15 — irc logs at oftc.irclog.whitequark.org

2022-08-14 19:45 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:00 Jeremy_Rand_Talos_ has quit [Ping timeout: 480 seconds]

00:12 <jenatali> Ugh does the Vulkan CTS really override stdout redirection?

00:15 <alyssa> jenatali: just be grateful it's not the CL CTS

00:15 <jenatali> Yeah... different problems

00:15 <jenatali> I just want to be able to debug a NIR shader, but writing to stdout overflows my terminal buffer size

00:16 <jenatali> So I want to dump that to a file... but apparently I can't do that

00:16 <alyssa> ooof

00:16 <alyssa> at least it's not the CL CTS

00:16 <HdkR> change to stderr instead? :D

00:17 <alyssa> "oh yes we are just going to test every single possible input how many could there be?"

00:17 <jenatali> alyssa: Yeah the CL CTS isn't too bad when you use wimpy mode though

00:18 <alyssa> even with wimpy it's pretty bad ;p

00:24 <jenatali> Weird, looks like my debugger was messing with it, nevermind

00:28 ybogdano has joined #dri-devel

00:30 Haaninjo has quit [Quit: Ex-Chat]

00:35 camus has quit [Remote host closed the connection]

00:35 camus has joined #dri-devel

00:44 camus has quit [Remote host closed the connection]

00:45 camus has joined #dri-devel

00:55 camus has quit [Remote host closed the connection]

00:56 camus has joined #dri-devel

00:59 cengiz_io has quit [Quit: Connection closed for inactivity]

01:03 yuq825 has joined #dri-devel

01:04 ahajda_ has quit []

01:12 mattrope has quit [Remote host closed the connection]

01:19 Leopold_ has quit [Remote host closed the connection]

01:20 Leopold_ has joined #dri-devel

01:23 ngcortes has joined #dri-devel

01:32 ybogdano has quit [Ping timeout: 480 seconds]

01:33 jkrzyszt has joined #dri-devel

01:34 co1umbarius has joined #dri-devel

01:36 columbarius has quit [Ping timeout: 480 seconds]

01:42 mattrope has joined #dri-devel

01:52 ngcortes has quit [Ping timeout: 480 seconds]

01:53 ppascher has quit [Ping timeout: 480 seconds]

01:54 jkrzyszt has quit [Ping timeout: 480 seconds]

01:55 Akari has quit [Quit: segmentation fault (core dumped)]

02:18 camus1 has joined #dri-devel

02:22 camus has quit [Ping timeout: 480 seconds]

02:40 persise[m] has joined #dri-devel

02:46 oneforall2 has quit [Remote host closed the connection]

02:47 oneforall2 has joined #dri-devel

02:50 persise[m] has quit [autokilled: This host has violated network policy. Mail support@oftc.net if you feel this is in error. (2022-12-15 02:50:02)]

02:50 Company has quit [Quit: Leaving]

03:06 <Lynne> highly informative post by nvidia engineer on DPB reference indices

03:08 pjakobsson has joined #dri-devel

03:11 pjakobsson_ has quit [Ping timeout: 480 seconds]

03:29 camus1 has quit [Remote host closed the connection]

03:31 camus has joined #dri-devel

03:41 kailanqq[m] has joined #dri-devel

03:42 kailanqq[m] has quit [autokilled: This host has violated network policy. Mail support@oftc.net if you feel this is in error. (2022-12-15 03:42:50)]

03:43 heat has quit [Ping timeout: 480 seconds]

03:57 camus has quit [Remote host closed the connection]

03:59 camus has joined #dri-devel

04:08 <airlied> agd5f: yeah not sure how patching vulkan cmd bufs will go, I suppose I get to find out when I hit encode

04:17 <airlied> Lynne: oh I also was to fix separate dpb for h265 on navi21+

04:27 <airlied> uggh fixing it break h264, need to figure out the reference pics there

04:30 <airlied> Lynne: be interesting to see if that helps the speed any, fixes are pushed

04:32 <Lynne> neat, give me a sec

04:35 <Lynne> sadly no, speed is the same as before

04:36 <Lynne> at least it works, no corruption or crashes

04:37 bmodem has joined #dri-devel

04:39 <Lynne> ...just as I wrote that, I tried h264, and got a crash bad enough to have to reboot

04:43 <Lynne> yup, it's consistent, reverting the last commit fixes it

04:53 <airlied> Lynne: dang what video? I had just tested some 264

04:55 <airlied> okay fantastic four trailer killed it for me

04:55 <Lynne> some 4k high bitrate I had

04:56 <Lynne> aww, good thing you didn't try fan4stic, you may have needed a new card after it

05:05 <airlied> Lynne: is this legal?

05:05 <airlied> (gdb) print *frame_info->pSetupReferenceSlot

05:05 <airlied> $6 = {sType = VK_STRUCTURE_TYPE_VIDEO_REFERENCE_SLOT_INFO_KHR, pNext = 0x7fffd804a850, slotIndex = 0, pPictureResource = 0x7fffd8049bf0}

05:05 <airlied> (gdb) print *frame_info->pReferenceSlots

05:05 <airlied> $7 = {sType = VK_STRUCTURE_TYPE_VIDEO_REFERENCE_SLOT_INFO_KHR, pNext = 0x7fffd804abc8, slotIndex = 0, pPictureResource = 0x7fffd8049c40}

05:05 <airlied> for h264

05:09 <Lynne> the slotIndex?

05:09 <airlied> yeah

05:09 <airlied> seems wrong to have them both be 0

05:09 <airlied> though maybe it's fine

05:09 <Lynne> I have a not sure mark on it in my code, because vaapi uses "pic->long_ref ? pic->pic_id : pic->frame_num;" for it

05:09 <Lynne> it didn't change anything last time I tried it

05:09 pjakobsson_ has joined #dri-devel

05:10 <Lynne> output looks identical before and after, just tried it

05:11 <airlied> yeah just trying to see why this separate stuff is getting angry and noticed it

05:12 <airlied> still hangs when I fix it :-P

05:12 pjakobsson has quit [Ping timeout: 480 seconds]

05:13 * airlied backs out the change for now

05:14 <Lynne> I'll make the slotIndex change, just seems more correct as that's what other code uses

05:30 <airlied> Lynne: I think we still have some bugs around h264 dpb indexing

05:32 <airlied> Lynne: is there somewhere that tracks the DPB in h264?

05:33 <airlied> I think we should do the same thing for h264 as for 265 using DPB

05:35 <Lynne> I'm not sure, haven't seen anything like that in other hwaccels

05:37 <Lynne> I don't remember any dbp bugs in h264

05:38 <Lynne> but I do believe that's it for me tonight

05:38 <airlied> yeah I'm just seeing this when I try to control the dpb placements manually

05:38 <airlied> or sparsely rather

05:39 <airlied> the slotindex just don't make sense

05:39 <airlied> I'll dig around and what I can figure out now

05:45 <airlied> Lynne: yeah I think that is correct

05:46 <airlied> https://paste.centos.org/view/dc92ac2b

05:46 <airlied> Lynne:other APIs have never had explicit DPB mgmt

05:47 <airlied> Lynne: pushed the corresponding radv fixes to my branch

05:54 kts has joined #dri-devel

05:55 lemonzest has joined #dri-devel

06:03 YuGiOhJCJ has joined #dri-devel

06:10 bgs has joined #dri-devel

06:16 Duke`` has joined #dri-devel

06:18 itoral has joined #dri-devel

06:24 <airlied> Lynne: https://paste.centos.org/view/e2cfd025 rebased version

06:25 aravind has joined #dri-devel

06:29 camus has quit [Remote host closed the connection]

06:30 camus has joined #dri-devel

06:38 fab has joined #dri-devel

06:38 Duke`` has quit [Ping timeout: 480 seconds]

06:45 yuq825 has quit []

06:51 danvet has joined #dri-devel

06:54 bgs has quit [Remote host closed the connection]

06:55 yuq825 has joined #dri-devel

06:57 YuGiOhJCJ has quit [Quit: YuGiOhJCJ]

07:10 bgs has joined #dri-devel

07:14 dv_ has quit [Quit: WeeChat 3.0]

07:15 dcz_ has joined #dri-devel

07:20 bgs has quit [Remote host closed the connection]

07:21 dv_ has joined #dri-devel

07:25 dv_ has quit []

07:25 dv_ has joined #dri-devel

07:26 fab has quit [Quit: fab]

07:30 alanc has quit [Remote host closed the connection]

07:30 alanc has joined #dri-devel

07:31 frieder has joined #dri-devel

07:47 maxzor has quit [Ping timeout: 480 seconds]

07:49 rasterman has joined #dri-devel

07:52 <mripard> airlied: danvet : There's nothing in drm-misc-next-fixes, so there won't be a PR

07:55 tobiasjakobi has joined #dri-devel

07:56 mvlad has joined #dri-devel

07:58 tobiasjakobi has quit []

07:58 <airlied> mripard: cool!

08:00 off^ has quit [Remote host closed the connection]

08:01 ppascher has joined #dri-devel

08:09 fab has joined #dri-devel

08:13 jkrzyszt has joined #dri-devel

08:17 lanodan has quit [Ping timeout: 480 seconds]

08:17 camus has quit [Remote host closed the connection]

08:18 * airlied has paged back in enough of intel avc decode to realise I've no idea how it works

08:18 camus has joined #dri-devel

08:22 tursulin has joined #dri-devel

08:33 lanodan has joined #dri-devel

08:36 tzimmermann has joined #dri-devel

08:47 camus has quit [Remote host closed the connection]

08:47 camus has joined #dri-devel

08:50 <ccr> via eldritch dark magic?

08:52 sarahwalker has joined #dri-devel

08:54 Akari has joined #dri-devel

08:54 swalker_ has joined #dri-devel

08:55 swalker_ is now known as Guest2216

08:57 lynxeye has joined #dri-devel

08:58 vliaskov has joined #dri-devel

09:00 sarahwalker has quit [Ping timeout: 480 seconds]

09:07 nchery has quit [Remote host closed the connection]

09:07 nchery has joined #dri-devel

09:11 ahajda_ has joined #dri-devel

09:19 yuq825 has quit []

09:21 jagan_ has joined #dri-devel

09:23 camus has quit [Remote host closed the connection]

09:24 camus has joined #dri-devel

09:27 yuq825 has joined #dri-devel

09:38 frieder has quit [Quit: Leaving]

09:38 frieder has joined #dri-devel

09:40 Leopold_ has quit [Remote host closed the connection]

09:41 Leopold_ has joined #dri-devel

09:47 camus1 has joined #dri-devel

09:49 camus has quit [Read error: No route to host]

09:52 jkrzyszt has quit [Remote host closed the connection]

09:56 tursulin has quit [Quit: Konversation terminated!]

09:58 tursulin has joined #dri-devel

10:13 MajorBiscuit has joined #dri-devel

10:14 itoral has quit []

10:21 fab has quit [Quit: fab]

10:21 fab has joined #dri-devel

10:26 jkrzyszt has joined #dri-devel

10:33 Akari has quit [Quit: segmentation fault (core dumped)]

10:34 kts has quit [Read error: Connection reset by peer]

10:41 kts has joined #dri-devel

10:43 yuq825 has quit []

10:47 pjakobsson has joined #dri-devel

10:50 pjakobsson_ has quit [Ping timeout: 480 seconds]

10:52 kts has quit [Quit: Leaving]

10:53 warpme_____ has joined #dri-devel

10:54 pcercuei has joined #dri-devel

11:00 MajorBiscuit has quit [Quit: WeeChat 3.6]

11:01 MajorBiscuit has joined #dri-devel

11:11 devarsht[m] has joined #dri-devel

11:12 aradhya7[m] has joined #dri-devel

11:23 LordKalma has quit [Quit: Server has probably crashed]

11:24 LordKalma has joined #dri-devel

11:24 lanodan has quit [Quit: WeeChat 3.6]

11:29 JohnnyonFlame has joined #dri-devel

11:31 Haaninjo has joined #dri-devel

11:40 srslypascal is now known as Guest2231

11:40 srslypascal has joined #dri-devel

11:40 lanodan has joined #dri-devel

11:44 Guest2231 has quit [Ping timeout: 480 seconds]

11:47 Leopold_ has quit [Remote host closed the connection]

11:47 Leopold_ has joined #dri-devel

11:50 srslypascal has quit [Ping timeout: 480 seconds]

11:50 xantoz has quit [Remote host closed the connection]

11:51 zzag_ has quit []

11:52 Company has joined #dri-devel

11:52 zzag has joined #dri-devel

11:52 xantoz has joined #dri-devel

11:53 srslypascal has joined #dri-devel

11:55 kts has joined #dri-devel

12:00 djbw has quit [Read error: Connection reset by peer]

12:27 f11f12 has joined #dri-devel

12:33 maxzor has joined #dri-devel

12:39 egbert has quit [Remote host closed the connection]

12:43 yuq825 has joined #dri-devel

12:44 lynxeye has quit [Ping timeout: 480 seconds]

12:49 <graphitemaster> Looking forward to bringing down Linux distributions one stray long running compute shader at a time loaded directly in your browser without you even aware of it, activated all on the same day, served from a CDN silently, stealthy, the day every Linux freezes. https://twitter.com/Tojiro/status/1603087438150217728

12:49 <graphitemaster> Sorry, I'm writing a horror novel, let me know if you like my premise.

12:52 <ishitatsuyuki> wasn't that basically the same with webgl anyway

12:56 agd5f has quit [Remote host closed the connection]

12:59 cengiz_io has joined #dri-devel

13:05 lynxeye has joined #dri-devel

13:10 <zamundaaa[m]> graphitemaster: Wouldn't a shader that runs long enough to freeze other GPU tasks just trigger a GPU reset?

13:11 <FLHerne> reliability of GPU resets for most Linux kernel drivers is quite poor

13:12 bgs has joined #dri-devel

13:14 devilhorns has joined #dri-devel

13:20 maxzor has quit [Remote host closed the connection]

13:20 maxzor has joined #dri-devel

13:23 alarumbe has joined #dri-devel

13:37 maxzor has quit [Ping timeout: 480 seconds]

13:51 kts has quit [Quit: Leaving]

13:53 navi has joined #dri-devel

13:54 agd5f has joined #dri-devel

14:05 yuq825 has left #dri-devel [#dri-devel]

14:10 <ishitatsuyuki> it's just amdgpu that is bad lol

14:10 <ishitatsuyuki> friendly reminder that GL CTS has a test that contains infinite loop to test resets

14:15 tzimmermann has quit [Quit: Leaving]

14:15 tzimmermann has joined #dri-devel

14:24 kts has joined #dri-devel

14:24 jagan_ has quit [Remote host closed the connection]

14:26 mbrost has joined #dri-devel

14:28 camus1 has quit [Remote host closed the connection]

14:29 camus has joined #dri-devel

14:31 heat has joined #dri-devel

14:32 <FLHerne> It's not just amdgpu; nouveau is pretty hopeless and intel used to be unreliable but it's less bad now

14:32 OftenTimeConsuming has quit [Quit: OftenTimeConsuming]

14:35 <DemiMarie> Does Intel support preemption?

14:36 OftenTimeConsuming has joined #dri-devel

14:38 <MrCooper> yes

14:40 <MrCooper> mutter 44 will take full advantage of this (as Wayland compositor), it can run at full frame rate even while there are GPU-limited clients at much lower frame rate

14:49 fxkamd has joined #dri-devel

14:50 Leopold_ has quit [Ping timeout: 480 seconds]

14:57 JohnnyonFlame has quit [Read error: Connection reset by peer]

14:58 <zamundaaa[m]> ishitatsuyuki: amdgpu gpu resets work mostly fine here

14:59 <zamundaaa[m]> the bigger problem is userspace...

15:15 MajorBiscuit has quit [Read error: Connection reset by peer]

15:19 jkrzyszt has quit [Remote host closed the connection]

15:19 jkrzyszt has joined #dri-devel

15:27 The_ASV has joined #dri-devel

15:28 The_ASV has quit [Remote host closed the connection]

15:28 The_ASV has joined #dri-devel

15:30 The_ASV has quit []

15:30 kts has quit [Quit: Leaving]

15:32 The_ASV has joined #dri-devel

15:35 fab has quit [Quit: fab]

15:35 The_ASV has quit [Read error: Connection reset by peer]

15:37 The_ASV has joined #dri-devel

15:41 Testing has joined #dri-devel

15:42 Testing has quit []

15:42 The_ASV has quit [Remote host closed the connection]

15:44 jkrzyszt has quit [Remote host closed the connection]

15:44 The_ASV has joined #dri-devel

15:45 Akari has joined #dri-devel

15:45 f11f12 has quit [Quit: Leaving]

15:49 <ishitatsuyuki> that too

15:55 <Ristovski> I patiently await the day some new webgpu exploit with a witty name comes out and causes all drivers to implement mitigations that butcher performance

15:55 <ishitatsuyuki> I don't know why do you need to shit on the API so hard

15:55 <ishitatsuyuki> it's fine and it has some legitimate use cases

15:56 <Ristovski> it was mainly a joke :P I am actually pretty excited for webgpu myself

15:56 jkrzyszt has joined #dri-devel

15:56 <ishitatsuyuki> ah ok ;)

16:01 <Ristovski> Hopefully then Firefox will improve the efficiency of their GFX stack on Linux, chrome still beats it (esp. on older hardware)

16:02 <Ristovski> the webgl perf difference between firefox and chrome on my old Haswell is day and night

16:06 <ishitatsuyuki> did that change with dmabuf stuff or the disparity is still there

16:08 <Ristovski> dmabuf helped a bit (especially in vaapi video decode for example), but in general the difference is still there

16:10 <ishitatsuyuki> ok

16:13 <MrCooper> Ristovski: out of curiosity, what site(s) do you use for webgl perf testing?

16:15 kts has joined #dri-devel

16:16 <Ristovski> MrCooper: iirc even something like https://webglsamples.org/aquarium/aquarium.html (obviously selecting a higher num of fish makes the effect more apparent). Firefox also has higher CPU usage when playing a VAAPI decoded video for example

16:18 <Ristovski> given that machine is now my "spare", I could maybe try gathering some perf data hmmm

16:19 JohnnyonFlame has joined #dri-devel

16:19 <Ristovski> speaking of, can one hook up a spare machine as extra CI for mesa? I apparently have very cursed HW :)

16:20 lynxeye1 has joined #dri-devel

16:21 fab has joined #dri-devel

16:26 <MrCooper> Ristovski: indeed, Firefox runs at ~40 fps with 5000 fish, chromium with 15000

16:26 lynxeye has quit [Ping timeout: 480 seconds]

16:29 djbw has joined #dri-devel

16:30 <Ristovski> checks out

16:31 junaid has joined #dri-devel

16:32 Duke`` has joined #dri-devel

16:41 jkrzyszt has quit [Remote host closed the connection]

16:41 jkrzyszt has joined #dri-devel

16:43 * pinchartl is scared to open the aquarium

16:44 <pinchartl> opening the shadertoys website (in Firefox or Chrome) locks up my i915 GPU after a few seconds

16:44 <MrCooper> not really the same thing :)

16:44 <pinchartl> and the deadlock detection and recovery doesn't work, I have to hard reboot

16:44 frieder has quit [Remote host closed the connection]

16:45 <MrCooper> shadertoys is nasty by design, "normal" webgl sites are generally safe

16:45 alyssa has quit [Quit: leaving]

16:46 <Ristovski> opening shadertoy is always fun, I can "hear" the shaders thanks to shitty filtering on my (old) mobo

16:46 kts has quit [Quit: Leaving]

16:46 <pinchartl> MrCooper: if it's that easy to lock up i915 from any untrusted website, there's a problem though

16:47 <Ristovski> (yes, vblank_mode=0 glxgears does make it _scream_)

16:49 <Ristovski> pinchartl: Hmm, might be worth filing a bug report then

16:49 mvlad has quit [Ping timeout: 480 seconds]

16:49 LordKalma has quit [Quit: Server has probably crashed]

16:49 <MrCooper> pinchartl: welcome to the wonderful world of GPUs

16:50 LordKalma has joined #dri-devel

16:51 OftenTimeConsuming has quit [Remote host closed the connection]

16:52 OftenTimeConsuming has joined #dri-devel

16:54 vliaskov has quit [Remote host closed the connection]

16:56 navi has quit [Ping timeout: 480 seconds]

16:56 rasterman has quit [Quit: Gettin' stinky!]

16:58 LordKalma has quit [Quit: Server has probably crashed]

16:58 LordKalma has joined #dri-devel

16:58 <pinchartl> MrCooper: I used to dream of working on GPUs when I was a student, I'm now relieved I didn't succeed :-)

17:00 <MrCooper> FWIW, I'm on the second page of shadertoy.com Browse, no hang yet with an AMD GPU

17:00 jkrzyszt has quit [Remote host closed the connection]

17:01 <MrCooper> some amazing stuff on there

17:04 <javierm> I'm curious but also have a i915 so I'm afraid to have a hard lock up as pinchartl mentioned :)

17:07 devilhorns has quit []

17:07 Guest2216 has quit [Remote host closed the connection]

17:08 <Ristovski> I have a Haswell running i915 as well, and it works fine for me

17:09 mvlad has joined #dri-devel

17:20 jkrzyszt has joined #dri-devel

17:24 fxkamd has quit []

17:24 fxkamd has joined #dri-devel

17:25 pcercuei has quit [Ping timeout: 480 seconds]

17:26 tursulin has quit [Ping timeout: 480 seconds]

17:28 bmodem has quit [Ping timeout: 480 seconds]

17:37 rgallaispou has quit [Ping timeout: 480 seconds]

17:46 jhli_ has quit [Remote host closed the connection]

17:52 JohnnyonFlame has quit [Ping timeout: 480 seconds]

17:52 The_ASV has quit [Read error: Connection reset by peer]

18:05 pcercuei has joined #dri-devel

18:05 heat has quit [Remote host closed the connection]

18:06 heat has joined #dri-devel

18:07 Akari has quit [Quit: segmentation fault (core dumped)]

18:11 camus has quit [Remote host closed the connection]

18:11 camus has joined #dri-devel

18:12 maxzor has joined #dri-devel

18:13 aravind has quit [Ping timeout: 480 seconds]

18:18 ybogdano has joined #dri-devel

18:19 junaid has quit [Remote host closed the connection]

18:34 The_ASV has joined #dri-devel

18:35 camus has quit [Remote host closed the connection]

18:37 camus has joined #dri-devel

18:38 <Lynne> airlied: thanks, merged

18:38 <Lynne> also fixed the last todo in h264, scaling_list_present_mask

18:39 <Lynne> mesa doesn't use it, and doesn't fix nvidia, but it's now complete

18:40 The_ASV has quit [Read error: Connection reset by peer]

18:40 bnieuwenhuizen has quit [Quit: Bye]

18:40 bnieuwenhuizen has joined #dri-devel

18:41 <Lynne> also tried host mapping the slices buffer to avoid uploading, but didn't make a difference to speed

18:46 LordKalma has quit [Quit: Server has probably crashed]

18:46 LordKalma has joined #dri-devel

18:47 <Lynne> (also, should've linked to my blog post in your so folks know how to run the code, but w/e, too late for that now)

18:48 LordKalma has quit []

18:49 LordKalma has joined #dri-devel

19:00 lemonzest has quit [Quit: WeeChat 3.6]

19:09 <DemiMarie> MrCooper: if https://shadertoy.com (or any other website, for that matter) can lock a user’s machine and killing the browser does not fix the problem, this is a security vulnerability in the GPU driver.

19:16 ahajda_ has quit []

19:22 <airlied> Lynne: my current blogpost is kinda a placeholder

19:24 <airlied> the other blogpost needed a place to point to

19:24 <airlied> ill rewrite it next week

19:30 The_ASV has joined #dri-devel

19:44 <Lynne> yeah, I can get away with publishing bad blogs because spreading the news or not won't change the fact no one reads them

19:47 <airlied> Lynne: except people unhappy about the benchmark figures :-P

19:48 <airlied> Lynne: why do you create sps/pps lists for every frame? do they really change that often?

19:49 <airlied> the idea behind the vulkan api is that you should just create those as they come out of the stream, not on every frame

19:49 <psykose> i was going to say the other day you should be careful with the numbers since they're going to instantly become a phoronix headline :p

19:50 <airlied> yeah I'd back off on them a bit until we actualy spend time on perf, so far getting things correct has been more important

19:51 <mlankhorst> My disk space is now 99% utilized!

19:51 <airlied> but also I suspect nvidia beta drivers might come with requests to not benchmark things :-P

19:57 <Ristovski> mlankhorst: Remember, free space is wasted space!

20:00 <graphitemaster> re: the webgpu stuff, I think pushing it will help Linux along in getting resources put towards solving the whole userspace graphics preemption thing

20:09 tzimmermann has quit [Quit: Leaving]

20:13 <Lynne> airlied: https://github.com/KhronosGroup/Vulkan-Docs/issues/1694#issuecomment-1353653150

20:13 <Lynne> of course my voice falls on deaf ears

20:14 <Lynne> I have complete respect in someone's opinion, who gave up on testing my code within 2 minutes and didn't bother investigate or attempt again

20:18 <Lynne> as for why I create SPS/PPS/headers on every frame - *state*

20:19 <Lynne> managing state in a decoder is the hardest part

20:19 <Lynne> you want to give every single frame the maximum chance of being decoded

20:20 <airlied> I don't see why that should matter though, you are reading a stream, every sps/pps will be processed in order won't it?

20:20 <Lynne> packets get lost, get corrupted, rotational velocidensity takes its toll on 10 year old mp3

20:20 <airlied> if there's a missing sps/pps then it'll screw up whether you handed it to the decoder sw once or every frame

20:21 <Lynne> the decoder internally does magic to compensate

20:21 <Lynne> we don't want another layer of magic on top

20:21 <Lynne> besides, it's really not that much data, most of it is a memcpy

20:22 <Lynne> couple of kilobytes, and the allocation is pooled just in case we get crafted input

20:22 <Lynne> I looked at whether it's a bottleneck but it's too insignificant to detect

20:23 The_ASV has quit []

20:24 <Lynne> by the decoder, I meant the ffmpeg parser really, there's a good reason why we call hwaccels hwaccels rather than decoders

20:24 <airlied> Lynne: like if sps/pps was done optimally I could optimise it a lot more in the driver

20:24 <airlied> I've just been lazy so far

20:25 <Lynne> (when cuda had its own decoder, complete with a parser, we had a separate h264_cuda decoder, and it was bad enough that no one really considered using it)

20:28 <Lynne> done optimally where?

20:28 lynxeye1 has quit []

20:29 <DemiMarie> Lynne: I really wish openH264 had HW accel support ☹️

20:30 <Lynne> the last original h264 patent should expire next year, I think?

20:31 <airlied> Lynne: by the decoder

20:31 <Lynne> well, how would you suggest that's done? like I said, it's not much overhead for us to set it?

20:31 <airlied> it's still memory allocation overhead for the backend

20:32 <Lynne> can't you pool it like we do?

20:32 <airlied> it creates vulkan objects

20:32 <airlied> those are usually just malloced

20:33 <airlied> the idea behind the API is you get the sps/pps sets you need and batch create them if you can

20:33 <airlied> then have the decode video call just reference the entries in the set

20:34 <Lynne> is that what the session template field is for when creating it?

20:34 <Lynne> other than that, you'd have to compare pointers the user gives you, which is ghetto

20:36 <Ristovski> Is there some other interface one can use instead of /sys/kernel/debug/ttm/page_pool_shrink?

20:39 * airlied isn't 100% sure how the templating bit is expected to work

20:39 <airlied> I think it's just a start from this point with a new set of parms

20:41 <airlied> but that idea is you should create a sized set, and update them as new ones come in

20:41 <airlied> not generate one every frame

20:42 <Lynne> how would the driver know they haven't been updated?

20:44 <airlied> the driver shouldn't have to care, the decoder would tell it when it gets updated ones

20:46 <airlied> like if seq_parameter_set_id is needed a frame, make sure you've given it to the driver before that

20:46 <airlied> but if the video just uses seq_parameter_set_id == 0 then just give it once

20:47 <airlied> Lynne: I wonder how much overheads there are just in driver init

20:59 <Lynne> in mesa, I didn't see much at all in perf, it was mostly in the kernel

21:03 <Lynne> still how would the decoder tell the driver the sets have been updated?

21:03 <Lynne> the reset flag?

21:04 <Lynne> OH, videoSessionParameters?

21:05 <Lynne> so video session == SPS/PPS/VPS data?

21:05 <airlied> videosessionparameters yes

21:06 macromorgan_ has joined #dri-devel

21:06 <Lynne> I thought it was decoding context, not a compiled list of headers

21:06 macromorgan is now known as Guest2265

21:06 Guest2265 has quit [Read error: Connection reset by peer]

21:06 macromorgan_ is now known as macromorgan

21:06 flibit has joined #dri-devel

21:10 <airlied> Lynne: in theory the driver can prebake part of the hw packets from those, at the moment I don't yet

21:10 <airlied> which means submitting individual decode video cmds should be faster

21:12 <Lynne> I can do it, I guess, though it'll be a per-frame memcmp of ~400kib on average

21:12 flibitijibibo has quit [Ping timeout: 480 seconds]

21:13 <Lynne> because due to frame threading, non-consecutive frames can be submitted out of order for decoders

21:13 <airlied> Lynne: do you not have the sps id from the stream? or is this part of the decoder working around broken things?

21:13 <Lynne> I'm not sure if it's trustworthy

21:14 <airlied> what do you do if a frame references an sps you haven't seen?

21:14 <Lynne> it's handled by the decoder

21:15 <airlied> though I don't reckon that is going to save a massive amount of clock time, it's just one place I know there is some

21:16 dcz_ has quit [Ping timeout: 480 seconds]

21:16 alyssa has joined #dri-devel

21:16 <alyssa> robclark: Any reason not to use BLIT for TEXTURE_TRANSFER_MODES on freedreno?

21:16 <airlied> Lynne: what's the equiv cmd line to time vaapi btw?

21:17 <alyssa> The gallium docs say to use BLIT for dGPUs and default for swrasts and doesn't give any guidance for iGPUs, and the drivers in tree all do different things

21:17 <airlied> just remove the init_hw_device and change others to vaapi?

21:18 <alyssa> otoh ccce9409470 ("v3d: Disable PIPE_CAP_BLIT_BASED_TEXTURE_TRANSFER.")

21:18 <alyssa> i guess it's going to depend entirely on whether you can be smart about the cache

21:21 JohnnyonFlame has joined #dri-devel

21:21 <Lynne> airlied: yup, "./ffmpeg_g -init_hw_device "vaapi=vk:/dev/dri/renderD128" -hwaccel vaapi -hwaccel_output_format vaapi -i TEST -an -sn -"

21:22 <Lynne> you can omit the init_hw_device (the vk is just a label)

21:22 <Lynne> -an -sn prevent decoding of audio and subtitles

21:23 <Lynne> err, there should be an -f null - at the end there to use the null muxer which just frees packets

21:23 <robclark> alyssa: we'll end up doing a blit internally to tiled and/or ubwc in many cases (depends on format/target/etc).. so using BLIT probably ends up doing two blits. or at the very least something not better than the current setup

21:23 ahajda_ has joined #dri-devel

21:24 <alyssa> robclark: hum, okay... it's not clear to me that should be the case though (though it may be)

21:24 <alyssa> without blit-based transfer, mesa will do BGRA->RGBA conversion on the CPU which isn't great

21:24 <alyssa> (versus a straight memcpy)

21:24 * airlied will see if I can hunt perf issue once I get kids sorted

21:25 <alyssa> AFAICT:

21:25 <alyssa> blit-based: 1 GPU blit + 1 memcpy from STAGING (i.e. write-back)

21:25 <robclark> I guess that at least hasn't shown up in the things I've profiled

21:26 <alyssa> transfer-based: 1 GPU blit (internal blit for ubwc -> linear) + 1 BGRA->RGBA conversion on the CPU

21:26 <robclark> but game startup isn't a thing I've looked at a whole lot

21:26 <alyssa> this is from profiling screenshots on wayland

21:26 <alyssa> which can spend a whole lot of time in glReadPixels if you're unlucky

21:26 <robclark> oh, for the readback?

21:26 <alyssa> ye

21:27 <alyssa> I guess both paths are doing 2 blits effectively

21:27 <alyssa> but at least you get the format conversion for free along one of them

21:28 <alyssa> the case where blit-based loses is if the framebuffer is linear in memory

21:28 <alyssa> blit-based is 1 blit to format convert on the GPU and 1 memcpy after

21:29 <alyssa> transfer-based is converting into the dest buffer in 1 pass

21:29 <alyssa> (on the CPU)

21:29 <alyssa> but if the framebuffer is write-combine, that might actually be slower.

21:32 <robclark> I think w/ blit based you'd get two blits, but from a quick look that could probably be solved if mesa/st used PIPE_BIND_LINEAR for the staging buffer.. but either way we are using cached coherent for the staging buffer which probably makes the format conversion less painful

21:34 <alyssa> right, I see

21:35 <alyssa> for a ubwc framebuffer, I see why 1 blit + 1 format conversion on the CPU as you get from transfer-based would win, yeah

21:36 <alyssa> or at least be no worse than the 1 blit + 1 memcpy on the CPU you'd get from blit-based + the BIND_LINEAR fix

21:36 <alyssa> (although maybe usage==STAGING should imply LINEAR, because STAGING is defined by gallium to be optimized for CPU access)

21:38 bgs has quit [Remote host closed the connection]

22:10 Duke`` has quit [Ping timeout: 480 seconds]

22:12 alyssa has left #dri-devel [#dri-devel]

22:21 mvlad has quit [Remote host closed the connection]

22:22 camus1 has joined #dri-devel

22:24 <Lynne> I gave calling vkCreateVideoSessionParametersKHR only if sps/pps/vps updated a go, but it didn't improve performance on nvidia

22:24 <Lynne> so I guess they haven't done that optimization either

22:25 ybogdano3 has joined #dri-devel

22:25 quantum5_ has joined #dri-devel

22:25 fxkamd1 has joined #dri-devel

22:26 tchar_ has joined #dri-devel

22:26 narmstrong_ has joined #dri-devel

22:26 camus has quit [Ping timeout: 480 seconds]

22:26 hfink_ has joined #dri-devel

22:26 cengiz_io_ has joined #dri-devel

22:26 SanchayanMaity_ has joined #dri-devel

22:26 danvet has quit [Ping timeout: 480 seconds]

22:27 flto_ has joined #dri-devel

22:27 sarnex_ has joined #dri-devel

22:27 zf_ has joined #dri-devel

22:28 cengiz_io has quit [Ping timeout: 480 seconds]

22:28 SanchayanMaity has quit [Ping timeout: 480 seconds]

22:28 sarnex has quit [Read error: Connection reset by peer]

22:28 Mangix has quit [Read error: Connection reset by peer]

22:28 Mangix has joined #dri-devel

22:28 cengiz_io_ is now known as cengiz_io

22:28 quantum5 has quit [Ping timeout: 480 seconds]

22:28 hfink has quit [Ping timeout: 480 seconds]

22:28 dviola has quit [Remote host closed the connection]

22:28 agd5f has quit [Read error: Connection reset by peer]

22:28 mslusarz has quit [Remote host closed the connection]

22:28 mslusarz has joined #dri-devel

22:29 dviola has joined #dri-devel

22:29 djbw_ has joined #dri-devel

22:29 djbw has quit [Read error: Connection reset by peer]

22:29 fxkamd has quit [resistance.oftc.net larich.oftc.net]

22:29 mbrost has quit [resistance.oftc.net larich.oftc.net]

22:29 warpme_____ has quit [resistance.oftc.net larich.oftc.net]

22:29 alanc has quit [resistance.oftc.net larich.oftc.net]

22:29 tchar has quit [resistance.oftc.net larich.oftc.net]

22:29 zehortigoza has quit [resistance.oftc.net larich.oftc.net]

22:29 zf has quit [resistance.oftc.net larich.oftc.net]

22:29 flto has quit [resistance.oftc.net larich.oftc.net]

22:29 narmstrong has quit [resistance.oftc.net larich.oftc.net]

22:29 sumits has quit [resistance.oftc.net larich.oftc.net]

22:29 soreau has quit [resistance.oftc.net larich.oftc.net]

22:29 ybogdano has quit [resistance.oftc.net larich.oftc.net]

22:29 Mangix has quit [resistance.oftc.net larich.oftc.net]

22:30 agd5f has joined #dri-devel

22:30 Mangix has joined #dri-devel

22:30 mbrost has joined #dri-devel

22:30 fxkamd has joined #dri-devel

22:30 alanc has joined #dri-devel

22:30 zehortigoza has joined #dri-devel

22:30 warpme_____ has joined #dri-devel

22:30 flto has joined #dri-devel

22:30 zf has joined #dri-devel

22:30 sumits has joined #dri-devel

22:30 soreau has joined #dri-devel

22:30 sumits has quit [Ping timeout: 482 seconds]

22:31 zf has quit [Ping timeout: 482 seconds]

22:31 agd5f has quit [Remote host closed the connection]

22:31 fxkamd has quit [Ping timeout: 482 seconds]

22:31 mbrost has quit [Ping timeout: 482 seconds]

22:31 flto has quit [Ping timeout: 482 seconds]

22:31 agd5f has joined #dri-devel

22:32 flto_ has quit []

22:32 flto has joined #dri-devel

22:34 sumits has joined #dri-devel

22:37 pcercuei has quit [Quit: dodo]

22:51 <airlied> Lynne: yeah I'm going to guess it something sync related, so we are blocked rather than burning CPU

22:51 flibit has quit []

22:52 flibitijibibo has joined #dri-devel

22:52 <airlied> Lynne: how hard would it be to pool the slice data buffer memory?

22:52 <airlied> Lynne: so you'd do a larger allocate memory and do smaller updates to it?

22:53 <airlied> you also shouldn't destroy the slice buffer until semaphores have signalled

22:58 <airlied> ah you don't just reading the code now :)

22:58 ahajda_ has quit []

23:05 Mangix has quit []

23:05 Mangix has joined #dri-devel

23:07 JohnnyonFlame has quit [Read error: Connection reset by peer]

23:13 <Lynne> yeah, I forgot to write that, updated it

23:19 <Lynne> as for locking, hmm, I remember the kernel doesn't have a native representation of timeline semaphores

23:19 <Lynne> and converts them to regular binary semaphores internally

23:20 <Lynne> it has to wait for #semaphoes == #refs+2

23:28 <Lynne> the ref semaphores are pretty much guaranteed to be signalled by the time they have to be waited on

23:29 <Lynne> as there's only a single decode queue index

23:29 <Lynne> but we still have to wait on them

23:30 <Lynne> maybe explicit (read: actual) synchronization has its price?

23:31 <airlied> yeah I don't see any stalls around waiting there

23:31 fab has quit [Quit: fab]

23:33 <airlied> Lynne: ff_vk_free_buf does a full device idle

23:33 <airlied> that doesn't seem optimal

23:35 <airlied> not sure removing it helps, but it definitely not good idea

23:39 <Lynne> good point, done

23:40 heat_ has joined #dri-devel

23:40 <DavidHeidelberg[m]> added possibility to drop --rev for the ci_run_n_monitor.py into MR fixing unicode parsing: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20271 .

23:40 <DavidHeidelberg[m]> Any review welcome.

23:40 <Lynne> buffer uploading, then, maybe? it has to be done right before decoding, maybe there's a stall there

23:41 <airlied> Lynne: the allocate/free memory paths are definitely going to be slow

23:41 <airlied> vulkan really expects those to be managed and pool in the app

23:44 heat has quit [Ping timeout: 480 seconds]

23:44 <Lynne> fair enough, menial, but I can quickly write something up