#dri-devel on 2021-10-12 — irc logs at oftc.irclog.whitequark.org

2021-07-26 22:56 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:07 mattrope has joined #dri-devel

00:12 co1umbarius has joined #dri-devel

00:14 columbarius has quit [Ping timeout: 480 seconds]

00:21 kenjigashu has joined #dri-devel

00:32 nchery has quit [Quit: Leaving]

00:33 kenjigashu has quit []

00:38 ngcortes has quit [Ping timeout: 480 seconds]

00:41 Company has quit [Read error: Connection reset by peer]

01:09 i-garrison has quit [Read error: Connection reset by peer]

01:10 i-garrison has joined #dri-devel

01:33 Akari` has joined #dri-devel

01:34 Akari has quit [Remote host closed the connection]

01:37 linearcannon has quit [Quit: Textual IRC Client: www.textualapp.com]

01:46 dongwonk has joined #dri-devel

01:49 linearcannon has joined #dri-devel

01:56 mattrope has quit [Remote host closed the connection]

01:59 lemonzest has joined #dri-devel

02:01 camus has joined #dri-devel

02:04 Kayden has joined #dri-devel

02:15 JohnnyonFlame has quit [Ping timeout: 480 seconds]

02:29 flto has quit [Remote host closed the connection]

02:35 aravind has joined #dri-devel

02:38 flto has joined #dri-devel

03:04 camus1 has joined #dri-devel

03:08 camus has quit [Ping timeout: 480 seconds]

03:17 ybogdano has joined #dri-devel

03:24 flto has quit [Quit: Leaving]

03:25 ybogdano has quit []

03:28 flto has joined #dri-devel

03:32 Yuriy has joined #dri-devel

03:35 Yuriy has quit []

03:35 ybogdano has joined #dri-devel

03:37 ybogdano has quit []

03:38 ybogdano has joined #dri-devel

03:38 ybogdano is now known as Yuriy

03:39 Yuriy has quit []

03:39 ybogdano has joined #dri-devel

03:43 mbrost_ has joined #dri-devel

03:48 ybogdano has quit [Ping timeout: 480 seconds]

03:50 mbrost has quit [Ping timeout: 480 seconds]

04:13 slattann has joined #dri-devel

04:17 camus has joined #dri-devel

04:21 camus1 has quit [Ping timeout: 480 seconds]

04:32 Duke`` has joined #dri-devel

04:45 flto has quit [Remote host closed the connection]

04:46 flto has joined #dri-devel

04:55 mbrost_ has quit []

05:06 <Ristovski> mareko: Missed your reply yesterday. The rather cursed mesa issue I was experiencing was due to me having forgotten to remove amdgpu.mcbp=1 from my kernel cmdline. It caused the gfx ring to timeout when running anything, including `glxinfo`. Since I am on GFX6, I assume it was trying to use MCBP even though it wasn't properly implemented?

05:08 camus1 has joined #dri-devel

05:09 <Ristovski> I suspect its this, https://cgit.freedesktop.org/mesa/mesa/commit/?id=205e8cd09354422a8f1b80aaea49e3e0c770f972, haven't bisected and likely won't (since technically working as intended..), but thats the only commit that stands out. Also lol at "This fixes arb_compute_shader-dlist with mcbp enabled." :D

05:13 camus has quit [Ping timeout: 480 seconds]

05:29 sdutt has quit [Ping timeout: 480 seconds]

05:36 Duke`` has quit [Ping timeout: 480 seconds]

05:41 illwieckz has quit [Ping timeout: 480 seconds]

05:44 mbrost has joined #dri-devel

05:50 illwieckz has joined #dri-devel

05:59 thellstrom has quit [Quit: thellstrom]

05:59 thellstrom has joined #dri-devel

06:00 mlankhorst has joined #dri-devel

06:04 danvet has joined #dri-devel

06:16 <mareko> Ristovski: I think MCBP doesn't even exist on gfx6

06:18 <Ristovski> mareko: I see. Any clue why it would break with amdgpu.mcbp=1 then? Or does that override somehow present fake mcbp support to mesa?

06:18 <mareko> gfx8 is the first hw with MCBP

06:18 <mareko> that codepath is probably messed up everywhere

06:18 <mareko> not all kernel options are supposed to work at all times

06:20 <Ristovski> Figured, I guess it makes sense if mesa doesn't check for >= GFX8 and tries to use MCBP

06:21 <Ristovski> I assume it gets the "is mcbp supported" bit from libdrm?

06:28 unsolo_ has joined #dri-devel

06:34 unsolo has quit [Ping timeout: 480 seconds]

06:35 Erandir has quit [Ping timeout: 480 seconds]

06:39 pnowack has joined #dri-devel

06:42 <CounterPillow> Does anyone happen to know where the radeon driver has its PCI IDs? I have an Evergreen Cedar card "PCI edition" connected to a PCIe-to-PCI bridge (don't ask) and it just enumerates as "Non-VGA unclassified device: Comp. & Comm. Research Lab Device 8112 (rev 2a)" (1035:8112)

06:44 frieder has joined #dri-devel

06:45 <CounterPillow> No clue whether I stumbled into some engineering sample or something or if an error is being read as the PCI id, because that is a very weird vendor id.

06:48 camus1 has quit [Remote host closed the connection]

06:48 Company has joined #dri-devel

06:48 unsolo_ has quit [Ping timeout: 480 seconds]

06:48 camus has joined #dri-devel

06:50 * thellstrom Fixing drm-tip

06:55 <Ristovski> CounterPillow: the device id is weird too, unless I'm missing something obvious

06:56 <Ristovski> you can get a full list of pci vids/pids here: https://pci-ids.ucw.cz/pci.ids

06:57 <Ristovski> yours should be "cedar"

07:01 camus1 has joined #dri-devel

07:02 camus has quit [Ping timeout: 480 seconds]

07:13 <thellstrom> done fixing drm-tip.

07:13 <CounterPillow> 8112 PEX8112 x1 Lane PCI Express-to-PCI Bridge

07:13 <CounterPillow> that's an interesting one to have an ID of 8112. Not the bridge I use, but maybe one the card internally uses.

07:15 tursulin has joined #dri-devel

07:18 rgallaispou has joined #dri-devel

07:18 mbrost has quit [Read error: Connection reset by peer]

07:18 rasterman has joined #dri-devel

07:21 <airlied> CounterPillow: you should still be able to see the VGA device

07:21 <airlied> if the bridge is working an enumerated

07:21 <CounterPillow> Hmmm, true.

07:21 unsolo has joined #dri-devel

07:22 <airlied> I assume there's a PCI->PCIe bridge on the card

07:22 <airlied> which is that device

07:27 tzimmermann has joined #dri-devel

07:32 lynxeye has joined #dri-devel

07:33 dongwonk has quit [Remote host closed the connection]

07:33 Ahuj has joined #dri-devel

07:49 rgallaispou has quit [Read error: Connection reset by peer]

07:51 robher has quit [Read error: Connection reset by peer]

07:51 SanchayanMaity has quit [Read error: Connection reset by peer]

07:51 SanchayanMaity has joined #dri-devel

07:51 robher has joined #dri-devel

07:51 lileo_ has quit [Read error: Connection reset by peer]

07:51 dianders_ has quit [Read error: Connection reset by peer]

07:51 lileo_ has joined #dri-devel

07:51 krh has quit [Read error: Connection reset by peer]

07:51 dianders_ has joined #dri-devel

07:51 krh has joined #dri-devel

07:51 jonmason has quit [Read error: Connection reset by peer]

07:52 jonmason has joined #dri-devel

07:52 aswar002 has quit [Quit: No Ping reply in 180 seconds.]

07:52 rsripada_ has quit [Remote host closed the connection]

07:53 aswar002 has joined #dri-devel

07:53 rsripada has joined #dri-devel

07:54 elongbug has quit [Ping timeout: 480 seconds]

07:56 <CounterPillow> I think this really is a bridge device, just with a different vendor id. I shall try hacking in code to enable the bridge chip during probe at a later date.

08:00 rgallaispou has joined #dri-devel

08:07 shashanks has quit [Read error: Connection reset by peer]

08:07 shashanks has joined #dri-devel

08:08 rgallaispou has left #dri-devel [#dri-devel]

08:11 <CounterPillow> so in short, I am on an arm64 board probing a Radeon HD 5450 through a PCIe<->PCI bridge connected to another PCIe<->PCI bridge. Things are going well.

08:21 <arnd> CounterPillow: it's likely that there is a bug in your firmware or the PCIe host bridge driver that prevents bridges from getting probed right

08:21 <arnd> which SoC is this?

08:22 <CounterPillow> RK3566. I think it's the bridge actually, the device ID is referenced in oxygen_lib.c as a bridge, where it gets some manual configuration.

08:25 <CounterPillow> the reason why I'm not PCIe-ing directly is because of a silicon bug on the SoC making it unable to satisfy the cache coherency requirements of PCIe.

08:29 gawin has joined #dri-devel

08:30 shashanks has quit [Remote host closed the connection]

08:30 shashanks has joined #dri-devel

08:31 <arnd> CounterPillow: I don't support for rk3566 in mainline yet, only rk3399. Are you using any out-of-tree patches for the SoC support, or does the host bridge claim compatibility with rockchip,rk3399-pcie?

08:32 <arnd> CounterPillow: I don't know what the cache coherency requirements are here, but it seems unlikely that going through two extra bridges helps there, that usually only makes things worse ;-)

08:32 rgallaispou has joined #dri-devel

08:33 <arnd> the rk3399 pcie support has no cache coherency at all, but that's how most arm64 SoCs operate and not considered a bug, it's just slow

08:34 frieder has quit [Ping timeout: 480 seconds]

08:34 <arnd> it means all DMA that the device does needs to be done to uncached memory, or it needs additional cache flushes when accessed by the kernel

08:35 <CounterPillow> It's a different controller than the rk3399, but it has mainline support that was merged in 5.15.

08:35 <arnd> CounterPillow: which driver is it?

08:36 <CounterPillow> https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=0e898eb8df4e34c7b129452444eb7cef68a11f43

08:37 <arnd> ok, got it. I haven't merged the dts changes for that yet, but I see the driver now

08:37 hansg has joined #dri-devel

08:37 <CounterPillow> basically, what I'm doing is plugging an Analogix PCIe/PCI bridge into it, end then the PCI gpu into that. It's quite the unstable stack in all meanings of the word.

08:38 <CounterPillow> err, not Analogix. ASMedia, sorry.

08:39 <CounterPillow> ASM1083/1085, so notably one with special quirk handling. Might be part of the issue as well.

08:41 <arnd> I see the DT binding at https://lore.kernel.org/all/20210818093406.157788-1-xxm@rock-chips.com/#t

08:41 <arnd> which also (unsurprisingly) has it as non-coherent, but (slightly more surprising) looks like it does not support any legacy IRQs either

08:42 <arnd> Not sure if passing MSIs through your stack of bridges works as intended

08:42 <gawin> in my experience bridges pci <-> pci express are very tricky (at least inside usb controllers), I wasn't able to get it running with VFIO (device was always busy)

08:42 <arnd> though that wouldn't cause the probing to fail entirely

08:43 frieder has joined #dri-devel

08:44 <CounterPillow> the device vendor id being something unexpected but the device id being one that also is a bridge from some other vendor makes me think someone did an acquisition, though do the bridges even have drivers that match against certain IDs?

08:44 <arnd> CounterPillow: since this is now a dwc PCIe, I would at least expect the bridges to work, as the actual probing is done in the common dwc-pcie code, not in the rockchip specific parts

08:45 <CounterPillow> oh yeah the PCI bridge I plug in works with other PCI devices (I used an ASUS Xonar PCI soundcard to test at one point)

08:45 <arnd> bridges are meant to be probed by generic code looking at the device classes, not PCIe vendor/device IDs

08:45 <CounterPillow> I see

08:46 <arnd> CounterPillow: what does 'lspci -t -v' show? Do you see both bridge, but not the device behind the second bridge, or do you only see the first bridge?

08:47 <CounterPillow> -[0000:00]---00.0-[01-ff]----00.0-[02]----00.0 Comp. & Comm. Research Lab Device 8112

08:47 <CounterPillow> I see both bridges, but not the device behind the second bridge.

08:47 <CounterPillow> Assuming that this still is a bridge

08:47 <CounterPillow> it enumerates as a "Non-VGA unclassified device", so if it is a bridge then it's not saying that it's a bridge

08:48 <arnd> try 'lspci -vv' as root for more detailed output about both bridges

08:48 unsolo_ has joined #dri-devel

08:48 <CounterPillow> https://gist.githubusercontent.com/CounterPillow/acae1aa99f7f1baf67e2ea54bdf08b05/raw/ea2d7cd2db0b3a1874e110132522af82d0166e01/gistfile1.txt

08:49 elongbug has joined #dri-devel

08:50 rgallaispou has quit [Read error: Connection reset by peer]

08:50 unsolo has quit [Ping timeout: 480 seconds]

08:50 <arnd> "Memory behind bridge: [disabled]" looks like a problem

08:50 <arnd> so even the host bridge has no access to MMIO registers

08:51 <CounterPillow> Oh well

08:51 <arnd> CounterPillow: do you see any errors in the boot log for the PCIe probe?

08:51 <CounterPillow> [ 1.280445] pci 0000:02:00.0: [1035:8112] type 00 class 0x060400

08:51 <CounterPillow> [ 1.281060] pci 0000:02:00.0: ignoring class 0x060400 (doesn't match header type 00)

08:51 <CounterPillow> [ 1.279344] pci_bus 0000:02: extended config space not accessible

08:51 <CounterPillow> [ 1.279980] pci_bus 0000:02: scanning bus

08:52 <arnd> ok, so the last line explains why it ignores the second bridge, it just doesn't know what this is

08:54 <arnd> the first line might be the cause of that problem, it's possible that you can't configure the first bridge without extended config space (not sure, that's where my pcie knowledge definitely hits its limits)

08:54 <CounterPillow> Oh well, this was a fun poke at things, I'll just write this one off :)

08:54 rgallaispou has joined #dri-devel

08:55 <arnd> CounterPillow: do you know what the original problem was that prevents you from using a normal pcie card? That would likely be the easier problem to work out.

08:55 f11f12 has joined #dri-devel

08:56 <arnd> I think the probing here can be fixed by digging into it, it certainly sounds like something wrong with the host bridge driver, but most likely after you fix that you end up in the same situation that you'd be in with a normal pcie card

08:56 <CounterPillow> I think it was that the amd drivers will not support missing cache coherency

08:56 <arnd> right, in that case, there is no hope

08:56 <arnd> if the pcie host bridge is not coherent, then adding bridges would not make it any less broken

08:56 <CounterPillow> oops

08:57 <arnd> you might still be able to use it as a dumb framebuffer if you could get the bootloader to post the device, but there are probably better ways of getting a dumb framebuffer

08:58 <CounterPillow> Well yeah, there's an integrated GPU that's just sitting there waiting for a vop2 driver

08:58 <CounterPillow> It's not that I need to do this, it's just that I thought it would be funny if it ended up working

09:02 <arnd> CounterPillow: I suppose there is a chance of the amdgpu driver getting fixed at some point. Samsung and AMD already announced a phone SoC based on a more modern AMD gpu, so if that is noncoherent as well (most phone chips are), they might have to fix the driver after all

09:04 <CounterPillow> Does nouveau work with noncoherent devices? Since Tegra is an ARM SoC family.

09:04 <HdkR> Or they just make it coherent out of sanity :)

09:06 <CounterPillow> I guess I could test the coherency requirements of nouveau after lunch, I've got an 8800 GTX as well as a GTX 480 laying about that should be supported.

09:18 lynxeye has quit []

09:20 <tjaalton> will mesa 21.3 branch tomorrow?

09:21 <tjaalton> eric_engestrom: ^

09:23 <eric_engestrom> tjaalton: yes, tomorrow at around 6pm UTC :)

09:23 <tjaalton> eric_engestrom: cool, thanks

09:25 <arnd> CounterPillow: maybe ask on #armlinux or #aarch64-laptops, I'm sure someone there has tried it before

09:26 <pinchartl> danvet: can I (gently) ping you on "[GIT FIXES FOR v5.15] R-Car DU fix" ?

09:27 <danvet> airlied, ^^

09:27 <pinchartl> actually, scratch that, it seems I miesed up

09:27 <pinchartl> messed

09:27 <pinchartl> the same fix was included by mistake in my -next pull request

09:27 <pinchartl> which has been merged already

09:27 <pinchartl> so that will conflict in Linus' tree, not nice

09:28 <pinchartl> I suppose it's best to skip the v5.15 fix and get it backported in the v5.15.x stable branch ?

09:32 <pinchartl> airlied: ^^ I'll let you decide what's best

09:42 <gawin> may be stupid question how can I debug vram corruption? unfortunately there's no asan/valgrind for vram

09:43 <HdkR> renderdoc? :)

09:45 <gawin> thanks, gonna try

10:04 <gawin> " In particular on desktop only modern GL is supported - legacy GL that is only available via the compatibility profile in OpenGL 3.2 is not supported." sad r300 noises

10:07 <gawin> recently debugging d3d9 or gl2 became difficult (even if you're on app's side)

10:09 <gawin> I mean even just getting tools is problematic (iirc amd has removed their older tools for windows)

10:19 camus has joined #dri-devel

10:22 slattann has quit []

10:24 camus1 has quit [Ping timeout: 480 seconds]

10:25 <MrCooper> gawin: you can enable Option "TearFree" in xorg.conf without forcing the driver, or you can enable TearFree at runtime with xrandr

10:36 flacks has quit [Quit: Quitter]

10:38 flacks has joined #dri-devel

10:45 <gawin> MrCooper: just this?

10:45 <gawin> Section "Device"

10:45 <gawin> Option "TearFree" "true"

10:45 <gawin> EndSection

10:45 <MrCooper> and an Identifier

10:46 <gawin> it can be anything? or needs to match something?

10:46 <MrCooper> anything

10:46 <gawin> thanks

10:47 <MrCooper> np

10:58 <hansg> vsyrjala, can you review my "[PATCH 10/10] drm/i915: Add privacy-screen support (v3)" patch please ? I've addressed your request to move drm_privacy_screen_get() call to intel_ddi_init_dp_connector(). That is the last one of the series needing a review, then I can push the series.

11:17 <emersion> hansg: btw, in case you've missed it, i've sent a RFC for the CLOSEFB stuff. feedback welcome!

11:17 <hansg> emersion, yeah I've seen it. I've been out with the f

11:18 <hansg> Ugh. What I wanted to write is I've been out with the flu for 10 days, so I'm currently catching up on the backlog. And I really want to get the drm-privacy stuff wrapped up before starting something new.

11:18 <emersion> oh, no worries, glad you're back!

11:19 <hansg> With that all said I've looking at your v2 / CLOSEFB proposal on my to do list.

11:19 <emersion> yeah, no rush, just wanted to make sure it's not falling through the cracks :)

11:19 <hansg> ack

11:45 imre has quit [Quit: leaving]

11:56 imre has joined #dri-devel

11:57 flto has quit [Read error: Connection reset by peer]

12:05 lynxeye has joined #dri-devel

12:06 vivijim has joined #dri-devel

12:06 gruetze_ has joined #dri-devel

12:07 flto has joined #dri-devel

13:00 sdutt has joined #dri-devel

13:01 sdutt has quit []

13:01 sdutt has joined #dri-devel

13:20 sukualam has joined #dri-devel

13:24 shashank_sharma has joined #dri-devel

13:25 camus1 has joined #dri-devel

13:27 camus has quit [Ping timeout: 480 seconds]

13:29 camus has joined #dri-devel

13:29 camus1 has quit [Read error: Connection reset by peer]

13:30 shashanks has quit [Ping timeout: 480 seconds]

13:30 sukualam has quit [Ping timeout: 480 seconds]

13:35 leandrohrb1 has joined #dri-devel

13:35 <gawin> btw mareko if you have time now what do you think about this one? https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13226

13:36 leandrohrb has quit [Read error: Connection reset by peer]

13:58 fxkamd has joined #dri-devel

14:02 unsolo has joined #dri-devel

14:04 unsolo_ has quit [Ping timeout: 480 seconds]

14:08 pushqrdx has quit [Ping timeout: 480 seconds]

14:23 camus1 has joined #dri-devel

14:23 camus has quit [Read error: Connection reset by peer]

14:24 FireBurn has quit [Quit: Konversation terminated!]

14:24 thellstrom has quit [Ping timeout: 480 seconds]

14:29 pushqrdx has joined #dri-devel

14:40 pushqrdx has quit [Ping timeout: 480 seconds]

14:43 mattrope has joined #dri-devel

14:44 <hwentlan> Plagman, emersion: was off for canadian thanksgiving yesterday. i'm okay taking the chrome workaround upstream or for chrome guys to take it in the chrome tree. i hear chrome devs are working on a new compositor, so maybe in a year this won't be needed anyways

14:45 thellstrom has joined #dri-devel

14:46 thellstrom1 has joined #dri-devel

14:51 <Venemo> daniels: the latest pipeline is green, but marge says "CI is taking too long" here: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13121 what should I do about it?

14:52 Venemo is now known as Venemo__

14:53 thellstrom has quit [Ping timeout: 480 seconds]

14:54 Venemo__ is now known as Venemo

15:03 BobBeck has joined #dri-devel

15:03 Duke`` has joined #dri-devel

15:05 unsolo has quit [Ping timeout: 480 seconds]

15:05 flto has quit [Ping timeout: 480 seconds]

15:06 cengiz_io_ has quit []

15:06 cengiz_io has joined #dri-devel

15:11 <bnieuwenhuizen> Venemo: reassign to marge

15:11 <bnieuwenhuizen> ?

15:11 <zmike> can confirm^

15:11 <bnieuwenhuizen> looks like the pipeline took 56 min, which happens to be pretty long

15:11 <bnieuwenhuizen> but it succeeded even though marge had given up

15:15 <Venemo> freaking bot

15:26 pushqrdx has joined #dri-devel

15:29 Ahuj has quit [Ping timeout: 480 seconds]

15:29 nchery has joined #dri-devel

15:30 mbrost has joined #dri-devel

15:40 tzimmermann has quit [Quit: Leaving]

15:49 thellstrom1 has quit []

15:54 kts has joined #dri-devel

15:54 kts has quit []

15:55 kts has joined #dri-devel

15:56 adjtm has quit [Quit: Leaving]

16:09 <jekstrand> karolherbst: Can you take a look at the clover patches in !4743?

16:09 <jekstrand> karolherbst: Should be pretty trivial

16:09 <jekstrand> But stupid

16:11 <karolherbst> jekstrand: that MR is already on my todo list :D

16:11 <jekstrand> karolherbst: Cool.

16:15 frieder has quit [Remote host closed the connection]

16:48 gouchi has joined #dri-devel

16:49 ngcortes has joined #dri-devel

16:50 <anholt> still have a cts uprev waiting, and it fixes up iris manual expectations to be more stable. Anyone for review? https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13253

16:53 <ajax> anholt: rb

16:54 <anholt> thanks

16:55 tobiasjakobi has joined #dri-devel

16:55 tobiasjakobi has quit [Remote host closed the connection]

17:02 macromorgan has quit [Read error: Connection reset by peer]

17:02 macromorgan has joined #dri-devel

17:03 agd5f has quit [Remote host closed the connection]

17:04 jstultz has quit [Read error: Connection reset by peer]

17:04 jstultz has joined #dri-devel

17:04 ezequielg has quit [Read error: Connection reset by peer]

17:04 ezequielg has joined #dri-devel

17:05 agd5f has joined #dri-devel

17:36 xexaxo has quit [Remote host closed the connection]

17:37 xexaxo has joined #dri-devel

17:39 ybogdano has joined #dri-devel

17:44 mlankhorst has quit [Ping timeout: 480 seconds]

17:50 <gawin> anholt: (if you have some free time) can you also take a look at this one? https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12995 thanks

17:56 <gawin> :)

17:56 <anholt> (done)

17:57 <gawin> thanks once again

18:04 camus has joined #dri-devel

18:04 camus1 has quit [Read error: Connection reset by peer]

18:29 gawin has quit [Ping timeout: 480 seconds]

18:30 aravind has quit [Ping timeout: 480 seconds]

18:31 aravind has joined #dri-devel

18:31 hansg has quit [Quit: Leaving]

18:34 <anholt> agd5f: we get a lot of kernel warnings in ci, and I don't see an obvious commit fixing it. Who should this be reported to? https://gitlab.freedesktop.org/anholt/mesa/-/jobs/14638956#L4662

18:36 <agd5f> anholt, https://gitlab.freedesktop.org/drm/amd/-/issues

18:36 Ahuj has joined #dri-devel

18:39 <anholt> agd5f: thanks. https://gitlab.freedesktop.org/drm/amd/-/issues/1747

18:40 lynxeye has quit []

18:41 pnowack has quit [Quit: pnowack]

18:41 aravind has quit [Ping timeout: 480 seconds]

18:46 gpoo has quit [Remote host closed the connection]

18:50 <airlied> hwentlan, agd5f : https://paste.centos.org/view/raw/9ab2f8f4 amd vs intel display code in a race to 10000 :-P, though the intel one is after I've refactored 1000 more lines out

18:50 <airlied> but 10,000 loc in one file seems a bit unwieldly

18:54 camus1 has joined #dri-devel

18:56 camus has quit [Read error: Connection reset by peer]

18:59 gawin has joined #dri-devel

19:03 <hwentlan> airlied: yeah, we need to break up that file some more

19:05 JohnnyonFlame has joined #dri-devel

19:08 <emersion> i always end up having to touch amdgpu_dm.c

19:08 <emersion> was wondering if it was just bad luck, doesn't seem like it :P

19:11 ybogdano has quit [Ping timeout: 480 seconds]

19:11 flto has joined #dri-devel

19:15 pnowack has joined #dri-devel

19:25 camus has joined #dri-devel

19:29 camus1 has quit [Ping timeout: 480 seconds]

19:40 alanc has quit [Remote host closed the connection]

19:41 alanc has joined #dri-devel

19:47 heat has joined #dri-devel

19:47 dianders_ has left #dri-devel [#dri-devel]

19:49 dianders has joined #dri-devel

19:50 lemonzest has quit [Quit: WeeChat 3.2]

19:55 unsolo has joined #dri-devel

19:58 pnowack has quit [Quit: pnowack]

19:58 pnowack has joined #dri-devel

20:02 gawin has quit [Ping timeout: 480 seconds]

20:05 hfink_ has quit []

20:05 hfink has joined #dri-devel

20:16 heat has quit [Ping timeout: 480 seconds]

20:16 mbrost has quit [Remote host closed the connection]

20:20 ybogdano has joined #dri-devel

20:30 Duke`` has quit [Ping timeout: 480 seconds]

20:33 Ahuj has quit [Ping timeout: 480 seconds]

21:01 rasterman has quit [Quit: Gettin' stinky!]

21:15 <graphitemaster> So the amdgpu driver does support mid command buffer preemption, it's just not default - loading the module with mcbp=1 does make AMD systems so much more responsive to long running compute kernels

21:15 <graphitemaster> Why is this not default?

21:22 <HdkR> Buggy, crashy, not fully validated, all of the above? :P

21:23 <graphitemaster> Okay well someone here previously told me this is a hard problem requiring a re-plumb of the entire Linux graphics stack and apparently there's already a working implementation of it that has been here for quite awhile, just needs some massaging. I don't know who to believe anymore XD

21:24 <airlied> graphitemaster: why not both?

21:25 <graphitemaster> https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5798/commits

21:25 <bnieuwenhuizen> what is your compute work? rocm?

21:26 <airlied> mcbp doesn't fix any of the problems with the Linux stack replumbing

21:26 <airlied> it's merely a component of the fix

21:27 gouchi has quit [Remote host closed the connection]

21:28 <bnieuwenhuizen> I think if the compute work is rocm, and you want to preempt that the replumbing might actually not be needed, but that is quite a limited scope of work

21:28 <agd5f> graphitemaster, all mcbp=1 does is allow preemption to work. Someone still has to actually do request it.

21:28 <agd5f> bnieuwenhuizen, ROCm can already preempt

21:29 vivijim has quit [Ping timeout: 480 seconds]

21:29 <pinchartl> airlied: I've made a mistake and included the same fix in both "[GIT FIXES FOR v5.15] R-Car DU fix" (which hasn't been pulled yet) and in a branch that you have merged for v5.16. how would you like to proceed with that, dropping the v5.15 fix pull request (the fix, which is for a v5.15 regression, would then end up in the stable branch once v5.16 is out), or still merge the fix for v5.15 ?

21:31 <graphitemaster> No, not ROCm, I was just doing my own research and investigation into it when I felt people here weren't that interested or serious in fixing what I believed was a fairly serious problem. Came across the only driver that had any mention of more granular preemption and decided to try it for myself and I can report on a regular X desktop install with a long running GLSL compute shader, the desktop is more responsive than without it.

21:31 <graphitemaster> It's still not Windows good though. I'm curious to know what plumbing is needed when this already has a pretty obvious interactivity improvement?

21:32 <graphitemaster> If it's just a part of the equation and more work is needed, this is already a pretty damn good start.

21:35 <agd5f> graphitemaster, nothing actually gets preempted unless someone says preempt that job. that part is missing. On windows the OS does it. On Linux no one does.

21:37 <graphitemaster> Is it so hard to hack that pre-empt request in the kernel/mesa?

21:37 <airlied> pinchartl: how big is the fix? I think I can cherry-pick it back from the v5.16 branch into the v5.15 branch like we do for other fixes

21:37 <bnieuwenhuizen> I'm just wondering what changed if there is no pre-emption happening yet

21:37 <airlied> it shouldn't mess git up as much

21:38 <graphitemaster> It doesn't need to be perfect here, a really gnarly criterion that just keeps the desktop interactive is infinitely better than the current "hah your desktop is hosed, lets hold the power button down because not even alt+sysrq+r works"

21:39 <karolherbst> soooo.. I want to look into using marge for nouveau kernel patches (doing the rebase + adding Link: tags etc...)

21:39 <karolherbst> is there anybody around to give me some pointers on this? :D

21:41 <pinchartl> airlied: 3 files changed, 28 insertions(+), 4 deletions(-)

21:42 <karolherbst> btw... how strong are we about those Link: tags inside the commits?

21:43 <pinchartl> airlied: 187502afe87a in your tree

21:43 <airlied> pinchartl: okay let me give this a go

21:44 <airlied> karolherbst: we like the Link tags to exist, it's not end of the world if they don't

21:45 <airlied> karolherbst: is marge going to get a signed-off-by line or how are you going to deal with that requirement?

21:46 * pinchartl wonders why after all this year git..b don't support adding tags to commits

21:46 <karolherbst> airlied: well, people can also just set them manually and point towards the MR

21:46 <karolherbst> pinchartl: because you have to figure out to which patch the tag applies to

21:46 <karolherbst> does it apply to all? or just a few?

21:47 <pinchartl> that's why I'd like git..b to support per-commit approval

21:47 <pinchartl> instead of a single button to approve the whole request

21:47 <karolherbst> airlied: atm I push the patches to drm-misc myself and I don't plan to automate this with marge

21:47 <pinchartl> I'm certainly biased by the kernel work flow, but it seems such a core feature to me

21:47 <karolherbst> marge just pushes to nouveau-next/fixes and then somebody would move it higher up

21:48 <karolherbst> pinchartl: sure, but again.. how do you map language/comments to tags added to which commit

21:48 <pinchartl> (and if per-commit review became a first-class citizen, it would also be good to have the ability to comment on the commit message itself)

21:48 <airlied> pinchartl: per-commit is harder for gitlab/hub to track, esp as an MR evolves

21:48 <karolherbst> yeah.. and that as well

21:48 <airlied> gerritt does the crazy change-id thing

21:49 <pinchartl> when reviewing a merge request, you get the list of commits, you can open them individually and it wouldn't be difficult in the UI to support adding tags

21:49 <airlied> pinchartl: how do you track the tags across iterations though

21:49 <airlied> the commit ids change

21:49 <airlied> so do the subjects

21:49 <airlied> and the contents

21:49 <karolherbst> airlied: so, if a Link: tags points towards a merge request, that would be totally acceptable for everybody I guesS?

21:49 <airlied> karolherbst: yes as long as you can find discussion

21:49 <pinchartl> that's why the author would likely need to pull the updated branch

21:49 <pinchartl> to work on vn+1

21:50 * pinchartl goes back to his e-mail client for reviews

21:50 <airlied> yeah it's more the whole database behind gitlab tracking it, github used to choke on force pushes for the same reason

21:50 <airlied> it could no longer match the existing commentary to the new push

21:51 <karolherbst> yeah.. it's all not very nice atm

21:51 <karolherbst> but it also never was with literally any other tooling

21:52 <karolherbst> the only thing I think could work is, people approving it via UI and then it applies to all patches

21:52 <karolherbst> and this marge could add

21:52 <karolherbst> but anything beyond that?

21:52 <pinchartl> for large patch series it's common to approve some of them only, I like tags to figure out what I've reviewed already

21:53 <pinchartl> I need to start using b4 and public-inbox more seriously, there are very good ideas there

21:53 <karolherbst> yeah, but I don't see a way to do that automatically

21:53 ybogdano has quit [Ping timeout: 480 seconds]

21:53 <airlied> like email only works there if the original author remembers to add all tags manually before sending a v2

21:54 <karolherbst> or use proper reply-to stuff...

21:54 <airlied> which is no different than using gitlab and having the author manually add tags

21:54 <karolherbst> point is: every solution sucks

21:54 <airlied> and also the v1-Rb: type of thing

21:54 <airlied> gets messy

21:54 <karolherbst> yeah...

21:54 <airlied> it's like how much change invalidates an r-b

21:55 <karolherbst> it's a social issue you try to find technical solutions for :p

21:55 <pinchartl> everything sucks, we should all go farm tomatoes in portugal (or herd goats in the French Larzac, it seems quite popular for people who are fed up with humanity :-))

21:55 <airlied> there's a lot of guesswork and workarounds even for the glorious email syustem

21:55 <airlied> pinchartl: they suck more

21:55 <airlied> ever try and a make a profit from farming?

21:55 <pinchartl> maybe "profit" is what we should reconsider :-)

21:55 <airlied> pinchartl: mostly just pointing out why "why can't g..* just do X" is because X is really hard to do right

21:55 <karolherbst> or even a living wage without subsidies

21:55 <airlied> pinchartl: you can't live on tomatoes

21:56 <airlied> or goats

21:56 <pinchartl> you can't eat patches :-D

21:56 <karolherbst> pinchartl: it's not even enough to live unless you get subsidies :D

21:56 danvet has quit [Ping timeout: 480 seconds]

21:56 <pinchartl> (while a goat meat stew with tomatoes...)

21:56 <karolherbst> airlied: well.. actually...

21:56 <airlied> so you are fed for one week a year when your tomatoes aren't eaten by random goats :-P

21:57 <karolherbst> :P

21:57 <pinchartl> hmmmm... right, goats eating tomatoes is an issue we'd have to solve

21:57 <karolherbst> after that you eat the goats

21:57 <pinchartl> I wonder if there's a project on git..b for that :-D

21:57 <karolherbst> anyway..... for now I just want marge to add tags :D

21:58 <karolherbst> another problematic thing would be s-by tags

21:58 <karolherbst> should we add them from the person "accepting" the MR even though somebody else pushes it up to drm-misc?

21:58 <pinchartl> agreed, we don't need to solve the tomatoes and goats issues as part of your tag handling problem. that would be an extreme case of yak shaving, or most likely goat shaving

21:59 <karolherbst> atm I just want to script away the annoying bits of all of this

22:02 <airlied> karolherbst: yes automated s-o-b tags are kinda legally dubious

22:03 <karolherbst> *sigh*

22:03 <karolherbst> airlied: well.. but we could say that the person accepting the MR ....

22:03 <karolherbst> dim also adds the tag automated

22:04 <karolherbst> so I don't realy see the big difference here

22:04 <karolherbst> if a maintainer accepts patches they are not comfortable adding a s-by, then I'd question the maintainer why the patches were accepted in the first place :p

22:05 <pinchartl> it may be fine, but it should be discussed with the kernel community I think

22:06 <karolherbst> I mean... I can also add the s-by through dim...

22:06 <robclark> karolherbst: what about marge pushing to `${driver}-next-staging` and then real human doing `git rebase --sign-off` and pushing to the real -next branch?

22:06 <karolherbst> dim apply-from-gitlab $dest_Branch $src_branch

22:06 <karolherbst> robclark: and then pushing to drm-misc again?

22:06 <karolherbst> that's two steps

22:07 <airlied> karolherbst: someone applying a patch is when the s-o-b from them is applied

22:07 <karolherbst> so atm I am just talking about MR against nouveau-next/fixes and then somebody pushes against drm-misc-next/fixes

22:07 <robclark> well, like marge would push to drm-misc-staging but then maintainer does the manual step of adding s-o-b and pushing to drm-misc

22:07 <airlied> they've signed off that they are legally allowed to apply this patch

22:07 <airlied> now having a bot do that makes me a bit skeptical

22:08 <airlied> having a bot do it on behalf of someone when they click a merge button I suppose might be okay

22:08 <karolherbst> airlied: yeah

22:08 <karolherbst> that's my thinking

22:08 <airlied> but I think there has to be a considered action on behalf of a user

22:08 <karolherbst> why is marge different to dim

22:08 <karolherbst> it's all triggered by a person

22:08 <airlied> does marge know which person to apply them for?

22:08 <karolherbst> I have no idea

22:08 <pinchartl> there has to be a human action I think. git rebase --sign-off already automates adding the SoB line, having another tool doing it isn't very different, as long as it's triggered by a human

22:09 <karolherbst> let's see

22:09 <pinchartl> but it should still be discussed with the kernel community I think

22:09 <airlied> dim is all done locally on a developers machine by them, it's just a wrapper around it

22:09 <karolherbst> airlied: "Emma Anholt @anholt assigned to @marge-bot and unassigned @anholt 2 hours ago"

22:09 <karolherbst> at least gitlab knows

22:09 <airlied> yeah I'm guessing you'd have to fork marge-bot anyways to do this sort of thing

22:09 <karolherbst> probably

22:10 <karolherbst> okay

22:10 <karolherbst> so uncontroversial is adding Link: tags to patches

22:10 <airlied> yeah that should be fine

22:10 <karolherbst> and we could script adding the s-by tags in dim for now

22:10 <karolherbst> having some "apply those patches to this branch" kind of thing checking all patches a last time or whatever

22:11 <robclark> adding s-o-b is easy, git-rebase can do it

22:11 <robclark> so rebase drm-misc-staging on drm-misc, and then push

22:11 <karolherbst> yeah..

22:11 <karolherbst> if people don't forget

22:12 <pinchartl> if gitlab can't do it by itself, what's the process to capture review "tags" ? or is the plan to drop that information altogether, not recording review and test information in commit messages ?

22:12 <robclark> true.. it is at least a step in the right direction because we get some CI and automation of most of the process

22:12 <karolherbst> yeah.. I think it would be good to know what the kernel community thinks about adding those tags automatically, because then it's easy

22:13 <karolherbst> pinchartl: you have to collect those tags yourself sadly :/

22:13 <karolherbst> well.. at least something patchwork wasn't that terrible at

22:13 <pinchartl> that doesn't differ from today, so it's not a regression, even if automation could be nice

22:13 <pinchartl> but how do you get them in the first place if reviews happen on gitlab ?

22:14 <bnieuwenhuizen> on mesa people just comment Rb or whatever and you just amend that manually

22:14 <agd5f> graphitemaster, there is a debugfs file, amdgpu_preempt_ib, if you want to play with it.

22:15 <JoshuaAshton> karolherbst: It'd be nice if we could make Marge collate strings like "Reviewed-by: " for every commit or "[sha] - Reviewed-by:" for a single commit

22:16 <JoshuaAshton> It could even refer to the mail map contributors thing to work for just "Rb" or "[sha] - Rb"

22:17 <karolherbst> JoshuaAshton: I'd try those things out inside mesa first though

22:17 <pinchartl> it's indeed better to test the process first before pushing it towards the kernel community, or you'll risk some serious backlash

22:18 <robclark> it is a bit hard for a script to decide if r-b applies to the whole series or just individual patches..

22:18 <karolherbst> yeah..

22:19 <karolherbst> I think it's fine to add Link: and s-b-o tags, but everything else should be heavily tested before, as this gets quite complicated real quick

22:19 <karolherbst> airlied: do you want to start the discussion with the kernel folks?

22:19 <karolherbst> also.. where to get marge? and how to deploy it and where? :D

22:20 <robclark> I guess to start, it is submitter's responsibility to append r-b/t-b/etc tags and re-push the MR

22:20 <airlied> karolherbst: not really sure where best to bring in kernel ppl, cc'ing lkml is pretty futile :-P

22:20 <airlied> tbh I'm not sure you want to engage too much with the lkml community before you've got a proof of concept

22:21 * airlied isnt sure whree our marge-bot comes from or is hosted

22:21 <airlied> anholt: knows more

22:21 <bnieuwenhuizen> the bot is hosted on fdo

22:21 <bnieuwenhuizen> https://github.com/smarkets/marge-bot is the upstream source

22:22 <bnieuwenhuizen> and https://gitlab.freedesktop.org/freedesktop/helm-gitlab-config/-/tree/master/gitlab-bots has our config

22:22 <pinchartl> airlied: maybe workflows@vger.kernel.org ?

22:23 camus1 has joined #dri-devel

22:23 <karolherbst> airlied: I can work on the proof of concept part, but it's really not that much I guess? Just a bot adding s-b-o tags from people saying "merge it!". But I gues I _could_ make it work first

22:24 <airlied> pinchartl: oh indeed I forgot about that

22:24 <pinchartl> there's users@linux.kernel.org and tools@linux.kernel.org too

22:24 <pinchartl> not sure which ones are the most appropriate

22:24 <airlied> karolherbst: yeah a bot doing it under user direction seems like a proper answer

22:25 <karolherbst> yeah, wouldn't do anything more and it's still not into drm-misc directly yet, so there is a person in between making sure everything is alright

22:25 camus has quit [Ping timeout: 480 seconds]

22:25 <karolherbst> we could also say, MRs against drm-misc or drm _need_ those s-b-o tags

22:26 <karolherbst> and what drivers do internally? nobody cares :p

22:26 <karolherbst> but I guess we want to come to a situation where we only have one repo?

22:26 <karolherbst> dunno

22:27 <karolherbst> or well.. drm-misc accepts whatever

22:27 <graphitemaster> agd5f, I don't really want to play with it, so much as I want to see the Linux desktop have it :P Basically need to take this from toy status to every desktop install in 2022 can actually just deal with long running compute shaders. WebGPU is approaching dangerously so at minimum the denial of service attack bug reports going to be piling up on both sides as regular users navigate to URLs that system lock their PC.

22:27 <karolherbst> but drm requires s-b-o tags for all MRs

22:27 <bnieuwenhuizen> graphitemaster: webgl can already do this, don't worry

22:28 <airlied> karolherbst: linux needs s-o-b on every commit from everyone who handles it

22:28 <airlied> until it lands in a git tree

22:29 <karolherbst> airlied: sure, that's not what I mean

22:29 <karolherbst> I mean until it gets to drm, bots/scripts can add those tags

22:29 <robclark> bnieuwenhuizen: I did notice the other day that shadertoy seems to not try to compile all the shadertoy's on it's front page these days

22:29 <graphitemaster> bnieuwenhuizen, Only a problem on Linux too.

22:29 <karolherbst> but drm _requires_ those to exist before merging

22:29 <karolherbst> ohh wait

22:29 <karolherbst> then it needs yours or danvets tag...

22:29 <karolherbst> ehh

22:29 <karolherbst> annoying

22:30 <karolherbst> or doesn't it?

22:32 ngcortes has quit [Remote host closed the connection]

22:35 ybogdano has joined #dri-devel

22:45 <airlied> karolherbst: when I merge an MR I only add my tag to the merge

22:45 <graphitemaster> Hah, NV's 496.13 driver completely breaks all OpenGL applications.

22:45 <jekstrand> woohoo

22:46 <jekstrand> quality

22:46 <airlied> karolherbst: I assume that whoever merged the patches or done intermediate rebases have added their s-o-b targs

22:46 <airlied> tags

22:47 adjtm has joined #dri-devel

22:48 <graphitemaster> jekstrand, conspiracy to kill opengl by making bad gl drivers is amd's stichk

22:49 pnowack has quit [Quit: pnowack]

22:50 <jekstrand> graphitemaster: Yeah, nvidia typically loves GL

22:51 <graphitemaster> They still probably only test like one CAD suite and the original GLQuake

22:51 <graphitemaster> If they stay working then GL QoL passes.

22:52 <graphitemaster> I want the job of the guy who just plays GLQuake all day testing drivers.

22:54 <anholt> mareko: are there any nice tools for tracking/assigning blame for gpu memory usage with radeonsi?

22:55 <HdkR> graphitemaster: Their automated testing suite is definitely more encompassing than that.

22:55 <karolherbst> airlied: mhh, okay

22:56 <karolherbst> so I guess we could do it this way then

22:56 <karolherbst> drm is doing real merges and you can add your tag yourself before sending it out or something.. but dunno

22:56 <karolherbst> maybe you can also just merge locally :D

22:56 <karolherbst> but the MR could verify that we have s-b-o tags from allowed people

22:56 <bnieuwenhuizen> anholt: what granularity?

22:56 <bnieuwenhuizen> between processes or within a process?

22:57 <anholt> bnieuwenhuizen: within a process

22:57 <bnieuwenhuizen> none I think

22:57 <bnieuwenhuizen> what do you need to trace it back to?

22:57 <anholt> trying to run test_va_api, we're ooming on grunt with it looks like 2.5GB of memory used between 4 test processes.

22:58 <anholt> I've grabbed a few testcases and checked libc allocations with massif and it's ~16MB.

22:58 <graphitemaster> HdkR, I was making a joke lol. I know the coverage of it is more than that XD

22:59 flto has quit [Remote host closed the connection]

22:59 <anholt> bnieuwenhuizen: so, if there was some logging of BO allocation, I might be able to use that to find troublesome testcases, or find leaks.

23:00 <bnieuwenhuizen> anholt: my random guess would be the encoder has a scratch buffer that IIRC was huge in the past due to no mid-stream resize capabilities

23:00 flto has joined #dri-devel

23:00 <bnieuwenhuizen> assuming a new enough kernel /sys/kernel/debug/dri/0/amdgpu_vm_info might give you the current state

23:01 <bnieuwenhuizen> just no really useful metadata per buffer object beside size and memory type

23:05 <airlied> pinchartl: pushed that rcar fix to drm-fixes

23:05 <anholt> bnieuwenhuizen: thanks, that should help me at least figure out if I'm on the right track with BOs

23:16 <anholt> bnieuwenhuizen: yep. 587MB of BOs showing up in some subset of the tests.

23:16 tursulin has quit [Read error: Connection reset by peer]

23:19 xyene has quit []

23:19 quantum5 has quit [Quit: ZNC - https://znc.in]

23:22 xexaxo_ has joined #dri-devel

23:22 xexaxo has quit [Read error: No route to host]

23:37 <pinchartl> airlied: thanks a lot

23:43 tarceri has quit [Remote host closed the connection]