#freedesktop on 2023-01-19 — irc logs at oftc.irclog.whitequark.org

2022-12-21 00:45 ChanServ changed the topic of #freedesktop to: https://www.freedesktop.org infrastructure and online services || for questions about freedesktop.org projects, please see each project's contact || for discussions about specifications, please use https://gitlab.freedesktop.org/xdg or xdg@lists.freedesktop.org

00:47 <pendingchaos> this channel is mostly for freedesktop infrastructure and online services, rather than hosted projects

00:47 <pendingchaos> I don't know what would be a good channel for d-bus

01:29 <daniels> Harzilein: yeah, the dbus@ mailing list is the best forum for it

01:44 ybogdano has quit [Ping timeout: 480 seconds]

02:18 damian has quit [Read error: Connection reset by peer]

02:33 damian has joined #freedesktop

02:45 anholt has quit [Ping timeout: 480 seconds]

02:48 ofourdan has quit [Ping timeout: 480 seconds]

03:51 anholt has joined #freedesktop

05:43 jarthur has quit [Quit: Textual IRC Client: www.textualapp.com]

05:55 systwi_ has joined #freedesktop

05:59 agd5f_ has joined #freedesktop

06:00 systwi has quit [Ping timeout: 480 seconds]

06:06 agd5f has quit [Ping timeout: 480 seconds]

06:14 danvet has joined #freedesktop

06:20 aswar002 has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

06:25 aswar002 has joined #freedesktop

06:32 agd5f has joined #freedesktop

06:38 agd5f_ has quit [Ping timeout: 480 seconds]

07:06 itoral has joined #freedesktop

07:31 alanc has quit [Remote host closed the connection]

07:32 alanc has joined #freedesktop

07:53 SirNeo has left #freedesktop [#freedesktop]

08:11 agd5f_ has joined #freedesktop

08:18 agd5f has quit [Ping timeout: 480 seconds]

08:33 mvlad has joined #freedesktop

08:35 phasta has joined #freedesktop

08:42 <MrCooper> bentiss: the -12 runner seems to have similar issues as the -11 one before: https://gitlab.freedesktop.org/iv-m/xserver/-/jobs/35008122

08:42 <bentiss> MrCooper:all the runners are now using podman

08:43 <MrCooper> meson tests are timing out for no apparent reason

08:43 <bentiss> MrCooper: the "glamor" part makes me think it needs to have access to the host

08:43 <bentiss> and that's bad anyway

08:43 <bentiss> so I would ask to use boot2container to start the tests in a VM

08:43 <MrCooper> not sure what you mean

08:44 <MrCooper> the glamor tests use llvmpipe

08:45 <MrCooper> I saw similar issues yesterday (in Mesa IIRC) with the -11 runner before you took it out

08:45 <bentiss> MrCooper: previously, we had full access to the host with docker as the containers are privileged. IOn the short future I want to switch to rootless podman, so we better fix our CI instead.

08:46 <bentiss> This way we could securely do releases through CI

08:46 <MrCooper> I get that, but I don't see how this issue is related

08:46 <MrCooper> it doesn't have anything to do with docker or host access

08:47 <bentiss> SocketCreateListener() failed -> probably a permission issue?

08:47 ___nick___ has joined #freedesktop

08:48 <bentiss> MrCooper: can you point at failing mesa jobs?

08:49 <MrCooper> some of the same tests passed fine in https://gitlab.freedesktop.org/iv-m/xserver/-/jobs/35007730

08:50 <MrCooper> bentiss: I don't remember I'm afraid, it was likely one mentioned here or on #dri-devel

08:50 <bentiss> MrCooper: so far, all of the mesa failing ones I have seen have been fixed by using a more recent podman version

08:51 <bentiss> so it's weird to me that only xserver is having such issues

08:51 <bentiss> gstreamer, mesa, networkmanager all are fine with podman

08:52 <MrCooper> so is xserver, in some jobs, just not always

08:53 ofourdan has joined #freedesktop

08:53 <bentiss> testing locally on rootless podman, on fedora

09:00 <bentiss> seems perfectly fine on local container...

09:01 <bentiss> but I am not running 10 containers with the same test suite at the same time

09:04 <MrCooper> it seems to be "meson dist" which fails (not always though, even on the same runner), wonder if that does something special

09:08 <bentiss> it might be interesting to test on an alpine base image, instead of debian stable. maybe we have less tuning to do there

09:09 <bentiss> cause we don't have the ability to freshly install a debian 12 system

09:10 <bentiss> and all of the other images and quite old too (beside ubuntu 22.04 but I'd rather not go there

09:16 <MrCooper> nope, not specific to meson dist: https://gitlab.freedesktop.org/daenzer/xserver/-/jobs/35011895

09:18 <MrCooper> bentiss: "RuntimeError: can't start new thread" on https://daenzer.pages.freedesktop.org/-/xserver/-/jobs/35011895/artifacts/build/meson-logs/testlog.txt

09:21 <MrCooper> looks like it's hitting some limit on the number of threads maybe

09:23 <bentiss> does the test suite in xserver handles FDO_CI_CONCURRENT?

09:23 <MrCooper> actually one issue in xserver's CI is that it uses ninja test, which ends up spawning the maximum number of test threads regardless of -j

09:23 <MrCooper> would need to use meson test directly instead

09:27 <MrCooper> still seems like the limit is too low though, if a few xserver jobs hit it

09:28 <bentiss> MrCooper: if you point at the setting, I would happily change it

09:29 <MrCooper> I would if I knew it offhand :)

09:32 <bentiss> damn, fpaste is not responding

09:33 <bentiss> https://paste.debian.net/1267768/ -> doesn't look so bad

09:56 pixelcluster has quit [Quit: Goodbye!]

09:56 <MrCooper> all I can say is it was never an issue before, and now it suddenly is

09:56 pixelcluster has joined #freedesktop

10:20 a-l-e has joined #freedesktop

10:26 <MrCooper> bentiss: https://gitlab.freedesktop.org/xorg/xserver/-/merge_requests/1041 avoids it

10:27 <MrCooper> this might affect other projects as well though

10:32 <bentiss> MrCooper: testing tright now the current pipeline on a runner that has no load whatsoever, and it's already failing :)

10:32 <daniels> MrCooper: setting LP_NUM_THREADS=1 is probably also a good idea

10:34 <MrCooper> hmm yeah, maybe it's the <number of test processes> * <number of llvmpipe threads> explosion?

10:34 <MrCooper> still not clear why it's suddenly an issue now, not before

10:37 <MrCooper> actually only (some of) the X server processes should spawn llvmpipe threads, not every test process

10:37 <bentiss> that's weird. On that runner, with htop I don ´t even see the CPU getting used when everything explodes

10:37 <bentiss> (without your patch)

10:39 <MrCooper> it feels like some kind of artificial limit

10:41 <bentiss> yeah, I just don't know where it comes from, and why it wasn't an issue before

10:43 <bentiss> and OTOH, if it forces people to have a fair usage of CI, that's not so bad :)

10:46 <MrCooper> true, but I'm worried that we can hit the limit even with reasonable utilization

10:49 <bentiss> TBH, I am starting to wonder if we should keep debian testing as a base for those runners. Something like CoreOS or Silverblue (even flatcar linux if it weren't using docker) would be easier to maintain IMO

11:03 hikiko has joined #freedesktop

11:05 vbenes1 has joined #freedesktop

11:05 vbenes has quit [Read error: Connection reset by peer]

11:16 AbleBacon has quit [Read error: Connection reset by peer]

11:31 vbenes2 has joined #freedesktop

11:31 vbenes1 has quit [Read error: Connection reset by peer]

11:32 a-l-e has quit [Quit: Leaving]

11:33 hikiko has quit []

11:38 vbenes2 has quit []

11:39 vbenes has joined #freedesktop

11:40 phasta has quit [Ping timeout: 480 seconds]

11:46 phasta has joined #freedesktop

12:53 vladh has joined #freedesktop

12:53 <vladh> I signed up for an account two hours ago but it's still "blocked", any tips?

12:54 <bentiss> vladh: you should have received a verification email

12:54 <bentiss> vladh:if not I can manually allow you in

12:54 <vladh> bentiss: I haven't :( my email is vlad@vladh.net

12:55 <bentiss> vladh: done

12:55 <vladh> bentiss: thank you very much :)

12:56 <bentiss> you're welcome (and sorry if it doesn't work quite well, it's an attempt at fighting bots)

13:18 pjakobsson has joined #freedesktop

13:22 <kbingham> Aha... so signup is limited at the moment? My colleague djrscally also just tried to signup and hasn't been able to yet.

13:22 djrscally has joined #freedesktop

13:23 <bentiss> kbingham: it's not. it's supposed to be working, with a delay. But it seems people who are using non standrad domains are having issues

13:23 <bentiss> kbingham: if you need I can also unlock the situation for that person

13:23 <kbingham> Ah. What's 'non-standard domain'? I expect he signed up with dan.scally@ideasonboard.com

13:24 <kbingham> bentiss, Yes please!

13:24 <bentiss> kbingham: approved

13:25 <kbingham> bentiss, Thankyou

13:25 <bentiss> kbingham: honestly no idea what would be non standard

13:25 vladh has left #freedesktop [Igloo IRC: https://iglooirc.com]

13:25 <kbingham> :-)

13:25 <bentiss> maybe it's time for me to check if that manual registration is still working

13:46 itoral has quit [Remote host closed the connection]

13:57 <kbingham> Is there anyway with gitlab to 'add' tags to a commit message? (I'm guessing not, so I'm wondering what equivalent options there are)

13:58 <kbingham> Trying things out djrscally sent me a merge request. Previously I would have added Reviewed-by: tags....

13:58 <kbingham> But now - it's just 'punch the merge button' ...

13:59 <bentiss> kbingham: projects like mesa and gstreamers are using bots for that

13:59 <ofourdan> for the xserver, this is still a manual process

13:59 <bentiss> the bot rewrite the commits, force pushes the branch, and wait for the CI before merging the code

13:59 <kbingham> are the bots gitlab side?

14:00 <kbingham> sounds like what I'd be after.

14:00 <bentiss> kbingham: they are hosted by us, but not part of gitlab (same infra but not pure gitlab)

14:00 <__tim> in GStreamer we just add Part-of: <merge-request-url> to the commit message, we don't add reviewed-by / approved-by etc.

14:00 <__tim> (that info is in gitlab)

14:02 <ofourdan> yeah, gnome does the same

14:04 <kbingham> Thanks - lots to experiment with then.

14:06 <DavidHeidelberg[m]> Anyone could review https://gitlab.freedesktop.org/freedesktop/ci-templates/-/merge_requests/162 it's for making s3cp more reliable, and also resistant against 50x errors

14:07 <DavidHeidelberg[m]> Need to merge it ASAP for Mesa3D; but not risking that without review from someone with deeper knowledge of ci-fairy + Python :)

14:08 <bentiss> DavidHeidelberg[m]: dumb question, have you tried it with a large data to send?

14:09 <DavidHeidelberg[m]> nah, but I can do in 20 minutes

14:09 <bentiss> I am a little bit worry about the "So if you’re making several requests to the same host, the underlying TCP connection will be reused"

14:09 <bentiss> I am not sure how this is seen on the host side

14:09 <bentiss> server

14:09 <DavidHeidelberg[m]> yes, if the connection works? anyway, let me run whole mesa against it

14:10 <bentiss> that would be nice

14:10 <bentiss> I can monitor the logs on the server side

14:17 <DavidHeidelberg[m]> how to use ci-fairy from my fork? I tried https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20788/diffs?commit_id=b014ef19ffaae0351a7e5cbc0b6ef14461b9c4e1 but it's no go

14:17 <bentiss> you need to also overwrite the ci-fairy image

14:18 <bentiss> DavidHeidelberg[m]: let me find an example for you

14:18 ds` has quit [Quit: ...]

14:18 ds` has joined #freedesktop

14:20 <bentiss> DavidHeidelberg[m]: https://gitlab.freedesktop.org/mesa/mesa/-/commit/6fd67774ef6f87e8147faf984fb5f67e2376bc73

14:21 <DavidHeidelberg[m]> bentiss: oh you saved me! thanks

14:23 <bentiss> make it harbor.freedesktop.org/cache/okias/ci-templates/ci-fairy:sha256-ef74372a6713cd93228278bacd7d6c8f09de0de0e50cc2ad8c9f61d92a45d35c in your case

14:23 <DavidHeidelberg[m]> right, the harbor usage :) thx!

14:23 <bentiss> well, you don't need to directly use harbor actually, the runners will pick it up

14:23 <bentiss> so just registry.freedesktop.org/okias/ci-templates/ci-fairy:sha256-ef74372a6713cd93228278bacd7d6c8f09de0de0e50cc2ad8c9f61d92a45d35c as taken from your container registry

14:28 <bentiss> mupuf: I have a selfish request: would you mind getting out a release of boot2container? I have added vm2c in the kernel for hid https://git.kernel.org/pub/scm/linux/kernel/git/hid/hid.git/commit/?h=for-next&id=507806e9fdf09774d390c5f22893ba4d87ce40d5 and now it fetches the latest released initramfs which is not compatible with the new b2c.run options

14:28 <mupuf> bentiss: already on it: https://gitlab.freedesktop.org/mupuf/boot2container/-/tags/v0.9.9

14:28 <bentiss> great :)

14:28 <mupuf> you may use these binaries already, if you need

14:28 <bentiss> that was also a way to tell you that I put vm2c in the kernel :)

14:28 <mupuf> gotta go

14:29 <mupuf> oh, right, wonderful!

14:29 <bentiss> k, no worries

14:29 Haaninjo has joined #freedesktop

14:29 <mupuf> congrats, and thanks!

14:29 <bentiss> next step would be to put it directly in the Makefile of selftest

14:29 <bentiss> :)

14:40 <bentiss> DavidHeidelberg[m]: looks like the server is happy with your requests, both single upload and multi

14:40 <DavidHeidelberg[m]> nice

14:42 <bentiss> well, I'm still witing for the POST at the end of the /artifacts/mesa/mesa/787944/mesa-arm64-asan.tar.zst upload

14:43 <DavidHeidelberg[m]> bentiss: that's not going to happen :( https://gitlab.freedesktop.org/mesa/mesa/-/jobs/35027800#L4200

14:44 <bentiss> \the uploaded file is still 151 MB, so why is it not complete

14:47 <bentiss> huh, the file seems fine.

14:53 <DavidHeidelberg[m]> session.post(dst, headers=headers, params={"uploadId": key}, data=complete) # this didn't worked, right?

14:54 <bentiss> oh, wait it might be my filter on the logs

14:56 <bentiss> that was it. the POST log in the server doesn't give me s3.fd.o and I was filtering on it

14:56 <bentiss> all good

14:56 <DavidHeidelberg[m]> uffff

14:57 <DavidHeidelberg[m]> I wonder, is any chance that session will decrease load on the servers? (there isn't new connection, but if it reuses existing, it could save a bit work).. just curious if it's relevant at this scale

14:57 <bentiss> not sure I'll be able to measure that

14:59 <bentiss> the thing is we have *a lot* of connections and they should use all 3 control planes, so hard to see a difference

14:59 <bentiss> it would matter only at he nginx level too, I am not sure it keeps the same connection between nginx and ceph

15:00 <bentiss> plus when you upload the file to ceph, it internally gets replicated over 3 nodes, so it might have some benefits, but I wouldn't bet anything on a reduce load of the cluster

15:02 <bentiss> DavidHeidelberg[m]: also just stating the obvious: you have to wait for https://gitlab.freedesktop.org/freedesktop/ci-templates/-/pipelines/787975 to finish before you can include the ci-templates sha in mesa

15:02 <bentiss> (the publish to quay happens when the pipeline is correct, at the very end)

15:03 <DavidHeidelberg[m]> oh, thanks for merging :)

15:09 <MrCooper> bentiss daniels: I wonder how many other projects use "ninja -j<...> test" in CI and expect it to limit the number of test processes spawned in parallel :/

15:10 <bentiss> MrCooper: actually I should try to see if podman is capable of limitting how many cpus are exposed to the container

15:10 <bentiss> I know it didn't work with docker

15:11 <MrCooper> would be nice if it was possible

15:11 <bentiss> IIRC there is gitlab-runner option that wasn't properly handled

15:14 <bentiss> well, we can limit the cpu usage, not how many are exposed

15:21 <MrCooper> taskset affects the nproc command, sadly not meson though

15:22 <bentiss> MrCooper: https://gitlab.freedesktop.org/bentiss/xserver/-/jobs/35031336 same with cpus=8

15:22 <bentiss> given that it's not using cpu, but just spawning too many tests, it doesn't work to induce that quota

15:23 <bentiss> (and the machine was loaded between 30 to 60 as some virglrenderer tests were also running)

15:24 <daniels> last time I looked at that, we'd need to make podman use a runtime like Kata which would spawn a VM that we could limit the number of CPUs on

15:29 MrCooper has quit [Quit: Leaving]

15:31 MrCooper has joined #freedesktop

15:39 MrCooper has quit [Quit: Leaving]

15:40 MrCooper has joined #freedesktop

15:47 bl4ckb0ne has quit [Remote host closed the connection]

15:47 emersion has quit [Remote host closed the connection]

15:47 bl4ckb0ne has joined #freedesktop

15:47 emersion has joined #freedesktop

15:50 ybogdano has joined #freedesktop

15:59 <MrCooper> daniels: BTW, LP_NUM_THREADS=1 still spawns one worker thread; LP_NUM_THREADS=0 was measurably lighter, so I went for that in https://gitlab.freedesktop.org/xorg/xserver/-/merge_requests/1041

16:07 * bentiss is rather confused... since https://gitlab.freedesktop.org/freedesktop/fdo-containers-usage/-/blob/master/images_2021-09-06.yaml the stats for the containers usage are definitely not working :(

16:07 <bentiss> so we were clearing all images all the time (expect those that are in the keep-list)

16:12 djrscally has quit [Ping timeout: 480 seconds]

16:31 djrscally has joined #freedesktop

16:43 eroux has quit [Ping timeout: 480 seconds]

16:48 djrscally has quit [Ping timeout: 480 seconds]

16:51 sbraz has quit [Quit: ZNC - https://znc.in]

16:52 sbraz has joined #freedesktop

16:52 eroux has joined #freedesktop

17:05 phasta has quit [Quit: Leaving]

17:17 miracolix has joined #freedesktop

17:23 jarthur has joined #freedesktop

17:44 ___nick___ has quit []

17:46 ___nick___ has joined #freedesktop

17:47 ___nick___ has quit []

17:49 ___nick___ has joined #freedesktop

18:09 Leopold has quit [Remote host closed the connection]

18:15 Leopold_ has joined #freedesktop

18:28 ybogdano has quit [Ping timeout: 480 seconds]

18:37 <eric_engestrom> any objection to me merging a bunch of timestamps in the ci to have them next time an issue happens?

18:37 <eric_engestrom> https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20792

18:42 ybogdano has joined #freedesktop

18:45 <DavidHeidelberg[m]> eric_engestrom: I object, your honor.

18:45 <DavidHeidelberg[m]> What about using timestamps in the sections? https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20272

18:47 <anholt> then you have to keep breaking down smaller and smaller sections to debug, when if you just had timestamps you could take your failing log and figure out where things stalled.

18:47 <eric_engestrom> DavidHeidelberg[m]: too coarse

18:48 <anholt> gitlab also managed to snatch failure from the jaws of victory by concluding that requests for timestamps were actually just requests for section timings.

18:48 <eric_engestrom> (also, please don't use the phrase "make X great again")

18:51 <DavidHeidelberg[m]> the quote is catchy, can't help it ;-) (for political reasons, I declare I don't care about politics)

18:56 <eric_engestrom> oh, I was about to marge it (thanks for the acks!), but I just realized I need to bump container tags; just KERNEL_ROOTFS_TAG, right?

18:56 <anholt> nothing in that MR affects containers

18:57 <anholt> the CI *.{sh,txt,yml,toml} files go through artifacts

18:57 <eric_engestrom> ah right

18:57 <eric_engestrom> thanks :)

18:58 <eric_engestrom> ok, marg'ing then

19:02 ybogdano has quit [Ping timeout: 480 seconds]

19:08 agd5f_ has quit []

19:08 agd5f has joined #freedesktop

19:27 Leopold___ has joined #freedesktop

19:28 Leopold_ has quit [Remote host closed the connection]

20:05 GNUmoon has quit [Remote host closed the connection]

20:05 GNUmoon has joined #freedesktop

20:24 Kayden has quit [Quit: to lunch and office for a bit]

20:30 ybogdano has joined #freedesktop

20:31 ybogdano has quit []

20:32 ybogdano has joined #freedesktop

20:53 ___nick___ has quit [Ping timeout: 480 seconds]

21:46 mvlad has quit [Remote host closed the connection]

22:05 ybogdano has quit [Ping timeout: 480 seconds]

22:09 danvet has quit [Ping timeout: 480 seconds]

22:17 AbleBacon has joined #freedesktop

22:36 djrscally has joined #freedesktop

22:44 ybogdano has joined #freedesktop

23:11 GNUmoon has quit [Remote host closed the connection]

23:12 GNUmoon has joined #freedesktop

23:53 Haaninjo has quit [Quit: Ex-Chat]