#freedesktop on 2023-03-14 — irc logs at oftc.irclog.whitequark.org

2022-12-21 00:45 ChanServ changed the topic of #freedesktop to: https://www.freedesktop.org infrastructure and online services || for questions about freedesktop.org projects, please see each project's contact || for discussions about specifications, please use https://gitlab.freedesktop.org/xdg or xdg@lists.freedesktop.org

00:05 busheling has left #freedesktop [Leaving]

00:28 Leopold_ has quit [Remote host closed the connection]

00:28 Guest6707 has quit [Read error: Connection reset by peer]

00:29 peelz has joined #freedesktop

00:29 peelz is now known as Guest7620

00:30 Leopold__ has joined #freedesktop

00:34 jarthur has quit [Ping timeout: 480 seconds]

01:06 Haaninjo has quit [Quit: Ex-Chat]

01:07 Haaninjo has joined #freedesktop

01:10 columbarius has joined #freedesktop

01:12 Haaninjo has quit [Quit: Ex-Chat]

01:12 co1umbarius has quit [Ping timeout: 480 seconds]

01:27 Kayden has joined #freedesktop

01:45 agd5f_ has joined #freedesktop

01:51 agd5f has quit [Ping timeout: 480 seconds]

02:01 rcampbell has quit []

02:02 BlkPoohba has joined #freedesktop

03:11 strugee has quit [Quit: ZNC - http://znc.in]

03:17 strugee has joined #freedesktop

03:19 ximion has quit [Quit: Detached from the Matrix]

04:11 K`den has joined #freedesktop

04:11 Kayden has quit [Read error: Connection reset by peer]

04:11 BlkPoohba has quit []

04:12 BlkPoohba has joined #freedesktop

04:16 K`den is now known as Kayden

04:27 BlkPoohba has quit []

04:27 BlkPoohba has joined #freedesktop

04:42 BlkPoohba has quit []

04:43 BlkPoohba has joined #freedesktop

04:57 BlkPoohba has quit []

04:58 BlkPoohba has joined #freedesktop

05:12 harishnkr has joined #freedesktop

05:13 BlkPoohba has quit []

05:13 harishnkr has quit []

05:13 BlkPoohba has joined #freedesktop

05:28 BlkPoohba has quit []

05:29 BlkPoohba has joined #freedesktop

05:43 BlkPoohba has quit []

05:44 BlkPoohba has joined #freedesktop

05:59 BlkPoohba has quit []

06:06 chip_x has quit [Ping timeout: 480 seconds]

06:09 <mupuf> bentiss: the wget command probably needs to be put in a loop

06:10 <mupuf> that being said, I fully agree with having a script like this

06:11 * mupuf just removed his, but I guess something equivalent could be reintroduced for test machines

06:15 BlkPoohba has joined #freedesktop

06:29 BlkPoohba has quit []

06:30 BlkPoohba has joined #freedesktop

06:41 robobub has joined #freedesktop

06:45 BlkPoohba has quit []

06:45 BlkPoohba has joined #freedesktop

07:00 BlkPoohba has quit []

07:01 BlkPoohba has joined #freedesktop

07:07 agd5f has joined #freedesktop

07:07 Haaninjo has joined #freedesktop

07:12 agd5f_ has quit [Ping timeout: 480 seconds]

07:15 danvet has joined #freedesktop

07:15 BlkPoohba has quit []

07:16 BlkPoohba has joined #freedesktop

07:29 <bentiss> mupuf: I need more context. I'm not following you

07:30 <mupuf> bentiss: in runner-gating.sh, `wget -q https://gitlab.freedesktop.org/freedesktop/test-ci/-/raw/main/users.txt` should be attempted a couple of times so that a short network outtage doesn't fail jobs

07:31 <mupuf> the best thing would be to cache it, and just try to update it in every job (and use the cache if it failed)

07:31 BlkPoohba has quit []

07:32 BlkPoohba has joined #freedesktop

07:32 <bentiss> mupuf: cache doesn't work because we don't have much control over the container that is spawned at this time

07:32 <bentiss> IIRC

07:32 <bentiss> agree on the loop

07:32 <bentiss> I think today I'll set this one in place

07:32 <mupuf> thanks a lot!

07:33 <mupuf> We don't deserve you...

07:33 <bentiss> daniels: I will probably rename some of the repos we are using: helm-gitlab-omnibus -> helm-gitlab-deployment for a start

07:46 BlkPoohba has quit []

07:47 BlkPoohba has joined #freedesktop

07:47 <daniels> bentiss: sure! go ahead

07:48 <bentiss> daniels: already done :)

07:48 <bentiss> daniels: I'm also renaming the branch master into main

07:49 <daniels> yeah nice

07:49 <bentiss> because it's painful to have either master or main depending off the project

07:49 <daniels> srsly

07:49 AbleBacon has quit [Read error: Connection reset by peer]

07:53 <bentiss> daniels: I'm now splitting helm-gitlab-config in 2 (3?) -> helm-infra-deployment and fdo-bots

07:53 <bentiss> I'll try to keep the history in both

07:55 <bentiss> ideally, I might just rename helm-gitlab-config into fdo-bots and respin helm-infra-deployment from a new project. This way if others want to look for marge config, they'll get redirected

07:59 <daniels> do many people go looking for Marge config? tbh I’ve probably linked gitlab-runner provisioning the most

08:00 <daniels> turns out it wasn’t a great example tho so maybe it is best to break those links :P

08:00 <bentiss> k, 2 new projects it is

08:01 BlkPoohba has quit []

08:02 BlkPoohba has joined #freedesktop

08:17 <bentiss> daniels: https://gitlab.freedesktop.org/freedesktop/fdo-bots is up now

08:17 BlkPoohba has quit []

08:18 BlkPoohba has joined #freedesktop

08:19 mvlad has joined #freedesktop

08:21 MajorBiscuit has joined #freedesktop

08:29 agd5f_ has joined #freedesktop

08:29 <daniels> awesome

08:30 <bentiss> and https://gitlab.freedesktop.org/freedesktop/helm-gitlab-infra is renamed from -config, and pruned from marge-bots

08:32 BlkPoohba has quit []

08:35 agd5f has quit [Ping timeout: 480 seconds]

08:49 BlkPoohba has joined #freedesktop

08:58 <bentiss> daniels, mupuf, anybody else: I have enabled runner-gating on ml-13. If it goes OK-ish, I'll enable it for the rest of the fleet

09:00 <bentiss> damn, it didn't triggered the pre-clone script

09:00 <bentiss> I'll check later, bbiab

09:01 <mupuf> bentiss: thanks for the notice!

09:03 BlkPoohba has quit []

09:04 Major_Biscuit has joined #freedesktop

09:05 Leopold__ has quit [Remote host closed the connection]

09:05 MajorBiscuit has quit [Ping timeout: 480 seconds]

09:09 Leopold_ has joined #freedesktop

09:19 BlkPoohba has joined #freedesktop

09:23 <bentiss> ouch, right now the script doesn't work if GIT_STRATEGY: none

09:24 <bentiss> because it's a "pre_get_sources_script", not "pre_anything" :(

09:31 <sergi> mupuf, bentiss: about the wget and the retries. We often replace wget by curl with parameters to retry like `alias curl="curl -L --retry 4 -f --retry-all-errors --retry-delay 60"` in https://gitlab.freedesktop.org/virgl/virglrenderer/-/blob/master/.gitlab-ci/container/debian/x86_test.sh#L5

09:31 <bentiss> sergi: when we are at this time, we are in a container image that doesn't have curl

09:33 <sergi> :(

09:33 <bentiss> yep :)

09:34 <mupuf> sergi: BTW, I removed the restriction for running jobs in the valve farms

09:34 <mupuf> I need to remove the restriction in Mesa

09:34 BlkPoohba has quit []

09:34 <mupuf> Will add it as part of https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21872

09:35 BlkPoohba has joined #freedesktop

09:36 <bentiss> haha! there is a `pre_build_script` I can use instead

09:37 <bentiss> though we might be in the container, so now wget and bash are probably not guaranteed

09:37 <sergi> mupuf, has this any relation with https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21193 about ^zink-radv-.*-valve jobs? ci-uprev is not yet triggering those jobs because they aren't created when it test a piglit uprev

09:38 <mupuf> bentiss: yep, that's what I have been using

09:39 <mupuf> I store the script outside of the container: https://gitlab.freedesktop.org/mupuf/valve-infra/-/blob/master/executor/server/src/valve_gfx_ci/executor/server/templates/gitlab_runner_config.toml.j2#L25

09:41 <mupuf> sergi: yes :)

09:41 <mupuf> sergi: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21872/diffs?commit_id=9a28ec71ae9c34c67bdb8746ce621c57f8b7928e <-- this is the commit that will solve your problem

09:43 <mupuf> sergi: review appreciated ;)

09:43 <mupuf> I would like to merge this series ASAP

09:46 <bentiss> ml-13 pre-build-script disabled for now, we need a curl/wget backup or some images will just fail

09:47 <mupuf> bentiss: so... we need a statically-compiled script?

09:47 <mupuf> maybe something written in go or rust?

09:47 <mupuf> this way we could inject that in any container, without having to care about dependencies?

09:47 <bentiss> mupuf: oh, you mean mount it as a voilume?

09:47 <mupuf> yes! That's what I do :)

09:48 * bentiss needs to think of it then

09:48 <mupuf> except it is still a shell script for me (but it has no dependencies)

09:49 <bentiss> I like the idea (using a mount volume), but I need to think at how I can automatically update it on the runner

09:49 BlkPoohba has quit []

09:50 BlkPoohba has joined #freedesktop

09:50 <sergi> mupuf, but sorry, is this restricting to have those jobs only in "mesa/mesa"? The uprevs are prepared in a branch in a fork. Before proposing an uprev by creating a merge request, it is testing the uprev and amending it if there are expectation changes. This way this will not be allowed. I'm thinking in change the flow in ci-uprev to adapt to that

09:51 <mupuf> sergi: that's how it *was*. I got rid of limitation, for you

09:51 <mupuf> (and other developers who were hellaconfused)

09:52 <mupuf> I will likely reintroduce it when bentiss and I figure out something that works for everyone

09:52 <sergi> mupuf agh sorry. I misinterpreted merge request. My mind read like those lines stay there, when in fact they are removed. Sorry

09:52 <mupuf> the current way I was doing it was a allowlist in every farm

09:53 <mupuf> ;)

09:53 <mupuf> be back in a couple of hours

10:00 <bentiss> mupuf: silly idea: I mount the host curl command in the container to a known path :)

10:10 Leopold___ has joined #freedesktop

10:17 Leopold_ has quit [Ping timeout: 480 seconds]

10:22 <bentiss> damn, I need a statically compiled curl :(

10:24 <__tim> There must be a little rust tool that can do the job somewhere surely

10:24 MajorBiscuit has joined #freedesktop

10:26 <bentiss> https://github.com/moparisthebest/static-curl seems to be working, and we can probably just rebuild curl from our .gitlab-ci

10:27 <bentiss> MIT :)

10:32 Major_Biscuit has quit [Ping timeout: 480 seconds]

10:32 Haaninjo has quit [Quit: Ex-Chat]

10:49 BlkPoohba has quit []

11:12 <mupuf> bentiss: yeah, this is why I was proposing rust or go

11:13 <bentiss> mupuf: this works: https://gitlab.freedesktop.org/freedesktop/helm-gitlab-infra/-/commit/956d28828745aa88975c6a467b3ede3d8f594e9f

11:32 <mupuf> Moparisthebest :D

11:43 mohamexiety has joined #freedesktop

11:49 <mupuf> bentiss: https://gitlab.freedesktop.org/mesa/mesa/-/jobs/38036309#L23 nice :)

11:51 <mupuf> daniels, DavidHeidelberg[m]: Seems like valve infra wasn't the only one hit by the XDG_RUNTIME issue: https://gitlab.freedesktop.org/mesa/mesa/-/jobs/38036309#L813

11:55 vbenes has quit [Ping timeout: 480 seconds]

12:02 <daniels> mupuf: yeah, so guess it would be the same fix there - either tell dEQP to not bother trying to use Wayland as a platform (not sure if there's an env variable or something?), or run it under Weston

12:03 <daniels> mupuf: if you've got some time that would be magnificent; I'm not going to have time today as I've got a lot of stuff to clear out before I go on holiday Thurs-Mon inclusive

12:03 <mupuf> or set the variable and make sure the folder exists

12:03 <mupuf> enjoy your holiday!

12:03 <daniels> thanks!

12:04 * mupuf will try to clear some time for that... but he is also preparing for 3 weeks of ... parental leave

12:04 <daniels> and yeah, init-stage2 could just do the export XDG_RUNTIME_DIR="$(mktemp -d -m 0700)" or whatever

12:04 <daniels> ahhh fun

12:04 <mupuf> init-stage2 could just do the export XDG_RUNTIME_DIR="$(mktemp -d -m 0700)" or whatever --> Yeah, done that already, but this job does not use init-stage2

12:04 <mupuf> same as valve infra, it was hardcoding the deps

12:05 <daniels> oh ofc, it's swrast

12:07 vbenes has joined #freedesktop

12:09 <mupuf> ;)

12:09 <mupuf> it can still use stage2 though, no harm done

12:10 <daniels> yeah, it would definitely be nice to keep on making as much stuff as we can common and predictable through there

12:11 <daniels> I spent a bunch of time doing the initial init unification to make LAVA and bare-metal coherent a while ago, but lost momentum (and/or the will to live) after that

12:16 <mupuf> I am not surprised

12:16 <mupuf> and it is bound to diverge... unless we start from the top, right?

12:17 <mupuf> Will be fun to try running b2c on lava :p

12:20 <mupuf> we are almost done adding support for arm64 to valve infra (it works in our developer environment where the infra is simulated in a VM, and I b2c boots on a RPI 4 using EFI)

12:21 <mupuf> but it is not plug-and-play juuuust yet

12:29 vkareh has joined #freedesktop

12:51 mohamexiety has quit [Remote host closed the connection]

13:04 <daniels> nice :)

13:05 <daniels> mupuf: ooi what do you mean by 'removing the need for the gitlab runner'?

13:06 Leopold has joined #freedesktop

13:06 Leopold___ has quit [Remote host closed the connection]

13:06 <mupuf> daniels: I mean that my "executor" service would be the one talking to gitlab directly

13:07 <daniels> right, so still a gitlab runner, but just not using gitlab-runner

13:07 <mupuf> yes

13:07 <mupuf> :)

13:07 <daniels> we're doing the same for LAVA and there's something decent coded up for it (in Rust even), but it's on the backburner whilst we're sorting out a few other things like artifact storage

13:07 <mupuf> and when we are done with that, we'll add support for other forges

13:08 <mupuf> yes, you opened my eyes to that :)

13:08 <mupuf> I was expecting it to be a disaster... but it didn't look that bad

13:09 <mupuf> what I think we'll gain from that: 1. never starting 2 jobs at the same time for the same runner; 2. faster execution time; 3. making running on hardware look exactly the same as running in the cloud

13:10 <mupuf> (sure, you can set some variables to specify the kernel, add kernel command line arguments, ..., but for simple jobs, everything would work well by default)

13:10 vbenes has quit [Quit: Leaving.]

13:10 vbenes has joined #freedesktop

13:11 <daniels> yeah exactly, removing the indirection sure would be nice, and definitely gives us a lot more scope to simplify things and make the three different DUT handlers look more similar

13:11 <mupuf> For instance, I stopped using dnsmasq in the past month and vendored pypxe in the executor: I can now serve the most appropriate kernel based on the DHCP request!

13:12 <mupuf> and that also means I can automatically reboot the machine if it did not issue any DHCP request within the expected time it usually takes to boot

13:12 <mupuf> and this value can be learnt during our training phase (what happens when you enroll a new machine, or when its tags changed)

13:13 <mupuf> but yeah, removing the indirection will be a great boost to bare metal CI adoption

13:14 <mupuf> I am planning on having a workshop in the next XDC on how to deploy one. I'll come with some hardware

13:19 <daniels> arm certainly makes it easier to bring hardware at least :P

13:22 <mupuf> exactly, and I can ask people to bring their boards too!

13:24 <mupuf> curPageLogUid=6oFiTb2JuZ8k

13:24 <mupuf> Aliexpress has freaking amazing x86-based routers too: https://www.aliexpress.com/item/1005004501531656.html?spm=a2g0o.productlist.main.7.7d546a48fuHbjC&algo_pvid=b2bdb4d8-f532-4660-9799-2f0dfeae7c3c&algo_exp_id=b2bdb4d8-f532-4660-9799-2f0dfeae7c3c-3&pdp_ext_f=%7B%22sku_id%22%3A%2212000029385108658%22%7D&pdp_npi=3%40dis%21EUR%21343.31%21192.25%21%21%21%21%21%40211be3cd16788001737685750d06e6%2112000029385108658%21sea%21FI%214121609751&

13:25 <mupuf> for ~250 euros, you can get 4x2.5G ports, wifi, nvme drive, and 8 GB of RAM (when on sale)

13:25 <mupuf> and this is fully fanless and consuming ~8W at idle

13:25 <mupuf> I bought one for my home router, and one for replacing my current gateway which consumes 8 times as much

13:27 agd5f has joined #freedesktop

13:32 agd5f_ has quit [Ping timeout: 480 seconds]

13:38 <daniels> yeah, the problem with arm-based routers is that they're often cripplingly limited for I/O, which is sort of not good

13:43 <bentiss> mupuf: sigh... https://gitlab.freedesktop.org/gfx-ci/mesa-performance-tracking/-/jobs/38038833 -> quay.io/freedesktop.org/ci-templates:container-build-base-2022-02-14.0 doesn't have /etc/ssl/certs/ca-certificates.crt installed

13:43 <bentiss> mupuf: do you have a quick copy/paste I could use to export that cert in the container and use it instead of the defaults?

13:43 <mupuf> bentiss: I do not...

13:44 * mupuf has it easier... since the script is auto-generated by ansible

13:44 <bentiss> yeah, well, I have to account for *any* image :(

13:45 <mupuf> exactly

13:45 <mupuf> let's check what I ship in b2c

13:45 <bentiss> curl --cacert [file]

13:45 <bentiss> let's see if that works

13:46 <bentiss> looks like it works :)

13:46 <mupuf> yeepee!

13:46 <mupuf> you are fast!

13:46 * bentiss changes the script/commands

13:46 * mupuf was taking the cacerts from alpine on b2c creation

13:48 MrCooper has quit [Ping timeout: 480 seconds]

14:01 MrCooper has joined #freedesktop

14:01 <bentiss> daniels: I am tempted to deploy the kill switch on all runner now. I do not see what is the next hiccups we'll have so we should probably target a wider audience

14:02 <bentiss> right now, it's just giving a notification at the beginning of the job: https://gitlab.freedesktop.org/btissoir/test-ci/-/jobs/38045524

14:03 <bentiss> it would fail if sh is not available in the target container... but not sure we can do at that time given that gitlab requires the entrypoint to be sh or bash

14:05 <daniels> bentiss: oh nice, yeah I think sh is fine for all but windows obviously - is there anything in particular to know to include outside of the host environment? we could do the same for LAVA/b2c/bare-metal farms too

14:06 <bentiss> daniels: so I included in the image through a volume a static curl and the ca-certificates.crt file. That seems to be it

14:13 <daniels> yeah, makes sense - the ca-certificates provided in the default runner-helper only has the cert for the service itself

14:14 abrotman has quit [Remote host closed the connection]

14:14 <bentiss> daniels: actually we are not in pre-clone script now

14:14 abrotman has joined #freedesktop

14:15 <bentiss> we are in pre-build, which executes just before the before_script, so in the target container

14:16 <bentiss> generate-cloud-init is wrong in that regard :)

14:17 * bentiss fixes it

14:20 <daniels> ahhh, nice!

14:27 <bentiss> daniels: what's the plan with m3l-18 and kata-1?

14:29 <daniels> bentiss: I have no plan with m3l-18, I thought you were going to burn -15 so I just provisioned 16+17+18 fresh

14:29 <daniels> kata-1 I might just destroy for now; a bunch of stuff came up over the past couple of days I couldn't ignore and I'm not going to be able to get Kata done before I go on holiday

14:30 <bentiss> daniels: k. well, -13 is still alive, so maybe we want to keep it while it has some cache for jobs

14:30 <bentiss> daniels: this is what I run on all runners: https://paste.centos.org/view/245fd464

14:31 <bentiss> well, assuming the arch is amd64 and that the pre_get_sources_script was already defined

14:31 <bentiss> so I'm going to kill kata-1 and m3l-18, just to keep the runners to a reasonable number

14:32 <daniels> sounds good!

14:32 <bentiss> daniels: k, thanks

14:45 <bentiss> OK, the kill switch is installed in all runners I have access to. It's a matter of commenting https://gitlab.freedesktop.org/freedesktop/helm-gitlab-infra/-/blob/main/runner-gating/runner-gating.sh#L39 to enable it

14:46 * bentiss waits a bit in case someone screams (though you have to scream a lot to change my mind)

14:49 <__tim> and then we wait for people to comment on the ticket to add their projects?

14:49 <bentiss> either comment on the ticket or sending MR

14:50 <__tim> ah, I didn't see the groups check, sorry

14:51 <bentiss> __tim: that script is a baseline. I would be glad to add a smarter check policy, but I personnally add myself as a vip, because I trust myself to not temper the runners, but others can eeasily add more logic

14:53 agd5f_ has joined #freedesktop

14:59 agd5f has quit [Ping timeout: 480 seconds]

15:07 agd5f_ has quit []

15:07 agd5f has joined #freedesktop

15:21 agd5f_ has joined #freedesktop

15:27 agd5f has quit [Ping timeout: 480 seconds]

15:29 agd5f_ has quit [Ping timeout: 480 seconds]

15:33 agd5f has joined #freedesktop

15:37 MajorBiscuit has quit [Ping timeout: 480 seconds]

15:38 MajorBiscuit has joined #freedesktop

15:40 agd5f_ has joined #freedesktop

15:46 agd5f has quit [Ping timeout: 480 seconds]

15:46 agd5f has joined #freedesktop

15:49 agd5f_ has quit [Ping timeout: 480 seconds]

15:50 agd5f_ has joined #freedesktop

15:52 DodoGTA has quit [Quit: DodoGTA]

15:53 AbleBacon has joined #freedesktop

15:53 DodoGTA has joined #freedesktop

15:56 agd5f has quit [Ping timeout: 480 seconds]

15:57 agd5f has joined #freedesktop

15:58 agd5f_ has quit [Ping timeout: 480 seconds]

15:59 agd5f_ has joined #freedesktop

16:05 agd5f has quit [Ping timeout: 480 seconds]

16:06 agd5f has joined #freedesktop

16:07 DodoGTA has quit [Quit: DodoGTA]

16:08 DodoGTA has joined #freedesktop

16:09 <bentiss> It's been ~1h30 that the script is deployed without a failure related to it. Enforcing it.

16:09 agd5f_ has quit [Ping timeout: 480 seconds]

16:10 agd5f_ has joined #freedesktop

16:11 DodoGTA has quit [Remote host closed the connection]

16:12 <jenatali> I assume there's going to be similar enforcement applied to the Windows runners at some point?

16:12 DodoGTA has joined #freedesktop

16:14 <kusma> I got a failure here that tells me I'm missing permissions, but the text at the link it pointed to doesn't really seem to match up with what happened: https://gitlab.freedesktop.org/kusma/mesa-demos/-/jobs/38052767

16:14 agd5f has quit [Ping timeout: 480 seconds]

16:15 <kusma> The text says "What it means for me, a maintainer of a project part of gitlab.freedesktop.org? Hopefully nothing. Contributors should still be able to run CI on the MRs, and you can still have your project being tested as previously."

16:15 agd5f has joined #freedesktop

16:15 <kusma> But that's not the case. The CI runs in the users namespace, which I guess is what gives the error...

16:16 <bentiss> kusma: please see the backlog for (a lot of) context

16:17 <bentiss> I have disabled shared runners for personal namespaces, so either you migrate your upstream official projects under a group, either you add yourself as a vip in https://gitlab.freedesktop.org/freedesktop/helm-gitlab-infra/-/blob/main/runner-gating/users.txt

16:17 <kusma> This is mesa/demos

16:17 <kusma> It's under a group already.

16:18 <kusma> It's this MR: https://gitlab.freedesktop.org/mesa/demos/-/merge_requests/143

16:18 <bentiss> kusma: MR pipelines are not enabled in this project

16:19 <kusma> What do you mean?

16:19 DodoGTA has left #freedesktop [#freedesktop]

16:19 <bentiss> https://gitlab.freedesktop.org/freedesktop/freedesktop/-/issues/438#what-it-means-for-me-a-maintainer-of-a-project-part-of-gitlabfreedesktoporg -> "The only thing you have to be aware of, is that you need to have detached MR pipelines. For that, just append the following snippet to your .gitlab-ci.yaml"

16:19 <kusma> We've been doing MRs with automatic pipelines for a long time... They trigger, but in the user's namespace.

16:19 DodoGTA has joined #freedesktop

16:19 agd5f_ has quit [Ping timeout: 480 seconds]

16:20 <bentiss> yes, but not using MR pipelines. The pipelines are run in the fork, not in the group namespace

16:20 <kusma> OK, I see. Well, I don't think I have the permissions needed to add that setting.

16:20 <bentiss> Just submit an MR :)

16:20 <daniels> bentiss: I think we might want to reword the notification, because 'hacked pretty badly' is going to make people think their data/etc was compromised, as opposed to just we had to deal with a ton of pain

16:21 <bentiss> daniels: sure, please do

16:21 DodoGTA has left #freedesktop [#freedesktop]

16:21 <bentiss> daniels: in the meantime I s/We/Our runners/

16:22 <kusma> bentiss: Ah, it's *just* the YAML change?

16:22 <bentiss> kusma: yes

16:22 <kusma> OK, thanks.

16:22 <kusma> That I can do :)

16:22 <daniels> bentiss: reworded it slightly so hopefully we get less 'HACKED???' panic

16:22 DodoGTA has joined #freedesktop

16:23 <bentiss> daniels: thanks :)

16:23 agd5f_ has joined #freedesktop

16:23 <daniels> np!

16:25 <bentiss> sigh, this one should not have failed: https://gitlab.freedesktop.org/boyzhang/mesa/-/jobs/38053559

16:27 <bentiss> actually I guess this is because the person is not part of the project

16:27 <bentiss> so the pipeline needs to be manually triggered by someone from the project

16:27 <alatiera> oh it wasn't a fork pipeline btw

16:28 <bentiss> oh, the MR was created after the pipline :)

16:28 <bentiss> I see...

16:28 DodoGTA has left #freedesktop [#freedesktop]

16:29 DodoGTA has joined #freedesktop

16:29 <bentiss> I guess it's going to be a bumpy ride :(

16:29 agd5f has quit [Ping timeout: 480 seconds]

16:30 <alatiera> the main issue I see is that for parent pipelines to work the person that opened the mr needs to also have developer permissions

16:30 <alatiera> its not enough for a maintainer to trigger the CI

16:31 agd5f has joined #freedesktop

16:31 <alatiera> it just ends up creating more fork pipelines that way https://gitlab.freedesktop.org/gstreamer/gstreamer/-/merge_requests/4155/pipelines#note_1819674

16:36 agd5f_ has quit [Ping timeout: 480 seconds]

16:36 agd5f_ has joined #freedesktop

16:37 <bentiss> alatiera: this MR is not concerned by the new runner policy...

16:37 <alatiera> bentiss I mean, atm that's why your script failed

16:38 <alatiera> CI_PROJECT_ROOT_NAMESPACE in a parent pipeline is mesa group, but in a fork pipeline is the user

16:38 <alatiera> but I think there's a CI_MR variable, let me check

16:38 <bentiss> alatiera: yes, I know that. But if someone from the project runs the pipeline, we are good IIRC

16:39 <bentiss> so that means that before running the workload, for external users we will have to actually review that it's not crypto

16:39 agd5f has quit [Ping timeout: 480 seconds]

16:41 <alatiera> why would we be good if the pipeline is triggered by a maintainer

16:41 agd5f has joined #freedesktop

16:42 <bentiss> because someone actually reviews the code????

16:42 <alatiera> yes but pipeline created will still trip over the root_namespace check no?

16:43 <bentiss> AFAIU no

16:43 <bentiss> gitlab creates a different kind of pipeline

16:44 <alatiera> will play with it in a couple hours

16:44 <bentiss> https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21905/pipelines -> see: the pipeline I triggered is not tagged as "fork"

16:44 <bentiss> though I have superpowers, so maybe this is biased

16:45 <__tim> I guess 'triggered' here means 'run a new pipeline' and not 'trigger a manual pipeline/job that was created by the user'?

16:46 <bentiss> __tim: yeah. in the MR panel: switch tab to pipelines -> run pipeline

16:46 <emersion> bentiss: hm so what does that mean for personal projects?

16:46 <alatiera> yea i think that's due to your superpowers

16:46 agd5f_ has quit [Ping timeout: 480 seconds]

16:46 <emersion> i have libdisplay-info in my user namespace and i'm unsure what to do

16:47 <alatiera> cause in the gst one I linked above neither the pipeline I or marge triggered were parent-pipelines but still fork pipelines

16:47 <bentiss> emersion: either add yourself as a VIP, either request a group or move the project under a group

16:47 agd5f_ has joined #freedesktop

16:47 <emersion> how do i do the former, for instance?

16:48 <emersion> oh there is a list…

16:48 <bentiss> emersion: yes, at https://gitlab.freedesktop.org/freedesktop/helm-gitlab-infra/-/blob/main/runner-gating/users.txt

16:49 ybogdano is now known as Guest7714

16:49 * bentiss really wonders why he does this kind of stupid things just at the end of the day, and not on a brancd new day :/

16:49 ybogdano has joined #freedesktop

16:50 agd5f has quit [Ping timeout: 480 seconds]

16:50 Leopold___ has joined #freedesktop

16:51 agd5f has joined #freedesktop

16:52 Leopold has quit [Ping timeout: 480 seconds]

16:56 agd5f_ has quit [Ping timeout: 480 seconds]

16:56 agd5f_ has joined #freedesktop

16:59 agd5f has quit [Ping timeout: 480 seconds]

16:59 <bentiss> kusma: looks like it worked for mesa-demos

17:00 <kusma> bentiss: Yep, worked fine! Thanks for the help!

17:01 agd5f has joined #freedesktop

17:01 <bentiss> kusma: no worries

17:02 <alanc> so does this mean I have to add the 4 lines from the https://gitlab.freedesktop.org/freedesktop/freedesktop/-/issues/438#what-it-means-for-me-a-maintainer-of-a-project-part-of-gitlabfreedesktoporg to each of the 249 xorg/**/.gitlab-ci.yml files to have CI run on merge requests? or is there a ci-templates rev I can bump them to use to do that for me?

17:04 <bentiss> alanc: no ci-templates rev, no. we can work on one if we need. But given the state of xorg, wouldn't it be wiser to just do the lzy approach and request submitters to do that for you?

17:06 agd5f_ has quit [Ping timeout: 480 seconds]

17:06 agd5f_ has joined #freedesktop

17:07 ybogdano has quit [Ping timeout: 480 seconds]

17:08 <bentiss> alanc: to add a little bit of context, initially the plan was to give a heads up to people for a month or so. But given the kind of person we have in front of us, we couldn't afford this luxury

17:10 agd5f has quit [Ping timeout: 480 seconds]

17:10 <alanc> request submitters modify the .gitlab-ci.yml as part of their MRs?

17:11 <bentiss> yeah, or submit another one

17:11 <alanc> given the status of Xorg, I'd probably update the configs in the handful of projects that get MR's from people who aren't me, and then just put me in the exception list for the rest

17:11 agd5f has joined #freedesktop

17:11 <bentiss> alanc: works too

17:13 <alanc> and yeah, I understand why you did this and have no complaints, just making sure I understand what's needed now

17:13 <bentiss> alanc: thanks. Glad to hear that. Honestly

17:15 <alanc> heh, looking at https://gitlab.freedesktop.org/groups/xorg/-/merge_requests I have only personally submitted 1/3 of the MR's in the xorg group over the years (it's what happens when you make the same changes across 249 repos)

17:16 agd5f_ has quit [Ping timeout: 480 seconds]

17:17 <bentiss> yeah, Xorg is definitely not the cool kid anymore :)

17:17 agd5f_ has joined #freedesktop

17:17 jarthur has joined #freedesktop

17:18 <alanc> and xserver (which includes XWayland) is itself a bit over 1/3 of the MR's across all the xorg projects

17:19 <alanc> so for now, that, libX11, and a few others should be enough to cover most MR's we get

17:20 agd5f has quit [Ping timeout: 480 seconds]

17:20 agd5f has joined #freedesktop

17:22 <bentiss> that's weird, https://gitlab.freedesktop.org/gstreamer/cerbero/-/merge_requests/638/pipelines Andoni is a developer and it's running in the fork

17:22 ylatuya[m] has joined #freedesktop

17:24 <ylatuya[m]> Hi! I am trying to start a job (https://gitlab.freedesktop.org/ylatuya/cerbero/-/pipelines/829615) on a MR to cerbero, an official project (https://gitlab.freedesktop.org/gstreamer/cerbero/-/merge_requests/638). I understand this is something that should still be possible after https://gitlab.freedesktop.org/freedesktop/freedesktop/-/issues/438 right?

17:25 <bentiss> __tim: would you mind hitting the "run pipeline" button on https://gitlab.freedesktop.org/gstreamer/cerbero/-/merge_requests/638/pipelines?

17:25 <bentiss> ylatuya[m]: yes, I am currently trying to understand why this is not working

17:26 agd5f_ has quit [Ping timeout: 480 seconds]

17:26 <ylatuya[m]> Ok, thanks, no worries :) I wasn't sure about how far the restriction was going.

17:27 alyssa has joined #freedesktop

17:27 <alyssa> bentiss: Just to clarify, with the new rules, do MRs against forks of projects in official project namespaces get CI?

17:27 <alyssa> asahi/mesa, nouveau/mesa, etc

17:27 <alyssa> I think they do but I'm not super clear on the gitlab details

17:28 <bentiss> alyssa: they are supposed to have CI yes. But of course we have hiccups

17:28 <alyssa> sounds good

17:28 <alyssa> thank you for your tireless work

17:28 <alyssa> or tired work

17:28 <bentiss> tiring :)

17:28 <alyssa> i don't know how tire you have

17:28 <alyssa> tiring

17:28 <alyssa> got it

17:28 <alyssa> thank you for your tiring work

17:29 <bentiss> been a ride since last Sunday, yes

17:29 <bentiss> but daniels is also someone to thank for

17:29 <alyssa> thank you daniels too then :)

17:30 <ylatuya[m]> thank you both bentiss and daniels 👍️

17:30 <alyssa> bentiss: i kinda hate suggesting this but maybe it would make sense for starting CI to be whitelist only

17:30 agd5f_ has joined #freedesktop

17:31 <alyssa> i.e. any random can open an MR but if they haven't been permissioned yet they can't trigger a manual pipeline

17:32 <bentiss> alyssa: the problem with that list is that it's public, and some will complain

17:32 <bentiss> thus the opt-in

17:32 <alyssa> i don't really understand the problem

17:32 <bentiss> I wouldn't mind having 300 vips

17:33 <alyssa> if you want the username of everybody active on fd.o gitlab that doesn't seem hard to scrape with the apis

17:33 <bentiss> alyssa: the hook I introduced in the runners have no special privileges, they just fetch unauthenticated data to our gitlab server

17:33 <alyssa> no, I understand that, I don't understand why people would complain

17:33 <alyssa> I mean I believe oyu

17:33 <bentiss> so the list of users needs to be publicly accessible

17:33 <alyssa> *you

17:34 <bentiss> yes, but we had some

17:34 <bentiss> IIRC

17:34 <alyssa> wild

17:34 <alyssa> contributing to an upstream project inherently reveals your username (which might be a pseudonym for all we care)

17:34 <bentiss> yep, agree

17:35 <alyssa> I mean

17:36 <bentiss> ylatuya[m]: I *think* this is because the wrokflow rules are different to what I set in the issue with the announce

17:36 <alyssa> the nuclear option is to require an invite to join fd.o gitlab at all -- and if not for the bug trackers i might even be tempted -- but that seems really bad for not biting newcomers

17:36 agd5f has quit [Ping timeout: 480 seconds]

17:37 <bentiss> yes. WE already had that discussion

17:37 <bentiss> we

17:38 <alyssa> Weeeeeeee

17:38 <alyssa> (I might have even been party to that discussion. Brain is a little slow right now :p)

17:39 <bentiss> ylatuya[m]: would you mind rebasing on top of main your MR, if possible and change the .gitlab-ci.yaml with the workflow block at https://gitlab.freedesktop.org/freedesktop/freedesktop/-/issues/438#what-it-means-for-me-a-maintainer-of-a-project-part-of-gitlabfreedesktoporg

17:39 <bentiss> To see if that changes something

17:39 <ylatuya[m]> bentiss: sure, let me try

17:40 agd5f_ has quit [Ping timeout: 480 seconds]

17:40 <eric_engestrom> just catching up with #freedesktop; for the "no signups except by invite" thing gitlab has a bug (that they literally call a feature, well a *missing* feature) where it will send invites even if signups are closed, but users get an error when accepting these invites

17:41 <eric_engestrom> alyssa: ^

17:41 <alyssa> delight

17:41 <ylatuya[m]> bentiss: should I override existing ones or append the 2 new ones?

17:41 <eric_engestrom> as for what's going on, should we add this pre_clone_script hook to our runners as well?

17:42 <bentiss> ylatuya[m]: I would overwrite it entirely for starter

17:42 <bentiss> eric_engestrom: it's up to you, but this way we can change the whole config from a single push to a repo, with is kind of nice I would say

17:43 <eric_engestrom> ok

17:43 <eric_engestrom> and also, those of us who run extended pipelines on our forks to compensate for partial coverage in merge pipelines, do we need to add ourselves to that allow-list, or can we still do that?

17:43 <bentiss> I would suggest to add yourself in the list

17:43 <bentiss> anything not in a group is not permitted

17:44 <ylatuya[m]> bentiss: it failed as well -> https://gitlab.freedesktop.org/ylatuya/cerbero/-/jobs/38058121

17:44 <bentiss> or we can have a "ci-runner-permitted" group where most people add projects (after a review). The gnome folks have a "World" group for that

17:45 <alyssa> oh, also to be clear -- clicking the little gears under an open MR to an official project still triggers CI? it's not just Marge pipelines?

17:45 <alyssa> (sometimes I like to do a partial run on an MR before assigning to marge because once I put my MR in the queue I'm emotionally invested in seeing it merged :~P)

17:46 <bentiss> alyssa: if you add yourself to the allow-list, you'll be allowed to do it. If not, then not

17:46 <bentiss> well, no, l;et me correct that

17:47 <eric_engestrom> sounds like all the regular contributors need to be on that list then?

17:47 <bentiss> eric_engestrom: that would be easier, this way your don't have to deal with the pain of the MR pipelines

17:48 <bentiss> alyssa: so... on mesa, it would work: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21907/pipelines -> the MR runs in the namespace of the project

17:48 <alyssa> sounds good

17:48 <bentiss> alyssa: on gst/cerbero -> https://gitlab.freedesktop.org/gstreamer/cerbero/-/merge_requests/638 it doesn't. There is a little "fork" which blocks the script

17:49 <bentiss> why is that project different, I have strictly no idea ATM

17:49 <alyssa> sounds like you're having fun /s

17:49 <bentiss> you have no idea...

17:50 <bentiss> and I wanted to do some sports, but I have to leave in a few minutes, so painful

17:50 <mupuf> bentiss: you have my permission to go

17:50 <mupuf> ;)

17:51 <mupuf> I would kick your butt to go, but I'm too far for that

17:51 ximion has joined #freedesktop

17:51 <jenatali> It'd be nice if someone took a first stab at building a list of well-known/trusted folks for CI, rather than asking everyone to individually contribute their own names?

17:52 <daniels> alyssa: please use ci_run_n_monitor with a regex filter rather than the web trigger, so it only runs the targeted leaf jobs rather than everything

17:52 <eric_engestrom> +1, and I think this should be done before closing the gate

17:52 <eric_engestrom> (replying to jenatali)

17:53 <alyssa> daniels: I don't know what that is. Could you point me to docs?

17:53 <alyssa> I assume this lets me just run panfrost jobs?

17:53 <daniels> bentiss: can we have a ci-ok group which has all of our top-level groups added as members? so if you have access to any ‘real’ project then you have access to CI

17:53 <eric_engestrom> alyssa: I don't think we have any docs for it 🙃

17:53 <anholt> ~/src/mesa/.gitlab-ci/bin/ci_run_n_monitor.py --target <regex> to run whatever is necessary, and no more, for the job regex.

17:53 <eric_engestrom> alyssa: https://gitlab.freedesktop.org/mesa/mesa/-/blob/main/bin/ci/ci_run_n_monitor.py

17:53 <bentiss> daniels: but can we publicly request the list of members of that group?

17:54 <daniels> alyssa: what anholt said :)

17:54 <anholt> --stress is also super useful for your overnight "is my job actually stable" runs

17:54 <alyssa> sounds good, will give that a try

17:54 <eric_engestrom> daniels: I like this idea of meta group

17:55 <alyssa> that seems a lot more convenient than trying to click both arm_build and x86_build given that the UI is racey lol

17:55 <daniels> bentiss: no but you can set group membership to include another group. so we can set that group to include all mesa/wayland/gst/… members, and then is_member(mesa_user, ci_ok_group) will be true

17:55 <daniels> alyssa: yeah it’s way better, and also CLI rather than web

17:55 <alyssa> I do like that yes

17:56 <daniels> (sorry am on phone so limited throughout)

17:56 <bentiss> daniels: curl -L https://gitlab.freedesktop.org/api/v4/groups/freedesktop/members | jq -> unauthorized

17:56 <daniels> bentiss: can we give it the API token of a user with no perms to do that?

17:56 <bentiss> not sure

17:57 <bentiss> daniels: maybe it works

17:58 <bentiss> at least it works with my "work" account that is not affiliated to any group

17:58 <bentiss> should I switch the script off for the night?

17:58 <bentiss> (the european night)

17:59 <bentiss> daniels, mupuf, emersion, alanc, __tim, eric_engestrom ^^

17:59 <eric_engestrom> bentiss: yeah I think it's best to turn it off until it's no longer disruptive to most legitimate users

18:00 <bentiss> k

18:00 <bentiss> and done

18:01 <bentiss> ylatuya[m]: you should be good now, it's disabled for now :)

18:01 <bentiss> now that I wont take thunder from anyone because it is back to the previous situation, I'm off for the day

18:02 <eric_engestrom> haha

18:02 <eric_engestrom> 👋

18:02 <dcbaker> I know CI is crazy right now, but are we expecting the venus-lavapipe jobs to timeout due to not finding a runner

18:02 <ylatuya[m]> bentiss: It's working now, thanks!

18:02 <anholt> dcbaker: they're disabled in main, yes.

18:03 <eric_engestrom> and hopefully the meta-group idea is usable, and we can turn it back on soon :)

18:03 <anholt> er, I guess my MR did land, so they're back on on the general shared runner pool

18:04 <dcbaker> this is what I'm seeing on the staging/23.0 branch: https://gitlab.freedesktop.org/mesa/mesa/-/jobs/38059461

18:04 MajorBiscuit has quit [Quit: WeeChat 3.6]

18:04 <dcbaker> And sorry for adding more CI questions :/

18:05 <dcbaker> oh, I think I see

18:05 <dcbaker> your patch isn't a straight revert, I'll pull that and see if it fixes my issue

18:05 <dcbaker> anholt: thanks!

18:07 <daniels> bentiss: thanks, g’night!

18:13 <alyssa> ok, trying this newfangled script

18:14 <alyssa> "⏲ for the pipeline to appear"

18:16 <eric_engestrom> bentiss: just to save you a bit of reading, you'll want `/all` to include transitive users (ie. all of them in a meta-group), and you can filter by user_id to avoid pulling everything and then having to filter yourself; end result is:

18:16 <eric_engestrom> https://gitlab.freedesktop.org/api/v4/groups/ci-ok/members/all?user_ids=$GITLAB_USER_ID

18:16 <alyssa> let's try with --force-manual

18:16 <DavidHeidelberg[m]> alyssa: did you made a `git push`? :D

18:16 <alyssa> Yes

18:16 <eric_engestrom> alyssa: this means there's no pipeline in your fork for the commit that's currently you HEAD (or --rev)

18:16 <DavidHeidelberg[m]> waiting for pipeline usually appears when pipeline is not there or you push just changes in the commit message (not the code)

18:17 <eric_engestrom> you can open https://gitlab.freedesktop.org/alyssa/mesa/-/pipelines to verify yourself

18:17 <alyssa> in the gitlab UI it says

18:17 <alyssa> Merge request pipeline #829678 waiting for manual action for 8e6d47e7

18:17 <alyssa> which is my HEAD

18:17 <alyssa> that's a mesa/mesa pipeline not an alyssa/mesa one

18:17 <alyssa> because it's in MR context

18:17 <alyssa> I guess

18:18 <eric_engestrom> I don't think that's supported

18:18 <eric_engestrom> it always looks in your fork

18:18 <eric_engestrom> iirc

18:18 <alyssa> that sounds like a bug if the pipelines can end up in mesa/mesa?

18:19 <alyssa> oh there it goes

18:19 <alyssa> it just took a few minutes

18:19 <eric_engestrom> yeah imo it should be possible to pass --pipeline instead of --rev and letting the script guess where the pipeline is

18:19 <alyssa> Pipeline: https://gitlab.freedesktop.org/alyssa/mesa/-/pipelines/829679

18:19 <alyssa> I mean I like the script guessing as long as it guesses right ;)

18:19 <eric_engestrom> I tried to hook that up but failed

18:22 <alyssa> so if ctrl-c out of the job it'll keep going I assume?

18:28 <alyssa> hmm. I wonder why G52 in CI is slower than the G52 on my desk

18:30 rsjw has joined #freedesktop

18:31 <rsjw> can I get fork permission on gitlab? the username is the same as here

18:32 <mupuf> alyssa: I have been experiencing the same on my desktop PC vs in CI. These issues are annoying to review

18:36 <eric_engestrom> alyssa: yeah, ^C will stop the script, but doing that doesn't auto-cancel the job(s)

19:10 BlkPoohba has joined #freedesktop

19:10 alanc has quit [Remote host closed the connection]

19:10 ___nick___ has joined #freedesktop

19:10 alanc has joined #freedesktop

19:35 agd5f has joined #freedesktop

19:37 mvlad has quit [Remote host closed the connection]

19:40 ___nick___ has quit []

19:43 ___nick___ has joined #freedesktop

19:43 ___nick___ has quit []

19:46 ___nick___ has joined #freedesktop

19:55 i-garrison has quit [Read error: Connection reset by peer]

20:00 pendingchaos has quit [Ping timeout: 480 seconds]

20:03 i-garrison has joined #freedesktop

20:12 pendingchaos has joined #freedesktop

20:19 Kayden has quit [Ping timeout: 480 seconds]

20:21 Kayden has joined #freedesktop

20:29 Haaninjo has joined #freedesktop

20:37 danvet has quit [Ping timeout: 480 seconds]

20:42 <bentiss> rsjw: please open a bug at https://gitlab.freedesktop.org/freedesktop/freedesktop/issues/new?issuable_template=User%20verification by following the template. We need to get a trace on who asked what, we have been abused not later than last week

20:43 ybogdano has joined #freedesktop

20:45 ybogdano is now known as Guest7728

20:45 Guest7714 is now known as ybogdano

20:48 alyssa has left #freedesktop [#freedesktop]

20:56 vkareh has quit [Quit: WeeChat 3.6]

20:59 pixelcluster_ has joined #freedesktop

21:02 pixelcluster has quit [Ping timeout: 480 seconds]

21:06 pixelcluster_ has quit []

21:06 pixelcluster has joined #freedesktop

21:08 ___nick___ has quit [Ping timeout: 480 seconds]

21:47 Guest7728 has quit [Ping timeout: 480 seconds]

22:16 trinitronx has joined #freedesktop

22:27 trinitronx has quit [Quit: leaving]

22:28 trinitronx has joined #freedesktop

23:04 MrCooper_ has joined #freedesktop

23:09 MrCooper has quit [Ping timeout: 480 seconds]

23:21 MrCooper_ has quit [Remote host closed the connection]

23:24 <anholt> hmm, cluster of jobs that all took an hour and timed out on this runner https://gitlab.freedesktop.org/admin/runners/3195#/jobs

23:24 <anholt> (across projects)

23:31 <anholt> https://gitlab.freedesktop.org/monado/monado/-/issues/243

23:40 MrCooper has joined #freedesktop

23:58 MrCooper has quit [Remote host closed the connection]

23:58 MrCooper has joined #freedesktop