#freedesktop on 2023-08-31 — irc logs at oftc.irclog.whitequark.org

2023-08-27 13:06 daniels changed the topic of #freedesktop to: GitLab is currently down for upgrade; will be a while before it's back || https://www.freedesktop.org infrastructure and online services || for questions about freedesktop.org projects, please see each project's contact || for discussions about specifications, please use https://gitlab.freedesktop.org/xdg or xdg@lists.freedesktop.org

00:12 nnm has quit []

00:12 nnm has joined #freedesktop

00:17 killpid_ has joined #freedesktop

00:30 FireBurn has quit [Quit: Konversation terminated!]

00:53 co1umbarius has joined #freedesktop

00:55 columbarius has quit [Ping timeout: 480 seconds]

01:00 DragoonAethis has quit [Quit: hej-hej!]

01:00 DragoonAethis has joined #freedesktop

01:53 PuercoPop has joined #freedesktop

01:56 killpid_ has quit [Quit: Quit.]

02:23 lsd|2 has joined #freedesktop

02:30 karolherbst_ has joined #freedesktop

02:37 karolherbst has quit [Ping timeout: 480 seconds]

03:16 PuercoPop has quit [Ping timeout: 480 seconds]

03:59 AbleBacon has quit [Read error: Connection reset by peer]

04:00 epony has quit [Remote host closed the connection]

04:00 epony has joined #freedesktop

04:07 dcunit3d_ has quit [Ping timeout: 480 seconds]

04:10 dcunit3d has joined #freedesktop

04:14 GNUmoon has quit [Remote host closed the connection]

04:14 GNUmoon has joined #freedesktop

04:40 ximion has quit [Quit: Detached from the Matrix]

05:29 bmodem has joined #freedesktop

05:35 tzimmermann has joined #freedesktop

05:43 i-garrison has quit []

05:44 i-garrison has joined #freedesktop

05:50 sima has joined #freedesktop

05:58 bmodem has quit [Quit: bmodem]

05:59 bmodem has joined #freedesktop

06:20 <alatiera> hmm, I've noticed that the build log keeps rebuilding the my image with the following in the

06:20 <alatiera> + skopeo inspect docker://registry.freedesktop.org/alatiera/gstreamer_test_test123/amd64/fedora:2023-08-29.10-f34-main

06:20 <alatiera> + jq '[.Digest, .Layers]'

06:20 <alatiera> time="2023-08-31T06:17:05Z" level=fatal msg="fetching blob: blob unknown to registry"

06:21 <alatiera> any ideas?

06:21 <alatiera> might be an one off post migration and hiccups hopefully

06:23 <alatiera> the tag does show up in the fork's registry, it's weird

06:32 <alatiera> bentiss for gst images, anything that's not gstreamer/gstreamer, gstreamer/cerbero andmaybe gstreamer/meson-ports/*, you can mass delete from the registry and user forks

06:33 <bentiss> alatiera: I am just re-fetching your fedora image...

06:34 <alatiera> bentiss oh it's fine if it was swept up by some hiccup or not, I am more worried about the template having a bug

06:34 <alatiera> like we had with the mesa rebuilds on windows recently

06:35 <bentiss> alatiera: no, it was a misconfiguration from my part where I was pointing the registry at the wrong data backend (still on google cloud)

06:35 <bentiss> so we had 24h of pushes to GCP that are "lost" and that need manual fetching

06:35 <alatiera> ah okay

06:36 <bentiss> alatiera: anyway, your image should be fixed now (hopefully)

06:36 <alatiera> bentiss awesome, thanks!

06:38 <alatiera> for expires after btw, I was thinking we could probably add it on the template as is

06:39 <alatiera> and default to 'if upstrea_repo image move on, else rebuild the one in the fork registry with expires-after by default'

06:44 <alatiera> hah, so now it does find the image indeed but tries to copy it to the upstream registry 🤦

06:44 <alatiera> wait no

06:47 <alatiera> https://gitlab.freedesktop.org/freedesktop/ci-templates/-/merge_requests/178/diffs#note_2065886

07:26 Ahuj has joined #freedesktop

07:42 thaller has quit [Remote host closed the connection]

07:44 thaller has joined #freedesktop

07:49 An0num0us has joined #freedesktop

08:06 kj has joined #freedesktop

08:23 mvlad has joined #freedesktop

08:42 <hakzsam> https://gitlab.freedesktop.org/mesa/mesa/-/jobs/48285498 damn

08:45 <hakzsam> ok, bumped to 90 min

09:13 <mupuf> hakzsam: don't, Marge will fail anyway if it takes longer than 60 minutes

09:13 <alatiera> mupuf no, the timeout is configurable per instance

09:13 <mupuf> alatiera: per Marge instance?

09:13 <alatiera> yes

09:14 <mupuf> ok, but do we really want that?

09:14 <alatiera> she has a different timeout set, and it's also taking into account the whole pipeline duration

09:14 <mupuf> 90 minutes is a looooong time

09:14 <hakzsam> well, if it requires more than 60 min, it should be bumped?

09:14 <alatiera> also marge doesn't know about queued or executed distinctions

09:14 <hakzsam> otherwise, how do I create that container?

09:14 <alatiera> she just sees the total number

09:16 lsd|2 has quit []

09:19 <MrCooper> hakzsam: you reassign to Marge once the container is built

09:19 <MrCooper> building containers is an exceptional case, tuning Marge's timeout for that means potentially wasting a lot of time when something goes wrong in a pipeline

09:27 <mupuf> +1

09:28 <mupuf> but then... the question is: why do we still create rootfses?

09:28 <mupuf> can't we just extract a container, add the kernel/initrd and be done?

09:28 <mupuf> why do we duplicate all of this work?

09:29 <cwabbott> seems like something's very wrong with CI atm

09:29 <cwabbott> https://gitlab.freedesktop.org/mesa/mesa/-/pipelines/974269

09:30 <cwabbott> it's failing to download artifacts (?)

09:31 <mupuf> bentiss: ^

09:31 <mupuf> https://gitlab.freedesktop.org/mesa/mesa/-/jobs/48289250#L26

09:33 <bentiss> mupuf: yeah, s3 is still on the failing cluster, not sure I'll have time to fix it this week

09:36 <mupuf> oh, so even the hdds are dying now, or is it just the network?

09:36 <bentiss> let me check

09:38 <bentiss> where is that file supposed to be uploaded?

09:39 <bentiss> https://gitlab.freedesktop.org/mesa/mesa/-/jobs/48288303 doesn't seem to do much

09:40 <mupuf> no idea... but here this download failed: https://gitlab.freedesktop.org/mupuf/ci-triage-service/-/jobs/48283570#L28

09:40 <mupuf> if that may help

09:41 <bentiss> I am seeing a lot of timeout errors on gitlab.fd.o, but this should be unrelated to s3.fd.o

09:42 <bentiss> mupuf: and honestly, having a failing curl without the address is pointless

09:42 <mupuf> bentiss: sorry about that, the url was above: https://gitlab.freedesktop.org/api/v4/projects/5761/packages/generic/cbuild/sha256-0562a018421b401cc1eea315c4fcfd4bc56673cc2a6ce49bc0f1d30e38ef9cf3/cbuild

09:43 <bentiss> mupuf: right, so I guess you are having a timeout error

09:43 <bentiss> not sure why gitlab is crumbling under the load

09:49 <bentiss> mupuf: so we have 2 types of errors: https://gitlab.freedesktop.org/mesa/mesa/-/jobs/48289250#L26 gives a 404, so my guess is that the file was not built or sent properly to s3.fd.o

09:49 <mupuf> makes sense, sorry for the distraction :s

09:49 <bentiss> mupuf: and https://gitlab.freedesktop.org/mesa/mesa/-/jobs/48289243 -> gitlab timed out and artifacts/lava/lava-submit.sh was not pulled (should be fixed by a retry on curl)

09:50 <mupuf> DavidHeidelberg[m]: ^

09:50 <bentiss> actually no, that last one is an artifact from a previous job, so we are missing it

09:53 <bentiss> mupuf DavidHeidelberg[m]: to me the problem comes from https://gitlab.freedesktop.org/mesa/mesa/-/jobs/48288303 which doesn't seem to be doing anything

09:58 <bentiss> also I don't have the faintest idea on how ./artifacts/lava/lava-submit.sh is generated. I can only see ./.gitlab-ci/lava/lava-submit.sh and no other reference. I wonder if it does work only because we are caching the volumes and it was in the repo at some time

10:12 <DavidHeidelberg[m]> bentiss: it's not generated, it's passed fron artifacts

10:12 <bentiss> DavidHeidelberg[m]: yeah, but there is no job that creates it or place it in the artifacts

10:17 <DavidHeidelberg[m]> The rootfs job should check if the container exist,which in this case does nothing (since the container is already in place)

10:18 <bentiss> DavidHeidelberg[m]: but we get a 404 after, so it's not there, no?

10:19 <DavidHeidelberg[m]> The artifacts are prepared by `debian-testing` or any other `debian-.*` jobs

10:19 <DavidHeidelberg[m]> Btw. Afk food, I'll be back in 40 minutes, then 1 meeting and then I'll look into it :)

10:20 <bentiss> DavidHeidelberg[m]: k, no worries and enjoy!

10:22 <cwabbott> bentiss: fwiw, seems like prepare-artifacts.sh does "cp -Rp .gitlab-ci/lava artifacts/"

10:24 <cwabbott> afaict this got changed recently and it's supposed to be produced by alpine/x86_64_lava_ssh_client

10:25 <bentiss> found it (I think) -> https://gitlab.freedesktop.org/mesa/mesa/-/jobs/48288325 the step script is doing nothing because we failed at pulling the gating script, and then given that the gitlab CI steps were not even run, it is considered a passing job, and there are no artifacts

10:25 <bentiss> testing my theory by restarting this job

10:29 <bentiss> yep, works now https://gitlab.freedesktop.org/mesa/mesa/-/jobs/48292938

10:29 <bentiss> those timeouts are annoying

10:31 <cwabbott> ok, so anyway, what do I have to do now? just reassign to marge?

10:33 <bentiss> cwabbott: probably, yes. I'll need to check on why we get those timeouts, but meanwhile you should just retry

10:39 <cwabbott> bentiss: ugh, I think because no other MRs got merged it didn't try to rerun the pipeline and just reassigned to me

10:40 <DavidHeidelberg[m]> cwabbott: if nothing is merged into main meanwhile, it reuses the pipeline

10:40 <DavidHeidelberg[m]> So best way is just manually retry the jobs or/and wait until jobs which are still running finish

10:41 <cwabbott> yeah, I manually retried everything and reassigned to marge

10:42 <DavidHeidelberg[m]> (in case of serious failure of whole pipeline, the you can retry whole pipeline (top right button on pipeline view)

10:55 <emery> how often are shared-mime-info releases made?

11:36 <karolherbst_> https://gitlab.freedesktop.org/xdg/shared-mime-info/-/tags

11:36 karolherbst_ is now known as karolherbst

11:40 bmodem has quit [Ping timeout: 480 seconds]

11:52 bmodem has joined #freedesktop

12:00 bmodem has quit [Ping timeout: 480 seconds]

12:01 psukys has joined #freedesktop

12:01 vkareh has joined #freedesktop

12:07 ximion has joined #freedesktop

12:08 <emery> ok, sometimes but not more than once year

12:12 psukys has quit [Ping timeout: 480 seconds]

12:17 killpid_ has joined #freedesktop

12:17 GNUmoon has quit [Quit: Leaving]

12:20 koike has joined #freedesktop

12:20 koike is now known as Guest1341

12:21 killpid_ has quit []

12:22 killpid_ has joined #freedesktop

12:27 ximion has quit [Quit: Detached from the Matrix]

12:28 GNUmoon has joined #freedesktop

12:29 <bentiss> regarding the "too many connections" on the db, it seems we are using maybe too many webservice workers, as they account for roughly 50% of the max available connections. The rest is used by sidekiq's jobs

12:30 <bentiss> diminishing the number of webserive pods from 16 to 10, we'll see if that changes something

12:50 Guest1341 is now known as koike

12:56 koike is now known as koike-lounge

12:57 koike has joined #freedesktop

13:02 sravn has left #freedesktop [WeeChat 3.5]

13:05 <DavidHeidelberg[m]> cwabbott: good catch, the new dependency remove dependency on the debian-.* artifacts

13:05 <DavidHeidelberg[m]> so it works, because artifacts are fast, but not all the time

13:06 <cwabbott> woah, I actually said something useful!

13:06 <cwabbott> when I look at CI stuff I'm mostly flailing around

13:09 <DavidHeidelberg[m]> ... maybe

13:09 <DavidHeidelberg[m]> I'm just looking into it deeper, maybe it's ok.. but I see large area where issue can be

13:12 dcunit3d has quit [Ping timeout: 480 seconds]

13:17 dcunit3d has joined #freedesktop

13:21 <DavidHeidelberg[m]> bentiss: needs:... (full message at <https://matrix.org/_matrix/media/v3/download/matrix.org/GFKlxgQmhBmXSvnmDtypBvLl>)

13:22 <DavidHeidelberg[m]> + dependencies:

13:22 <DavidHeidelberg[m]> - debian-arm64

13:22 koike-lounge has quit []

13:23 klounge has joined #freedesktop

13:30 <DavidHeidelberg[m]> What I recall this previously happen to me when I invoked some hack triggering pipeline with ci_run_n_monitor script and then re-enabling jobs, where gitlab lose dependencies and trigger the job, even when it misses artifacts from previous stages

13:32 <DavidHeidelberg[m]> if it happen now in regular pipeline, for this job everything should be in-place, so it could be some gitlab bug or it wrongly parses needs/dependencies keywords in this case

13:40 <bentiss> DavidHeidelberg[m]: I think in that particular case it wasn ´t the needs/dependencies the problem

13:40 <bentiss> the problem was that the job that was supposed to run and produce the artifacts did not even execute itself, and was marked as passed

13:41 <bentiss> because we had a timeout error while fetching the gating script

13:45 bmodem has joined #freedesktop

13:54 nuclearcat2 has joined #freedesktop

13:59 An0num0us has quit [Ping timeout: 480 seconds]

14:04 MrCooper has quit [Remote host closed the connection]

14:04 MrCooper has joined #freedesktop

14:12 Haaninjo has joined #freedesktop

14:42 Ahuj has quit [Ping timeout: 480 seconds]

14:48 rpavlik has joined #freedesktop

14:55 AbleBacon has joined #freedesktop

15:50 bmodem has quit [Ping timeout: 480 seconds]

16:18 <DavidHeidelberg[m]> bentiss: that would make sense. Is possible to catch the failure at that point and fail?

16:20 tzimmermann has quit [Quit: Leaving]

16:27 killpid_ has quit [Ping timeout: 480 seconds]

16:38 bmodem has joined #freedesktop

16:40 An0num0us has joined #freedesktop

16:50 <bentiss> DavidHeidelberg[m]: that's the weird part. This is supposed to fail if the script fails, as if you don't have enough privileges. But this time it just went through

17:15 i509vcb has quit [Quit: Connection closed for inactivity]

17:15 bmodem has quit [Ping timeout: 480 seconds]

17:15 bmodem has joined #freedesktop

17:15 bmodem has quit [Excess Flood]

17:16 bmodem has joined #freedesktop

17:51 tnt has left #freedesktop [#freedesktop]

18:14 i509vcb has joined #freedesktop

18:17 ximion has joined #freedesktop

18:19 mvlad has quit [Remote host closed the connection]

18:42 alanc has quit [Remote host closed the connection]

18:43 alanc has joined #freedesktop

18:58 Kayden has quit [Quit: -> lunch]

19:00 bmodem has quit [Ping timeout: 480 seconds]

19:14 flom84 has joined #freedesktop

19:29 jani has quit []

19:29 jani has joined #freedesktop

19:30 ximion has quit [Quit: Detached from the Matrix]

19:31 jani has quit []

19:32 jani has joined #freedesktop

19:42 jani has quit []

19:42 jani has joined #freedesktop

19:44 jani has quit []

19:49 mattst88 has joined #freedesktop

19:50 <mattst88> could someone point me at a hopefully-simple .gitlab-ci.yml I could copy from to enable arm/aarch64 CI builds for pixman?

19:53 <mattst88> (people keep submitting arm and aarch64 fixes, but in the process break the other, and I'm getting tired of it)

19:55 Haaninjo has quit [Quit: Ex-Chat]

20:05 jani has joined #freedesktop

20:06 jani has quit []

20:06 systwi_ has joined #freedesktop

20:06 systwi_ has quit [Remote host closed the connection]

20:07 jani has joined #freedesktop

20:08 systwi_ has joined #freedesktop

20:12 systwi has quit [Ping timeout: 480 seconds]

20:12 <mattst88> last example here is probably what I want? https://freedesktop.pages.freedesktop.org/ci-templates/templates.html

20:15 * mattst88 has no idea what he's doing

20:16 <anholt_> oh, pixman doesn't have much CI does it.

20:17 <anholt_> ci-templates would be useful if you want to cache all that dnf and pip setup so you don't have so long to set up the build

20:17 vkareh has quit [Quit: WeeChat 3.6]

20:19 <anholt_> the equivalent to what you have now would be to add like "arm-build: image: fedora:28:arm64 tag: arm64" with the same "script" -- use an f28 arm docker image from docker, run it on fd.o's arm64 runners.

20:23 <mattst88> okay, thanks

20:23 <mattst88> doesn't look like that template from the docs works... https://gitlab.freedesktop.org/mattst88/pixman/-/pipelines/974824

20:24 <mattst88> > build-x86: unknown keys in `extends` (.fdo.container-build@fedora)

20:28 sima has quit [Ping timeout: 480 seconds]

20:39 <mattst88> https://gitlab.freedesktop.org/mattst88/pixman/-/pipelines/974835 failed because "config contains unknown keys: tag"

20:39 <mattst88> dunno if I'm putting that in the wrong place or what

20:40 <mattst88> what architectures does fdo have runners for?

20:41 flom84 has quit [Quit: Leaving]

20:43 Kayden has joined #freedesktop

20:43 <mattst88> looks like the amd64-build failed in https://gitlab.freedesktop.org/mattst88/pixman/-/pipelines/974837 maybe because it tried to run an amd64 container on an aarch64 system...?

20:44 <mattst88> presumably as the result of not having a tag declaration

21:04 <anholt_> mattst88: amd64-build was on equinix-m3l, which is x86

21:04 <anholt_> your arm64-build does need a tag

21:04 <anholt_> amd64's fail looks like just intermittent fdo fail

21:10 <mattst88> ah, okay

21:37 epony has quit [Remote host closed the connection]

21:38 epony has joined #freedesktop

22:15 ximion has joined #freedesktop

22:36 An0num0us has quit [Ping timeout: 480 seconds]

22:36 <DavidHeidelberg[m]> mattst88: you must `tags: - aarch64`

22:36 <DavidHeidelberg[m]> and change container tag, since otherwise it's the x86 one

22:44 <mattst88> thanks, I'll give that a try

23:10 <DavidHeidelberg[m]> mattst88: btw. for the last fail, you need to install something like`python3-pip` (on Debian is named that way, on fedora probably slightly different)

23:46 Kayden has quit [Quit: -> home]