ChanServ changed the topic of #freedesktop to: https://www.freedesktop.org infrastructure and online services || for questions about freedesktop.org projects, please see each project's contact || for discussions about specifications, please use https://gitlab.freedesktop.org/xdg or xdg@lists.freedesktop.org
Seirdy_ has quit [Ping timeout: 480 seconds]
jstein has quit []
Haaninjo has quit [Quit: Ex-Chat]
ybogdano has quit [Ping timeout: 480 seconds]
Seirdy_ has joined #freedesktop
alanc has quit [Remote host closed the connection]
alanc has joined #freedesktop
ngcortes has quit [Ping timeout: 480 seconds]
ximion has quit []
eroux has joined #freedesktop
i-garrison has joined #freedesktop
GNUmoon has quit [Ping timeout: 480 seconds]
frytaped has joined #freedesktop
ngcortes has joined #freedesktop
ngcortes has quit [Remote host closed the connection]
infernix has quit [Remote host closed the connection]
___nick___ has quit []
___nick___ has joined #freedesktop
___nick___ has quit []
___nick___ has joined #freedesktop
ximion has joined #freedesktop
mvlad has joined #freedesktop
aleksander has joined #freedesktop
vbenes has quit [Remote host closed the connection]
vbenes has joined #freedesktop
infernix has joined #freedesktop
<emersion>
daniels, do you want help debugging this Go code?
kennylevinsen has joined #freedesktop
bittin has joined #freedesktop
<daniels>
emersion: if you had some time to help look into nginx+workhorse that would be good thanks - I was out for the latter half of last week, and going to be stuck catching back up with work for most of the next couple of days (great timing to have a personal thing I really can't move for most of the day today ...)
<daniels>
I can shoot you some instructions and get you sorted with access when I get back later tonight
<emersion>
yup, can do!
<daniels>
thanks!
<daniels>
really appreciate it
<kennylevinsen>
if needed I can also volunteer some time on this issue
pepp has joined #freedesktop
xingwozhonghua has quit []
bittin has quit [Ping timeout: 480 seconds]
bittin has joined #freedesktop
bittin_ has joined #freedesktop
bittin has quit [Read error: Connection reset by peer]
ximion has quit []
bittin_ has quit [Read error: Connection reset by peer]
bittin has joined #freedesktop
bittin has quit [Remote host closed the connection]
bittin has joined #freedesktop
jarthur has joined #freedesktop
bittin has quit [Remote host closed the connection]
bittin has joined #freedesktop
<bentiss>
daniels: so that's weird. I moved all gstreamer projects yesterday, and I have no 504 for them expect 6 for a non existant project "/gstreamer/desktop-file-utils.git" The weird part is why is it waiting for 1800 secs to answer a 404...
bittin has quit [Read error: No route to host]
bittin has joined #freedesktop
ximion has joined #freedesktop
MajorBiscuit has quit [Quit: WeeChat 3.3]
<bentiss>
FWIW, I am moving 10 more namesapaces from gitaly-3 to gitaly-1: pulseaudio, pipewire, libinput, mobile-broadband, libfprint, polkit, cairo, dbus, geoclue, upower. So far, gitaly-1 is holding, so if we start seeing failures in gstreamer, NetworkManager or any of these projects then there is one culprit among those 10 namespaces
<__tim>
fontconfig would be useful too fwiw, in case there isn't a fixed order (as our ci pulls it) :)
GNUmoon has quit [Ping timeout: 480 seconds]
GNUmoon has joined #freedesktop
GNUmoon has quit [Remote host closed the connection]
GNUmoon has joined #freedesktop
<nirbheek>
also libnice :)
Seirdy_ has quit []
bittin has quit [Read error: Connection reset by peer]
bittin has joined #freedesktop
ybogdano has joined #freedesktop
bittin has quit [Read error: No route to host]
bittin has joined #freedesktop
bittin has quit [Read error: No route to host]
<ndufresne>
not sure what this is about, getting very slow repo update, and 504 sometimes on fontconfig
<ndufresne>
I guess related to the previous discussion ;-D
<bentiss>
alright, it seems to hold for now. I am adding fontconfig and libnice to the gitaly pod that doesn't seem affected by the bug
<bentiss>
ndufresne: yeah, I think we have a blackship (or maybe more than one) in the infra and it completely messes up the gitaly servers
jstein has joined #freedesktop
<bentiss>
black sheep
<ndufresne>
We should bleach these sheep
<bentiss>
on that note, I should mention that our last successful backup was 13 days ago... this ins not good :(
<bentiss>
especially given that I think we prune backups after 7 days
<__tim>
and freetype too please :)
<__tim>
user forks are in their separate namespaces I presume?
<bentiss>
nope, when I move a namesapce, to ensure git dedup I have to take all the forks too
<bentiss>
FWIW, only freetype/freetype-demos is on gitaly-3, so not sure it'll help
<bentiss>
daniels: the backup was failing because goldsteal/xkeyboard-config was referring to a non existant pool. I manually copied the pool from gitaly-3 to gitaly-1 with the expected name and now git seems happier
<bentiss>
I wonder if we have other issues like that
<daniels>
bentiss: *blink*
<daniels>
that was one of the ones which was failing to migrate when I did gitaly-3 in the first place
<daniels>
so I wonder if it did actually complete and then that just never got recorded?
jstein has quit []
<bentiss>
daniels: well, that one was pretty busted, the db thinks it doesn't have a pool when the alternate file clearly mentioned one
jstein has joined #freedesktop
<daniels>
urgh
<bentiss>
anyway: for the analysis if gitaly-1 is happy, I use elastic and look for 504 errors, they are showing up there, so if 'GET /gstreamer' is showing up, that means gitaly-1 is now busted
Haaninjo has joined #freedesktop
___nick___ has quit [Ping timeout: 480 seconds]
GNUmoon has quit [Ping timeout: 480 seconds]
jstein has quit []
Seirdy has joined #freedesktop
mvlad has quit [Remote host closed the connection]
ngcortes has joined #freedesktop
<anholt>
oof. didn't even get the pipeline started in mesa marge's 1 hour timeout.
<daniels>
so part of it is that both NM and GSt have some exceptionally long-running integration test jobs running
<daniels>
and then most of the rest of the capacity went to failing to clone libqmi
<anholt>
some jobs at the time were backed up on git fetches, but some of it was just it looks like we only have 3 runners these days?
<daniels>
bentiss: do you still have the toolbox open & able to pull libqmi?
<daniels>
*over to gitaly-1
<daniels>
anholt: yeah, we can bring back the fourth if shared capacity is a consistent thing
<anholt>
it's certainly been an issue for the last week-ish, but that's mixed up with the git fail
<anholt>
though llvmpipe/virgl jobs have been the long pole in mesa ci for a while I think. more jobs than we have runners, even if a job should be <10 min.
<emersion>
daniels: thanks! will look at all that tomorrow
<daniels>
emersion: np, thanks to you :)
<daniels>
anholt: yeah, probably time to combine some of them?
<bentiss>
daniels: will try
<anholt>
I guess we could combine to reduce the overheads, but really we have more work we want to get done on testing sw rasterization than we have time to complete it
<anholt>
would sure be cool if we could get a relevant company to just sponsor some big dedicated Mesa runners. (sigh)
<bentiss>
daniels: libqmi is already on gitaly-2
<daniels>
bentiss: urgh, I just saw a bunch of jobs stuck on cloning that :(
<bentiss>
there are 6 504 errors on libqmi since the move
<daniels>
anholt: yeah ok, if it's a problem then we can ask Equinix if we can take more capacity; if they say no then Collabora can foot that bill
danvet has quit [Ping timeout: 480 seconds]
<bentiss>
anholt: FWIW, the fact that we are seeing timeouts right now on many projects and that this timeout is 30 min is heavily impacting the runners availability :/
<bentiss>
we are also still one runner down that we use for MinIO caching of the repos