ximion1 has quit [Remote host closed the connection]
ximion1 has joined #freedesktop
jcline has quit [Quit: Bye.]
shbrngdo has quit [Remote host closed the connection]
shbrngdo has joined #freedesktop
tanty has quit [Ping timeout: 480 seconds]
tanty has joined #freedesktop
MrCooper has quit [Read error: Connection reset by peer]
MrCooper has joined #freedesktop
<bentiss> alright, mc mirror --watch is taking way too much memory, and if I don't stop it now, we are going to lose an other node soon
<bentiss> I've been running it for the past ~24 h, so I'm going to switch the config now, and hopefully people will not notice
<bentiss> and that should hopefully solve the 500s when uploading artifacts
ximion1 has quit []
<bentiss> ok, new config applied. I'll sync the remaining logs tomorrow, but I can target them more easily now given that I don't have to run the mirror on the whole instance
* bentiss goes back to bed
agd5f_ has joined #freedesktop
agd5f has quit [Read error: Connection reset by peer]
agd5f_ has quit [Read error: Connection reset by peer]
agd5f_ has joined #freedesktop
chomwitt has joined #freedesktop
i-garrison has quit []
jarthur has quit [Ping timeout: 480 seconds]
jarthur has joined #freedesktop
jarthur has quit [Quit: Textual IRC Client: www.textualapp.com]
chomwitt has quit [Ping timeout: 480 seconds]
danvet has joined #freedesktop
chomwitt has joined #freedesktop
sunarch has joined #freedesktop
<daniels> bentiss: ooof
chomwitt has quit [Remote host closed the connection]
<bentiss> daniels: BTW, there is a security update pending, but I'd like to have a full backup done first
<bentiss> I enabled the pages bucket to be backed up yesterday, but it keeps failing to work. I think I manually managed to work around the errors, but I'd rather not touch the deployment now
<bentiss> daniels: in case you wonder too: minio-artifacts is now gone, so is fdo-k3s-large-4, in 2 hours I should have locally copied te 6 backup files, and we can killminio-backup too
<bentiss> and FWIW, not a single 500 since the switch to ceph
pendingchaos has quit [Read error: Connection reset by peer]
pendingchaos has joined #freedesktop
<daniels> bentiss: \o/ \o/ \o/
<daniels> and yeah, I did see the security update, but was thinking waiting might be better ...
<daniels> nothing in it looks _crushingly_ urgent, but definitely good to have
<daniels> I have some other things for the next couple of hours btw
<psychon> somehow GitLab seems to require a lot of maintenance...
<bentiss> psychon: that's a pipeline from last week, and we basically lost all of them
<bentiss> we are slowly recovering the data from 2021/02/01 to 2021/05/20, there is a hole and then we got data since 2021/05/28
<daniels> psychon: it's not GitLab itself fwiw, it's our underlying storage
<bentiss> sigh... today's backup was successful, but only 222GB instead of ~350GB -> the pod could not talk to the old cluster anymore :(
vmeson has quit [Read error: Connection reset by peer]
<daniels> task-runner on the new cluster?
pendingchaos has quit []
pendingchaos has joined #freedesktop
<bentiss> yep
<bentiss> the backups are running on the new cluster
<bentiss> daniels: my new favorit tool to use remote storage: https://rclone.org/ -> it managed to copy the backups when mc would just fail and s3cmd is soooo slow (because single threaded)
<psychon> okay, thanks for the info & no problem
<daniels> bentiss: noted!
shbrngdo has quit [Remote host closed the connection]
shbrngdo has joined #freedesktop
aleksander has quit []
vmeson has joined #freedesktop
<daniels> nice!
bl4ckb0ne has joined #freedesktop
<bl4ckb0ne> is the monado channel still on freenode?
bl4ckb0ne has quit [Remote host closed the connection]
emersion has quit [Remote host closed the connection]
<daniels> yeah for now, they're going to move it soon
<daniels> but realistically all the activity happens on Discord anyway
emersion has joined #freedesktop
bl4ckb0ne has joined #freedesktop
chomwitt has joined #freedesktop
jarthur has joined #freedesktop
chomwitt has quit [Ping timeout: 480 seconds]
<bentiss> daniels: couple of things:
<bentiss> daniels: 1. the transfer of all the artifacts and backups is now done (rclone was *way* faster)
<bentiss> daniels: 2. I understand why pages are not properly backing up: the files are uploaded from a git user with id 998, and we are running the task-runner pod as git/1000 -> the `s3cmd sync` called tries to change the owner, and it fails with persmission error
<bentiss> for 1. -> I'll clean up large-2 and large-3 tomnight
<bentiss> for 2. -> still thinking what is the best course of actions
<bentiss> maybe we should monkey patch /usr/lib/ruby/vendor_ruby/object_storage_backup.rb
ximion has joined #freedesktop
alanc has quit [Remote host closed the connection]
alanc has joined #freedesktop
chomwitt has joined #freedesktop
ximion1 has joined #freedesktop
ximion has quit [Remote host closed the connection]
chomwitt has quit [Ping timeout: 480 seconds]
chomwitt has joined #freedesktop
<bentiss> oh, well, this time the backup passed...
ngcortes has joined #freedesktop
ngcortes has quit [Remote host closed the connection]
<bentiss> daniels: large-2 and large-3 are now evicted from the cluster. We "just" need to release the servers
shbrngdo has quit [Remote host closed the connection]
shbrngdo has joined #freedesktop
danvet has quit [Ping timeout: 480 seconds]
ngcortes has joined #freedesktop
karolherbst_ is now known as karolherbst
chomwitt has quit [Ping timeout: 480 seconds]
ngcortes has quit [Remote host closed the connection]