ximion1 has quit [Remote host closed the connection]
ximion1 has joined #freedesktop
jcline has quit [Quit: Bye.]
shbrngdo has quit [Remote host closed the connection]
shbrngdo has joined #freedesktop
tanty has quit [Ping timeout: 480 seconds]
tanty has joined #freedesktop
MrCooper has quit [Read error: Connection reset by peer]
MrCooper has joined #freedesktop
<bentiss>
alright, mc mirror --watch is taking way too much memory, and if I don't stop it now, we are going to lose an other node soon
<bentiss>
I've been running it for the past ~24 h, so I'm going to switch the config now, and hopefully people will not notice
<bentiss>
and that should hopefully solve the 500s when uploading artifacts
ximion1 has quit []
<bentiss>
ok, new config applied. I'll sync the remaining logs tomorrow, but I can target them more easily now given that I don't have to run the mirror on the whole instance
* bentiss
goes back to bed
agd5f_ has joined #freedesktop
agd5f has quit [Read error: Connection reset by peer]
agd5f_ has quit [Read error: Connection reset by peer]
chomwitt has quit [Remote host closed the connection]
<bentiss>
daniels: BTW, there is a security update pending, but I'd like to have a full backup done first
<bentiss>
I enabled the pages bucket to be backed up yesterday, but it keeps failing to work. I think I manually managed to work around the errors, but I'd rather not touch the deployment now
<bentiss>
daniels: in case you wonder too: minio-artifacts is now gone, so is fdo-k3s-large-4, in 2 hours I should have locally copied te 6 backup files, and we can killminio-backup too
<bentiss>
and FWIW, not a single 500 since the switch to ceph
pendingchaos has quit [Read error: Connection reset by peer]
pendingchaos has joined #freedesktop
<daniels>
bentiss: \o/ \o/ \o/
<daniels>
and yeah, I did see the security update, but was thinking waiting might be better ...
<daniels>
nothing in it looks _crushingly_ urgent, but definitely good to have
<daniels>
I have some other things for the next couple of hours btw
<psychon>
somehow GitLab seems to require a lot of maintenance...
<bentiss>
sigh... today's backup was successful, but only 222GB instead of ~350GB -> the pod could not talk to the old cluster anymore :(
vmeson has quit [Read error: Connection reset by peer]
<daniels>
task-runner on the new cluster?
pendingchaos has quit []
pendingchaos has joined #freedesktop
<bentiss>
yep
<bentiss>
the backups are running on the new cluster
<bentiss>
daniels: my new favorit tool to use remote storage: https://rclone.org/ -> it managed to copy the backups when mc would just fail and s3cmd is soooo slow (because single threaded)
<psychon>
okay, thanks for the info & no problem
<daniels>
bentiss: noted!
shbrngdo has quit [Remote host closed the connection]
<bl4ckb0ne>
is the monado channel still on freenode?
bl4ckb0ne has quit [Remote host closed the connection]
emersion has quit [Remote host closed the connection]
<daniels>
yeah for now, they're going to move it soon
<daniels>
but realistically all the activity happens on Discord anyway
emersion has joined #freedesktop
bl4ckb0ne has joined #freedesktop
chomwitt has joined #freedesktop
jarthur has joined #freedesktop
chomwitt has quit [Ping timeout: 480 seconds]
<bentiss>
daniels: couple of things:
<bentiss>
daniels: 1. the transfer of all the artifacts and backups is now done (rclone was *way* faster)
<bentiss>
daniels: 2. I understand why pages are not properly backing up: the files are uploaded from a git user with id 998, and we are running the task-runner pod as git/1000 -> the `s3cmd sync` called tries to change the owner, and it fails with persmission error
<bentiss>
for 1. -> I'll clean up large-2 and large-3 tomnight
<bentiss>
for 2. -> still thinking what is the best course of actions
<bentiss>
maybe we should monkey patch /usr/lib/ruby/vendor_ruby/object_storage_backup.rb