ChanServ changed the topic of #freedesktop to: https://www.freedesktop.org infrastructure and online services || for questions about freedesktop.org projects, please see each project's contact || for discussions about specifications, please use https://gitlab.freedesktop.org/xdg or xdg@lists.freedesktop.org
scrumplex_ has joined #freedesktop
scrumplex has quit [Ping timeout: 480 seconds]
guludo has quit [Quit: WeeChat 4.5.1]
Kayden has joined #freedesktop
<mupuf> DragoonAethis, robclark: indeed, this is what we did before... but we could do it with kubernetes now rather than hosting services on individual servers
<mupuf> that being said, this would once again be a lot of work
<mupuf> and would require local people
eluks has quit [Remote host closed the connection]
eluks has joined #freedesktop
AbleBacon has quit [Read error: Connection reset by peer]
alan_ has joined #freedesktop
swatish2 has joined #freedesktop
alan_ has quit [Quit: Leaving]
swatish21 has joined #freedesktop
jsa1 has joined #freedesktop
swatish2 has quit [Ping timeout: 480 seconds]
sima has joined #freedesktop
swatish21 has quit [Ping timeout: 480 seconds]
<bentiss> mupuf, DragoonAethis, robclark: yeah lots of effort plus having a S3 service externally managed would save us a lot of admin time, so hosting this in a uni might not be the best choice.
<bentiss> that being said, Hertzner has a coloc option where we could fit 14 units for $111 per month + electricity
swatish2 has joined #freedesktop
ximion has quit [Remote host closed the connection]
noodlez1232 has quit [Remote host closed the connection]
sghuge has quit [Remote host closed the connection]
sghuge has joined #freedesktop
jsa1 has quit [Ping timeout: 480 seconds]
noodlez1232 has joined #freedesktop
jsa1 has joined #freedesktop
tzimmermann has joined #freedesktop
<ivyl> bentiss: BTW thanks for helping with estimating the costs and all the work you do on the infra <3
<ivyl> yeah, I'm a fan of trading some extra costs in favor of our admin time. The less we have to admin ourselves and instead use managed stuff, as long as the costs are sensible, the better :-)
<ivyl> so that's the angle I'm exploring
<mupuf> Ditto!
<mupuf> GitLab provides enough infra work, no need to add extra
jsa1 has quit [Ping timeout: 480 seconds]
swatish2 has quit [Ping timeout: 480 seconds]
jsa1 has joined #freedesktop
<bentiss> ivyl: do you mind if I tweak a little bit your numbers you sent on the email and put them in the gitlab issue? (and also thanks for that initial quote)
rvr has joined #freedesktop
<ivyl> go ahead
rvr has quit [Quit: leaving]
<bentiss> thx
rvr has joined #freedesktop
swatish2 has joined #freedesktop
tintou has joined #freedesktop
<tintou> Hi there, I'm getting remote: remote: fatal: unable to write loose object file: No space left on device when git pushing to a repository (not that everything still seems to work though)
swatish21 has joined #freedesktop
swatish2 has quit [Ping timeout: 480 seconds]
<bentiss> tintou: which repo please?
<tintou> virglrenderer
<bentiss> thx
<bentiss> we still have disk space available on the gitaly pods...
<bentiss> could be that kemper is out of disk
<bentiss> yep... kemper is out of disk, so the post-receive hook is failing
<bentiss> Mithrandir, daniels ^^
<daniels> fixed, thanks
<bentiss> thx
mvlad has joined #freedesktop
guludo has joined #freedesktop
kode548 has joined #freedesktop
kode54 has quit [Ping timeout: 480 seconds]
gkiagia has joined #freedesktop
kode548 has quit []
kode54 has joined #freedesktop
swatish21 has quit [Ping timeout: 480 seconds]
* bentiss looks at the aws costs... and just the db is more expensive than the whole hetzner package :/
<bentiss> (I know they have the free credits, but still)
* mupuf would feel better with hetzner
<mupuf> oh, and if we go with hetzner, I would vote for moving the infra out of the USA
<bentiss> yeah, last treasurer report said we started the year with $230000, so that's 4 years of Hetzner if we just burn cash without any income
<mupuf> London or Amsterdam would be great locations IMO
<pinchartl> mupuf: +1
<mupuf> B. Less political uncertainty
<mupuf> A. Most of our DUTs are at Collabora which IIRC means cambridge
<bentiss> I thought Hetzner we had the choice between Germany or Finland....
<ivyl> yep, Hetzner is either or one of those
<bentiss> well, they have USA and Singapore... but what's teh point of goign there :)
<ivyl> extra latency!
<xorgy> being near a large proportion of contributors, and the most interconnected backhaul in the world...
<mupuf> I see, then it would be germany then
<xorgy> (not Singapore lol)
<mupuf> for OVH, it could be london or AMS
<xorgy> Hetzner has for me always given superior performance to other European hosts, whether virtual and dedicated
<xorgy> OVH has been really challenging to work with, in my experience. Billing errors are a common problem I, and people I know, run into with them.
<pq> Someone repeatedly cutting those comms cables in the Baltic Sea might make people think twice of hosting things in Finland, sadly.
<mupuf> pq: hehe, yeah!
swatish2 has joined #freedesktop
<svuorela> I'm hoping that when/if fundraising is needed that we could trick downstream consumers like distributions and KDE/Gnome/... to also help at least boost & convey the messages...
<mupuf> svuorela: I would think that XDC sponsorship would be enough to cover for CI expenses
<ivyl> it might be, and we have people on the board working on securing donors that would explicitly help with the infra costs
<mupuf> <3
<svuorela> I btw showed the gitlab issue to the KDE gitlab people, and one of the comments from them was "yeah. I can point to that when people ask why I'm so hard on purging CI artifacts.."
<mupuf> hehe
<daniels> yeah, the Collabora machines are in Cambridge, so anything in western or central Europe is totally fine
<mupuf> The main valve farm is in Seattle... but I don't think it matters
<mupuf> given the current latency of gitlab, I don't think the network matters much. Hopefully improved S3 performance will more than compensate for the increased ping time
jsa1 has quit [Ping timeout: 480 seconds]
<bentiss> mupuf: my bet is currently on the db and ceph that kills our latency
<mupuf> the DB being on ceph is for sure not gonna win any race ;)
<bentiss> yeah. sorry for that bad choice
<mupuf> sorry? What else could we have done?
<mupuf> tying the DB to a single host would have made migrations even harder
<bentiss> have the db replicated at the db level, and each db talk to its own ssd (like what I think we should do if we migrate to Hetzner)
<bentiss> basically postgres already has all the redundancy in place (with a pgbouncer IIRC), so you don't need ceph to replicate the db 3 times, you just need 3 dbs
<mupuf> oh, yeah, that's better
jsa1 has joined #freedesktop
<__tim> GStreamer runners are all on Hetzner as well, but, as you say, runners are probably the least problematic piece of all of this
jsa1 has quit [Ping timeout: 480 seconds]
ximion has joined #freedesktop
ximion has quit [Remote host closed the connection]
<DragoonAethis> mupuf: I tested our smaller CBL Postgres on a beefy enough Ceph vs local NVMe, and the penalty was ~10x in throughput and ~25x latency
<DragoonAethis> This is on 100G NICs and PCIe 4.0 NVMes
<DragoonAethis> So +1 for moving to local storage, just keep in mind Hetzner's block storage is AFAIK also networked and not local - benchmark it as needed
<bentiss> DragoonAethis: thanks for the info. Honestly it can not be worse than what we have. And if we have to pay for 2 more bare metal servers at $66, that's probably something that we could afford (we can apparently put on the same network bare metal and vcpus)
<DragoonAethis> bentiss: it's not gonna be worse, just benchmark as needed :P It's also billed per hour, so quickly starting a test project and tearing it down a few hours later costs very little (holler if you want a test project with API access)
<DragoonAethis> If you're running on dedicated boxes, you can also request custom local NVMe drives to be installed (for a considerable price, but still cheaper than anything else you'll find in big clouds)
<bentiss> TBH, my problem with benchmarking the db is that I just don't know how to do it. I would be able to set up the db once/if I get Hetzner access, restore a backup onto those dbs, but bloody hell I have no ideas on how to test this without pulling the entire gitlab :)
<DragoonAethis> The way I did it was: Restore a backup onto the new server, monitor the production instance for longer queries, rerun these long queries a few times on the to warm caches and then rerun them with perf monitoring (exec time, EXPLAINs on production vs test setup) to figure out a good baseline
<bentiss> ... which is gibberish to me :)
<DragoonAethis> it doesn't get any easier when you're 20 tabs into EDB docs :)
<bentiss> (not that I don't understand all the terms, but I don't know how to apply them and probably don't want to)
<bentiss> and again, if anyone wants to help on the db side, I'll welcome any
swatish2 has quit [Ping timeout: 480 seconds]
haaninjo has joined #freedesktop
sewn has quit [Remote host closed the connection]
jsa1 has joined #freedesktop
sewn has joined #freedesktop
GNUmoon has quit [Read error: Connection reset by peer]
GNUmoon has joined #freedesktop
<xorgy> pq: I think that issue w/ Finland may be resolved relatively soon, so I wouldn't factor it in too much if it makes sense to host in Finland for cost or other reasons.
jsa1 has quit [Ping timeout: 480 seconds]
tzimmermann has quit [Quit: Leaving]
<bentiss> damnit, I mixed up PSU and OSUOSL... :)
<bentiss> brain fart... if arm donates runners, could we configure them so that they are tied to mesa (easy) and only used for marge-bot (with some rules magic). This should allow control of the egress because more of the same jobs will be running and we wouldn't have to wipe the cache all the time, plus this means more trusted runners as they are not free for all
<daniels> yeah, we can manage that
<bentiss> I mean mesa and or drm or any trusted project
<bentiss> daniels: do you agree on taking the immediate Hetzner solution for gitlab? And when we are less in emergency see the various options?
<bentiss> (I'm already looking at ways on how to deploy the db, and it seems https://cloudnative-pg.io will be the best)
<daniels> bentiss: heh, I was just trying to compose a reply
<daniels> tbh I think AWS + Fastly is better for now
<daniels> in that I do think Hetzner if we have the time and bandwidth to manage it is great, but then we need to make sure we have a steady stream of cash coming in - which is not something we've been used to
<daniels> so rather than having to run around with cap in hand, personally my preference would be to give the board a year or two to figure out how to do the whole sponsorship thing, and make sure we can actually get enough money and reproduce that annually
<bentiss> k, I'll do with whatever we are given in the end :)
<daniels> it's also not my decision - I'm just a dude on the internet with some opinions
<bentiss> same here :)
<bentiss> anyway, going afk for tonight. Have fun everybody
<daniels> me too - enjoy!
todi has quit []
todi has joined #freedesktop
<mupuf> Hmm, getting sponsorship with fastly and aws will take time, and xorg has enough cash to last us for multiple years, that's enough time to figure out what to do
<mupuf> I would maybe host some runners at AWS and everywhere we can get capacity for free, but keep the core of the infra on paid infra, for stability and reliability
<daniels> yeah, I can definitely see the argument
<mupuf> also, with infra we can rent by the hour, we can experiment more easily with new ways of doing things... unlike what happened with equinix where we had to take the infra down when doing major upgrade.
alanc has quit [Remote host closed the connection]
<DragoonAethis> Neat feature here: You can take a VM snapshot with a runner setup, pay all the time just for its storage (€0.01353/GB/month with 23% VAT) and quickly bring new runners up/down on demand from that
alanc has joined #freedesktop
Kayden has quit [Quit: -> JF]
mazze has joined #freedesktop
<mazze> Did something change in ftp.freedesktop.org? When I try to access it with FileZilla I get ECONNREFUSED error.
AbleBacon has joined #freedesktop
jsa1 has joined #freedesktop
swatish2 has joined #freedesktop
<bilboed> <xorgy> "OVH has been really challenging..." <- I'll up that. I've had (and sadly still have) servers on linode and OVH/scaleway before trying hetzner and it was day and night. People stress about the pricing difference, but I would focus on the "bang for the buck". Not only are they cheaper but they also offer *way* better cpu/iops. The only downside is... 1G only connectivity, which is a not really an issue if you switch to their (new)
<bilboed> object storage
swatish21 has joined #freedesktop
swatish2 has quit [Ping timeout: 480 seconds]
Kayden has joined #freedesktop
mazze has quit [Quit: Leaving]
sima has quit [Ping timeout: 480 seconds]
swatish21 has quit [Ping timeout: 480 seconds]
jsa1 has quit [Ping timeout: 480 seconds]
jsa1 has joined #freedesktop
ximion has joined #freedesktop
guludo has quit [Ping timeout: 480 seconds]
mvlad has quit [Remote host closed the connection]
jsa1 has quit [Ping timeout: 480 seconds]
haaninjo has quit [Quit: Ex-Chat]