#freedesktop on 2023-12-06 — irc logs at oftc.irclog.whitequark.org

2023-09-08 23:49 daniels changed the topic of #freedesktop to: https://www.freedesktop.org infrastructure and online services || for questions about freedesktop.org projects, please see each project's contact || for discussions about specifications, please use https://gitlab.freedesktop.org/xdg or xdg@lists.freedesktop.org

00:15 lsd|2 has quit [Remote host closed the connection]

01:04 alpernebbi has quit [Ping timeout: 480 seconds]

01:07 alpernebbi has joined #freedesktop

01:21 lsd|2 has joined #freedesktop

01:47 cisco87 has quit [Remote host closed the connection]

01:54 cisco87 has joined #freedesktop

02:15 damian has quit []

02:47 bilboed has quit [Ping timeout: 480 seconds]

02:51 bilboed has joined #freedesktop

04:02 itaipu has quit [Ping timeout: 480 seconds]

04:08 mripard has quit [Ping timeout: 480 seconds]

04:15 privacy has joined #freedesktop

04:18 alatiera has quit [Quit: Connection closed for inactivity]

04:19 mripard has joined #freedesktop

04:29 lsd|2 has quit [Quit: KVIrc 5.0.0 Aria http://www.kvirc.net/]

05:44 ximion has quit [Quit: Detached from the Matrix]

06:07 todi1 has joined #freedesktop

06:12 todi has quit [Ping timeout: 480 seconds]

06:21 nektro has quit [Remote host closed the connection]

06:21 nektro has joined #freedesktop

07:04 bmodem has joined #freedesktop

07:22 sima has joined #freedesktop

07:30 AbleBacon has quit [Read error: Connection reset by peer]

07:49 nektro has quit [Remote host closed the connection]

07:49 nektro has joined #freedesktop

07:50 mvlad has joined #freedesktop

08:26 ___nick___ has joined #freedesktop

08:42 i509vcb has quit [Quit: Connection closed for inactivity]

08:47 nektro has quit [Remote host closed the connection]

08:47 nektro has joined #freedesktop

09:11 ___nick___ has quit []

09:13 ___nick___ has joined #freedesktop

09:13 ___nick___ has quit []

09:16 ___nick___ has joined #freedesktop

09:23 blatant has joined #freedesktop

09:27 thaller is now known as Guest9395

09:27 thaller has joined #freedesktop

09:33 privacy has quit [Quit: Leaving]

09:33 Guest9395 has quit [Ping timeout: 480 seconds]

09:36 tzimmermann has joined #freedesktop

10:01 MrBonkers has quit [Quit: The Lounge - https://thelounge.chat]

10:05 MrBonkers has joined #freedesktop

11:02 ximion has joined #freedesktop

11:07 ximion has quit [Quit: Detached from the Matrix]

11:12 bmodem has quit [Ping timeout: 480 seconds]

11:30 <eric_engestrom> bentiss: is the marge configuration somewhere public?

11:31 <eric_engestrom> jenatali: another test-dozen-deqp where all the tests start going Missing: https://gitlab.freedesktop.org/mesa/mesa/-/jobs/52430475

11:32 <eric_engestrom> I killed it and retried it, hopefully it will have time to finish for that MR

12:02 <daniels> eric_engestrom: yep https://gitlab.freedesktop.org/freedesktop/fdo-bots/-/commit/c9fcb90a002f0b055fa2249a92ddd719cea24c94

12:14 <jenatali> eric_engestrom: :( unfortunately I'm still pretty sure this is a runner problem, not a dozen problem, which really means it's out of my hands to address

12:21 <eric_engestrom> daniels: thanks! `values/marge-bot/run_marge.sh` is what I was looking for

12:22 <eric_engestrom> jenatali: yeah I think you're right that it's a runner problem; I didn't realize you didn't maintain the runner though, sorry for the pings :]

12:24 bmodem has joined #freedesktop

12:24 Inline has joined #freedesktop

12:24 <daniels> eric_engestrom: alatiera (& the GSt project generally) maintain the Windows runner(s)

12:24 <daniels> currently it's singular, hence the long queue, and gst has also been a very noisy neighbour up until late, hence the massively variable runtimes

12:24 alatiera has joined #freedesktop

12:25 <eric_engestrom> ack, thanks!

12:25 <jenatali> Microsoft does provide the licenses for Windows on the runners though :)

12:25 <eric_engestrom> (I love how you summoned them into the channel :P)

12:26 <eric_engestrom> haha jenatali

12:26 <alatiera> it's magic

12:26 <jenatali> Which I mean to say, through partnership, not sales lol

12:26 <eric_engestrom> very off-topic, but I thought MS had dropped the idea of windows licenses now?

12:26 <alatiera> I saw the matrix ping and realized my irc had dced

12:27 <jenatali> Nah, it still has to be purchased once per machine, and then that machine can upgrade forever

12:27 <eric_engestrom> ack

12:27 * eric_engestrom hasn't installed windows anywhere in... oof, I'm getting old

12:28 <jenatali> Which pays my salary so I can't really complain about this business model too much

12:28 <eric_engestrom> hehe

12:32 <pinchartl> the last time I bought a machine with windows, I had to battle for a year to get reimbursed for the OS license that I ws forced to get

12:32 <pinchartl> jenatali: hopefully that didn't affect your salary :-)

12:32 <jenatali> Hah

12:33 <pinchartl> it was a loooooong time ago, before I was old and grumpy

12:33 <pinchartl> I was young and grumpy I suppose

12:35 <jenatali> If it was that long ago then it was probably before I was even here

12:36 <jenatali> Though I am coming up on 12 years here in a few months... Crazy how time flies

12:46 <karolherbst> is the label maker dead? :'(

12:46 <bentiss> karolherbst: it shouldn't

12:47 <bentiss> karolherbst: which MR?

12:47 <karolherbst> https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26541, but I already tagged it...

12:47 <karolherbst> maybe I didn't wait enough

12:47 <bentiss> indeed, crashloopback

12:47 <karolherbst> ahh

12:48 <bentiss> value: Error { kind: Io(Os { code: 24, kind: Uncategorized, message: "Too many open files" })

12:48 <bentiss> sigh...

12:49 <bentiss> ok, fixed now

12:50 <bentiss> that's the second time in ~a week that this node has that issue

12:50 <karolherbst> pain

12:51 <karolherbst> just increase the fd limits then... 🙃

12:51 <karolherbst> or is that like the hypervisor complaining?

12:51 <bentiss> there is no hypervisor here

12:52 <bentiss> I just wonder which process is consuming all of the fds

12:52 <karolherbst> I see

12:52 <emersion> the limit is per-process usually

12:53 <emersion> maybe dmesg logs that?

12:53 <bentiss> that process shouldn't use a lot, and it was actually in a boot loop, so just trying to get 1 fd

12:53 <emersion> hm, probably not

12:53 <bentiss> https://github.com/timsueberkrueb/webber/issues/62 seems to give a command to get that info

12:53 <emersion> this error usually indicates a FD leak

12:54 <emersion> inside the process

12:54 <pinchartl> I'm trying to run a job locally, with a container image from the registry. podman can run the container fine (with a naive 'podman container run -it $registry_url bash'), but the qemu I start in the container complains that it can't initialize KVM. indeed, there's no /dev/kvm in the container. I have a /dev/kvm, and qemu can use it fine on the host. what am I missing to use kvm inside the container

12:54 <pinchartl> ?

12:54 <bentiss> right now, on that runner the command I linked above gives me only 144 fds... so unlikely to be correct when journalctl -f still complains

12:55 <karolherbst> maybe some config being wrong?

12:55 <karolherbst> worst case, strace the bot and see what happens

12:55 <bentiss> karolherbst: it's not the bot, it's something else on the machine

12:55 <bentiss> the bot is just a side effect of not being able to get an fd

12:56 <karolherbst> could be some funky python bug, but yeah...

12:57 <karolherbst> bentiss: try `sysctl fs.file-nr`

12:57 <emersion> i don't think you'd see this error if another process was responsible

12:57 <bentiss> karolherbst: fs.file-nr = 1334409223372036854775807

12:57 <karolherbst> `lsof | wc -l` might also be worth a try

12:58 <karolherbst> bentiss: what...

12:58 <karolherbst> bentiss: it output three numbers, right?

12:58 <karolherbst> ohh wait...

12:58 tzimmermann has quit [Quit: Leaving]

12:58 <karolherbst> IRC being IRC I guess

12:58 <bentiss> fs.file-nr = 13344 0 9223372036854775807

12:58 <bentiss> yeah

12:58 <karolherbst> mhh

12:58 <karolherbst> check lsof then

12:58 <bentiss> on it

12:59 <bentiss> that's a lot of "lsof: no pwd entry for UID 65535"

12:59 <karolherbst> mhhh

13:00 <bentiss> well, not just 65535

13:00 <karolherbst> could be some container stuff

13:00 <bentiss> lsof -l -> 1403280

13:00 <karolherbst> that's a lot of entries

13:00 <emersion> ls -l /proc/$(pidof …)/fd

13:00 <emersion> coudl help

13:00 <karolherbst> though on my desktop I have like 721047

13:01 <karolherbst> ` ls -l /proc/*/fd | wc` :D

13:01 <karolherbst> but yeah...

13:02 <karolherbst> could help with figuring out if something uses a lot though

13:02 <bentiss> 37927 202864 1531104

13:02 blatant has quit [Quit: WeeChat 4.1.2]

13:03 <karolherbst> mhhh

13:03 <karolherbst> what's the fd limit of the system?

13:03 <bentiss> cat /proc/sys/fs/inotify/max_user_watches -> 1048576

13:03 <karolherbst> that's inotify tho

13:03 <karolherbst> there is `/proc/sys/fs/file-max` for file handles, but not sure if that's the same as fds

13:04 <bentiss> cat /proc/sys/fs/file-max

13:04 <bentiss> 9223372036854775807

13:04 <karolherbst> yeah and `ulimit` probably says "unlimited"

13:04 <bentiss> which seems a lot more :)

13:04 <bentiss> yep

13:04 <karolherbst> yeah, so it's unlikely you hit a global fd limit

13:04 <bentiss> if I were, I couldn't ssh to the box, no?

13:05 <karolherbst> yeah

13:05 <karolherbst> probably

13:05 <karolherbst> so yeah.. either a python bug or a bug in the bot is most likely here

13:07 tzimmermann has joined #freedesktop

13:07 <bentiss> well, there a re a lot of processes with a lot of opened sockets

13:08 <bentiss> not sure if they count in the inotify

13:09 <bentiss> I wonder if I should not just bump the max_user_watches

13:13 <bentiss> https://github.com/mikesart/inotify-info seems interesting :)

13:14 <karolherbst> bentiss: inotify is an API to track changes to filesystems, it's not related to any fd limits

13:15 <bentiss> so not inotify related?

13:15 <karolherbst> no

13:16 <karolherbst> max_user_watches is just the limit of how many watchers on events there can be in total

13:17 <karolherbst> however

13:17 <bentiss> k, and the tool above returns Total inotify Watches: 5559

13:17 <bentiss> Total inotify Instances: 238

13:17 <karolherbst> I don't know what python does which triggers the value: Error { kind: Io(Os { code: 24, kind: Uncategorized, message: "Too many open files" })

13:17 <bentiss> it's rust, not python

13:17 <karolherbst> ahhh

13:17 <karolherbst> then it's rust

13:17 <karolherbst> I'd check with strace and see what fails

13:17 <bentiss> well, journalctl -f also fails

13:18 <karolherbst> mhhh...

13:18 <karolherbst> there is definetly something funky going on

13:18 <bentiss> that was my point: the bot is gone, and I still have issues on the server

13:18 <karolherbst> yeah...

13:18 <karolherbst> check strace then

13:18 <karolherbst> this error can mean anything at this point

13:18 <bentiss> strace on what?

13:19 <karolherbst> whatever fails

13:19 <karolherbst> so maybe journalctl -f?

13:19 <bentiss> inotify_init1(IN_NONBLOCK|IN_CLOEXEC) = -1 EMFILE (Too many open files)

13:20 <karolherbst> uhhh... okay, so it is inotify related after all...

13:20 <bentiss> don't know why hookiedookie would use inotify though

13:20 <karolherbst> yeah.. me neither

13:20 <bentiss> it's a webserver, so a socket would do

13:20 <karolherbst> ohh

13:20 <karolherbst> maybe just listen for files changing on disk

13:21 <karolherbst> to update the RAM cache or something

13:21 <karolherbst> or regenerate files or whatever

13:22 <bentiss> my memory muscle started typing "ps -aef"... and ps -aef | grep gpg | grep defunct | wc -l

13:22 <karolherbst> mhhh

13:22 <bentiss> 6117

13:22 <karolherbst> I wonder which of the inotify limits you are hitting.. let's see...

13:22 <bentiss> I wonder why I have so many gpg defunct processes

13:22 <karolherbst> "The user limit on the total number of inotify instances has been reached. "

13:22 <karolherbst> mhhh

13:23 <karolherbst> that's EMFILE

13:23 <bentiss> yeah, that matches the numbers I gave earlier

13:23 <karolherbst> sysctl fs.inotify

13:23 <karolherbst> `fs.inotify.max_user_instances = 128` I guess?

13:23 <bentiss> oh, I know why I use inotify in hookiedookie: it looks for changes in the Settings.tmpl file to reload itself in case there is a change

13:23 <karolherbst> try bumping that

13:24 <bentiss> karolherbst: that helped for journalctl -f > no more errors

13:24 <karolherbst> cool

13:25 <bentiss> what would be a reasonable value?

13:25 <bentiss> 256? 512? 1024?

13:25 <karolherbst> maybe try 1024 and see if that's enough?

13:25 <bentiss> k, I'll bump it on all of the nodes

13:25 <karolherbst> I don't know what's reasonable here but 128 is apparently not enough :)

13:26 <bentiss> I just need to remember how to make that setting persistent

13:26 <karolherbst> /etc/sysctl/

13:26 <karolherbst> uhm..

13:26 <karolherbst> sysctl.d/

13:26 <bentiss> /etc/sysctl.conf and sysctl.d

13:26 <bentiss> yeah

13:31 <bentiss> k, bumped on every node, we'll see if that fails once again

13:31 <karolherbst> hopefully it won't :)

13:35 vkareh has joined #freedesktop

13:37 <bentiss> eric_engestrom: I don't understand your comment at https://gitlab.freedesktop.org/freedesktop/fdo-bots/-/merge_requests/15#note_2195867

13:37 <bentiss> well, I think I do

13:38 <bentiss> let me reply

13:38 bmodem has quit [Ping timeout: 480 seconds]

13:39 <DavidHeidelberg> jenatali: is this happening often or it's rare that dozen job takes ~ 45 min https://gitlab.freedesktop.org/mesa/mesa/-/jobs/52436612 ?

13:41 <jenatali> DavidHeidelberg: See discussion from 1hr ago, there's runner overload that's being addressed

13:41 <jenatali> When overload isn't happening, it's closer to 15min

13:43 <karolherbst> sadly the job continues to run if marge times out, so it would help to kill the jobs on the runner

13:44 <karolherbst> but I also kinda wished marge would do that when moving to the next MR

13:44 <karolherbst> like kill all still running jobs on the last MR if marge decides to move on

13:45 <DavidHeidelberg> kk thx

13:47 <bentiss> karolherbst: not sure if you saw, but now if you push changes to main of https://gitlab.freedesktop.org/freedesktop/marge-bot this will get automatically deployed, so it's just a MR away????

13:47 <karolherbst> I wished I had time this year for working on marge :')

13:47 <bentiss> though we probably want to keep it close to upstream

13:47 <bentiss> the marge we are using is 7 months old, not sure what happened upstream since

13:47 <karolherbst> ~~maybe once I'm on PTO~~

13:47 <karolherbst> fair...

13:47 <bentiss> PTO is for resting, not marging

13:47 <karolherbst> I know

13:48 <karolherbst> and I'm on PTO next week

13:48 <bentiss> nice

13:48 <karolherbst> sooo...

13:48 <karolherbst> not sure I find some time to work on marge. however if anybody feels like working on marge, I think implementing what I mentioned would help tremendously in situations like this

13:51 <bentiss> karolherbst: anyway, thanks heaps for the help of the max_user_thingy

13:52 <karolherbst> no problem. remember to use strace for cases like this as it is a power tool :D

13:52 <bentiss> heh, I'll do :)

13:58 blatant has joined #freedesktop

14:14 <eric_engestrom> karolherbst, bentiss: I added `--cancel-pipeline-on-timeout` to Marge: https://gitlab.com/marge-org/marge-bot/-/merge_requests/411 :)

14:15 <eric_engestrom> not tested yet though, and not really reviewed either

14:16 <karolherbst> cool!

14:17 <eric_engestrom> hopefully when I'm back in january I can test it and get upstream to merge it

14:19 <eric_engestrom> I also have a branch based on that one that adds `--job-failure={warn,abort,ignore}` so that we can either cancel the whole thing when a job fails, or at least post a message in the MR so that users can retry asap

14:20 <eric_engestrom> a future improvement will be to add `--job-failure-delay-abort N` to give some time between the job failure and the abort, eg. 10min, so that we don't waste too much resource but also give a chance to retry

14:21 <karolherbst> I think my idea was to only do that once marge actually moves to the next MR

14:21 <eric_engestrom> yeah that's what the MR I posted does

14:21 <eric_engestrom> but I was talking about other things we can also do

14:21 <karolherbst> right..

14:23 <karolherbst> yeah.. I don't really know, I think reducing the load on the CI system is more important than having jobs continue to run for a while which might pass anyway

14:24 <eric_engestrom> yeah, "reducing ci load" has been my focus for the last 2-3 months

14:25 <eric_engestrom> this morning I merge the MR that stops mesa from re-running all the test right after merging (since we just ran them to get to the point where the MR is merged)

14:25 <karolherbst> yeah.. that should help a lot

14:26 <karolherbst> it makes sense to do that if we wouldn't rebase, but as we do...

14:26 <eric_engestrom> hopefully with that 2x waste, the 50% reduction in ci load will result in noticable improvement in marge pipeline times

14:26 <karolherbst> yeah

14:26 <karolherbst> so the only jobs which run post MR are like gitlab pages stuff and such?

14:26 <eric_engestrom> exactly

14:26 <karolherbst> good

14:26 <eric_engestrom> no "and such", that's it

14:26 <eric_engestrom> just `pages`

14:26 <karolherbst> yeah.. 2x in capacity should get us going for a while :)

14:27 <eric_engestrom> well, it's only _actually_ 2x capacity increase in the case of back-to-back MRs like we've had the last couple of days

14:28 <eric_engestrom> when there's time in between MRs the difference isn't that high

14:28 <jenatali> Which have partially been my fault (kinda) :(

14:28 <karolherbst> eric_engestrom: well.. isn't that what capacity means?

14:29 <karolherbst> like how many MRs we could merge at most in a day :)

14:29 <eric_engestrom> yeah I guess

14:29 <eric_engestrom> ^^

14:29 <eric_engestrom> jenatali: not your fault since you're not the one maintaining that runner :P

14:30 privacy has joined #freedesktop

14:30 <jenatali> Yeah but I maintain the test pass and the code under test

14:41 lsd|2 has joined #freedesktop

14:53 AbleBacon has joined #freedesktop

15:17 blatant has quit [Ping timeout: 480 seconds]

15:22 bilboed has quit [Quit: The Lounge - https://thelounge.chat]

15:32 immibis_ is now known as immibis

15:32 immibis is now known as immibis_

15:33 <eric_engestrom> jenatali, alatiera: https://gitlab.freedesktop.org/mesa/mesa/-/jobs/52442924 -> the windows build job started almost half an hour into the pipeline (all the other test jobs have finished) and it hasn't started actually compiling yet, after 10+ minutes

15:33 <eric_engestrom> should we consider the windows farm offline?

15:34 <eric_engestrom> haha, it heard me I think, it just started compiling

15:34 immibis_ is now known as immibis

15:34 <alatiera> Queued: 8 minutes 16 seconds: Not that bad given that we are at half the runners atm

15:38 <eric_engestrom> that's on top of the 7+10+8 pending of the previous windows jobs in the chain :/

15:38 <eric_engestrom> (minutes)

15:39 jarthur has joined #freedesktop

15:41 <eric_engestrom> jenatali, alatiera: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26549 I know it's not ideal, but I think it's necessary :(

15:42 <eric_engestrom> I have to go for a bit, jenatali I'll let you assign it to Marge if you approve

15:44 <jenatali> I don't like it but yeah the single runner just can't handle gst and Mesa

15:44 <eric_engestrom> https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26360 test-dozen-deqp hasn't even started yet and there's 12 minutes left :(

15:45 <alatiera> @eric_engestrom go for it if you think it's necessary

15:46 <eric_engestrom> ack, I'll merge it now

15:47 <alatiera> the runner is at constant 100% cpu utilization as you'd expect

15:48 <eric_engestrom> :(

15:49 <jenatali> eric_engestrom: I would assign it, but I can't from my phone. When I click edit on the assigned to field, and the keyboard pops out, it closes the side bar

15:50 <eric_engestrom> yeah I have the same web ui bug on my phone

15:50 <eric_engestrom> I usually just post a comment with `/assign @marge-bot` instead

15:50 <eric_engestrom> but I just merged it so no need to do anything

15:56 <jenatali> :O That's a thing? That's cool

15:57 alyssa has joined #freedesktop

15:57 <alyssa> eric_engestrom: do we need to cancel pipeline of currently running MR since it won't merge now?

15:58 <alyssa> oh beat me to it

15:58 <alyssa> :p

16:00 mvlad has quit [Remote host closed the connection]

16:00 <eric_engestrom> :)

16:02 blatant has joined #freedesktop

16:18 tzimmermann has quit [Quit: Leaving]

16:29 Haaninjo has joined #freedesktop

16:52 <jenatali> alatiera: Please let me know as soon as you expect any kind of improvement, since I'd like to re-enable Windows CI as soon as it won't cause issues for folks

16:52 <jenatali> The longer it's off, the more people make changes that break my drivers or MSVC builds :)

16:53 <karolherbst> I think the plan was to disable post merge pipelines and after that I'm sure it's fine to reenable?

16:53 damian has joined #freedesktop

16:53 <karolherbst> what's blocking that anyway?

16:54 <alatiera> We have disabled post-merge in gst for a couple years now, and only do a nightly schedule just to make sure

16:54 <alatiera> and the schedule is very recent we were fine without it

16:55 <alatiera> jenatali: ack, currently wiped the runner and started again from scratch cause I couldn't make it to work, so dunno

16:55 <MrCooper> Mesa's post-merge pipeline is mostly empty now, only the pages job if needed

16:55 <jenatali> karolherbst: Windows jobs have been turned off (except the build I think?) for post-merge anyway

16:55 <karolherbst> ohh that already landed

16:56 <jenatali> The problem is the Dozen job is super CPU-intensive, and it fights with other stuff that's running on the system, and if there's only one runner it gets too busy to handle that job appropriately

16:58 <karolherbst> I see

17:06 <MrCooper> hmm, that sounds like a gitlab-runner misconfiguration? The same number of instances of the job can end up run concurrently regardless of which pipelines the job does (not) exist in

17:32 mvlad has joined #freedesktop

17:43 <alyssa> maybe dozen needs dedicated runners?

17:44 blatant has quit [Ping timeout: 480 seconds]

17:46 <eric_engestrom> oops, sorry about gitlab being a bit unresponsive, I think that's my fault for sending a bunch of requests at once from a script

17:47 <eric_engestrom> I imediately killed the script so it will come back soon

17:47 <eric_engestrom> yeah looks like it's back to normal :)

17:48 blatant has joined #freedesktop

17:50 <eric_engestrom> karolherbst:

17:50 <eric_engestrom> > I think the plan was to disable post merge pipelines and after that I'm sure it's fine to reenable?

17:50 <eric_engestrom> > what's blocking that anyway?

17:50 <eric_engestrom> if you're talking about mesa, https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26451 was merged this morning :)

17:52 ___nick___ has quit [Ping timeout: 480 seconds]

17:54 <DavidHeidelberg> eric_engestrom: are you sending more, because GL look pretty dead "D

17:54 <eric_engestrom> DavidHeidelberg: it works fine for me now

17:54 <DavidHeidelberg> lucky u

17:54 <DavidHeidelberg> ok, good recovered, but minute ago it got stuck on loading page

17:55 <eric_engestrom> it was unresponsive for 2-3 minutes but it's been fine for a while now

17:55 <eric_engestrom> well, "while" = 5+ minutes

17:56 <DavidHeidelberg> bentiss: Can we help somehow? If it would be meaningful I would provide some server or ask on conferences if any corp want to put some extra $ into FDO? :)

18:06 <bentiss> DavidHeidelberg: for the gitlab instance in itself, we would welcome any extra runner, but having more nodes for the cluster would require them to be hosted on Equinix datacenters

18:06 <bentiss> though I think if we get extra runners, we could trade some of the runners for k8s nodes

18:09 blatant has quit [Quit: WeeChat 4.1.2]

18:13 ximion has joined #freedesktop

18:34 <jenatali> alyssa: I wish I could make that happen...

18:44 <eric_engestrom> weren't we talking about all the money microsoft has earlier? :P

18:46 <jenatali> It's not strictly money, it's a skillset and time for managing a machine as well

18:48 alanc has quit [Remote host closed the connection]

18:48 alanc has joined #freedesktop

20:48 <daniels> ^

20:53 <airlied> just have 10 windows runner machines, with another machine that is cycling through and reinstalling them all from scratch :-P

20:55 <jenatali> We actually have an internal tech that sets up machines to dual-boot. You join a minimal OS environment as a runner, and one of the things you can tell it to do is to reboot and install an OS and join that as a runner

21:07 <daniels> yeah, having something like that would be great - surprisingly fd.o people are not natural Windows admins

21:08 <daniels> the last time I did it was NT4

21:09 <pinchartl> the last time I had to administer a windows machine, it required a keyboard and a mouse

21:10 <jenatali> Let me ask around a little bit and see if we can contribute more than just licenses. It'd be nice to get some proper Azure compute time on beefy machines for Windows CI

21:10 <jenatali> I wouldn't really expect an answer before the new year though, things shut down around here come December

21:21 <airlied> unless we can somehow tie it to OpenAI :-P

21:25 i509vcb has joined #freedesktop

21:31 <alyssa> jenatali: then maybe dozen fraction needs to be bumped

21:31 * alyssa shrugs

21:31 <jenatali> Wouldn't matter. That'd decrease the time, but when the machine is under heavy load, some test results start to go missing

21:32 <alyssa> maybe dozen shouldn't be ci'd upstream then yet

21:32 <alyssa> (i'm sympathetic to the challenges of running ci at mesa/mesa scale, this is a major reason why asahi ci upstream is not in the cards)

21:34 <jenatali> I dunno. Seems like even just the build jobs were having problems

21:35 alyssa has quit [Quit: alyssa]

21:46 vkareh has quit [Quit: WeeChat 4.1.1]

22:27 sima has quit [Ping timeout: 480 seconds]

22:27 Haaninjo has quit [Quit: Ex-Chat]

23:07 thelounge14738 has quit []

23:07 thelounge14738 has joined #freedesktop

23:10 privacy has quit [Quit: Leaving]

23:10 itaipu has joined #freedesktop

23:40 <jenatali> alatiera: I think I've asked this before, but what's the config of the machines that host the Windows CI? I'm going to make a run at asking for resources from our side for Mesa (at least) and would probably want something comparable

23:49 <alatiera> I will find and send you tmr the details

23:50 <jenatali> Thanks!