#asahi-dev on 2022-02-17 — irc logs at oftc.irclog.whitequark.org

2021-07-26 22:57 ChanServ changed the topic of #asahi-dev to: Asahi Linux: porting Linux to Apple Silicon macs | General development | GitHub: https://alx.sh/g | Wiki: https://alx.sh/w | Logs: https://alx.sh/l/asahi-dev

00:09 kenzie35 has quit []

00:09 kenzie35 has joined #asahi-dev

00:29 refi64 has quit [Remote host closed the connection]

00:30 refi64 has joined #asahi-dev

00:31 phire_ has joined #asahi-dev

00:31 phire is now known as Guest12

00:31 phire_ is now known as phire

00:33 Guest12 has quit [Ping timeout: 480 seconds]

00:34 axboe has joined #asahi-dev

00:35 <axboe> kettenis: seriously? like 90s style?

00:37 <kettenis> totally

00:37 <axboe> wow

00:39 <axboe> well I guess any kind of fs perf numbers on openbsd needs to come with that caveat ;)

00:40 <kettenis> I don't think i've ever lost a filesystem though

00:40 <axboe> I mean, all's well unless it crashes or you lose power

00:40 <axboe> I used to use openbsd for a ppoatm router, when that was my inet connection

00:52 <milek7_> are modern ssd really providing any guarantees on power loss? with opaque FTLs and hundreds of megabytes of cache

00:52 <axboe> that's what the flush cache command is for

00:53 <axboe> if previously acked writes aren't power loss stable after that, then the hw is defective

01:15 yuyichao has quit [Ping timeout: 480 seconds]

01:19 <alyssa> defective hardware? impossible.

01:38 <axboe> heh I know

01:38 <axboe> but it's one of the 4 core primitives, really should get that wrong for any half-way decent device and above

01:39 <axboe> I'm sure there's tons of shitty ones that just nop it :/

01:40 <axboe> s/wrong/right, obviously

01:44 yuyichao has joined #asahi-dev

02:15 rkjnsn has quit [Quit: Reconnecting]

02:15 rkjnsn has joined #asahi-dev

03:29 PhilippvK has joined #asahi-dev

03:32 phiologe has quit [Ping timeout: 480 seconds]

03:39 axboe has quit [Quit: leaving]

03:40 kov has quit [Quit: Coyote finally caught me]

03:40 kov has joined #asahi-dev

04:53 <marcan> < sven> we can always file a radar with apple and have it disappear and get no feedback for years! ;)

04:53 <marcan> the question is what does macOS do

04:53 <marcan> if it violates barrier guarantees in a way that can cause proper corruption when you yank the plug on the mac mini, we repro it then file a bug then I blog about it so it hits hacker news ;)

04:53 <marcan> then apple listens :D

05:00 <marcan> milek7_: yes, some SSDs (even some consumer lines) have enough capacitors to flush the cache on plug pulls

05:00 <marcan> it's not that hard with how fast SSDs are

05:00 <marcan> though these days they're mostly given up onthat, but Micron/Crucial ones used to be like that

05:00 <marcan> and enterprise lines certainly like to advertise that feature

05:01 <marcan> I think at the lower end they only flush enough to make sure the FTL is consistent, but not necessarily all the data

05:03 <marcan> see e.g https://www.micron.com/-/media/client/global/documents/products/white-paper/ssd_power_loss_protection_white_paper_lo.pdf

05:04 <marcan> the consumer line has enough caps to avoid corruption on power yank; the enterprise line has enough to save the entire cache

06:16 XeR has quit [Ping timeout: 480 seconds]

06:17 <VinDuv> I may have missed some part of the discussion, but on macOS fsync does not flushes the disk cache; it only writes the data to disk. You have to use fcntl(fd, F_FULLFSYNC) to flush the disk cache.

06:18 <VinDuv> I though fsync on Linux worked similarly but maybe not?

06:24 <marcan> if that's how it works, maybe fio should be changed on macos so it stops lying... :p

06:25 <VinDuv> https://lists.apple.com/archives/darwin-dev/2005/Feb/msg00072.html

06:26 <rkjnsn> Sounds like it matches how kettenis was describing OpenBSD as working, then, which I suppose is not terribly surprising.

06:42 <marcan> ok, can confirm macOS absolutely loses data if you do a plain fsync

06:42 <marcan> simple python script writing to a file (using raw os. calls) and doing fsync() on it, then sleeping

06:42 <marcan> I wait 5 seconds after the write/sync, then reboot via USB-PD command

06:42 <marcan> write is gone

06:43 <marcan> in fact I even managed to trigger an inconsistency in an Apple app by accident: I had GarageBand open, and closed it before the test, not saving data. on reboot, it tried to restore the now-nonexistent file, and threw up an error about corruption/invalid project

06:43 <marcan> so I guess it deleted the "unsaved" project but didn't commit the "last open" state to restore on boot

06:47 <marcan> F_FULLFSYNC indeed works

06:50 <marcan> and indeed with a dumb repeated write test I get ~46 IOPS with that, vs. 40000 with plain fsync

06:50 <sven> hah

06:50 <marcan> ooookay then, time to make some noise on twitter

06:51 <sven> at least that explains all the weird behavior

06:51 <marcan> yup

06:55 <ar> so, fsync without f_fullfsync is basically a no-op on macos?

06:55 <marcan> it's like fsync on Linux with the write cache set to write-through

06:55 <marcan> (fake set)

06:55 <marcan> so it's not a no-op but it's not good enough

06:56 <marcan> given the default dirty writeback on Linux is 5 seconds and macOS is losing more than 5 seconds worth of data with this, it *effectively* is like doing nothing on Linux, modulo writeback pressure

06:59 <marcan> so on a Samsung SSD 860 EVO mSATA on my laptop, I get 10K IOPS with the same write cache hack, and ~330 without (and fsync)

06:59 <marcan> so even this mSATA SSD does better than Apple's NVMe

06:59 <marcan> sigh

06:59 <marcan> on my iMac with a WD SSD, I get more like 2000 IOPS with proper flushes

07:00 <marcan> and 20000 without

07:01 the_lanetly_052__ has joined #asahi-dev

07:04 <rkjnsn> Do we know how much of an issue it would is outside of apt and synthetic benchmarks? I'm curious if it would make sense just to make apt less aggressive with its syncs, given that in can still be a significant slowdown (albeit less of one) on other drives. Presumably most software isn't trying to sync more than 50 times a second…

07:05 <marcan> it's a problem for e.g. databases

07:06 <rkjnsn> Ah.

07:09 <rkjnsn> Out of curiosity (since I don't know too much about this topic), if folks do want to ignore/postpone flushes on Linux, does the SSD support any other kind of ordering operation that could ensure data written after a flush doesn't hit the disk before data written prior to it?

07:09 <rkjnsn> I know it could still eat data that was supposed to be safely stored, but it'd be nice to avoid broken internal consistency.

07:16 <VinDuv> I’m pretty sure there isn’t a way to properly order the writes since basically the whole chain is allowed to reorder them

07:25 <marcan> I'm not sure if NVMe itself provides barriers; axboe might know

07:25 <marcan> and this might be a good reason to try to rescue that kind of feature...

07:30 <Glanzmann> marcan: axboe said yesterday that there is no such thing as 'barrieres' for 15 years. And that it is impossible due to multiqueue support IIUC.

07:31 <rkjnsn> I think they were talking about barriers in Linux, as opposed to barriers in the drive, though?

07:31 <rkjnsn> Even without support for write ordering in the kernel, it seems like (IIUC) translating fsyncs to write barriers (assuming the drive supports them) rather than flushes could at least help avoid corruption from the drive reordering things, even if it doesn't protect against writes that were supposedly committed being lost.

07:33 <sven> it's not impossible to do with multiqueues. it's just very challenging.

07:34 <sven> I don't remember seeing anything about write barriers in the nvme spec but I only read that one for the first time a few weeks ago

07:34 <marcan> and yeah, translating fsync()s to write barriers would likely avoid corruption, if the drive can do that

07:37 <sven> there's force unit access for write commands but that would probably only order writes where it's set (if it enforces the ordering at all)

07:45 <_jannau_> marcan: I don't think dcp in its current state is ready to be integrated in asahi. preserved regions need a coordinated m1n1/kernel changes with a not fully agreed on dt-binding. My current code to remap in the locked dcp dart apparently only works when iboot doesn't initialize dcp (it works only on the mac mini)

07:45 <marcan> ack, I'll defer then. I was on the fence about that

07:46 <marcan> would it be fair to say that if I make I release within this month I should keep simpledrmfb for now?

07:46 <_jannau_> it might be enough to transition dcp to hibernate in m1n1

07:47 <_jannau_> yes, I think simpledrm should be preferred unless someone has time to work on dcp before that

07:50 <_jannau_> I don't think there is huge amount of work to be done to make dcp useful but I should concentrate on submitting spi-hid

08:01 MajorBiscuit has joined #asahi-dev

08:02 <marcan> repro'd on the Mac Mini pulling the plug; there is definitely no (working) last-gasp mechanism

08:19 c10l4 has joined #asahi-dev

08:25 c10l has quit [Ping timeout: 480 seconds]

08:26 <ar> interestingly, https://sqlite.org/atomiccommit.html#sect_9_2 > Setting fullfsync on a Mac will guarantee that data really does get pushed out to the disk platter on a flush. But the implementation of fullfsync involves resetting the disk controller. And so not only is it profoundly slow, it also slows down other unrelated disk I/O. So its use is not recommended.

08:27 <marcan> well, this is #10 on hacker news now, so... who knows, maybe Apple will fix it :-)

08:27 <ar> so it's not a new problem

08:27 <marcan> ar: indeed, the database folks knew about this for a while

08:27 <marcan> it's just amazingly nonobvious for the rest of us

08:29 <ar> postgres also seems to have some references to `fcntl(fd, F_FULLFSYNC, 0)`

08:31 <dottedmag> Is there a way to quickly detect power loss on mini, to cobble last-gasp signal purely in sw?

08:32 <marcan> very good question. no idea.

08:32 <marcan> though I imagine if there were Apple would be using it?

08:32 <marcan> let me look at the schematics for a bit...

08:32 <marcan> ah wait, I don't have the Mini ones, derp

08:33 <marcan> though there might still be some info

08:37 <marcan> dottedmag: not seeing anything in SMC, quick look at the MBA schematic (they often have hints as to other variants) doesn't show any place that signal would go

08:37 <marcan> I suspect there is no such mechanism

08:38 <dottedmag> Alas. I was thinking more about some kind of side effect that betrays the power loss.

08:38 <marcan> dottedmag: polling the main primary voltage rail voltage, it updates once a second, but I caught it updating immediately before shutdown after pulling the plug and it wasn't drooping

08:39 <marcan> so I suspect the PSU just keeps it up then immediately kills power...

08:39 <marcan> wonder if there's some PGOOD thing that could still work...

08:39 <dottedmag> OK, so no easy way out.

08:39 <marcan> hard to say without the real schematic

09:12 tanty has joined #asahi-dev

10:08 <marcan> well this is interesting

10:09 <marcan> while running the python loop, python says 46 ops/s, powermetrics reports 915 disk ops per second / 4.7MB/s (seems a bit much? APFS write amplification? and 20MB/s of ANS2 memory bandwidth in both directions)

10:10 <marcan> doing the same thing, but with an artificial delay to make it run at the same speed sans the flush, I get 44 disk ops/s / 180 KB/s, and ~200 KB/s of ANS2 memory bandwidth

10:11 <marcan> so that sounds like two problems here... FULLFSYNC does some horrible APFS amplification nonsense, *and* ANS2 blows it up even more

10:11 <marcan> let me try another filesystem...

10:14 <marcan> on FAT32 it's actually slower (34 IOPS), but no serious amplification: 78.37 ops/s 321.02 KBytes/s

10:14 <marcan> and:

10:14 <marcan> ANS2 RD : 3.042 MB/s

10:14 <marcan> ANS2 WR : 8.836 MB/s

10:15 <marcan> so yeah, ANS2 is definitely doing something somewhat dodgy if it's doing 9MB/s of memory write traffic and 3MB/s of memory read traffic to serve 320KB/s of data traffic

10:16 <marcan> same test without the full fsync: write: 93.52 ops/s 397.77 KBytes/s

10:16 <marcan> ANS2 DCS RD : 0.244 MB/s

10:16 <marcan> ANS2 DCS WR : 0.411 MB/s

10:16 <marcan> that's more like it

10:16 <marcan> so yeah, that NVMe sync command is making ANS2 do a lot of work...

10:16 <marcan> I wonder if it, like, linearly scans some huge cache hashtable?

10:19 <Dcow[m]> is amount of work depends on the storage size?

10:27 XeR has joined #asahi-dev

10:44 refi64 has quit [Read error: Connection reset by peer]

10:45 refi64 has joined #asahi-dev

11:01 <sven> so that just leaves that weird issue where we sometimes miss an interrupt now

11:11 <rkjnsn> I saw a reference to a F_BARRIERFSYNC. Not sure if that's a macOS thing or just an iOS thing, but if it's available on macOS, it might be worth seeing if it works as advertised and results in less performance loss.

11:11 <maz> sven: contrary to what I said, the AIC doesn't seem to have a configuration for edge/level, and "knows" which line in which. and the fasteoi flow doesn't distinguish them either.

11:11 <maz> is*

11:16 <sven> yeah, that's what I thought after spending some time with the interrupt code yesterday as well

11:21 <kettenis> does the NVMe core code implement different code paths for MSI and non-MSI?

11:24 <maz> kettenis: if it does, the latter probably isn't very well tested...

11:37 <kettenis> sven: so on OpenBSD I still set the "number of openings" to 1 to avoid the NVMe from locking up

11:38 <kettenis> which effectively means command sumission is serialized

11:38 <kettenis> the issue I ran into sounds somewhat similar to what you're seeing

11:39 <kettenis> at some point the completion interrupt for a command never happens

11:39 <kettenis> although in my case the command actually didn't complete as far as I could tell

11:40 <kettenis> not optimal, but still plenty fast and it has been rock solid since I added that hack

11:41 <sven> that sounds very similar actually. some commands never generate the completed interrupt but they do appear in the completion queue and can be polled but others don't ever seem to be triggered

11:42 <sven> what exactly does "number of openings" do exactly?

11:42 <sven> macos uses INTMS and INTMC in its interrupt handler apparently, which linux doesn't because it relies on the interrupt controller itself

11:43 <kettenis> I think "number of openings" is classic SCSI terminology for the number of commands that can be in flight simultaniously

12:01 MajorBiscuit has quit [Ping timeout: 480 seconds]

12:04 gladiac has quit [Quit: k thx bye]

12:05 gladiac has joined #asahi-dev

12:17 MajorBiscuit has joined #asahi-dev

13:01 Gaspare has joined #asahi-dev

13:43 <marcan> for shits and giggles (and because HN commenters are tiring): full fsync on a shitty USB3 flash drive on macOS/M1: 223 IOPS. internal NVMe, 58 IOPS. Both FAT32.

13:44 <alyssa> marcan: *blinks*

13:44 <marcan> (of course, the flash drive has no cache at all, so it's equally slow with a vanilla fsync :p)

13:44 <marcan> alyssa: yes, you get better database transaction latencies on a shitty flash drive than on internal NVMe on these machines :-)

13:45 <alyssa> marcan: and if you wear out the flash drive you're not fsck()'d :V

13:45 <marcan> :p

13:56 <phire> wtf?

14:00 <phire> did they just not try for decent fsync preformance at all?

14:04 <marcan> I suspect nobody noticed

14:04 <marcan> since it doesn't matter on iOS

14:12 <sven> aaand now I can't reproduce the missing interrupts at all anymore with the original code *sigh*

14:13 <povik> filing bugs with Apple by getting on HN is an interesting method

14:13 <sven> every team needs someone to do the mediawhoring ;)

14:14 <marcan> the only reason I haven't deleted twitter is those 50k followers are *sometimes* useful :p

14:14 <as400[m]> sven: are you suggesting that marcan is a new apple spokesperson ?

14:27 <alyssa> marcan: unfollowing everyone and setting all privacy settings to "following only" has made my twitter quiet! :-p

14:32 Major_Biscuit has joined #asahi-dev

14:33 MajorBiscuit has quit [Ping timeout: 480 seconds]

14:35 <marcan> I did set that for notifications :p

14:39 <alyssa> that's step 2!

14:41 yuyichao has quit [Ping timeout: 480 seconds]

14:48 <marcan> https://news.ycombinator.com/item?id=30373572

14:48 <marcan> how's that for a scary demo?

14:49 <alyssa> i found the data!

14:49 <alyssa> `-$ echo "very important data" > file.txt` <-- it's right here

14:49 <sven> :D

14:49 <j`ey> marcan: lul nice

14:56 <marcan> that said, I'm not even entirely sure if rsync tries to do a normal fsync() here

14:56 <marcan> so it might just be unsafe on any system :p

14:57 <marcan> but fsync() certainly wouldn't save macOS

14:59 yuyichao has joined #asahi-dev

15:06 hays has quit []

15:07 <milek7_> rsync doesn't do fsync at all

15:10 hays has joined #asahi-dev

15:26 hays has quit []

15:28 hays has joined #asahi-dev

15:29 axboe has joined #asahi-dev

15:30 <axboe> I already had 3 separate people send me marcans flush rant this morning ;)

15:30 <axboe> good to see some noise on this topic

15:37 Gaspare has quit [Ping timeout: 480 seconds]

15:44 <Jamie[m]1> is radar priority decided directly based on HN rank, or is twitter likes the bigger factor? :P

15:59 <sven> axboe: remind me again, is taking the anv->lock around the writel itself enough to prevent those timeouts or did the memcpy also have to be inside the lock?

16:00 <sven> i can't seem to reproduce the issue at all even without the lock right now :/

16:06 <axboe> sven: just around writel is enough, that's what I've been running and it's been rock solid

16:07 <axboe> sven: reproduces trivially for me, was a bit harder with issues serialized, but could still hit it. consistently just doing a make -j8 kernel compile on it

16:07 <axboe> would always hang

16:07 <axboe> I'm running the three nvme patches now and it's all dandy

16:07 <axboe> tcb clear cleanup, lock around writel, and the flush deferral

16:08 <sven> okay, so from looking at the hypervisor logs macos seems to never interleave "ring cq doorbell + nvmmu invalidation" with "write new tag to sq"

16:08 <axboe> ok

16:08 <sven> so maybe if something like "start nvmmu invalidation, start new command, write cq doorbell" happens things break. no idea why though.

16:08 <axboe> so that won't happen with that patch either then

16:09 <axboe> as nvmmu invalidation is already under the anv lock

16:09 <sven> yeah. i'm just trying to understand why we need that lock around the writel

16:09 <axboe> I generally didn't feel comfortable without that lock to begin with, only part I'm a bit puzzled on is why the cpu freq changes makes this so much more likely to trigger

16:10 <axboe> it'd be better if we didn't need to share this lock (and particularly since it's irq disabling on the submit path), but at the rates of these drives and the fact that it only has a single queue anyway, it's not going to be an iops monster anyway and hence it doesn't really matter

16:11 <axboe> so while that could be improved, it doesn't matter in practice imho

16:11 <sven> makes sense

16:12 <sven> if it's actually a race between nvmmu/cq_db and that writel maybe the higher cpufreq just makes it more likely to lose (or win, depending how we look at it :D) that one

16:13 <axboe> definitely some correlation there, just not sure what!

16:13 <axboe> I ran quite a bit with the cpus at max freq before that

16:13 <axboe> and didn't see it

16:13 <axboe> maybe differences in clock between simultaneous issuers? though I don't see how...

16:13 <sven> quite a few people have been using it on the max/pro as well before cpufreq as well and never saw it either.

16:14 <sven> yeah, i just hate when a mystery lock solves something that smells like some kind of race

16:14 <axboe> yeah I know

16:14 <axboe> your osx tracing sounds useful though

16:15 <axboe> if it is indeed inval vs doorbell write

16:15 <axboe> if only there was documentation ;)

16:15 <sven> yup :D

16:15 <axboe> I think we just need a good comment around why we _think_ it's needed

16:16 <axboe> in the commit message too, but more importantly in the code

16:16 <sven> yeah, absolutely

16:16 <axboe> irq fix is queued up btw, but I guess you saw that already

16:16 <sven> yup, that was pretty quick :)

16:16 <axboe> one less to worry about :)

16:18 <axboe> oh forgot, also running that "set aq depth to 2" patch

16:31 <axboe> sven: though for the batching, we could optimize the nvmmu_inval by writing each tag, then one readl_relaxed at the end...

16:33 <sven> macOS does a readl_relaxed after each inval fwiw. I’m not sure if that’s required ofc.

16:33 <sven> could be worth a try

16:33 <axboe> probably not worthwhile to pursue, but something to keep in mind

16:34 <axboe> I've got this funky series for io_uring that allows registered buffers to retain dma and iommu mappings, instead of doing them for each io submit and complete

16:34 <alyssa> "if only there was documentation ;)"

16:34 <alyssa> mood

16:34 <axboe> it'd bump the peak 17xK rand iops to substantially more

16:34 <sven> nice!

16:35 <axboe> was planning on adding support for apple nvme just to test out what it can actually do, if I do that, then I can try the nvmmu inval optimization too as it'd likely make a difference at that point

16:35 <axboe> was part of the "lets see how many iops we can do on a core" experiments, 1 of 2 series that hasn't been posted anywhere yet as it's a bit of a hack

16:36 <sven> I saw part of those optimization on twitter. Very impressive how much it improved

16:37 <axboe> it was a fun project

16:38 <axboe> there's no way to test apple-nvme in qemu yet, is there?

16:40 <sven> nope

16:40 <axboe> ah screw it, we'll do it live

16:40 <axboe> let's find out

16:40 <sven> :D

16:40 <sven> what could go wrong ;)

16:40 <axboe> right?

16:51 <axboe> done, let's see if it works...

16:53 axboe has quit [Quit: reboot]

17:00 the_lanetly_052___ has joined #asahi-dev

17:02 skipwich has quit [Quit: DISCONNECT]

17:02 skipwich has joined #asahi-dev

17:05 <sven> uh oh

17:06 the_lanetly_052__ has quit [Ping timeout: 480 seconds]

17:06 axboe has joined #asahi-dev

17:06 <axboe> it worked

17:06 <sven> nice!

17:07 <axboe> not sure why I seem to be hitting the segments != 1 path in apple_nvme_map_data() though

17:10 <axboe> let's debug...

17:11 axboe has quit [Quit: Lost terminal]

17:14 axboe has joined #asahi-dev

17:14 <axboe> oh, it's the 16k page size

17:14 <axboe> I should align my buffers better and not assume 4k page sizes :)

17:14 <j`ey> :-)

17:14 <axboe> and then apple-nvme needs to support ->queue_rqs too

17:16 bpye3 has joined #asahi-dev

17:17 al3xtjames2 has joined #asahi-dev

17:18 skipwich has quit [Ping timeout: 480 seconds]

17:19 <alyssa> sven: so uh what does this mean for apple-nvme upstreaming?

17:19 al3xtjames has quit [Quit: Ping timeout (120 seconds)]

17:19 <alyssa> i guess we're still super blocked on rtk?

17:19 al3xtjames2 is now known as al3xtjames

17:23 skipwich has joined #asahi-dev

17:23 bpye has quit [Ping timeout: 480 seconds]

17:23 bpye3 is now known as bpye

17:32 <axboe> sven: ok queue_rqs is hard to support, since the sq door bell is writing the specific tag...

17:32 <axboe> oh well, guess we can't easily do that

17:32 <axboe> but with the changes, we get around the same perf, but at about half the CPU usage

17:33 <axboe> so guessing we're actually controller limited on iops anyway around that point

17:35 <rkjnsn> marcan, would it be easy to test F_BARRIERFSYNC using your python script to see what it does and how it performs on these machines?

17:44 Major_Biscuit has quit [Ping timeout: 480 seconds]

17:44 axboe has quit [Quit: Lost terminal]

17:59 the_lanetly_052___ has quit [Ping timeout: 480 seconds]

18:03 <sven> axboe: half the cpu usage already sounds great :)

18:05 <sven> alyssa: yeah, I need to take a look at what marcan did to rtkit and we need to discuss how we will upstream that together with smc

18:14 <alyssa> sven: right

18:28 axboe has joined #asahi-dev

18:29 <axboe> sven: got queue_rqs setup, and it's just cpu reduction at this point, iops are capped / controller limited

18:29 <axboe> so not worth pursuing imho

18:29 <sven> aww… was worth a try though!

18:30 <axboe> doing 32 issues batched, no real gain :/

18:30 <axboe> but yeah, worth an experiment

18:30 <axboe> beats real work, right?

18:30 <sven> yup :)

18:38 sirn- has joined #asahi-dev

18:45 sirn has quit [Ping timeout: 480 seconds]

18:45 sirn has joined #asahi-dev

18:52 sirn- has quit [Ping timeout: 480 seconds]

18:52 sirn- has joined #asahi-dev

18:53 sirn| has joined #asahi-dev

18:56 sirn^ has joined #asahi-dev

18:57 sirn has quit [Ping timeout: 480 seconds]

18:57 sirn^ is now known as sirn

18:58 axboe has quit [Quit: Lost terminal]

18:59 axboe has joined #asahi-dev

19:00 sirn- has quit [Ping timeout: 480 seconds]

19:03 sirn| has quit [Ping timeout: 480 seconds]

19:10 <axboe> marcan: https://git.kernel.dk/cgit/fio/commit/?id=a04e0665cb5d3a545ab1dbe2d2b7c150b404735d

19:19 ___nick___ has joined #asahi-dev

19:30 <alyssa> axboe: i haven't seen a ./configure in a while...

19:31 Telvana has joined #asahi-dev

19:33 <axboe> alyssa: it's a handrolled one ;)

19:33 <axboe> but don't judge, fio is probably around 20 years old at this point

19:40 <axboe> btw, if someone can test the above on osx, would be great

19:40 <axboe> or I'll have to borrow the wife's laptop later

19:44 <sven> I can give it a try later

19:45 ___nick___ has quit []

19:46 <axboe> sven: thanks!

19:47 ___nick___ has joined #asahi-dev

19:49 ___nick___ has quit []

19:51 ___nick___ has joined #asahi-dev

19:52 <jannau> axboe: configure checks for F_FULLSYNC instead of F_FULLFSYNC

19:53 <jannau> missing 'F'

19:53 <axboe> jannau: doh

19:53 <jannau> same in code

19:55 <axboe> thanks for spotting that, pushed out a fix

19:55 * jannau redirects the thanks to the compiler

19:55 <axboe> someone needs to run the compiler on osx :)

19:56 <alyssa> that would require owning a mac.

19:56 <axboe> definitely not typing this on the m1

19:56 <alyssa> no of course not

20:11 <jannau> axboe: 51 IOPS for 4k randwrite with --fsync 1 and F_FULLFSYNC

20:12 <axboe> jannau: great thanks, seems it's doing its job then

20:13 <axboe> only about 2x slower than a rotating hard drive, nice

20:13 <axboe> anyone know if F_FULLFSYNC is osx only, or do some of the bsd's use it too?

20:14 <axboe> I guess I can google

20:14 <axboe> looks like freebsd/netbsd at least do it saner and provide a sysctl setting for it

20:21 <sven> almost the same here, 49 IOPS

20:24 <kettenis> axboe: looks like osx only; even FreeBSD doesn't have it

20:26 <axboe> kettenis: yeah looks like the bsds took the "make it a system tunable" instead, which does make a lot more sense than a separate command

20:40 <axboe> sven: https://git.kernel.dk/cgit/linux-block/log/?h=m1-test-nvme

20:41 <axboe> sven: pushed the nvme test stuff there, just in case you are curious

20:44 <sven> very curious actually!

20:46 <axboe> this is the perf-wip patches for persistent maps cherry-picked, then the apple-nvme bits needed on top

20:46 <axboe> and queue_rqs is actually in 5.16, so could be done

20:46 <axboe> idea there is based on how linux manages IO in an on-stack plug for submitting more than 1

20:47 <axboe> so it allows you to submit N ios at the time

20:47 <axboe> ammortize the cost of grabbing the submit side lock

20:47 <axboe> that part works a bit better on "normal" nvme, as we just have to do that one doorbell write with the sq tail

20:54 <sven> ah, true. Just copy all commands to the queue and advance it by N elements with a single write

20:54 <axboe> yeah

21:08 ___nick___ has quit [Ping timeout: 480 seconds]

21:33 robher has quit [Read error: Connection reset by peer]

21:33 kendfinger has quit [Read error: Connection reset by peer]

21:33 weems_ has quit [Read error: Connection reset by peer]

21:33 daniels has quit [Read error: Connection reset by peer]

21:33 arnd has quit [Read error: Connection reset by peer]

21:33 weems_ has joined #asahi-dev

21:33 daniels has joined #asahi-dev

21:33 kendfinger has joined #asahi-dev

21:33 arnd has joined #asahi-dev

21:33 tardyp has quit [Read error: Connection reset by peer]

21:33 tardyp has joined #asahi-dev

21:33 robher has joined #asahi-dev

21:58 axboe has quit [Ping timeout: 480 seconds]

22:06 axboe has joined #asahi-dev

22:08 thermoblue[m] has left #asahi-dev [#asahi-dev]

23:34 axboe has quit [Ping timeout: 480 seconds]

23:57 axboe has joined #asahi-dev