#asahi on 2022-06-19 — irc logs at oftc.irclog.whitequark.org

2022-04-28 01:57 marcan changed the topic of #asahi to: Asahi Linux: porting Linux to Apple Silicon macs | https://asahilinux.org/2022/03/asahi-linux-alpha-release/ | General project discussion | GitHub: https://alx.sh/g | Wiki: https://alx.sh/w | Topics: #asahi-dev #asahi-re #asahi-gpu #asahi-alt #asahi-stream #asahi-offtopic | Keep things on topic | Logs: https://alx.sh/l/asahi

00:53 NickLu[m]1 has joined #asahi

00:57 <fionera[m]> <sven> "but that’s possible on these..." <- this was what I wanted to ask ^^ Was curious why parallels on macos doesnt support it

01:04 <Lucy[m]> Does macOS even allow that?

01:07 darkapex has joined #asahi

01:08 <steev> i don't have anything that's actually thunderbolt, but i thought it did fionera[m] - https://kb.parallels.com/en/124266 says you can use e.g. an eGPU

01:08 <steev> ahh directly

01:14 <fionera[m]> yeah i would like to have the raw pcie device in the vm :D

01:14 <fionera[m]> tho thats probably smth that macos blocks

01:24 chadmed has joined #asahi

01:29 nicolas17 has quit [Quit: Konversation terminated!]

01:29 nicolas17 has joined #asahi

01:54 nicolas17 has quit [Quit: Konversation terminated!]

01:54 nicolas17 has joined #asahi

02:33 PhilippvK has quit [Ping timeout: 480 seconds]

02:52 nicolas17 has quit [Ping timeout: 480 seconds]

03:26 ptudor_ is now known as ptudor

03:26 marvin24_ has joined #asahi

03:29 marvin24 has quit [Ping timeout: 480 seconds]

03:33 nico_32 has quit [Ping timeout: 480 seconds]

03:56 <MichaelMesser[m]> I thought eGPUs weren't likely to work. https://twitter.com/marcan42/status/1534825580801433600

03:57 <MichaelMesser[m]> I assume that would apply to VMs as well.

03:57 <marcan> that parallels link is for intel machines

03:58 Ry_Darcy has joined #asahi

03:58 nico_32 has joined #asahi

03:58 <marcan> M1 VMs on macOS do not support passthrough of anything and definitely not eGPUs

03:59 <MichaelMesser[m]> macOS on Intel doesn't support passthrough either

04:00 <marcan> ah, that link is only about paravirt graphics, not passthrough

04:02 <marcan> it's technically possible to make eGPUs work (with performance limitations) on M1 devices, but it's like a whole driver refactoring/development project to pull it off in a way that can work

04:03 <marcan> so you'd have to find someone well versed in graphics and they'd likely have to spend months working on it, for each GPU driver/vendor, and then everything needs to be upstreamed, and upstream might not like the (rather intrusive) changes it brings

04:03 <marcan> likely needs userspace mesa changes too

04:03 <marcan> and you'd still end up with pathologically bad performance for some workloads

04:06 <marcan> VMs don't change any of that (the guest still needs the same driver changes)

04:07 <marcan> technically with a VM you could hook all BAR accesses and make it "work" with no changes to the guest, but performance would be hilariously bad

04:07 <marcan> like 100x slower bandwidth than native VRAM accesses bad

04:09 <marcan> would be trivial to test that with the m1n1 hypervisor once thunderbolt/pcie stuff is in asahi if someone wants to find out just how bad it would be

04:09 <marcan> mostly just completing unaligned r/w support to make it work

04:43 Ry_Darcy has quit [Remote host closed the connection]

04:46 Ry_Darcy has joined #asahi

04:51 <MichaelMesser[m]> Does anything other than eGPUs use this feature?

05:05 <marcan> not to my knowledge

05:06 <marcan> potentially weird GPU-like things (like FPGA boards with lots of onboard RAM) could

05:06 <marcan> but the problem with GPUs is that making them work "properly" the sane way involved potentially modifying every app/game

05:07 <marcan> GPUs are really the only device where this is exposed to a huge variety of applications

05:07 <marcan> if you have something more niche you can just work around the problem in the driver/app much more easily

05:07 <marcan> *involves

05:10 <MichaelMesser[m]> So if Apple does not care about eGPUs, this is unlikely to be fixed in later hardware?

05:12 <Lucy[m]> Any way to know if it's fixed in the M2? Probably not yet, right?

05:41 <chadmed> i wonder if they even consider it a bug at all

05:42 <marcan> this isn't unique to the M1, it's the same on may other ARM systems

05:42 qeeg has quit [Remote host closed the connection]

05:43 <chadmed> yeah i doubt theyll fix it

05:43 <marcan> it's not a "bug", it's software and hardware assuming everything is like x86

05:43 <marcan> they *could* make it like x86 to make this work, but it not being like x86 isn't a bug

05:43 <chadmed> not having eGPU support also encourages software vendors to fix their applications to work better on AGX rather than direct customers to just go buy GPUs that

05:43 <chadmed> "work"

05:43 <marcan> that too

05:43 qeeg has joined #asahi

05:45 <Lucy[m]> Didn't you say that performance would be impacted though?

05:45 <marcan> the workaround you'd have to do to make it work impacts performance

05:45 <chadmed> and even if they did make device memory look like x86, i doubt many PC consumer grade cards would work out of the box because of the weird Intel-specific DMA address filtering which itself would require software workarounds that would nuke performance in any case

05:45 <chadmed> so theres no point fixing any of this

05:45 <Lucy[m]> Right. Makes sense then

05:46 <chadmed> this has all already been experienced on ppc64 workstations

05:46 <chadmed> very very few pcie cards work properly on those, theres like 3 consumer grade cards in total that work as intended on talos boards for example

05:47 <chadmed> its going to be an interesting tug of war once arm platforms start becoming commonplace on less "integrated" stuff than apple's

05:48 <chadmed> who will capitulate first? the add in cards or the soc vendors

05:49 <marcan> I think we actually have yet to still 100% validate that this is not supported for PCIe with any memory type other than Device-nGnRE

05:49 <marcan> I just looked up the ARM spec for PCIe integration, and it says Normal-NC is allowed (but not Normal)

05:49 <marcan> we've never tried Normal-NC, just Normal

05:50 <marcan> I'm not holding my breath but it's worth checking

05:50 <marcan> might do it a bit later

05:51 <marcan> ()

05:51 <marcan> (Normal-NC would make eGPUs work sanely)

05:53 <chadmed> are cards expected to gracefully handle partial writes on x86?

06:03 <marcan> yes

06:19 the_lanetly_052 has joined #asahi

06:41 <kettenis> as far as I know NVIDIA uses Device-GRE mappings instead of Normal-NC in their (proprietary) graphics stack

06:44 <marcan> that won't work with userland that tries to make unaligned accesses to VRAM

06:44 <marcan> which some userland does

06:46 <kettenis> it'd be slow since the kernel would emulate the misaligned loads and stores

06:47 <kettenis> and presumably NVIDIA's userland code doesn't do this

06:48 <kettenis> (I mean unaligned accesses to VRAM from userland)

06:48 <marcan> it's up to individual applications, not nvidia's userland code

06:48 <marcan> e.g. memcpy will often do unaligned accesses for performance

06:48 <marcan> that's why we can't just fix this in drivers

06:49 <marcan> there is no way unaligned emulation will perform well when literally memcpy hits it

06:50 <kettenis> well, does OpenGL/Vulkan/CUDA allow mapping VRAM directly into userland

06:50 <kettenis> or is that an Intel/AMD extension?

06:51 <marcan> I think every GPU driver allows that?

06:51 <kettenis> the usb ones certainly don't ;)

06:52 <marcan> do USB GPUs exist, besides that abomination Lina is working on?

06:52 <marcan> (DisplayLink is not a GPU, it's a display controller)

06:53 <kettenis> I was thinking of displaylink as an example of something that doesn't expose a VRAM framebuffer

06:54 <kettenis> but yeah, not really a GPU

06:55 <marcan> yeah, this isn't about framebuffers, those end up being driver-managed; it's about things like texture buffers, VBOs, etc.

06:58 <kettenis> I'm not familliar enough with the various graphics APIs, but at least some of the OpenGL functionality to directly map stuff like that from VRAM is an extension that drivers don't have to implement

06:58 <marcan> https://patchwork.kernel.org/project/linux-arm-kernel/patch/20210429162906.32742-2-sdonthineni@nvidia.com/ relevant discussion that agrees with what I'm saying :p

07:01 <marcan> which you're part of :-)

07:01 <kettenis> ah, well, KVM throws in a whole other complication

07:01 <kettenis> and usually these discussions happen in the context of the open-source (Mesa) graphics stack

07:02 <kettenis> which defenitely uses memcpy and assumes that non-aligned access works

07:04 <kettenis> it'd be interesting to see what an Apple Silicon Mac Pro looks like and whether Apple is going to support PCIe GPUs in those

07:05 <marcan> still need to test that normal-NC really does not work

07:09 bisko has joined #asahi

07:20 <marcan> nope, does not work, and in fact you get a nice AMCC panic from macOS

07:20 <marcan> panic(cpu 0 caller 0xfffffe0013443760): "AMCC PLANE3 PIO request with RO flag set error: INTSTS 0x0000000000400000 AFERRLOG0/1/2/3 0x101000/0x17f1406/0x2000000/0x40001 ADDR 0x600101000 CMD/SIZE/TYPE 0x14(CifNCWr)/0x7f/0x1 AID/TID 0x10/0" @AppleT8101PlatformErrorHandler.cpp:1323

07:20 <marcan> presumably RO=reorder

07:20 <marcan> nGnRnE and nGnRE both work for PCIe BARs

07:21 <marcan> not sure how nGnRnE is supposed to actually work since PCIe writes are posted, but at least the fabric does not complain

07:21 bisko has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

07:21 <kettenis> with nGnRnE you simply don't get the benefits of posting at the CPU level

07:22 <marcan> yeah, it's just still posting behind the scenes which seems a bit odd

07:22 <kettenis> compatible with x86 ;)

07:22 <marcan> heh

07:23 <marcan> I really need to reverse engineer that AMCC error handler stuff, it'd be very useful

07:23 <kettenis> wonder wether GRE or GnRE works

07:23 <marcan> (I only have some stub stuff in m1n1 experiments)

07:23 <kettenis> since G is what you really care about for mapping "prefetchable

07:23 <kettenis> " PCI bars

07:26 <kettenis> but even then, it is probably not worth it spending time on making eGPUs work

07:26 <marcan> kettenis: tested they do

07:26 <marcan> but yeah, won't fix the alignment problem

07:27 <marcan> all Device modes work for aPCIeC, but Normal modes do not

07:55 <kettenis> so that means that all pci drivers that use ioremap_wc() to map a prefetchable BAR won't work on these machines

08:04 mikoxyzzz has joined #asahi

08:14 mikoxyzzz is now known as miko

08:28 miko has quit [Quit: WeeChat 3.5]

08:29 miko has joined #asahi

08:30 miko has quit []

08:30 miko has joined #asahi

09:11 Ry_Darcy has quit [Remote host closed the connection]

09:19 the_lanetly_052 has quit [Ping timeout: 480 seconds]

09:29 c10l has quit [Quit: Bye o/]

09:33 c10l has joined #asahi

09:53 the_lanetly_052 has joined #asahi

10:20 the_lanetly_052 has quit [Ping timeout: 480 seconds]

10:38 miko has quit [Quit: WeeChat 3.5]

10:38 gabuscus has joined #asahi

10:47 <fionera[m]> Oh holy I really started a discussion about that :) I was interested because I was wondering if I can pass it to a VM. Either Windows or Linux and not only GPUs but also other PCIe Devices like Network Cards. (Yes I use my eGPU case to test enterprise NICs on my Laptop)

10:50 c10l has quit [Quit: Bye o/]

10:54 c10l has joined #asahi

11:09 <kettenis> the answer is pretty much no for linux (either passthrough or native)

12:07 miko has joined #asahi

12:53 <milek7_> GL_MAP_COHERENT_BIT is required from gl 4.4

12:54 <milek7_> before that I think it is possible to get away with intermediary buffer in the driver

13:17 chadmed has quit [Ping timeout: 480 seconds]

13:18 chadmed has joined #asahi

13:40 MajorBiscuit has joined #asahi

15:08 amarioguy has joined #asahi

15:09 <unrelentingtech> <chadmed> "and even if they did make device..." <- what is the intel specific dma address filtering thing? amdgpu generally just works on non-broken aarch64 with no workarounds

15:24 jakebot6 has quit [Quit: The Lounge - https://thelounge.chat]

15:30 jakebot6 has joined #asahi

15:38 ___nick___ has joined #asahi

16:07 MajorBiscuit has quit [Quit: WeeChat 3.5]

16:15 miko has quit [Quit: WeeChat 3.5]

16:18 miko has joined #asahi

18:05 Moprius has joined #asahi

18:33 osaka1990 has joined #asahi

18:34 osaka1990 has quit []

18:35 osaka1990 has joined #asahi

18:56 Ry_Darcy has joined #asahi

19:15 qeeg has quit [Ping timeout: 480 seconds]

19:18 qeeg has joined #asahi

20:08 ___nick___ has quit [Ping timeout: 480 seconds]

20:37 ptudor_ has joined #asahi

20:40 ptudor has quit [Ping timeout: 480 seconds]

20:49 <konradybcio> hey, I was trying to figure out PMGR stuff for A(n) SoCs and got a little confused.. are the powerstates (https://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git/tree/arch/arm64/boot/dts/apple/t8103-pmgr.dtsi?h=next-20220531) a new representation of gate clocks? or are they a separate thing?

20:52 <sven> what we assumed to be clock gates are actually those power states

20:53 <konradybcio> oh so they're votable?

20:53 <konradybcio> as in, more than just on/off?

20:54 <sven> yeah, there are at least three states (off, clock gated(?) and on)

20:54 <sven> and they can do auto power management

20:54 <konradybcio> sounds convenient

20:55 <sven> they also usually don’t need explicit support in the drivers

20:55 <konradybcio> unlike some other SoC vendors' dvfs infra..

20:56 <sven> as long as a device is only attached to a single power domain the genpd core will handle everything automatically

20:59 <j`ey> which only seems to be nvme and mca

21:01 ptudor_ is now known as ptudor

21:02 <j`ey> wonder why sound/ is a separate subfolder and not in drivers/

21:02 <sven> huh, true. that’s weird

21:02 <sven> i bet it’s for some obscure historical reason :D

21:02 miko has quit [Quit: WeeChat 3.5]

21:42 yuyichao has joined #asahi

21:43 LunaFoxgirlVT has joined #asahi

21:51 qeeg has quit [Ping timeout: 480 seconds]

22:07 qeeg has joined #asahi

22:18 LunaFoxgirlVT has quit [Quit: Leaving]

22:31 manawyrm has quit [Quit: Read error: 2.99792458 x 10^8 meters/second (Excessive speed of light)]

22:31 manawyrm has joined #asahi

22:36 <MichaelMesser[m]> Does Apple include anything the might be useful for the eGPU issue here? https://developer.apple.com/documentation/kernel/hardware_families/pci/implementing_a_pcie_kext_for_a_thunderbolt_device

22:37 <MichaelMesser[m]> s/the/that/, s/hardware_families/hardware\_families/, s/implementing_a_pcie_kext_for_a_thunderbolt_device/implementing\_a\_pcie\_kext\_for\_a\_thunderbolt\_device/

22:42 Moprius has quit [Ping timeout: 480 seconds]

22:58 confusomu has joined #asahi

23:05 Ry_Darcy has quit [Remote host closed the connection]