#asahi-gpu on 2023-04-26 — irc logs at oftc.irclog.whitequark.org

2022-12-21 00:46 ChanServ changed the topic of #asahi-gpu to: Asahi Linux GPU development (no user support, NO binary reversing) | Keep things on topic | GitHub: https://alx.sh/g | Wiki: https://alx.sh/w | Logs: https://alx.sh/l/asahi-gpu

00:16 alyssa has joined #asahi-gpu

00:16 <alyssa> apple's compiler implements * 9 with iadd lsl

00:17 <alyssa> * 17 too

00:17 <alyssa> imad for * 33

00:17 <alyssa> imad for * 7

00:17 <alyssa> imad for * -7

00:18 <alyssa> oh, this is interesting, it uses iadd+lsl instead of bfi for small shifts

00:19 <alyssa> it does use isub lsl

00:20 <alyssa> but not in lieu of imad

00:20 <alyssa> (and will optimize isub lsl to imad, weirdly)

00:21 <alyssa> up to lsl 4 for a single iadd

00:21 <alyssa> then, weirdly, <<5 is implemented by chaining iadd(lsl 1) with iadd(lsl 4)

00:21 <alyssa> i fail to see why that would be a win

00:23 <alyssa> although, if there are a lot more SCIB than IC units than it makes sense

00:23 <alyssa> (those are functional units, in the perf counters)

00:23 <alyssa> (different ALUs)

00:26 <alyssa> I haven't done any benchmarking myself, but comparing Philip Turner's notes with dougall's, I guess there are 4 SCIB units per 1 IC unit on Apple M1

00:27 <alyssa> with int32 addition and basic bitwise ops on the SCIB unit, but the variable barrel shifters on the IC unit

00:27 <alyssa> meaning from a raw throughput instruction, using 3 adds (with lsl) to save a real shift would still be a win

00:28 <alyssa> that suggests x << 12 would be implemented with chained iadds, but x << 13 is a toss up, but x << 17 would be real bitwise

00:29 <alyssa> In reality, Apple's compiler chooses to use bfi for 9 and above, but x << 8 is chained iadds

00:29 <alyssa> That is, it will use 2 iadds to save a bfi, but will not use 3 iadds

00:29 <alyssa> Possibly there are battling latency and i-cache concerns here, this might be a heuristic it has

00:30 abd has joined #asahi-gpu

00:30 <alyssa> Performance of shifts by 10 is not of any importance whatsoever to me. But it's a nice test case for reasoning about the uarch.

00:35 <alyssa> I would like to have cycle count estimates in my shader-db stats, but meh, I don't think we know enough about the uarch to do that accurately yet.

00:37 <alyssa> ---

00:37 <alyssa> With no render targets, but depth/stencil

00:37 <alyssa> msaa 4x: sample count 1, sample stride 1, bytes per 4096, layout 32x16

00:37 nsklaus has quit [Ping timeout: 480 seconds]

00:38 <alyssa> msaa 2x: sample count 1, sample stride 1, bytes per 8192

00:38 <alyssa> ditto if no msaa

00:45 nsklaus has joined #asahi-gpu

00:53 nsklaus has quit [Ping timeout: 480 seconds]

01:02 nsklaus has joined #asahi-gpu

01:04 bluetail has quit [Quit: The Lounge - https://thelounge.chat]

01:10 nsklaus has quit [Ping timeout: 480 seconds]

01:18 i509vcb has joined #asahi-gpu

01:19 nsklaus has joined #asahi-gpu

01:27 nsklaus has quit [Ping timeout: 480 seconds]

01:36 nsklaus has joined #asahi-gpu

01:44 nsklaus has quit [Ping timeout: 480 seconds]

01:51 nsklaus has joined #asahi-gpu

02:01 nsklaus has quit [Ping timeout: 480 seconds]

02:10 nsklaus has joined #asahi-gpu

02:13 chadmed_ has joined #asahi-gpu

02:18 nsklaus has quit [Ping timeout: 480 seconds]

02:26 nsklaus has joined #asahi-gpu

02:34 nsklaus has quit [Ping timeout: 480 seconds]

02:42 nsklaus has joined #asahi-gpu

02:50 nsklaus has quit [Ping timeout: 480 seconds]

02:58 nsklaus has joined #asahi-gpu

03:08 nsklaus has quit [Ping timeout: 480 seconds]

03:12 nsklaus has joined #asahi-gpu

03:21 nsklaus has quit [Ping timeout: 480 seconds]

03:26 PyroPeter has joined #asahi-gpu

03:28 pyropeter3 has quit [Ping timeout: 480 seconds]

03:30 nsklaus has joined #asahi-gpu

03:38 nsklaus has quit [Ping timeout: 480 seconds]

03:40 bluetail has joined #asahi-gpu

03:46 nsklaus has joined #asahi-gpu

03:54 nsklaus has quit [Ping timeout: 480 seconds]

04:01 possiblemeatball has quit [Quit: Quit]

04:02 nsklaus has joined #asahi-gpu

04:10 nsklaus has quit [Ping timeout: 480 seconds]

04:19 nsklaus has joined #asahi-gpu

04:27 nsklaus has quit [Ping timeout: 480 seconds]

04:36 nsklaus has joined #asahi-gpu

04:44 nsklaus has quit [Ping timeout: 480 seconds]

04:52 nsklaus has joined #asahi-gpu

04:59 abd has quit [Remote host closed the connection]

05:01 nsklaus has quit [Ping timeout: 480 seconds]

05:08 nsklaus has joined #asahi-gpu

05:17 nsklaus has quit [Ping timeout: 480 seconds]

05:25 nsklaus has joined #asahi-gpu

05:34 nsklaus has quit [Ping timeout: 480 seconds]

05:42 nsklaus has joined #asahi-gpu

05:50 nsklaus has quit [Ping timeout: 480 seconds]

05:59 nsklaus has joined #asahi-gpu

06:07 nsklaus has quit [Ping timeout: 480 seconds]

06:16 pthariensflame has joined #asahi-gpu

06:16 pthariensflame has quit []

06:16 nsklaus has joined #asahi-gpu

06:25 nsklaus has quit [Ping timeout: 480 seconds]

06:32 chadmed_ has quit [Quit: Page closed]

06:36 nsklaus has joined #asahi-gpu

06:36 MajorBiscuit has quit [Ping timeout: 480 seconds]

06:44 nsklaus has quit [Ping timeout: 480 seconds]

06:52 nsklaus has joined #asahi-gpu

07:01 nsklaus has quit [Ping timeout: 480 seconds]

07:02 i509vcb has quit [Quit: Connection closed for inactivity]

07:09 nsklaus has joined #asahi-gpu

07:17 nsklaus has quit [Ping timeout: 480 seconds]

07:25 nsklaus has joined #asahi-gpu

07:33 nsklaus has quit [Ping timeout: 480 seconds]

07:44 nsklaus has joined #asahi-gpu

07:52 nsklaus has quit [Ping timeout: 480 seconds]

08:00 nsklaus has joined #asahi-gpu

08:08 nsklaus has quit [Ping timeout: 480 seconds]

08:08 MajorBiscuit has joined #asahi-gpu

08:17 nsklaus has joined #asahi-gpu

08:25 nsklaus has quit [Ping timeout: 480 seconds]

08:33 nsklaus has joined #asahi-gpu

08:38 MajorBiscuit has quit [Ping timeout: 480 seconds]

08:40 MajorBiscuit has joined #asahi-gpu

08:40 MajorBiscuit has quit []

08:42 nsklaus has quit [Ping timeout: 480 seconds]

08:52 nsklaus has joined #asahi-gpu

09:01 nsklaus has quit [Ping timeout: 480 seconds]

09:01 nyilas has joined #asahi-gpu

09:10 nsklaus has joined #asahi-gpu

09:18 nsklaus has quit [Ping timeout: 480 seconds]

09:26 nsklaus has joined #asahi-gpu

09:32 Jamie has joined #asahi-gpu

09:32 rhysmdnz has joined #asahi-gpu

09:33 Jamie is now known as Guest12222

09:34 nsklaus has quit [Ping timeout: 480 seconds]

09:37 Guest12142 has quit [Ping timeout: 480 seconds]

09:37 rhysmdnz1 has quit [Ping timeout: 480 seconds]

09:41 nsklaus has joined #asahi-gpu

11:59 cylm has joined #asahi-gpu

12:19 mkurz has quit [Ping timeout: 480 seconds]

12:24 possiblemeatball has joined #asahi-gpu

13:24 mkurz has joined #asahi-gpu

13:26 <alyssa> lina: dEQP-GLES31.functional.texture.multisample.samples_1.use_texture_depth_2d_array

13:29 <alyssa> e4cdbeab814bdc2b468ec2375a74fc961a423213

13:37 nyilas has quit [Remote host closed the connection]

13:39 nyilas has joined #asahi-gpu

13:40 nyilas has quit [Remote host closed the connection]

13:44 nyilas has joined #asahi-gpu

13:44 nyilas has quit [Remote host closed the connection]

14:02 hightower2 has joined #asahi-gpu

14:12 mkurz has quit [Ping timeout: 480 seconds]

14:44 LinuxM1 has joined #asahi-gpu

14:50 cylm has quit [Ping timeout: 480 seconds]

14:56 mkurz has joined #asahi-gpu

14:57 <alyssa> lina: https://developer.imaginationtech.com/open-source-gpu-driver/

15:05 <alyssa> lina: If you revert bf3027c3916 ("mesa/st: Normalize wrap modes for seamless cubes")

15:06 <alyssa> then a bunch of dEQP-GLES3 cases will fail (listed in the commit message)

15:06 <alyssa> if you can find a control register bit that you can toggle to make them pass it would make me very interesting

15:06 <alyssa> interested

15:06 <alyssa> would not change the level of interesting i am

15:09 LinuxM1 has quit [Quit: Leaving]

15:16 <alyssa> (which is to say, about 37% on a good day)

15:48 c10l has quit [Quit: Bye o/]

15:56 c10l has joined #asahi-gpu

16:06 stickytoffee has quit [Read error: Connection reset by peer]

16:08 stickytoffee has joined #asahi-gpu

16:49 Guest12222 has quit [Ping timeout: 480 seconds]

16:49 rhysmdnz has quit [Ping timeout: 480 seconds]

17:38 <lina> alyssa: I don't know if this is a good thing or a bad thing, but almost all the unknowns I exposed seem to do nothing to dEQP2/3... ^^;;

17:38 <alyssa> Sounds like a neutral thing ^^

17:39 <alyssa> Also I already had lunch, get some sleep %_%

17:39 <lina> The only interesting things I found are that if large_tib and the TVB isn't large enough I get a message and the job hangs (means I need to update the min_tvb_size for that case) and one of the registers has two bits which need to be set or a ton of things fault. I have a feeling it might be related to cache control/flushing?

17:39 <lina> I just ended the stream! I'll have a bit of dinner and sleep ^^

17:39 <alyssa> ^^

17:39 <lina> I opened a PR with the UAPI changes anyway ^^

17:39 <alyssa> 1. I don't know exactly how eMRT interacts with partial renders. I'm trying not to think about eMRT until I've finished up everything else in gles3.1

17:40 <alyssa> 2. Shrug

17:40 <lina> Speaking of, I thought we were passing all of GLES3 but there's still that one test that needs eMRT?

17:40 <lina> (the one with a too large TIB stride)

18:03 <alyssa> eh, yeah, right

18:03 <alyssa> that's in GLES3.0 in your CTS, it's in GLES3.1 in mine because I haven't updated in years :p

18:37 possiblemeatball has quit [Quit: Quit]

19:02 hightower2 has quit [Ping timeout: 480 seconds]

19:25 hightower2 has joined #asahi-gpu

19:55 Jamie has joined #asahi-gpu

19:56 rhysmdnz has joined #asahi-gpu

19:56 Jamie is now known as Guest12262

20:05 maximbaz has quit [Quit: bye]

20:07 maximbaz has joined #asahi-gpu

20:55 A_L_I_C_E has joined #asahi-gpu

21:03 A_L_I_C_E has quit [Quit: Quit]

21:16 maximbaz has quit [Quit: bye]

21:18 maximbaz has joined #asahi-gpu

21:20 zocker has quit [Ping timeout: 480 seconds]

21:38 zocker has joined #asahi-gpu

23:20 cylm has joined #asahi-gpu

23:33 cylm_ has joined #asahi-gpu

23:35 cylm has quit [Ping timeout: 480 seconds]