#panfrost on 2021-07-29 — irc logs at oftc.irclog.whitequark.org

2021-07-26 22:56 ChanServ changed the topic of #panfrost to: Panfrost - FLOSS Mali Midgard & Bifrost - Logs https://oftc.irclog.whitequark.org/panfrost - <macc24> i have been here before it was popular

00:43 <macc24> icecream95: do you have chromeos on your duet?

04:11 stano has quit [Read error: Connection reset by peer]

04:14 stano has joined #panfrost

04:40 <icecream95> macc24: yes...

05:28 Putti has joined #panfrost

07:13 rasterman has joined #panfrost

07:19 enunes has joined #panfrost

07:21 <bbrezillon> tomeu: can you try to add assert()s in pan_texture.c and pan_cs.c to make sure we're not passed X32_S8X24 or Z32_S8X24 formats

07:21 enunes has quit [Read error: Connection reset by peer]

07:22 enunes has joined #panfrost

07:23 <bbrezillon> those should be converted to dual-plane formats (Z32 + S8X24) by the gallium driver

07:24 <bbrezillon> tomeu: see https://elixir.bootlin.com/mesa/mesa-21.2.0-rc3/source/src/gallium/drivers/panfrost/pan_job.c

07:40 megi has quit [Quit: WeeChat 3.2]

07:41 megi has joined #panfrost

07:46 enunes has quit [Quit: ZNC - https://znc.in]

07:48 enunes has joined #panfrost

08:08 <tomeu> bbrezillon: will do!

08:10 atler has quit [Ping timeout: 480 seconds]

08:51 atler has joined #panfrost

08:53 <robmur01> macc24: FWIW, compile-time pipeline scheduling is a fundamental of VLIW architectures which used to be popular for CPUs/DSPs, and was also one of the reasons for IA-64 being so successful...

08:53 wwilly has quit []

08:54 <urja> ... succesful.

08:58 <robmur01> the extent of its success, yes ;)

09:27 wwilly has joined #panfrost

09:42 camus has joined #panfrost

09:54 camus has quit [Ping timeout: 480 seconds]

10:00 <macc24> robmur01: i also vaguely recall some nvidia cpu(denver?) being vliw

10:02 <robmur01> Quite possibly - I know Transmeta Crusoe was, and Denver's the same kind of deal

10:28 ids1024 has quit [Ping timeout: 480 seconds]

10:38 ids1024 has joined #panfrost

10:41 camus has joined #panfrost

10:50 davidlt has joined #panfrost

10:54 camus has quit [Ping timeout: 480 seconds]

11:04 camus has joined #panfrost

11:06 Net147 has quit [Quit: Quit]

11:07 Net147 has joined #panfrost

11:26 camus has quit [Ping timeout: 480 seconds]

11:30 robmur01 has quit [Read error: Connection reset by peer]

11:31 robmur01_ has joined #panfrost

11:41 robmur01_ is now known as robmur01

12:48 nlhowell has joined #panfrost

13:20 indy has joined #panfrost

13:22 ezequielg has quit []

13:22 ezequielg has joined #panfrost

13:49 camus has joined #panfrost

14:29 nlhowell has quit [Ping timeout: 480 seconds]

16:00 ezequielg has quit []

16:01 ezequielg has joined #panfrost

16:07 nlhowell has joined #panfrost

16:19 <HdkR> macc24: robmur01: Denver/Carmel is VLIW yes

16:21 <macc24> HdkR: is there linux port for that without any hardware translation?

16:26 <HdkR> You can't

16:27 <macc24> why

16:27 <HdkR> It's in violation of ARM's license to expose another ISA through the ARM path, so you just never get it exposed.

16:28 <macc24> D:

16:28 <HdkR> Also as itanium proved, you don't want VLIW in your CPU

16:28 <macc24> why?

16:29 <HdkR> Hand written VLIW is a major pita

16:29 <alyssa> HdkR: Unless you're Apple

16:29 <macc24> still, why vliw is bad

16:30 <HdkR> VLIW good. Just bad when you have any form of developer interaction with it :P

16:30 <macc24> i mean, there's a reason why everyone moves away from it lol

16:30 <macc24> terascale amd gpus were vliw iirc

16:31 <robclark> For general purpose CPUs, things are unpredictable enough that you don't really want vliw, you want more traditional superscaler and OoO.. the compiler doesn't have enough knowledge to dtrt with vliw

16:32 <robclark> for more niche cases where the compiler *can* know enough, vliw is nicer

16:33 <HdkR> You can also end up in a situation where even if your VLIW ISA is backwards compatible. Your hardware changes the scheduling model to make your previously written code now wildly inefficient or broken :D

16:33 <macc24> wtf

16:33 <alyssa> how about multi-issue in-order?

16:35 <HdkR> alyssa: Whatcha mean? Issuing multiple VLIW packets in order?

16:35 <macc24> alyssa: *whispers* midgard?

16:35 <alyssa> macc24: AGX, we think

16:36 <alyssa> HdkR: Encoding and assembly "looks" like a purely in-order scalar processor

16:36 <macc24> ah, agxgard

16:36 <alyssa> but if you have the same 'type' of instruction back to back, they can be executed faster

16:36 <HdkR> ooo, cute. Doing the nvidia thing

16:36 <alyssa> but the hw won't search out of order for instructions to execute

16:37 <robclark> in-order makes sense for GPUs where you care more about "wide" than "in-order-fast"

16:37 <alyssa> so the compiler /should/ reorder for optimal perf, but does not /have/ to for correctness

16:37 <alyssa> (and in particular, if you can't fill slots you don't waste i-cache space padding with nops)

16:38 <robclark> s/"in-order-fast"/"single-threaded-fast"/

16:39 <HdkR> Need those out of order GPUs to make compiler dev's jobs easier :P

16:40 <robclark> sure, just get one of those riscv gpu things.. or did they switch to power?

16:43 <HdkR> I think they are still planning riscv because of how easy it is to add instructions

17:24 wwilly has quit [Remote host closed the connection]

17:38 atler is now known as Guest2719

17:38 atler has joined #panfrost

17:40 Guest2719 has quit [Ping timeout: 480 seconds]

17:47 wwilly has joined #panfrost

17:47 <bbrezillon> alyssa: I got rid of all _packed objects in panvk's non per-gen files, but it looks like we have the same issue in the gallium driver

17:48 <bbrezillon> (pan_context.h)

17:48 <alyssa> uhh

17:49 <alyssa> i thought i fixed that

17:49 <alyssa> uhm. guess not. I'll deal with it

17:49 <alyssa> i assume you need review on the core bits in the mean time..?

17:50 <bbrezillon> I'd really like to go through one full conversion before posting the MR

17:51 <alyssa> Alright

17:52 <alyssa> In that case -

17:52 <alyssa> rasterizer, zsa, sampler -- all move to pan_cmdstream.c

17:52 <bbrezillon> ack

17:53 <alyssa> panfrost_sampler_view_destroy -- move to pan_cmdstream.c, and then sampler_view itself goes to pan_cmdstream.c

17:53 <alyssa> you'll need forward decls

17:55 <alyssa> panfrost_shader_state is the only tricky one, but given the RSD is partial anyway, probably easiest to just u32[] and STATIC_ASSERT and call it a day

17:55 <alyssa> (or just leave it as is -- RSD is the same size from t604 through g76... will need to change for valhall but valhall needs more invasive changes anyway)

17:55 <alyssa> that should be it for _packed

17:56 <alyssa> ack? 😃

17:57 <macc24> alyssa: uh do you know if anyone used panfrost under android?

17:58 <alyssa> macc24: globallogic has done so (with glodroid), haven't built it myself though

17:58 <bbrezillon> alyssa: should be good

17:58 <bbrezillon> thx

17:58 <alyssa> 👍

17:58 <macc24> alyssa: may or may not be doing similar thing

17:58 <macc24> but with chromebooks

17:58 <alyssa> macc24: Collabora will likely do a demo at some point but linux is more interesting 😉

17:59 <macc24> alyssa: what demo

17:59 <alyssa> as for chromebooks, er

17:59 <macc24> ah

17:59 <alyssa> tomeu: I think you had a chromeos panfrost build at some point? maybe?

18:00 <macc24> alyssa: don't even think about demoing the dumpster fire i call cadmium xD

18:00 <alyssa> macc24: I meant an android panfrost demo at some point

18:00 <macc24> ah cool

18:01 <alyssa> i'm scared of android but you know

18:01 <alyssa> someone's gotta do it

18:02 <macc24> alyssa: well... you should be scared xD

18:03 <macc24> android games probably are doing stupid gles stuff xD

18:03 <alyssa> Not worried about that, just about compiling AOSP on a chromebook ;p

18:03 <robclark> alyssa: speaking of android, I found a way to get better fps/gfx on a bunch of gamesandroid ...

18:03 <robclark> https://www.irccloud.com/pastebin/c39QlkGy/

18:03 <robclark> :-P

18:04 <macc24> robclark: WHY

18:04 <alyssa> robclark: 🤔

18:04 <HdkR> lol

18:04 <robclark> (a bunch of games are putting GPUs in performance brackets and when they see a GPU they don't recognize they put it in lowest bracket, artificially limiting framerate and gfx features)

18:05 <alyssa> garbage :p

18:05 <robclark> I'm working on a driconf so we can override GL_VENDOR/GL_RENDERER on a per-game basis

18:05 <robclark> yes, it is

18:05 <macc24> robclark: thanks i hate it

18:05 <robclark> (I won't use mali, I'll use qc values, ofc.. that was just an experiment when I was comparing some game to krane)

18:06 <HdkR> When Tegra was on the market, there were a BUNCH of games that check for tegra and only enable some effects in that case :(

18:06 <urja> ha... i once hacked the GLESv2 driver on the jolla to report a different (less weird? dont rememeber if it was slightly up or down ...) Adreno for GTA Vice city to work ... had some bug workaround tied to just a couple specific adrenos ...

18:08 <robclark> The funny thing, at least one game was failing to differentiate G72MP3 from G72MP12 and putting krane in a performance bracket where it totally didn't belong

18:08 <HdkR> oops

18:08 <HdkR> That bit of software not understanding how Mali GPUs scale :D

18:09 <robclark> yeah

18:10 <HdkR> G78MP24 totally the same perf bracket as G78MP7...

18:10 <alyssa> robclark: what i don't understand is why it doesn't put all malis in the lowest perf bracket 🙃

18:10 <macc24> alyssa: mali g72 mp2137 would be pretty fast

18:11 <alyssa> macc24: no, it would be tiler bottlenecked

18:11 <macc24> alyssa: but not slow

18:11 <alyssa> yes, slow

18:11 <alyssa> for higher geometry counts

18:11 <robclark> but it would be great at fullscreen quads :-P

18:11 <alyssa> that.. yes, it would be :-p

18:11 <alyssa> so i guess unironically ray tracing would be good

18:12 <macc24> robclark: or 2 fullscreen triangles

18:12 <HdkR> Just like Nvidia can't easily scale a design down to G57 levels, mobile GPUs can't easily up to Titan levels :>

18:12 <alyssa> though you'll quickly be mobile bw bound

18:12 <robclark> alyssa: the shadertoy edition ;-)

18:12 <alyssa> there we go

18:13 <alyssa> so i've been writing python to parse some xml and generate c code

18:13 <alyssa> i think i've hit the cliff where it's faster to just write the c code and skip the python and xml ;v

18:20 atler has quit [Ping timeout: 480 seconds]

18:26 atler has joined #panfrost

18:44 wwilly_ has joined #panfrost

18:51 wwilly has quit [Ping timeout: 480 seconds]

18:57 atler is now known as Guest2728

18:58 atler has joined #panfrost

19:04 Guest2728 has quit [Ping timeout: 480 seconds]

19:08 davidlt has quit [Ping timeout: 480 seconds]

19:40 <macc24> alyssa: how much do you think vr would kill mali g72?

19:55 <alyssa> yes.

19:55 <alyssa> meh. g72 with the ddk should be ok.

19:56 <alyssa> unless you're sensitive to motion sickness..

19:56 <macc24> hmmmmmmmmmmmmmm

19:56 <alyssa> I seem to recall doing google cardboard style AR with a mali t628 phone in 2015.

19:56 <alyssa> then again. do not recommend :-p

19:57 <macc24> i was doing cardboard vr with adreno 330 phone xD

19:57 <alyssa> actually maybe this was a330 as well. dunno

19:59 rasterman has quit [Quit: Gettin' stinky!]

20:26 atler is now known as Guest2738

20:26 atler has joined #panfrost

20:28 Guest2738 has quit [Ping timeout: 480 seconds]

20:30 rasterman has joined #panfrost

21:23 rasterman has quit [Quit: Gettin' stinky!]

21:42 kenzie has quit [Quit: The Lounge - https://thelounge.chat]

21:48 kenzie has joined #panfrost

21:48 <alyssa> $ bifrost_compile --gpu=G72 compile shader.frag

21:49 <alyssa> configurable targets now :]

21:49 <alyssa> (and yes I have a branch where --gpu=G78 works ;) )

21:51 <macc24> the best bifrost gpu - G78

21:53 <alyssa> valhall is what happens if you make bifrost useful.

21:53 <macc24> valhall is what bifrost should have been?

21:54 <alyssa> something like that :-p

21:54 <macc24> well then, the mali g57 in mt8192 better be fast ;)

22:17 <macc24> fun fact

22:17 <macc24> some mt8192 machine will have keyboard backlight

22:45 <alyssa> Heh, are those back in style?

22:46 <macc24> apparently

22:46 <macc24> alyssa: some lazors have it too

22:46 <alyssa> any reliable sources about when valhall chromebooks are coming, on the subject of vapid speculation ? :p

22:46 <macc24> unfortunately no

22:47 <alyssa> the Internet says Q2'21

22:47 <alyssa> but like

22:47 <alyssa> we're already a month into Q3 lol

22:47 <alyssa> are we blaming this on the pandemic too?

22:47 <macc24> also i'm going on holidays between 10 and end of august and won't be reachable from irc

22:47 <macc24> alyssa: yea i guess

22:48 <robclark> I don't think I can go back to a laptop that *didn't* have kb bl

22:48 <macc24> alyssa: all i can say is soon™ as chromeos kernel 5.4 /has/ support for asurada

22:49 <macc24> robclark: im planning on building a thinklight thing but for chromebooks xD

22:50 <robclark> all my lazors have kb bl already

22:56 atler has quit [Read error: Connection reset by peer]

23:37 camus has quit []

23:39 atler has joined #panfrost