#etnaviv on 2020-06-08 — irc logs at oftc.irclog.whitequark.org

2020-05-12 17:40 austriancoder changed the topic of #etnaviv to: #etnaviv - the home of the reverse-engineered Vivante GPU driver - Logs https://freenode.irclog.whitequark.org/etnaviv

01:36 Jookia has quit [Ping timeout: 240 seconds]

02:03 pcercuei has quit [Quit: dodo]

06:15 lynxeye has joined #etnaviv

06:22 lynxeye has quit [Quit: lynxeye]

07:59 lynxeye has joined #etnaviv

08:41 lynxeye has quit [Quit: lynxeye]

08:57 lynxeye has joined #etnaviv

09:51 juanrubio_ has joined #etnaviv

10:01 pcercuei has joined #etnaviv

10:36 <mntmn> what is the motivation for choosing a specific depth format? i'm comparing a blob trace and etnaviv trace, and in this example, the blob chooses D16 and etnaviv/mesa D24S8

10:41 <lynxeye> mntmn: The driver doesn't choose the format, it's the application. Normally it asks about EGL configs with a minimum depth/stencil precision, then picks one of the options provided by the driver.

10:41 <lynxeye> Obviously lower bit depth results in less bandwidth usage, thus higher performance.

10:42 <lynxeye> Some dumb applications just pick the first option, so actual chosen format may depend on driver ordering of the options

10:48 <mntmn> ah, that would explain it. i didn’t specify the depth resolution in the test.

11:48 berton has joined #etnaviv

12:01 lynxeye has quit [Remote host closed the connection]

12:01 lynxeye has joined #etnaviv

12:22 rhyskidd has joined #etnaviv

12:34 shoragan has quit [Ping timeout: 252 seconds]

12:35 pH5 has quit [Ping timeout: 260 seconds]

12:37 shoragan has joined #etnaviv

12:54 Jookia has joined #etnaviv

13:09 shoragan has quit [Ping timeout: 256 seconds]

13:10 shoragan has joined #etnaviv

13:33 shoragan has quit [Ping timeout: 272 seconds]

13:34 shoragan has joined #etnaviv

14:14 <mntmn> bumped the bounty to 1000 EUR https://gitlab.freedesktop.org/mesa/mesa/-/issues/3090

14:16 sravn has quit [Ping timeout: 260 seconds]

14:17 sravn has joined #etnaviv

14:46 lfa has joined #etnaviv

14:47 <pcercuei> mntmn: need some work on non-GPU related tasks? I'm for hire :)

14:48 <flto> mntmn: have you tried playing with the VIVS_RA_EARLY_DEPTH value? (we have a comment saying blob uses 0x40000031 on GC7000, and your trace has 0x50000031, 0x10000000 could be related to early-z disable)

14:56 <Marex> flto: I did that before, it didn't lead anywhere, but not on GC7000L all right

14:57 <flto> these would be new GC7000L bits so not relevant to older gpus

14:57 <mntmn> flto: yeah, i tried that. so 0x10000000 bit in RA_EARLY_DEPTH seems to disable depth testing (or writing) completely

14:58 <mntmn> flto: 0x40000000 doesn't make a difference

14:58 <mntmn> i'm wondering if the blob trace i got was correct. what it's doing seems kind of impossible.

14:59 <mntmn> pcercuei: the biggest pain i have is etnaviv bugs, so i currently only need (paid) support for that

15:00 <lynxeye> mntmn: so hey, you found the new disable_zs bit position

15:00 <flto> mntmn: I have an old branch from working on passing gles3 tests and it has RA_EARLY_DEPTH set to 0x15000030 and BIT(18) set in PE_DEPTH_CONFIG, so maybe try that

15:00 <mntmn> lynxeye: ah. :D

15:01 <mntmn> flto: oh strange, ok i'll try

15:01 <mntmn> this is why i think the trace might be not the right one:

15:01 <mntmn> blob etna

15:01 <mntmn> DEPTH_FUNC=0x7 DEPTH_FUNC=0x1

15:01 <mntmn> DEPTH_MODE=NONE DEPTH_MODE=Z

15:01 <mntmn> WRITE_ENABLE=0 WRITE_ENABLE=1

15:02 <mntmn> lynxeye: what exactly does "zs" mean btw?

15:02 <lynxeye> z as in depth, s as in stencil

15:02 <flto> mntmn: looks like the BIT(18) in DEPTH_CONFIG is already upstream as part of something else, so try just the RA_EARLY_DEPTH value

15:03 <mntmn> lynxeye: ah so it’s one bit for disabling both z and stencil? is z another way of saying depth?

15:05 <lynxeye> yep, the Z-buffer in graphics means the depth buffer. As depth and stencil usually share teh same buffer storage (as with format Z24S8) you have a single bit to disable all access to both

15:05 <mntmn> ahh, thanks for explaining!

15:07 <JohnnyonFlame> any tool/doc/guide I can read on getting useful stats to identify slowpaths and whatnot on etnaviv? or should I just be using oprof+apitrace

15:16 <pcercuei> JohnnyonFlame: know about GALLIUM_HUD?

15:17 <JohnnyonFlame> I was thinking more in terms of like, more granular data than frametime alone

15:17 <JohnnyonFlame> frametime & drawcalls/frame I guess

15:18 <mth> perfetto was mentioned here recently, but I don't know the details

15:18 <cphealy> GALLIUM_HUD will give you frametime and drawcalls/frame.

15:18 <JohnnyonFlame> more than ftime and drawcalls/frame*

15:18 <mth> from what I understand, that's the front-end, so it still needs a back-end that actually collects traces and counters

15:18 <cphealy> You may want to use perf to see if there are any hotspots in Mesa under various use cases.

15:18 <cphealy> That would be on the CPU though.

15:19 <JohnnyonFlame> data on cpu overhead is a good enough start here I guess

15:19 <mth> does apitrace log time stamps too?

15:25 <Marex> and ETNA_MESA_DEBUG=nir , because TGSI generates poor shader program

15:32 <lynxeye> flto: ... which reminds me: do we have any blockers left for flipping the switch on NIR?

15:34 <lynxeye> IIRC there were only some very rare shaders, which generated better instruction scheduling for texture fetches by chance on TGSI. Which IMHO isn't really a good reason to keep the default on TGSI.

15:38 <mntmn> flto: 0x15000030 doesn't make a difference except for the 0x10000000 which, as mentioned before, completely switches off Z

15:39 <flto> lynxeye: apparently there is an assert failing during gles2 tests now

15:45 <Marex> flto: only on gc7000l though

15:45 <Marex> flto: and a lot of them

15:53 <austriancoder> lynxeye: nir is not ready for prime time.. just use a debug build and try to run piglit or deqp

15:55 <Marex> austriancoder: I do quite often run dEQP on anything which isn't GC7000L (that one is broken)

15:56 <austriancoder> Marex: I have seen asserts on non gc7000l with piglit

15:58 <Marex> austriancoder: isolated to gc7000l, right ?

15:58 <austriancoder> JohnnyonFlame: GL_AMD_performance_monitor is supported to get some gpu related data (and the information is also available in GALLIUM_HUD)

15:58 <austriancoder> Marex: have seen an assert on GC3000 with piglit

15:59 <Marex> austriancoder: so maybe it would make sense to enable NIR on halti < 2 ?

15:59 <austriancoder> Marex: all or nothing

16:00 <Marex> austriancoder: that way, NIR will get basically no testing unless you know about it and there will be seldom any bug reports ... seems like a loss

16:00 <Marex> gc3000/7000l can be fixed later

16:00 <mntmn> soon we'll have >100 testers for NIR on GC7000: because i enable that by default ;)

16:00 <austriancoder> let's fix the asserts and avoid the complexity in the driver

16:01 <mntmn> i wonder if purism will ship with NIR enabled

16:01 <mntmn> agx_: do you know?

16:02 <flto> austriancoder: I guess some validation was added to NIR which makes etna_lower_io lowering of nir_intrinsic_load_uniform/ubo fail the validation.. it should be an easy fix

16:03 <Marex> would it make sense to add etnaviv into the mesa CI ?

16:03 <austriancoder> flto: jup

16:03 <austriancoder> Marex: there is something in the works

16:05 <Marex> austriancoder: it's been like that for a year or so, no ?

16:07 <austriancoder> Marex: that's why I did step up and do it on my own... I have a proof of concept running and pushing out my changes (some of them already landed in master)

16:08 <Marex> austriancoder: but no CI on push yet, to trap these new breakages ?

16:09 <lynxeye> Marex: maintaining the test runners has a non-zero cost in terms of time, which is why I haven't jumped on-board the CI hypetrain.

16:09 <Marex> lynxeye: well it's not hype, it prevents these kinds of breakages

16:09 <austriancoder> Marex: not yet.. but soon

16:09 <austriancoder> Marex: and I will start with gc2000

16:10 <austriancoder> lynxeye: ci is needed at the stage where etnaviv is now.. we need to catch regressions early

16:12 <lynxeye> I'm not telling anyone how to spend their time, I'm just saying that I don't have the bandwidth to babysit a farm of test-runners

16:13 <Marex> lynxeye: agreed

16:19 <Marex> austriancoder: nice

16:28 dv_ has quit [Ping timeout: 256 seconds]

16:40 <Marex> mntmn: btw your example requires WL and Xwayland on top of that ?

16:40 <mntmn> Marex: i don't think so?

16:41 <mntmn> Marex: minetest yes but my test case should run without

16:41 <Marex> maybe it's missing some FREEGLUT_WAYLAND=ON somewhere ?

16:41 <mntmn> i don't know, GLUT is not important there, it's just 2 triangles

16:41 <mntmn> maybe i should use another boilerplate

16:42 dv_ has joined #etnaviv

16:42 <Marex> dont worry about it, I just wanted to try it on GC2000

16:42 <mntmn> yeah i didn't even think about that this requires xwayland through glut, sorry

16:43 <mntmn> what is good for wayland, glfw or so? probably SDL2

16:43 <mntmn> i'll port it to SDL2

16:44 <Marex> https://github.com/yuq/gfx.git has some "set up context, draw, write to jpeg"

16:45 <mntmn> ok maybe later then

16:45 <Marex> mntmn: dont worry about it, I just wanted to try it

16:45 <Marex> to see whether the older GPUs are also affected

16:45 <mntmn> i think they are not

16:45 <mntmn> i was playing minetest on GC3000 and it was fine

16:46 <Marex> so it's isolated to gc7000l, ok

16:46 <austriancoder> Marex: it is related to gc7000l

16:48 <daniels> mntmn: I might be able to get another ready-made sample which demonstrates the same bug

16:48 <Marex> austriancoder: all right

16:49 <Marex> austriancoder: btw that gfx above is quite useful

16:49 <austriancoder> Marex: I know.. I shared this link with you ;)

16:52 <austriancoder> daniels: mntmn: https://github.com/austriancoder/freedreno/commits/discard

16:54 <mntmn> daniels: i just read the collabora article on panfrost, seems like a similar problem was fixed there?

16:55 <mntmn> austriancoder: cool

16:56 <daniels> mntmn: hence why I have a reproducer :P

16:56 <daniels> have asked if it's clear to redistribute

16:57 <mntmn> daniels: nice

17:28 <daniels> http://www.rasterman.com/files/earlyz-bug.tgz

17:33 <mntmn> thanks daniels!

17:33 <daniels> np

17:38 <daniels> heh, though I guess it won't work on sway since it depends on wl_shell

17:38 <mntmn> weird, segfaults in wl_proxy_marshal_constructor() on my intel/debian/sway desktop

17:38 <mntmn> ha, good timing daniels

17:39 <mntmn> good that i use sway everwhere...

17:43 lynxeye has quit [Quit: lynxeye]

18:01 <daniels> mntmn: https://static.fooishbar.org/tmp/earlyz-bug-xdg-shell.tar.gz

18:03 <mntmn> daniels: almost! > xdg_wm_base@5: error 4: wrong configure serial: 1500426

18:05 <daniels> mntmn: damn, that's what I get for not testing on Weston - Mutter is rather permissive here. fixed, pull it again

18:08 <mntmn> daniels: works!

18:08 <daniels> \o/

18:08 <mntmn> that looks pretty cool, even. gonna try it on the imx8mq now

18:10 <mntmn> yep, that shows the problem nicely. thank you daniels!

18:10 <daniels> np! hope the added shininess hopes you attract someone to fix :P

18:10 <daniels> *helps

18:11 <mntmn> haha yeah! will update the issue

18:11 <mntmn> Marex: there's something you can try on GC2000 now ;)

18:19 JohnnyonFlame has quit [Read error: Connection reset by peer]

18:29 JohnnyonFlame has joined #etnaviv

18:31 JohnnyonF has joined #etnaviv

18:34 JohnnyonFlame has quit [Ping timeout: 264 seconds]

18:34 Johnny_ has joined #etnaviv

18:34 <Marex> mntmn: just a second, I need to install about a gigabyte of dependencies for the build system (meson, ninja) ...

18:38 JohnnyonF has quit [Read error: Connection reset by peer]

18:39 Johnny_ has quit [Read error: Connection reset by peer]

18:40 <mntmn> Marex: haha

18:40 <mntmn> i'm always extremely happy when i see a meson.build because it means it will build reasonably fast on the target

18:41 JohnnyonFlame has joined #etnaviv

18:42 <Marex> mntmn: faster than running gcc ... ? :)

18:44 JohnnyonF has joined #etnaviv

18:44 <Marex> mntmn: I rewrote it into that gfx thing, so let's see

18:44 <mntmn> Marex: faster than running make

18:44 JohnnyonFlame has quit [Read error: Connection reset by peer]

18:52 <daniels> Meson doesn't need anything outside Python core, ninja similarly (only libc and libstdc++); its package install size on Debian is 300kB which includes all the docs and changelogs ...

18:53 <daniels> (make is 1.5MB)

18:53 <mntmn> almost fits on a floppy disk

18:57 <Marex> except for the python :)

19:10 <daniels> the subset of Python that Meson depends on is a 640kB install :P

19:11 <Marex> mntmn: ok yep, I have the same test on this gfx thing, except it's much simpler

19:11 <mntmn> Marex: so, does it work on GC2000?

19:12 <Marex> mntmn: I think so

19:12 <Marex> mntmn: if I rewrote the test right

19:25 <mntmn> Marex: i posted screenshots in that issue ticket

21:14 berton has quit [Quit: Leaving]

22:29 nathanhi has quit [Ping timeout: 260 seconds]

22:32 nathanhi has joined #etnaviv

22:47 nathanhi has quit [Ping timeout: 272 seconds]

22:48 nathanhi has joined #etnaviv

23:09 adjtm_ has joined #etnaviv

23:10 adjtm has quit [Ping timeout: 256 seconds]

23:16 pcercuei has quit [Quit: Lost terminal]

23:22 pcercuei has joined #etnaviv

23:24 pcercuei has quit [Client Quit]

23:30 pcercuei has joined #etnaviv