ChanServ changed the topic of #panfrost to: Panfrost - FLOSS Mali Midgard & Bifrost - Logs https://oftc.irclog.whitequark.org/panfrost - <macc24> i have been here before it was popular
JulianGro has quit [Remote host closed the connection]
jambalaya has quit [Ping timeout: 480 seconds]
jernej has joined #panfrost
jambalaya has joined #panfrost
<alyssa>
cphealy: Doing a full glmark run
<alyssa>
Results are making my jaw drop
<alyssa>
uhhh except for -bjellyfish apparently
<alyssa>
which er went from 814fps to 4fps
<alyssa>
I should strongly consider investigating
<alyssa>
jellyfish is fine now ..
<alyssa>
I see faulting in dmesg but why won't it reproduce?!
<cphealy>
35% good? OMG, that's amazing!
<cphealy>
Do you mean the overall glmark2-es2 score is up by 35% or just a subset of tests?
<alyssa>
cphealy: overall glmark2 score was 976->1014
<alyssa>
but -bjellyfish regressed from 814->4 due to some regression
<alyssa>
^functional
<alyssa>
For some reason it doesn't reproduce when run in isolation >.<
<alyssa>
[ 2995.906068] panfrost ffe40000.gpu: Unhandled Page fault in AS0 at VA 0x00000000000701E0
<alyssa>
access type 0x2: READ
<alyssa>
source id 0x300
<alyssa>
...isn't that general purpose read?
<cphealy>
interesting... You've definitely got me excited about doing some more benchmarking in my env.
<cphealy>
A couple months back, we went down the rabbit hole of identifying the cause of the regression with "[buffer] columns=200:interleave=false:update-dispersion=0.9:update-fraction=0.5:update-method=subdata" that took it from 50fps to 3. IIRC, it was CPU bound for some reason and you had a proposed fix. Can you remind me what happened with that?
<alyssa>
Okay, fixed the bug, was something stupid (off-by-one)
<alyssa>
Unfortunately, -bjellyfish fps is hurt regardless
<cphealy>
I assume not from 814->4 though?
<alyssa>
haha no
<alyssa>
829->775
<alyssa>
jellyfish.vert offers a clue why
<alyssa>
Both gl_Position and the varyings depend on a common variable `pos4`, which is expensive to calculate
<alyssa>
In theory the compiler could detect this case and bail but it's not obvious what heuristic would be best here
<alyssa>
Also in theory the intermediate result could be saved but this would end up significantly slower I think
<alyssa>
I think it makes sense to eat a 7% decrease in fps on a single scene when there are so many scenes that are up 30% etc
<cphealy>
Seems like a decent tradeoff if there is no better option.
<alyssa>
and actually I'm not so sure about those numbers,
<alyssa>
I just got 817 on this run and had 814 before so who knows
<cphealy>
As a datapoint, when I compare jellyfish with vendor DDK numbers, I'm seeing that Mesa runs at 76% the fps as DDK. (This is before your code change.)
<alyssa>
Alright
<alyssa>
I'd be curious what the DDK trace looks like
<alyssa>
You up for some clean r/e fun? :-p
<cphealy>
that's DDK vs mesa 21.2 where DDK had 393 FPS and Mesa had 300 FPS.