#dri-devel on 2021-05-31 — irc logs at oftc.irclog.whitequark.org

01:00 alatiera4 has joined #dri-devel

01:03 alatiera has quit [Ping timeout: 480 seconds]

01:09 soreau has joined #dri-devel

01:10 bbrezillon has quit [Ping timeout: 480 seconds]

01:10 mripard has quit [Ping timeout: 480 seconds]

01:15 adjtm has quit [Ping timeout: 480 seconds]

01:24 adjtm has joined #dri-devel

02:10 boistordu has joined #dri-devel

02:11 DrNick has quit []

02:14 DrNick has joined #dri-devel

02:17 boistordu_ex has quit [Ping timeout: 480 seconds]

02:24 thaytan_ has joined #dri-devel

02:25 thaytan has quit []

02:25 thaytan_ has quit []

02:26 thaytan has joined #dri-devel

02:45 dos1 has joined #dri-devel

03:30 lemonzest has joined #dri-devel

03:43 heat has quit [Ping timeout: 480 seconds]

04:32 sarnex has quit [Quit: Quit]

04:36 RAOFhehis[m] has joined #dri-devel

04:39 sarnex has joined #dri-devel

05:57 sumits has joined #dri-devel

06:07 RAOF has joined #dri-devel

06:09 RAOFhehis[m] has quit []

06:09 RAOFhehis[m] has joined #dri-devel

06:18 RAOF has quit [Quit: RAOF]

06:26 thellstrom has quit [Ping timeout: 480 seconds]

06:30 frieder has joined #dri-devel

06:46 <tomeu> daniels: thanks a lot!

06:50 frieder has quit [Ping timeout: 480 seconds]

06:54 danvet has joined #dri-devel

07:01 ppascher has quit [Quit: Gateway shutdown]

07:02 <dt9> danvet: regarding your question - no, noone rewrote gem_exec_schedule yet

07:02 ppascher has joined #dri-devel

07:07 frieder has joined #dri-devel

07:08 <danvet> dt9, I'll cc you and adixit on some patch

07:08 <danvet> I think I'll just do a quick hack

07:08 <danvet> for my problem

07:08 <danvet> but if my understanding is correct, that test should be converted to softpin unconditionally

07:08 <danvet> since relocations are getting in the way of the test logic

07:09 <danvet> and there's some not entirely 100% looking hacks to avoid the issues

07:09 <danvet> so softpin everywhere for that test will maybe make it a bit more reliable

07:15 mripard has joined #dri-devel

07:15 bbrezillon has joined #dri-devel

07:21 <dt9> danvet: relocations implicitly helps keeping pipeline busy, with softpin there's not so easy to do so, because after closing/freeing offset (in allocator) we can stall pipeline - second bo will get same offset (previously freed for first bo) so it has to wait for vma reuse

07:22 <dt9> we can use pseudo-allocations using incremented offsets (wrt to size of previous allocations) to have similar to relocations behavior

07:30 mripard has quit [Quit: leaving]

07:32 mripard has joined #dri-devel

07:42 <danvet> dt9, with this test it's the other way round

07:42 <danvet> the relocations can cause stalls, so we need to assign fixed addresses for all buffers upfront

07:43 <danvet> maybe the testcase needs to be converted to use hardcoded addresses even (it's kinda doing that right now)

07:43 <dt9> danvet: which subtest you mean?

07:44 <danvet> anything that uses __store_dword() iirc

07:45 <dt9> one of test I rewrote in my private branch uses similar store_dword()

07:45 <dt9> I can check how much effort is required to quickly change this to softpin

07:49 boistordu has quit [Remote host closed the connection]

07:52 RAOF has joined #dri-devel

07:59 blue__penquin has joined #dri-devel

08:01 <danvet> dt9, I don't think it's a case of "quickly"

08:02 <danvet> and I think my change (need to recheck my analysis) is really small change

08:04 <dt9> danvet: yes, but change to migrate to softpin is not straighforward and requires changes in spinner (I got this on private branch still)

08:19 pcercuei has joined #dri-devel

08:27 RAOF is now known as RAOF2

08:33 nsneck has quit [Remote host closed the connection]

08:42 pekkari has joined #dri-devel

08:43 evadot has quit [Remote host closed the connection]

08:43 manu has joined #dri-devel

08:45 bcarvalho has joined #dri-devel

08:45 mlankhorst has joined #dri-devel

08:47 pekkari has quit []

08:47 pekkari has joined #dri-devel

08:50 thellstrom has joined #dri-devel

09:06 thellstrom has quit [Ping timeout: 480 seconds]

09:19 rasterman has joined #dri-devel

09:27 abelloni has joined #dri-devel

09:41 blue__penquin has quit []

09:52 thellstrom has joined #dri-devel

09:57 thellstrom has quit [Remote host closed the connection]

10:22 itoral has joined #dri-devel

10:22 abelloni has left #dri-devel [#dri-devel]

10:26 thellstrom has joined #dri-devel

10:34 thellstrom has quit [Ping timeout: 480 seconds]

10:45 frieder_ has joined #dri-devel

10:46 frieder has quit [Ping timeout: 480 seconds]

11:00 adjtm has quit [Quit: Leaving]

11:01 pcercuei has quit [Quit: brb]

11:01 adjtm has joined #dri-devel

11:01 pcercuei has joined #dri-devel

11:04 RAOF2 has quit [Ping timeout: 480 seconds]

11:19 adjtm is now known as Guest310

11:19 adjtm has joined #dri-devel

11:26 Guest310 has quit [Ping timeout: 480 seconds]

11:40 ella-0 has joined #dri-devel

11:40 Sumera has joined #dri-devel

11:42 <Sumera> melissawen, danvet: what is a good way to debug memory errors?

11:42 <Sumera> I get this (https://paste.ubuntu.com/p/TwGpTpB7kR/) during the flip-vs-panning subtest and tried fixing it,

11:43 <Sumera> but not sure why it is happening

12:08 xp4ns3 has joined #dri-devel

12:10 <Sumera> I have a feeling it's because kfree() is not being called somewhere, but I tried changing that, still no show :/

12:16 Lightkey has quit [Ping timeout: 480 seconds]

12:17 neonking has quit [Ping timeout: 480 seconds]

12:25 Lightkey has joined #dri-devel

12:30 itoral has quit []

12:30 xp4ns3 has quit []

12:31 xp4ns3 has joined #dri-devel

12:34 neonking has joined #dri-devel

13:13 thellstrom has joined #dri-devel

13:45 <danvet> Sumera, hm I'd dump how big the allocation is

13:45 <danvet> maybe we're trying a huge resolution

13:45 <danvet> above what kzalloc can allocate

13:45 <danvet> then compare which allocations work and which don't, if it's only the big ones that fail, that's probably the bug

13:45 <danvet> if it's random, then there's another reason

13:46 <danvet> but if you only get that you're probably over the kmalloc limit

13:46 <danvet> since if we're actually running low on memory there's usually a big splat of additional information from the allocator

14:03 heat has joined #dri-devel

14:10 <Sumera> danvet: this happens only for the virtual_hw case tho, won't the memory being requested be same for both virtual and non virtual cases?

14:12 <Sumera> I will check the size being allocated in the meanwhile and get back to you in some time

14:17 <danvet> Sumera, hm maybe, I'd check to be sure

14:17 <danvet> it's just that usually when kmalloc fails, you get a few pages of allocator dumps in dmesg

14:17 <danvet> hm maybe dmesg debug level is only showing critical stuff?

14:17 <danvet> that could be another one

14:18 <danvet> __GFP_NOWARN is the flag for "I know how to handle allocations error here and even expect them, don't freak out when there's no memory"

14:18 <danvet> and we don't set that in that case you're hitting

14:18 alatiera4 is now known as alatiera

14:22 yk has joined #dri-devel

14:37 <Sumera> danvet: yeah, could be, none of my printks(even after using KERN_CRIT) were showing up so changed config and tree is building rn.

14:51 dt9 is now known as dt9_away

14:52 iive has joined #dri-devel

15:08 thellstrom has quit [Remote host closed the connection]

15:08 thellstrom has joined #dri-devel

15:13 thellstrom has quit [Remote host closed the connection]

15:17 xp4ns3 has quit [Quit: Konversation terminated!]

15:18 ella-0 has quit [Remote host closed the connection]

15:19 thellstrom has joined #dri-devel

15:24 Guest205 is now known as blue_penquin

15:28 txenoo has quit [Quit: Leaving]

15:39 frieder_ has quit [Remote host closed the connection]

15:50 pcercuei has quit [Quit: brb]

15:50 pcercuei has joined #dri-devel

15:52 pcercuei has quit []

15:55 pcercuei has joined #dri-devel

15:58 dt9_away has left #dri-devel [#dri-devel]

15:59 dt9 has joined #dri-devel

16:00 berylline has joined #dri-devel

16:02 <berylline> i don't want to sound like a pest, but i asked a question yesterday and it was this

16:02 <berylline> [18:49:46] <berylline> another question that i wanted to ask: is there any way to trace what's going on with GPUs on ARM without mmiotrace?

16:02 <berylline> [18:50:00] <berylline> i know there's a method like what panwrap was made for

16:03 <berylline> <berylline> but i was also wondering if there are any methods of tracing GPU hardware reads, writes and other things besides what i've already mentioned

16:03 txenoo has joined #dri-devel

16:13 <robclark> berylline: you disconnected before anyone had a chance to answer.. but I don't believe mmiotrace works on arm (it certainly didn't many years back when I started poking at gpus on arm devices).. AFAIK everyone uses LD_PRELOAD shims to wrap ioctls

16:17 <berylline> robclark: yeah, i understand that mmiotrace doesn't support ARM, which was why i asked. and i disconnected because i had to do something

16:17 <berylline> it's a shame that mmiotrace doesn't ARM, really :(

16:18 <berylline> *doesn't support ARM

16:20 <robclark> tbh, mmiotrace hasn't been to much needed.. the only thing really directly touching hw is the kernel part, and android requires that to be open source

16:33 <berylline> true. i also know that the SGX GPUs that i'm going to study have an open-source kernel module which contains service commands

16:34 <berylline> which i think is going to be interesting to refer to for looking at the traffic that goes on between the closed-source components and the kernel module, i think

16:36 <berylline> although i don't think all of the communication coming from those components are going to involve the kernel module

16:40 <berylline> but i can't really say unless i get my hands on a BeagleBone Black or something else like it

16:46 alanc has quit [Remote host closed the connection]

16:47 alanc has joined #dri-devel

16:55 <bl4ckb0ne> is it normal to experience gl3.3 failure when updating the vulkan spec?

16:57 gouchi has joined #dri-devel

17:45 pekkari has quit []

18:02 gouchi has quit [Remote host closed the connection]

18:03 gouchi has joined #dri-devel

18:16 thellstrom has quit [Quit: thellstrom]

18:28 berylline has quit [Quit: KVIrc 5.0.1 Aria http://www.kvirc.net/]

18:32 danvet has quit [Ping timeout: 480 seconds]

18:53 Danct12 has quit [Remote host closed the connection]

18:57 thellstrom has joined #dri-devel

18:57 danvet has joined #dri-devel

19:03 thellstrom has quit [Remote host closed the connection]

19:06 Sumera has quit [Ping timeout: 480 seconds]

19:21 <daniels> tango_, jasuarez[m]: igalia-ci24-rpi4-8gb has been consistently failing all its jobs for the last 4 days, so I disabled it

19:21 <daniels> tanty: ^

19:23 <tango_> sorry I cannot be of assistance ;-)

19:26 mlankhorst has quit [Ping timeout: 480 seconds]

19:34 chema has quit [Quit: authenticating]

19:34 chema has joined #dri-devel

19:34 chema has quit []

19:34 chema has joined #dri-devel

19:43 danvet has quit [Ping timeout: 480 seconds]

20:20 boistordu has joined #dri-devel

20:35 sh_zam has joined #dri-devel

20:43 idr has joined #dri-devel

20:54 rasterman has quit []

21:20 gouchi has quit [Remote host closed the connection]

21:36 * ccr blinks his eyes.

22:00 RAOF2 has joined #dri-devel

22:24 RAOF2 has quit []

22:29 iive has quit []

22:33 libv has joined #dri-devel

22:38 libv_ has quit [Ping timeout: 480 seconds]

22:43 libv_ has joined #dri-devel

22:45 SanchayanMaity has quit [Remote host closed the connection]

22:45 libv has quit [Ping timeout: 480 seconds]

22:45 arnd has quit [Read error: Connection reset by peer]

22:46 austriancoder has quit [Read error: Connection reset by peer]

22:50 jernej has quit [Quit: Free ZNC ~ Powered by LunarBNC: https://LunarBNC.net]

22:50 jernej has joined #dri-devel

22:51 arnd has joined #dri-devel

22:51 austriancoder has joined #dri-devel

22:53 <ccr> I wonder if anyone has realized that pretty much all of the gallium debugging stuff works by pure chance? I mean, the drivers and auxiliary/driver_* stuff all wrap pipe_screen into one of their own structs, basically typecasting to pipe_screen. which is kind of a problem when more than one thing does the same.

22:54 <zmike> shhhh we don't talk about that

22:55 <ccr> I see :P too bad I ran into a issue where things explode because driver messes with the trace component's data, corrupting a pointer -> kaboom

22:56 libv has joined #dri-devel

22:57 jernej has quit [Remote host closed the connection]

22:58 SanchayanMaity has joined #dri-devel

22:58 <ccr> I guess I'll "fix" it locally with something like struct trace_screen { struct pipe_screen base; int bogus_padding[1024]; ...

22:59 <ccr> was already trying to figure out a better solution, but this looks like rather deep-rooted issue and would require some major overhauls to fix properly

23:01 libv_ has quit [Ping timeout: 480 seconds]

23:04 <zmike> ccr: where are you encountering this?

23:05 <zmike> padding won't help since the driver is then failing to update its own data

23:05 <zmike> I fixed cases of this recently in iris and llvmpipe

23:05 <ccr> with crocus, so it's kinda out-of-mainline thing

23:06 <zmike> ah

23:06 <zmike> maybe the same as what I had in iris then?

23:07 <airlied> ccr: point me at it and I'll port over the iris fix

23:07 <zmike> anything in the driver accessing resource->screen will explode

23:07 <ccr> could be, but I don't really see why this kind of issue wouldn't occur with any driver .. depending on what the struct wrapping pipe_screen has after "base"

23:08 <zmike> ideally drivers don't trigger that behavior with wrapped pointers

23:09 <ccr> struct crocus_screen {

23:09 <ccr> uint32_t refcount;

23:09 <ccr> struct pipe_screen base;

23:09 <ccr>

23:09 dreda has quit [Ping timeout: 480 seconds]

23:09 <zmike> so yeah, same as the iris one it sounds like

23:10 <ccr> mainline iris has the same, is your fix in one of your trees?

23:10 <zmike> no, it's in iris

23:10 <airlied> zmike: oh I see the fix you did recently I'll pull it over

23:10 <ccr> ah

23:10 <zmike> 👍

23:10 jernej has joined #dri-devel

23:10 <ccr> I assumed it would've been some kind of struct thing, ok

23:11 <zmike> it is, you just have to approach it from the other direction

23:11 <ccr> sounds .. scary

23:11 <zmike> trace bugs are

23:12 <ccr> well, I'd say this is more of a design failure overall, but shrug

23:12 <zmike> cool that it's working with crocus!

23:12 <zmike> haha

23:12 dreda has joined #dri-devel

23:12 <ccr> crocus is working super well, for me at least.

23:12 mal has quit [Remote host closed the connection]

23:12 mal has joined #dri-devel

23:13 <ccr> super well with my .. haswell *badadum tssh*

23:13 <airlied> pushed the orig_screen fix to crocus

23:14 <ccr> airlied, hooray

23:15 <airlied> may even open an MR against main this week

23:15 <ccr> ../src/gallium/drivers/crocus/crocus_resource.c:320:23: error: implicit declaration of function ‘crocus_screen_ref’; did you mean ‘crocus_pscreen_ref’? [-Werror=implicit-function-declaration]

23:15 pcercuei has quit [Quit: dodo]

23:15 <zmike> oops

23:17 <airlied> dang snb is so slow to compile

23:19 <ccr> works \:D\

23:29 <HdkR> oop, I should double check if that llvmpipe patch works for me rather than claiming it'll work for me :P

23:41 i-garrison has quit []