#dri-devel on 2021-08-18 — irc logs at oftc.irclog.whitequark.org

2021-07-26 22:56 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:12 Lucretia has quit []

00:17 sravn has joined #dri-devel

00:24 idr has quit [Quit: Leaving]

00:25 jhli has quit [Read error: Connection reset by peer]

00:30 chivay_ has joined #dri-devel

00:35 jhli has joined #dri-devel

00:37 columbarius has joined #dri-devel

00:38 co1umbarius has quit [Ping timeout: 480 seconds]

00:52 jhli has quit [Ping timeout: 480 seconds]

00:58 <agd5f> mdnavare, yes, works fine

01:07 Koniiiik has quit [Server closed connection]

01:07 Koniiiik has joined #dri-devel

01:19 tjaalton has quit [Server closed connection]

01:19 tjaalton has joined #dri-devel

01:26 nchery has quit [Ping timeout: 480 seconds]

01:26 ngcortes has quit [Ping timeout: 480 seconds]

01:29 Peste_Bubonica has joined #dri-devel

01:30 Peste_Bubonica has quit []

01:45 chivay_ has left #dri-devel [#dri-devel]

01:46 chivay_ has joined #dri-devel

01:48 pH5 has quit [Server closed connection]

01:49 pH5 has joined #dri-devel

01:49 chivay_ has quit []

01:49 chivay_ has joined #dri-devel

01:51 mbrost has joined #dri-devel

02:03 haasn has quit [Server closed connection]

02:03 haasn has joined #dri-devel

02:04 mbrost_ has joined #dri-devel

02:04 mwalle has quit [Server closed connection]

02:05 mwalle has joined #dri-devel

02:11 mbrost has quit [Ping timeout: 480 seconds]

02:17 slattann has joined #dri-devel

02:20 sagar_ has quit [Ping timeout: 480 seconds]

02:22 <alyssa> daniels: I checked my log and there's a few fails in the CTS run noooooooo

02:23 <alyssa> dEQP-GLES3.functional.fragment_ops.random.* with multisampling 4x

02:34 slattann has quit []

02:51 <macromorgan> getting this error on mainline master, not sure what it could be: UBSAN: shift-out-of-bounds in drivers/gpu/drm/panfrost/panfrost_mmu.c:70:35 \n shift exponent -1 is negative

02:52 <HdkR> oops, trying to shift by negative 1 :P

02:52 <HdkR> Someone didn't check a return for error?

02:55 <HdkR> region_width = 0. (1ul << (region_width - 11)) = (1ul << -1);

02:55 <HdkR> er, region_width=10

02:56 <icecream95> "fls returns: 1 .. 32". "Note fls(0) = 0, fls(1) = 1, fls(0x80000000) = 32"

02:56 <icecream95> Those two comments (one in panfrost_mmu.c, one in fls.h) seem to disagree...

02:57 <HdkR> Guess it should error when someone tries locking a zero sized region :)

02:58 <macromorgan> anything I can do to help debug it better?

03:02 <HdkR> I don't do kernel dev, no idea from me.

03:08 <mdnavare> agd5f: And it works with both X and Wayland compositors right? So the work is mostly in the kernel driver to have some kind of lmem shadow copies for the smem buffer from which the primary dispaly is scanning out of and then copying over to shadow buffer in lmem

03:09 <agd5f> mdnavare, yeah works with X and wayland. I think userspace generally deals with any extra copies that are required

03:09 jhli has joined #dri-devel

03:11 <mdnavare> agd5f: Yes but proper import should happen from igpu to dgpu should happen at the kernel level and then per frame copying by userspace 3D driver

03:11 <mdnavare> ?

03:19 <airlied> mdnavare: userspace should handle all the copies

03:20 <airlied> for igpu sharing to dgpu with X.org there should be a main igpu scanout buffer covering the two displays, a shared igpu/dgpu the size of the dgpu scanout, and a dgpu vram buffer

03:20 <airlied> call them a, b, c

03:21 <airlied> userspace should be ensuring copies from a->b and b->c happen

03:21 <airlied> scanout for igpu should be from a, scanout for dgpu should be from c

03:24 jessica_24 has quit [Quit: Connection closed for inactivity]

03:25 <airlied> mdnavare: though looking at the X server there is a check for the i915 kernel driver

03:26 * airlied should check xserver master

03:26 <airlied> mdnavare: yup still there, that might be a bug to fix for dgpus

03:27 <airlied> if (!strncmp("i915", version->name, version->name_len)) {

03:27 <airlied> ms->drmmode.reverse_prime_offload_mode = FALSE;

03:27 <airlied> }

03:50 YuGiOhJCJ has joined #dri-devel

04:07 reductum has quit [Quit: WeeChat 2.8]

04:07 reductum has joined #dri-devel

04:09 <mdnavare> airlied: I am back sorry was finishing up dinner

04:11 jhli has quit [Remote host closed the connection]

04:16 sagar_ has joined #dri-devel

04:18 YuGiOhJCJ has quit [Remote host closed the connection]

04:19 YuGiOhJCJ has joined #dri-devel

04:21 Adrinael has quit [Server closed connection]

04:21 Adrinael has joined #dri-devel

04:29 <mdnavare> airlied: So for i915, it actually doesnt do the reverse PRIME mode?

04:29 thellstrom has quit [Read error: Connection reset by peer]

04:30 thellstrom has joined #dri-devel

04:31 slattann has joined #dri-devel

04:33 <mdnavare> thellstrom: I have been looking at some of your code on gem object migrate and pinning, I am trying to understand what exactly happens when we say buffer migrate from lmem to smem and pinned to smem and how that differs from importing a buffer on igpu from dgpu?

04:35 <airlied> mdnavare: yes you'd have to test with an xserver that line removed

04:38 sagar_ has quit [Ping timeout: 480 seconds]

04:38 <mdnavare> airlied: Okay and with X server then supporting reverse prime, that will enable the a to b and b to c copies then

04:38 <mdnavare> Is that the expectation?

04:39 Duke`` has joined #dri-devel

04:39 <airlied> mdnavare: yes that should be the expectation

04:41 <mdnavare> since by default now render will be happening on igpu and so it would have the buffer allocated on smem, then that will need to be copied to lmem + smem shadow buffer and then dgpu vram buffer for scanout

04:42 <mdnavare> airlied: With reverse prime offload allowed, will that all happen seamlessly?

04:43 <airlied> mdnavare: should do

04:46 <mdnavare> airlied: Also if we set the DRI PRIME =1 globally then it should always be c -> b and b-> a copies right?

04:46 mattrope has quit [Remote host closed the connection]

04:47 <mdnavare> and that should light up both displays in extended mode without the Xserver change?

04:49 <airlied> mdnavare: DRI_PRIME shouldn't matter for this

04:51 <mdnavare> I thought that is what will decide whether it is prime or reverse prime offload mode

04:52 <airlied> it shouldn't be affecting that, unless it's by accident

04:52 <mdnavare> probably then DRI PRIME only gets used in 3D for deciding whether the buffer should be allocated in lmem or smem

04:53 <mdnavare> airlied: Also Dave I am trying to understand what exactly happens when we say buffer migrate from lmem to smem and pinned to smem and how that differs from importing a buffer on igpu from dgpu? Can you help me better understand this

04:55 <airlied> in these cases the buffers shouldn't be migrating at all

04:55 <airlied> maybe once at startup

04:56 <airlied> if you import a buffer on the igpu from the dgpu then it will have to be in smem, how it gets there is up to the kernel

05:00 <mdnavare> airlied: What does it mean by buffer migration?

05:01 <mdnavare> airlied: I mean what is the fundamental differnce between buffer import and buffer migrate

05:02 <airlied> mdnavare: buffer import is the dma-buf operation, buffer migration is an internal driver thing

05:03 <mdnavare> airlied: But what does the driver physically do to the buffer when it says migrate

05:04 <airlied> just copies the contents from vram to system ram

05:04 <airlied> or vice-versa

05:12 <mdnavare> airlied: Okay so Case 1: iGPU is render, it will allocate a buffer on SMEM and if dGPU is displaying or scanning that out, what is the import/migrate/copy flow?

05:13 camus1 has joined #dri-devel

05:15 danvet has joined #dri-devel

05:16 <airlied> mdnavare: X server will create a shared igpu/dgpu buffer, and a dgpu buffer for scanout

05:16 <airlied> and it will execute the copies

05:16 <airlied> the kernel might have to move the dgpu buffer to vram the first time it's set to scanout

05:16 <airlied> depending on how it was allocated

05:18 <mdnavare> airlied: where would it be typically allocated? Xserver will allocate it initially on smem only?

05:18 camus has quit [Ping timeout: 480 seconds]

05:19 <airlied> the X server would ask gbm/egl whatever so it should know it's scanout and put in the right place from teh start

05:19 <airlied> but I'm not guaranteeing the drivers doing the right thing

05:21 <mdnavare> airlied: So if egl knows scanout is on dGPU, it will actually just allocate dgpu on LMEM/vram, then for every frame it just copies from igpu/dgpu shared buffer to dgpu?

05:21 <mdnavare> and then it get scanned out?

05:21 <airlied> mdnavare: yes

05:21 Thymo has quit [Server closed connection]

05:21 Thymo has joined #dri-devel

05:29 <mdnavare> Thanks airlied, will try with the Xserver change if it fixes the extended mode issue

05:29 vivijim has quit [Read error: Connection reset by peer]

05:30 <mdnavare> danvet: I think if the Xserver change works then we probably wont need any gem/display kernel changes to handle the extended mode on displays connected across IGPU +DGPU (like I had asked in the email)

05:41 leandrohrb2 has quit [Server closed connection]

05:42 leandrohrb2 has joined #dri-devel

05:50 xxmitsu has quit [Server closed connection]

05:50 xxmitsu has joined #dri-devel

05:55 JohnnyonF has joined #dri-devel

05:56 jhli has joined #dri-devel

05:56 mbrost_ has quit [Remote host closed the connection]

05:56 Daanct12 has quit [Quit: Quitting]

06:01 Danct12 has joined #dri-devel

06:02 JohnnyonFlame has quit [Ping timeout: 480 seconds]

06:08 shadeslayer has quit [Server closed connection]

06:08 shadeslayer has joined #dri-devel

06:17 frieder has joined #dri-devel

06:32 tobiasjakobi has joined #dri-devel

06:37 tobiasjakobi has quit [Remote host closed the connection]

06:42 mlankhorst has joined #dri-devel

06:47 gouchi has joined #dri-devel

06:47 gouchi has quit [Remote host closed the connection]

07:08 jhli has quit [Ping timeout: 480 seconds]

07:10 <thellstrom> mdnavare: Did you get all the answers you needed from airlied, or is there something I can try to answer?

07:12 thellstrom1 has joined #dri-devel

07:12 rasterman has joined #dri-devel

07:17 Daanct12 has joined #dri-devel

07:18 thellstrom has quit [Ping timeout: 480 seconds]

07:23 Danct12 has quit [Ping timeout: 480 seconds]

07:24 thellstrom1 has quit []

07:27 thellstrom has joined #dri-devel

07:32 lynxeye has joined #dri-devel

07:33 <MrCooper> agd5f: compositing is not required for Xorg PRIME secondary GPU scanout; the issue described by mdnavare could be a bug in the Xorg or kernel driver for the secondary GPU (airlied might have found it)

07:41 V has quit [Server closed connection]

07:41 V has joined #dri-devel

07:43 jkrzyszt has joined #dri-devel

07:44 ppascher has quit [Ping timeout: 480 seconds]

07:51 pnowack has joined #dri-devel

07:51 shfil has joined #dri-devel

07:57 xexaxo_ has joined #dri-devel

07:57 jkrzyszt has quit [Remote host closed the connection]

08:08 Ahuj has joined #dri-devel

08:10 xexaxo_ has quit [Ping timeout: 480 seconds]

08:17 Lucretia has joined #dri-devel

08:19 i-garrison has quit [Server closed connection]

08:19 i-garrison has joined #dri-devel

08:22 pendingchaos has quit [Quit: No Ping reply in 180 seconds.]

08:23 pendingchaos has joined #dri-devel

08:33 i-garrison has quit []

08:34 i-garrison has joined #dri-devel

09:07 frieder has quit [Quit: Leaving]

09:07 frieder has joined #dri-devel

09:11 <MrCooper> jekstrand: or since the code won't actually run on Windows, just #define DRM_FORMAT_MOD_INVALID <something appropriate> for that

09:18 pcercuei has joined #dri-devel

09:32 <dj-death> jenatali: I'm struggling a bit to figure what package is providing the spirv-mesa3d-.spv.h file

09:58 <daniels> dj-death: none - it's generated from the .spv provided by libclc which is part of llvm

10:00 <daniels> it installs that into its libexecdir when you pass -DLIBCLC_TARGETS_TO_BUILD=spirv-mesa3d-

10:13 dllud has quit [Server closed connection]

10:13 dllud has joined #dri-devel

10:23 Daaanct12 has joined #dri-devel

10:24 <dj-death> daniels: there is no way to generate it from mesa's meson ?

10:29 Daanct12 has quit [Ping timeout: 480 seconds]

10:31 thellstrom has quit [Quit: thellstrom]

10:31 thellstrom has joined #dri-devel

10:32 thellstrom has quit []

10:32 thellstrom has joined #dri-devel

10:42 thellstrom has quit [Remote host closed the connection]

10:43 frieder has quit [Ping timeout: 480 seconds]

10:45 flacks has quit [Quit: Quitter]

10:46 thellstrom has joined #dri-devel

10:46 flacks has joined #dri-devel

10:48 <daniels> dj-death: to generate the .spv? no, you need to compile it from libclc

10:49 <daniels> I mean Mesa could vendor the whole of libclc, or distribute a pre-built SPIR-V binary, but I can't see that exactly being popular

10:50 xexaxo_ has joined #dri-devel

10:51 <dj-death> daniels: I just find it a bit odd that apparently we need the libclc source package to generate the file

10:52 frieder has joined #dri-devel

10:57 <daniels> dj-death: mm, no? you build libclc (using cmake) and install it to wherever, it installs the generated .spv and pkg-config file to the prefix, then Meson uses pkg-config to discover the install location and wrap the .spv in a .h using xxd

10:57 <daniels> but yeah, I mean at some point you do need to build libclc itself, given it's ... non-trivial

10:58 <daniels> it's packaged as libclc in Fedora 35 and libclc-13 in Debian experimental

11:00 <daniels> s/experimental/sid/

11:05 <dj-death> ah okay

11:09 <dj-death> I guess I'm unlucky that it's not shipping as part of libclc-13 from apt.llvm.org

11:12 <dj-death> hmmm

11:12 <daniels> you might want to upgrade? seems like jljusten flipped the switch last week ... https://salsa.debian.org/pkg-llvm-team/llvm-toolchain/-/commit/f572d3695eceeeaf897631c3c12679dc00190c57

11:13 <dj-death> but that file doesn't appear in experimental either

11:16 The_Company has quit [Server closed connection]

11:17 The_Company has joined #dri-devel

11:22 vivijim has joined #dri-devel

11:25 pochu has quit [Ping timeout: 480 seconds]

11:29 xexaxo_ has quit [Ping timeout: 480 seconds]

11:29 <daniels> ah, nice and timely: https://salsa.debian.org/pkg-llvm-team/llvm-toolchain/-/commit/c5b9334be225ad44bf05a8b3bef14265f7115266

11:35 JohnnyonF has quit [Ping timeout: 480 seconds]

11:39 <tjaalton> spirv-tools isn't available on the old releases, which is why apt.llvm.org builds were failing

11:43 <dj-death> arg :(

11:54 Daaanct12 has quit [Quit: Quitting]

11:55 Danct12 has joined #dri-devel

11:55 frieder_ has joined #dri-devel

11:56 frieder has quit [Ping timeout: 480 seconds]

12:01 iive has joined #dri-devel

12:16 <zmike> MrCooper: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12445

12:23 slattann has quit []

12:27 i-garrison has quit [Server closed connection]

12:27 i-garrison has joined #dri-devel

12:45 shfil has quit [Read error: Connection reset by peer]

12:52 heat has joined #dri-devel

13:05 <zmike> MrCooper: you found a hard bug :/

13:13 pochu has joined #dri-devel

13:15 mattrope has joined #dri-devel

13:19 <dj-death> is kmscube more or less of a joke than vkcube?

13:25 pochu has quit [Ping timeout: 480 seconds]

13:27 <MrCooper> zmike: sorry :)

13:27 <MrCooper> just happened to notice it

13:41 slattann has joined #dri-devel

14:11 slattann has quit []

14:23 <CounterPillow> What could cause a drm driver to probe but never call its bind?

14:29 <danvet> EPROBE_DEFERREED

14:29 <danvet> CounterPillow, if this is some random soc at least

14:30 <danvet> usually means you didn't enable enough stuff in your Kconfig for the driver to fully load

14:30 <danvet> or I'm confused about the problem

14:30 * danvet interpolating a lot

14:31 <CounterPillow> danvet: thanks, this is a random soc and we're currently trying to bludgeon a rough forward port of rockchip's vop2 into at least doing something

14:33 <CounterPillow> [ 1.253529] rockchip-vop2 fe040000.vop: deferred probe timeout, ignoring dependency

14:33 <CounterPillow> ^- is this possibly related? It does complain of a cyclic dependency with HDMI earlier on, but the probe function prints after this message

14:50 <ajax> i find it suspicious that if i take out glx_arb_sync_control i have exactly 100 glx tests in piglit

14:51 <danvet> CounterPillow, yeah that's EPROBE_DEFER

14:52 <CounterPillow> Alright, thanks

14:52 <danvet> it means not all parts of the driver are there

14:52 <danvet> I think at least

14:52 <danvet> I've never worked with this stuff for real, just watch others fail at it :-P

14:52 <danvet> x86 doesn't have these cobbled-together devices that much

14:53 <CounterPillow> Yeah :(

14:57 YuGiOhJCJ has quit [Quit: YuGiOhJCJ]

15:10 dongwonk has quit [Remote host closed the connection]

15:10 nchery has joined #dri-devel

15:18 frieder_ has quit [Remote host closed the connection]

15:21 slattann has joined #dri-devel

15:33 nirmoy has joined #dri-devel

15:33 pochu has joined #dri-devel

15:35 camus has joined #dri-devel

15:36 slattann has quit []

15:39 camus1 has quit [Ping timeout: 480 seconds]

15:44 thellstrom has quit [Ping timeout: 480 seconds]

15:55 <mdnavare> danvet: On the extended mode thread / secondary GPU scanout as per airlied X should be creating a complete scanout buffer on smem, igpu + dgpu shared buffer of the size of dgpu scanout and then dgpu buffer for the scanout and then ensuring copies between a to b to c

15:55 <mdnavare> danvet: So then do you suspect any more changes here on our kernel driver to handle this?

15:56 <mdnavare> or everything handled in the userspace?

16:00 <mdnavare> jekstrand : The concern you had had was that in this extended mode case, the smem buffer might be needed to be pinned in smem and lmem at the same time for scanout but I think if there are 3 buffers like what airlied said (smem buffer for the whole desktop content, igpu + dgpu shared buffer of size of dgpu scanout and then dgpu buffer and if userspace ensures copies between them we wont have this issue ) right ?

16:09 pochu has quit [Remote host closed the connection]

16:10 <alyssa> ...Wait

16:11 <alyssa> glClearColor is dithered only if glEnable(GL_DITHER) is set at the time isn't it.

16:12 slattann has joined #dri-devel

16:13 <alyssa> Ahhh

16:14 <danvet> mdnavare, for igpu+dgpu we should only need 2 buffers

16:14 <danvet> 3 are needed for dgpu + dgpu

16:15 <danvet> also the migration fix from jekstrand is still needed in DII I think

16:15 <danvet> maybe check with mike ruhl on that, not sure

16:17 <MrCooper> danvet: unless your dGPU can scan out from system memory, a separate shared buffer is required for iGPU rendering → dGPU scanout as well

16:18 <mdnavare> danvet: MrCooper: Yes our dgpu cannot scanout from smem so thats why I think yes we would be needing that separate buffer

16:18 <MrCooper> (or unless the dGPU happens to be able to consume the iGPU's main buffer, I guess)

16:18 Ahuj has quit [Ping timeout: 480 seconds]

16:19 <danvet> MrCooper, yeah the 2 buffers only works if the igpu doesn't insist the scanout must be in stolen or something

16:19 <danvet> which hasn't been the case for intel igpu ever since og i810

16:19 <danvet> but I know amd igpu only lifted that recently

16:19 <MrCooper> danvet: talking about *dGPU scanout*

16:19 <danvet> MrCooper, yes

16:19 <danvet> 1 idgpu buffer, pinned into ram

16:19 <danvet> 1 dgpu buffer, pinned into vram

16:19 <jekstrand> danvet: Yes, the migration fix is needed in DII. It still locksplats.

16:20 <MrCooper> why are you talkig about iGPU scanout then? :)

16:20 <danvet> dgpu does the blt job from the first to the 2nd

16:20 <danvet> MrCooper, extended desktop I thought we're talking about, where you want the same desktop on both

16:20 <jekstrand> danvet: The case that has me confused is what happens if a client renders on the dGPU, scans out on the dGPU, and then tries to import to an iGPU or a different dGPU.

16:20 <danvet> jekstrand, fails at fd2handle

16:21 <MrCooper> danvet: right, that's assuming the dGPU can read the iGPU's main buffer tiling

16:21 <danvet> if you try to share after you've set it up as scanout

16:21 <jekstrand> Ok, I guess that makes sense

16:21 <danvet> if you do it other way round it should fail at addfb time (which we probably don't)

16:21 <danvet> MrCooper, ah yes we also assume that

16:21 <jekstrand> danvet: Actually, I think we might. We probably don't have IGTs for it, though.

16:22 <danvet> ofc there's the slight problem of most compositors not having enabled drm_formats for intel, but oh welp

16:22 shfil has joined #dri-devel

16:22 <danvet> jekstrand, the thing I'm wonder is whether an addfb should block pinning to smem already or not

16:22 <jekstrand> MrCooper: dGPU reading iGPU's tiling will be a thing. :)

16:23 <alyssa> jekstrand: When will you support Arm tiling?

16:23 <jekstrand> alyssa: Not that iGPU. :P

16:23 <alyssa> For when I use my DG1 card on my Chromebook

16:23 <alyssa> :_p

16:23 <MrCooper> jekstrand: same joke about AMD tiling :P

16:23 <jekstrand> alyssa: I have considered teaching the mesa PRIME code about Intel X and Y-tiling, though.

16:24 <jekstrand> Instead of doing vkCmdCopyImageToBuffer to do the blit, set up a compute shader that X-tiles the output and run that.

16:25 <jenatali> We attempted to introduce a standard tiling layout into the industry so we could do better cross-adapter memory accesses

16:25 <jenatali> We failed :(

16:25 <jekstrand> Not sure how that'd work through the DRI layer but it'd be easy enough with the Vulkan WSI layer.

16:25 <jekstrand> jenatali: We were right with you on it! Too bad we implemented it wrong at least 3 times. (-:

16:25 <jenatali> Yeah I think you guys were the only ones who tried though

16:26 <jekstrand> AFAIK, yes.

16:26 <MrCooper> generic user space will probably need modifiers to determine if a special case with 2 buffers is possible, and fall back to 3 buffers if not

16:26 <jekstrand> But if you also had it in WARP, that's two implementations. It's a standard!

16:27 pochu has joined #dri-devel

16:27 rgallaispou has quit [Read error: Connection reset by peer]

16:31 <mdnavare> danvet: MrCooper: jekstrand: So in the extended case, by default when only igpu is rendering and we need to scanout on both igpu and dgpu display, then it renders on igpu + dgpu buffer in smem, copies that to dgpu buffer for scanout on dgpu, no issues there right?

16:32 <mdnavare> danvet: MrCooper:jekstrand: Problem is if we force dri prime for dgpu to render and want scanout on both igpu and dgpu then what happens?

16:34 <MrCooper> iGPU composites dGPU client buffer contents to its main buffer, after that same as always

16:36 <MrCooper> (that's the generic fallback case, fancy compositors might try to avoid the dGPU → iGPU → dGPU round-trip somehow)

16:36 <danvet> jekstrand, iirc intel adopted it, it's our 64k tile format

16:36 <danvet> not sure we use it for anything though :-)

16:36 <jekstrand> danvet: Nope

16:36 <jekstrand> danvet: We did. It was called Ys on SKL

16:36 <jekstrand> But that's totally different from tile64 on DG2

16:36 <mdnavare> MrCooper: so thatc omposited buffer will be in smem and be igpu + dgpu shared buffer and then copy again to dgpu buffer for scanout right?

16:37 <danvet> jekstrand, and neither of them was the ms format?

16:37 <jekstrand> danvet: Ys was the MS format, except we implemented miptails wrong.

16:37 <danvet> hah lol

16:37 <jekstrand> And then we implemented them differently wrong on CNL.

16:37 <danvet> classic intel "we need dx9.0c for us pls" or whatever it was

16:37 <bnieuwenhuizen> I think on the AMD side we actually support the MS format, I think it is our _S tiling

16:37 <jekstrand> I've not R/Ed ICL to know if we finally got them right.

16:38 <MrCooper> mdnavare: sounds right

16:38 <danvet> bnieuwenhuizen, jekstrand I guess if we type up an igt that verifies it scans out or something we could add it

16:38 <danvet> but also, probably way too much niche

16:39 <danvet> need to get modifiers in general going better first

16:39 <mdnavare> jekstrand: danvet: So atleast for getting the extended desktop case to work like it works on AMD, the existing X and Wayland compositors hsould handle it after removing the reverse prime = false for i915 in Xserver. No other driver work needed there?

16:40 <mdnavare> danvet: jekstrand: Well ofcourse the gem migration fixes that jekstrand made but no new changes I guess

16:40 <mdnavare> needed

16:40 <mdnavare> jekstrand: danvet: So no arch changes/discussion on that is that correct

16:45 gouchi has joined #dri-devel

16:46 mbrost has joined #dri-devel

16:51 JohnnyonFlame has joined #dri-devel

16:51 hikiko_ has joined #dri-devel

16:52 hikiko_ has quit []

16:52 mlankhorst has quit [Ping timeout: 480 seconds]

16:53 hikiko_bsd has joined #dri-devel

16:55 xexaxo_ has joined #dri-devel

16:58 jhli has joined #dri-devel

17:02 Company has joined #dri-devel

17:06 anujp has quit [Ping timeout: 480 seconds]

17:08 The_Company has quit [Ping timeout: 480 seconds]

17:10 pochu_ has joined #dri-devel

17:15 xexaxo_ has quit [Ping timeout: 480 seconds]

17:16 anujp has joined #dri-devel

17:17 pochu has quit [Ping timeout: 480 seconds]

17:31 <jljusten> dj-death, daniels: hopefully the revert should only be short term. the issue is that https://apt.llvm.org/ wants to support buster, bionic (18.04), but they don't have llvm-spirv. the dev that owns things needs to split things up a bit to deal with that.

17:31 <jekstrand> jljusten: What revert?

17:31 <jenatali> jekstrand: daniels pasted a link above: https://salsa.debian.org/pkg-llvm-team/llvm-toolchain/-/commit/c5b9334be225ad44bf05a8b3bef14265f7115266

17:32 <jljusten> jekstrand: almost have .spv files for debian/ubuntu...

17:32 pochu_ has quit [Ping timeout: 480 seconds]

17:32 <jekstrand> jljusten: Ah

17:33 idr has joined #dri-devel

17:36 slattann has quit []

17:41 shfil has quit [Quit: Konversation terminated!]

17:59 <alyssa> robclark: " nit, space before ( for iterator macros

17:59 <alyssa> "

17:59 slattann has joined #dri-devel

17:59 <alyssa> this is my preference as well (I think) but it's annoyingly different than kernel style :V

18:00 * robclark disagrees with kernel style in a few areas..

18:00 * alyssa nods

18:00 lynxeye has quit [Quit: Leaving.]

18:01 <jekstrand> I don't put in that space

18:01 <robclark> I kinda thing iterator macros should follow the same rule as `for`.. I think clang-format agrees with me on that one particular thing..

18:05 <zmike> 👎⛔🚫space🚫⛔👎

18:12 <jekstrand> robclark: Yeah... I get that argument.

18:13 <dj-death> jljusten: thanks for doing all that :)

18:20 <jljusten> dj-death: now I know just how long it takes to build llvm. :) and, I know how to try to build llvm but have it break 1h into the build with no useful error message. (repeatedly :)

18:28 <jekstrand> jljusten: Useful life skills!

18:28 <jekstrand> jljusten: Especially that second one. :P

18:34 dongwonk has joined #dri-devel

18:36 slattann has quit []

18:37 dongwonk has quit []

18:40 <jenatali> Oof, yeah

18:45 ngcortes has joined #dri-devel

18:45 <macromorgan> so in regards to the problems I was having last night: UBSAN: shift-out-of-bounds in drivers/gpu/drm/panfrost/panfrost_mmu.c:70:35 \n shift exponent -1 is negative, it looks like https://elixir.bootlin.com/linux/v5.14-rc6/source/drivers/gpu/drm/panfrost/panfrost_mmu.c#L118 and #L134 are the culprits

18:46 <macromorgan> specifically those lines are setting the value of size to ~0UL, which I'm seeing as a -1?

18:48 rasterman has quit [Quit: Gettin' stinky!]

18:52 JohnnyonFlame has quit [Ping timeout: 480 seconds]

19:12 soreau has quit [Read error: Connection reset by peer]

19:13 soreau has joined #dri-devel

19:16 <airlied> jenatali: coming up with new tiling is gpu memory c9ntroller designers job security

19:20 <dj-death> jljusten: I unfortunately know worse projects than llvm in terms of build time and ability to hang your machine just by the amount of memory required to compile some objects

19:21 <airlied> linking llvm is the killer

19:22 <airlied> esp doing ninja without setting then cmake opt to limit parallel links

19:28 gouchi has quit [Remote host closed the connection]

19:30 rasterman has joined #dri-devel

19:34 gouchi has joined #dri-devel

19:35 nirmoy has quit []

19:36 <jljusten> the path for nixos .spv files appears to have even more obstacles :/ currently stuck on a macos build failure blocking the llvm-spirv update

19:40 <alyssa> macromorgan: eyes

19:42 <alyssa> macromorgan: ooh

19:43 gouchi has quit [Remote host closed the connection]

19:46 <alyssa> writing a patch

19:47 <alyssa> log2 minus 1..ok

19:47 gouchi has joined #dri-devel

19:48 <macromorgan> cool... let me know if you want me to test it

19:49 <alyssa> incidentally it looks like this code is buggy

19:49 <alyssa> like, functionally buggy, not just theoretically

19:49 <alyssa> s/theoretically/UBSAN-ily/

19:49 <alyssa> at least on bifrost

19:52 <macromorgan> Yeah, I've been running it on an Odroid Go Advance I use for development, but yesterday was the first time I tried anything silly like launching a GUI on it in several months.

19:53 JohnnyonFlame has joined #dri-devel

20:00 <alyssa> doesn't apply to midgard

20:00 <alyssa> wonder if kbase hanldes this?

20:02 <alyssa> yep.

20:02 <alyssa> robher: ^

20:02 <alyssa> In bifrost-supporting kbase, there's the following define:

20:02 <alyssa> #define KBASE_LOCK_REGION_MIN_SIZE_LOG2 (15)

20:03 <alyssa> kbase's `lock_region` rounds up to said minimum size, expressing the fact that bifrost can only lock 32kb regions at a time (not 4k like midgard)

20:04 <alyssa> macromorgan: This is assuredly not the bug you were referring to 😅

20:04 <alyssa> Regardless, I just refreshed my brain on the Mali MMU. Will write some patches fixing both issues. Thank you for the detailed report ❤️

20:05 <alyssa> (but first I need to fix a compiler bug before dinner o'clock so I can get a CTS run overnight)

20:07 thellstrom has joined #dri-devel

20:10 pcercuei has quit [Quit: brb]

20:11 <robher> alyssa: s/size_t/u64/ and maybe should be ~0ULL that's passed.

20:11 pcercuei has joined #dri-devel

20:12 <macromorgan> thank you, appreciate it

20:13 <alyssa> robher: Ack.

20:13 <robher> Though 4GB may be enough as that was the max per AS (a should be enough(TM) max, not any h/w max).

20:13 <alyssa> The other thing still applies

20:13 <alyssa> Guess that just slipped through with all this code being written for midgard and then bifrost just getting bolted on and "it happens to work"?

20:14 * alyssa would not have noticed if she hadn't gone researching AS_LOCKADDR

20:23 jhli has quit [Remote host closed the connection]

20:23 Duke`` has quit [Ping timeout: 480 seconds]

20:24 shfil has joined #dri-devel

20:25 jhli has joined #dri-devel

20:47 xexaxo_ has joined #dri-devel

20:54 jewins has joined #dri-devel

20:57 <dcbaker> kusma: "llvmpipe: take intersection with bbox for non-legacy points" is nominated for stable, but it looks like it's causing regressions in the CI: https://gitlab.freedesktop.org/mesa/mesa/-/jobs/12915160

20:57 <dcbaker> I've dropped it from the staging branch for the moment

20:59 JohnnyonF has joined #dri-devel

20:59 <alyssa> dcbaker: I apologize in advance for all the stuff i'm cc'ing stable

20:59 <ngcortes> qq; where can I find the latest drm-next kernel?

20:59 JohnnyonFlame has quit [Ping timeout: 480 seconds]

20:59 <alyssa> I really shouldn't have waited to do conformance testing until /after/ a branch point ...

21:00 <dcbaker> alyssa: I'm still curious what you want to do about: "pan/bi: Use FABSNEG pseudo ops for modifier prop"

21:00 <dcbaker> I don't think I can backport that properly

21:00 <alyssa> "still" did we talk about this...?

21:00 <dcbaker> I swear I messaged you about it?

21:00 <alyssa> you might have I am very forgetful

21:00 <dcbaker> maybe it was when I had become unregestered? D:

21:01 <kusma> dcbaker: would you mind creating a ticket and assigning it to me about it? I'll take a look first thing in the morning...

21:01 <dcbaker> kusma: sure

21:01 <alyssa> dcbaker: For "FABSNEG", I would just drop that commit tbh

21:01 thellstrom1 has joined #dri-devel

21:01 <alyssa> It does fix a real bug (and is visible on GL workloads), but the GL spec doesn't actually care, it's only interesting for vulkan which tightens the numerical requirements

21:02 <alyssa> and we're not shipping panvk yet so no need to backport

21:03 <alyssa> "Use FCLAMP pseudo op" ... this /does/ fix a real bug visible on 21.2 unfortunately.

21:03 <dcbaker> I seem to remember that those were built on top of some non trivial changes

21:03 <dcbaker> I could pull those in as well

21:04 thellstrom has quit [Ping timeout: 480 seconds]

21:05 <dcbaker> jenatali: I'm see what I think are unexpected passes (the output is incredibly hard to read) on 21.2 for the d3d12 driver tests: https://gitlab.freedesktop.org/mesa/mesa/-/jobs/12915163

21:05 <dcbaker> I'm not sure what to do with that

21:06 <jenatali> dcbaker: Huh, guess that reinforces that these were somehow fixed externally... we updated the baseline on main recently

21:06 <jenatali> Let me find the change

21:06 <jenatali> https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12391

21:08 <alyssa> dcbaker: Oh... Hm.

21:09 <alyssa> when is the next bug fix release?

21:10 <dcbaker> well it's supposed to be today, but I 1. forgot to push the changes I staged yesterday, and 2. found some regressions

21:11 <dcbaker> jenatali: thanks! I've backported that

21:11 <alyssa> (My priority right now is passing CTS.)

21:13 <dcbaker> alyssa: then you have 2 weeks :)

21:13 <karolherbst> alyssa: talk to you in 3 years then

21:13 <karolherbst> :P

21:14 <alyssa> karolherbst: i'm on my last test case.

21:15 <karolherbst> alyssa: you didn't try the full run yet

21:15 <alyssa> i mean i've been on my last test case for 3 months but

21:15 <alyssa> I did, overnight

21:15 <karolherbst> ohh, nice

21:15 <karolherbst> GLES I guess?

21:15 <alyssa> gles3.1

21:15 <karolherbst> ahh

21:15 <alyssa> found 2 bugs that I didn't hit in my usual test runs

21:15 <karolherbst> :)

21:15 <karolherbst> nice

21:15 <alyssa> 1 i already fixed, 2 I'm fighting now

21:18 <alyssa> karolherbst: 13h on my board fwiw for ES3.1

21:18 <imirkin> at least you can repro it at will?

21:19 <imirkin> on nouveau, the issue only comes up on like hour 4 of the cts runner...

21:19 <alyssa> imirkin: thankfully yes 🙂

21:19 <alyssa> Oof

21:19 <karolherbst> yeah

21:19 <alyssa> last week I fixed an fd leak that only repro'd after 2 hours of CTS...

21:19 <karolherbst> imirkin: although we might know what the issue could have been

21:20 <imirkin> karolherbst: super race?

21:20 <karolherbst> yeah

21:20 <imirkin> let's hope!

21:20 <karolherbst> ben said it was also fixing random issues for him

21:20 <karolherbst> imirkin: if you want to try it: https://github.com/skeggsb/linux/commits/linux-5.14

21:26 <imirkin> karolherbst: yeah, dunno when i'll be able to mess around with ... anything, really

21:28 gouchi has quit [Remote host closed the connection]

21:29 pzanoni has quit [Quit: Coyote finally caught me]

21:29 <CounterPillow> [ 0.540111] Unable to handle kernel NULL pointer dereference at virtual address 0000000000000060

21:29 <CounterPillow> [ 0.539194] rockchip-vop2 fe040000.vop: [drm:vop2_bind] I'm a real driver!

21:29 <CounterPillow> Sometimes my debug prints make me laugh

21:30 <alyssa> st_tile what is wrong

21:32 <karolherbst> CounterPillow: ... oops

21:34 <imirkin> CounterPillow: a real driver should handle the null pointer deref!

21:37 <icecream95> alyssa: Stop distracting me, concentrating on school work during lockdown is hard enough even without you making me want to hack on Panfrost

21:40 <karolherbst> icecream95: I think you have to just work on panfrost then

21:42 pcercuei has quit [Quit: dodo]

21:45 xexaxo_ has quit [Ping timeout: 480 seconds]

21:48 jhli_ has joined #dri-devel

21:50 rasterman has quit [Quit: Gettin' stinky!]

21:50 pochu has joined #dri-devel

21:55 jhli has quit [Ping timeout: 480 seconds]

22:02 heat has quit [Remote host closed the connection]

22:09 jhli has joined #dri-devel

22:12 jhli_ has quit [Ping timeout: 480 seconds]

22:17 danvet has quit [Ping timeout: 480 seconds]

22:21 jessica_24 has joined #dri-devel

22:21 urja has quit [Read error: Connection reset by peer]

22:23 urja has joined #dri-devel

22:26 alanc has quit [Remote host closed the connection]

22:26 mbrost has quit [Quit: Leaving]

22:26 alanc has joined #dri-devel

22:28 shfil has quit [Ping timeout: 480 seconds]

22:33 mbrost has joined #dri-devel

22:34 <alyssa> dcbaker: Okay, I finished up the bug fixes I started in Mesa

22:34 <alyssa> I guess I can see about backporting things. idk. I have some big bug fixes queued on top of what's already there...

22:35 iive has quit []

22:38 pzanoni has joined #dri-devel

22:39 <karolherbst> alyssa: I let others do the backporting for me :O

22:39 <karolherbst> just apply Fixes tags and things get backported magically on their own :p

22:40 <alyssa> karolherbst: these are nontrivial backports.

22:41 <pinchartl> is git.fd.o down again, or is it just me ?

22:41 <karolherbst> works here

22:42 <pinchartl> sounds like the definition of "it's just me" then

22:42 <pinchartl> can you fetch from git://anongit.freedesktop.org/drm/drm ?

22:42 <imirkin> cgit (and thus anongit) appears broken

22:42 <imirkin> daniels: --^

22:42 Hello71 has joined #dri-devel

22:42 <karolherbst> pinchartl: ohh wait.. I tried the wrong thing

22:42 <karolherbst> nope, that's broken

22:43 <pinchartl> https://git.freedesktop.org/ also appears to be broken

22:44 <pinchartl> probably a good excuse to call it a day then

22:45 <alyssa> ...wait this test is passing when run by itself :V

22:45 <karolherbst> alyssa: classic

22:45 <karolherbst> happens all the time with nouveau

22:45 <karolherbst> sometimes even the order of tests matters

22:46 <karolherbst> you really want to have a truely random memory allocator for testing

22:46 <pinchartl> don't forget the phase of the moon

22:48 <airlied> cgit at this time of day often seems to hit a backup cycle or something

22:50 <alyssa> ok, here's a pretty minimal case list to trigger the bug...

22:56 <karolherbst> alyssa: but uhm... I guess it's not memory which is your problem but just stall state or something on the screen/context?

23:03 <alyssa> TBD

23:05 <icecream95> alyssa: So it still happens with all the obvious options like nocache and nocrc?

23:05 <alyssa> icecream95: Yes.

23:05 <alyssa> It's a state leak. It's not entirely clear if it's a Panfrost bug or mesa/st bug.

23:06 <alyssa> Maybe dithered clears aren't handled in any Gallium driver.

23:06 <alyssa> Kayden: maybe ^^?

23:09 <alyssa> Yeah. mesa/st doesn't handle it.

23:09 <alyssa> Not technically a spec violation but still seems wrong.

23:12 <alyssa> Meh. Will split off the change into a separate MR. I don't need it for conformance.

23:21 <alyssa> https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12459

23:22 <imirkin> alyssa: i don't think that ->clear() is meant to respect ->blend() settings

23:23 <imirkin> perhaps st/mesa should not be calling ->clear() if dithering is enabled. or perhaps it doesn't matter.

23:23 <alyssa> imirkin: spec wise, it doesn't matter

23:24 <alyssa> but I'd like the info passed to the driver -- if the driver wants to ignore it, it can make that decision itself

23:24 <alyssa> right now the driver has to assume dithering is disabled in the clear()

23:24 <imirkin> alyssa: then just look at the blend state. don't set a whole special blend state right before it.

23:25 <imirkin> if you want mesa to "fix" the blend state to reflect GL state, you can do so with an appropriate flush

23:25 <imirkin> and anger the performance crowd

23:25 <alyssa> hum?

23:25 <imirkin> your change to st_cb_clear

23:25 <imirkin> that should not be necessary.

23:26 <imirkin> if you want blend to be updated, just make sure that " st_validate_state(st, ST_PIPELINE_CLEAR);" takes care of it.

23:26 <airlied> just add a dither flag to clear :-P

23:26 <alyssa> Ah! thank you

23:26 <alyssa> airlied: or that I guess

23:26 <alyssa> seemed like way more work ...

23:26 <alyssa> imirkin: sorry I have like 1 mesa/st change ever and I think it was reverted :p

23:27 <imirkin> alyssa: no worries. gotta learn some time!

23:27 <alyssa> I'd rather not ;-p

23:29 <imirkin> alyssa: st_atom.h -- ST_PIPELINE_CLEAR_STATE_MASK -- add ST_NEW_BLEND to it

23:30 <imirkin> i still don't think "people" will be happy about that

23:30 <imirkin> ->clear() is supposed to ignore the various set state

23:30 <imirkin> and act fairly independently

23:31 <alyssa> st_atom.h -- yeah just found that

23:57 Lucretia has quit []