#dri-devel on 2023-08-03 — irc logs at oftc.irclog.whitequark.org

2022-12-21 00:45 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:00 <zmike> if talking about 64bit io then it seems like it'd be good to add some alignment validation to nir_validate

00:02 kzd has quit [Quit: kzd]

00:05 <mareko> zmike: clip/cull indexing

00:05 <zmike> ah

00:05 <zmike> yes, well, there's tricks to work around it

00:05 <mareko> we have lots of piglit failures for 64-bit IO already

00:05 <zmike> just seems inconsistent that they're needed

00:05 <zmike> 64bit io is cancer

00:07 <HdkR> How soon until smooth interpolation is allowed for 64-bit IO

00:07 <mareko> never

00:08 <mareko> also it's possible today

00:08 <HdkR> Just do the interpolation manually using the exposed barycentrics? :)

00:09 <mareko> there is more to it, you also need explicit loads with a vertex index in FS

00:10 <mareko> and disable the vector subtract for P1 and P2 inputs

00:10 <HdkR> So you're saying we need a new extension to let the driver do it all for you right? :)

00:11 <mareko> no, Vulkan can do it already, hakzsam implemented custom interpolation not so long ago

00:13 heat has joined #dri-devel

00:13 <zmike> mareko: what's left with your linker thingy? do you plan to merge it in this release cycle?

00:14 <mareko> I could if I don't implement the remaining stuff

00:14 <zmike> ah

00:17 RSpliet has joined #dri-devel

00:23 JohnnyonFlame has quit [Ping timeout: 480 seconds]

00:23 RSpliet has quit [Read error: Connection reset by peer]

00:24 RSpliet has joined #dri-devel

00:28 kzd has joined #dri-devel

00:29 oneforall2 has quit [Remote host closed the connection]

00:30 oneforall2 has joined #dri-devel

00:31 heat has quit [Remote host closed the connection]

00:35 tristan has joined #dri-devel

00:36 tristan is now known as Guest7738

00:49 pcercuei_ has quit []

00:50 MoeIcenowy has quit [Quit: ZNC 1.8.2 - https://znc.in]

00:51 MoeIcenowy has joined #dri-devel

00:57 columbarius has joined #dri-devel

00:59 co1umbarius has quit [Ping timeout: 480 seconds]

01:09 yuq825 has joined #dri-devel

01:12 alyssa has joined #dri-devel

01:13 <alyssa> mareko: i'm also curious, is the linker meant to handle cases like `v_color = a_color;` where a_color turns out to be 3-channels? (and then in a monolithic pipeline, the linker is able to drop the w output and replace with a constant 1.0 in the FS?)

01:14 <alyssa> I *think* that will Just Work if the backend driver calls the i/o linker after lowering vertex inputs in NIR

01:14 <alyssa> (should be straightforward in RADV with monolithic pipelines, for ex)

01:15 <alyssa> but I notice the proposal has the linker called in the GLSL compiler and not the backend driver, so I wasn't sure if there's a benefit/requirement to call it early

01:15 <alyssa> (long before the vertex format key could be applied)

01:17 benjaminl has quit [Ping timeout: 480 seconds]

01:17 <alyssa> (I also don't know if radeonsi would use it this way... specializing to both the vertex formats AND the linked fragment shader seems, undesireable if it's not already a monolithic pipeline bundled up nicely in VK.)

01:18 yyds has joined #dri-devel

01:19 Guest7738 has quit [Remote host closed the connection]

01:19 tristan_ has joined #dri-devel

01:44 heat has joined #dri-devel

01:49 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

01:49 TMM has joined #dri-devel

01:52 tristan_ has quit [Remote host closed the connection]

02:01 tristan_ has joined #dri-devel

02:12 YuGiOhJCJ has joined #dri-devel

02:19 JohnnyonFlame has joined #dri-devel

02:21 mbrost has joined #dri-devel

02:33 godvino has joined #dri-devel

02:36 krushia has quit [Ping timeout: 480 seconds]

02:42 tristan_ has quit [Remote host closed the connection]

02:49 tristan_ has joined #dri-devel

02:50 Danct12 has quit [Remote host closed the connection]

02:51 Danct12 has joined #dri-devel

03:13 JohnnyonFlame has quit [Read error: Connection reset by peer]

03:13 Danct12 is now known as Guest7752

03:13 Danct12 has joined #dri-devel

03:14 crabbedhaloablut has joined #dri-devel

03:26 tristan_ has quit [Remote host closed the connection]

03:33 tristan_ has joined #dri-devel

03:36 heat_ has joined #dri-devel

03:36 heat has quit [Read error: No route to host]

03:47 tristan_ has quit [Remote host closed the connection]

03:54 tristan_ has joined #dri-devel

03:57 Danct12 has quit [Quit: WeeChat 4.0.2]

03:57 Haaninjo has joined #dri-devel

03:57 Haaninjo has quit [Remote host closed the connection]

04:08 Danct12 has joined #dri-devel

04:17 mbrost has quit [Ping timeout: 480 seconds]

04:19 JohnnyonFlame has joined #dri-devel

04:33 tristan_ has quit [Remote host closed the connection]

04:34 tristan_ has joined #dri-devel

04:44 godvino has quit [Ping timeout: 480 seconds]

04:55 tristan_ has quit [Remote host closed the connection]

04:55 tristan_ has joined #dri-devel

04:56 aravind has joined #dri-devel

05:03 heat_ has quit [Ping timeout: 480 seconds]

05:17 tristan_ has quit [Remote host closed the connection]

05:17 tristan_ has joined #dri-devel

05:18 Duke`` has joined #dri-devel

05:19 bgs has joined #dri-devel

05:26 JohnnyonFlame has quit [Read error: Connection reset by peer]

05:35 pcercuei has joined #dri-devel

05:36 Duke`` has quit [Ping timeout: 480 seconds]

05:36 Company has quit [Quit: Leaving]

05:38 tristan_ has quit [Ping timeout: 480 seconds]

05:39 bmodem has joined #dri-devel

05:40 junaid has joined #dri-devel

05:54 itoral has joined #dri-devel

06:06 ayaka_ has joined #dri-devel

06:06 <ayaka_> Do we still use that master/client auth thing in drm?

06:08 bmodem has quit [Quit: bmodem]

06:08 bmodem has joined #dri-devel

06:13 rasterman has joined #dri-devel

06:13 sima has joined #dri-devel

06:14 bgs has quit [Remote host closed the connection]

06:22 <mripard> jani: hey, do you know who's in charge of maintaining dim these days? I've had a PR stuck for months on gitlab

06:24 junaid_ has joined #dri-devel

06:25 jkrzyszt_ has joined #dri-devel

06:26 Zopolis4 has joined #dri-devel

06:33 vliaskov_ has joined #dri-devel

06:36 junaid_ has quit [Remote host closed the connection]

06:36 junaid has quit [Remote host closed the connection]

06:38 aravind has quit [Ping timeout: 480 seconds]

06:38 aravind has joined #dri-devel

06:43 MoeIcenowy has quit [Quit: ZNC 1.8.2 - https://znc.in]

06:43 MoeIcenowy has joined #dri-devel

06:49 <rasterman> hmmm libdrm design question...

06:50 vliaskov__ has joined #dri-devel

06:50 <rasterman> __u64 user_data (used in several "public" facing structs)

06:50 <rasterman> the way it's actually used at least in various places is to stuff pointers into...

06:51 <rasterman> this is of course wrong... but it's done. was/ius user_data actually meant to be able to carry a pointer?

06:51 <airlied> why is it wrong?

06:51 <rasterman> what if my pointers are > 64bit?

06:51 <rasterman> what if they are actually special types that are not just plain integers...

06:51 <airlied> then you get to define a whole new ABI

06:52 <rasterman> ie i have to use a void * or a uintptr_t

06:52 <airlied> it can't store "pointers" because they change size

06:52 <airlied> so it breaks all sorts of things

06:52 <airlied> like 32-bit apps on 64-bit kernels

06:52 <rasterman> well in effect i am defining a new abi...

06:52 <rasterman> as it's a new architecture...

06:52 <rasterman> so i'm wondering if he right solution is a bit of ifdeffing there so essentially

06:53 <airlied> if you never want to run any "different" pointer size or emulate another arch that might work

06:53 <rasterman> #if (UNTPTR_MAX > 0xffffffff)

06:53 <rasterman> uintptr_t user_data;

06:53 <rasterman> #else

06:53 <rasterman> ... current code

06:53 <airlied> you mean 0xffffffffffffffffULL

06:53 <rasterman> but really questioning if the INTENT was that user_data can hold a ptr

06:53 <rasterman> yeah - thus "essentially"

06:54 <airlied> most __u64 are ptrs, but not all

06:54 <rasterman> you get the idea :)

06:54 <rasterman> well depends... inside kernel - yes. at the kernel/user boundary - yeah (ioctls and so on)

06:54 <rasterman> but once you get a bit higher up ... it's questionable

06:54 <rasterman> thus i ask :)

06:55 <rasterman> what the *intent* is

06:55 <rasterman> to me the intent was to carry at least 64bits of int or a ptr

06:55 <airlied> some of them apis might have _ptr in them

06:55 <airlied> where it expects a user ptr

06:55 <airlied> some of them iught not

06:55 <airlied> it would have to be a case by case trawl through include/uapi/drm/*

06:55 <rasterman> yeah

06:56 <rasterman> but this is libdrm...

06:56 <rasterman> so its fully up in userspace

06:56 <airlied> it's not really, libdrm just calls into the kernel ioctls

06:56 <rasterman> one step beyond the kernel boundary

06:56 <rasterman> sure

06:56 <airlied> you'd have to fix both

06:56 <airlied> and libdrm mostly just reproduces kernel interfaces

06:56 vliaskov_ has quit [Ping timeout: 482 seconds]

06:56 <airlied> any include like drm_mode.h etc are kernel interfaces really

06:57 <rasterman> but struct drm_event_vblank is what it exposes directly to userspace beyond

06:57 <rasterman> for example

06:57 <airlied> that's a kernel interface

06:57 sghuge has quit [Remote host closed the connection]

06:57 <airlied> libdrm reproduces a bunch of kernel interfaces, but most of the definitions will come from the kernel

06:58 sghuge has joined #dri-devel

06:58 <rasterman> so i'm wondering if i get to now redefine user_data to be what i think it should be (ie uintptr_t)

06:58 <airlied> no you can't use uintptr_t

06:58 <airlied> because it changes size

06:58 <rasterman> i get to redefine kernel abi too :)

06:58 <airlied> the kernel ioctl compat layer was how we dealt with 32-on-64 problems back in the old days

06:59 <rasterman> unfortunately i have to use an actual ptr type

06:59 <airlied> when a bunch of APIs had 32-bit ptrs in them

06:59 <rasterman> or somehting capable fo storing a ptr

06:59 <rasterman> i canty just use any old integer of N bits

06:59 <airlied> then you likely want to define a new "maxsizedptrstorage"

06:59 <airlied> that covers all existing ptrs and your new ones

06:59 <airlied> then define brand new structs for every thing that takes a ptr

07:00 <rasterman> well i can always ifdef as above

07:00 <rasterman> same on the kernel side

07:00 <rasterman> as compat32 now is actually good old 64bit

07:00 <rasterman> and the new native abi is ... well new.

07:02 <airlied> why do you have to use a ptr type though? since the kernel should never dereference these directly

07:02 <airlied> they have to be cast to void __user * and use copy_to/from_user

07:02 <rasterman> it's a capability architecture

07:03 <airlied> it might be an idea just to find all the really ptrs and give them a new type that is __u64 on normal arches

07:03 <airlied> then just have a special case for that type

07:03 <rasterman> ptrs are actually capabilities. specially flagged with ecc data + cpu instructions to define them as such

07:04 <rasterman> they are 128bit cabapibilities + 1 bit of extra separate metadata to flat that as a capability - thus you cant just use any old XXX bits of data to transport pointers. even if you dont ref/deref. you have to "transport" them

07:04 apinheiro has joined #dri-devel

07:05 <airlied> I assume someone has already done a bunch of the kernel interfaces

07:05 <rasterman> yes. me :)

07:05 <rasterman> or well specifically ... i fixed the compat32 for now

07:05 <airlied> I remember so of the CHERI stuff was discussiing this before

07:05 <rasterman> yeah - this is just that.

07:06 <rasterman> but now linux - not bsd.

07:06 <rasterman> so i fixed up the compat32 bits that all assumed compat abi == 32bit

07:06 <rasterman> all of that is working just fine now (dpu+gpu)

07:07 <airlied> so yeah I think define a new type like __userptr, make it __u64, hunt them all down, add an arch specific __userptr for your arch

07:07 <airlied> port igt-gpu-tools to get some test coverage at least

07:08 <airlied> I'd say about half the __u64 are userptrs in disguise

07:09 <rasterman> yeah - i'm sure they are

07:09 <airlied> we seem to use u64_to_user_ptr in a lot of places so might help point out some of them

07:09 <rasterman> thus really figuring out intent...

07:09 <rasterman> ie is this someone who just abused the abi realizing it could store a ptr

07:09 <rasterman> or is it that it was intended ...

07:09 <airlied> if there are any real ptrs they are errors or old legacy designs

07:10 sgruszka has joined #dri-devel

07:10 <airlied> u64_to_user_ptr seems to have pretty good coverage over what is a ptr

07:10 <rasterman> there's a bit of that floating about too... luckily i've managed to ignroe the legacy stuff :)

07:11 mvchtz has quit [Ping timeout: 480 seconds]

07:11 <airlied> how does 64-bit userspace work on 129-bit ptrs kernel? just doesn't do magic ptrs?

07:12 flto_ has joined #dri-devel

07:12 frankbinns has quit [Remote host closed the connection]

07:12 <rasterman> yeah

07:13 <rasterman> the kernel just sees 64bit userspace ADDRESSES ... which fit in any kernels-die 129 bit capability (rememebr the 129th bit is sideband data so its just 128bits in memory where the ptr would normally be)

07:13 <rasterman> so a lot of kernel code has moved to need uintptr_t for a lot of stuff

07:13 <rasterman> you have to hunt them downa nd follow the breadcrumbs

07:15 <airlied> I think once you find them all and fix u64_to_user_ptr to be whatever you need, things should be mostly ptr based inside drm

07:17 <rasterman> that's going to be fun. anyway - was just checking that thnigs like user_data even tho they are 64bit types wer intended to also hold ptrs... just was wondering who i should blame :)

07:17 <rasterman> ie what was the "contract"

07:18 flto has quit [Ping timeout: 480 seconds]

07:21 <airlied> yeah the important thing is it shouldn't change side across arches or when emulating one arch on antoher

07:21 MajorBiscuit has joined #dri-devel

07:21 <rasterman> yup

07:21 lemonzest has quit [Quit: WeeChat 3.6]

07:22 <airlied> https://docs.kernel.org/driver-api/ioctl.html for reference

07:22 <airlied> "The best workaround is to use __u64 in place of pointers, which requires a cast to uintptr_t in user space, and the use of u64_to_user_ptr() in the kernel to convert it back into a user pointer."

07:22 <rasterman> that's the "tricky" bit. i have to have all usial 64bit abi to stay as is with existing 64bit binaries. they have to "just work"™

07:23 <airlied> yeah for that it sounds like you'd need a compat ioctl layer

07:23 <rasterman> well we have one :)

07:23 <rasterman> compat32 ... :)

07:23 <airlied> initial 64-bit ioctl time was a horrible one

07:23 lemonzest has joined #dri-devel

07:23 <airlied> and there's a lot more abi now

07:23 <rasterman> well ok i've abused compa32 at the now compat64 layer

07:23 <rasterman> err as the compat64

07:25 MajorBiscuit has quit []

07:26 frieder has joined #dri-devel

07:31 MajorBiscuit has joined #dri-devel

07:37 <MrCooper> ayaka_: /dev/dri/card* do, /dev/dri/render* don't though

07:39 <ayaka_> MrCooper, yes, I know only a few driver support render only ioctl().I mean the DRM_IOCTL_AUTH_MAGIC

07:40 <MrCooper> I know

07:40 <ayaka_> in current atomic request way, I didn't know how a client work here

07:41 <ayaka_> All I know is a wayland compositor like weston takes the ownership of the master node, then every access must through wayland protocol

07:43 <MrCooper> Wayland clients can also use /dev/dri/card*, but then they have to get their file description authenticated by the compositor

07:44 frankbinns has joined #dri-devel

07:46 <ayaka_> that means a wayland client could do something likes create gem object once that client gains the auth?

07:47 <MrCooper> yes, couldn't draw anything otherwise

07:49 <ayaka_> I have not seen an application do this. Why we need that, does any GPU userland driver use this mechanism?

07:50 <emersion> card+auth is legacy

07:50 <emersion> the render node should be used instead

07:50 mvlad has joined #dri-devel

07:52 <ayaka_> emersion, but render node can't even create a buffer in generic ioctl(), if an application want to do off screen render, its gpu can't even create a command buffer

07:52 <emersion> you can do off-screen rendering with render nods

07:52 <emersion> nodes*

07:52 <emersion> wlroots does it

07:53 <emersion> there is no generic API to allocate a buffer and render to it with the GPU

07:53 <emersion> you need to use GBM and GL/Vulkan

07:55 <ayaka_> maybe you are the right to ask. We are expanding v4l2 uAPI, someone suggest it could lead to V4L3. One thing we are trying to address is the memory allocation

07:56 <ayaka_> you see, drm could allocate memory from create_dumb with the help of a userspace library that could tell the size requirement or custom ioctl()

07:56 <ayaka_> also gbm could be a fine wrapper

07:57 <emersion> create_dumb is not a generic allocation ioctl

07:57 <ayaka_> but v4l2 must support allocate memory from a device's memory space(although it could be system memory)

07:57 <emersion> it's for a single purpose: allocating a buffer which can be software rendered into and scanned out by KMS

07:58 <ayaka_> it is widely used in embedded device

07:58 <emersion> dumb buffers are not for GPU rendering, not for video decoding, and not for usage outside of KMS

07:58 <emersion> yeah, it's widely *ab*used :)

07:59 <ayaka_> that is not what I could stop. We could still solve this for v4l2 or v4l3

07:59 tursulin has joined #dri-devel

07:59 <emersion> i don't know anything about v4l2, i only know about DRM

08:00 <emersion> also note, i'm not interested in downstream hacks done in embedded companies -- i only care about upstream

08:01 <ayaka_> I am thinking two things, 1. supporting import buffer with fb_id with an device id for v4l2 2. a command allocation hints between v4l2 and drm

08:01 <emersion> no, KMS FB ID is not suitable for cross device sharing

08:01 <emersion> it's for KMS only

08:01 <emersion> for cross device buffer sharing, there is DMA-BUF

08:01 <ayaka_> because more and more pixel formats are compressed, it is impossible for the other device to use

08:01 <emersion> the "command allocation hints" is something we've been talking for ages indeed

08:02 <ayaka_> I know a dma-buf could work but I wish a id for present a frame(all its planes)

08:02 <ayaka_> emersion, sorry, a common allocation hints

08:02 <emersion> yes

08:02 <emersion> the format modifiers does half the work

08:02 <ayaka_> likes which planes and planes would be CMA

08:02 donaldrobson has joined #dri-devel

08:03 <emersion> it allows you to describe tiling and compression and layout

08:03 <emersion> the other half (e.g. placement) is not solved, and needs something new

08:03 <ayaka_> well it is still not enough

08:04 <ayaka_> for example the same tile format, video device only support 64 alignment while display supports 64, 128 bytes alignment

08:04 swalker_ has joined #dri-devel

08:05 <emersion> yes, that's part of "the other half" indeed

08:05 swalker_ is now known as Guest7768

08:05 djbw has quit [Remote host closed the connection]

08:05 <emersion> ayaka_: the latest work on this was https://lpc.events/event/9/contributions/615/

08:05 <ayaka_> also modifiers are still not enough. For example, we have 2 planes of compressed graphics, 2 planes of compressed meta data. While we could store the (un)compression options?

08:06 <emersion> you need to design your modifier bits properly

08:06 swalker__ has joined #dri-devel

08:06 RSpliet has quit [Ping timeout: 480 seconds]

08:06 <ayaka_> emersion, as I said, 4x64bits are not enough

08:07 <emersion> it's 1x64bits

08:07 <jani> mripard: that would still be me and sima *blush*

08:07 bmodem has quit [Ping timeout: 480 seconds]

08:07 <emersion> usually you start with lots of bits but in practice you're really interested in some combinations

08:08 <ayaka_> emersion, you could see the synaptics's modifier, they need 8x32bits for storing the compression options a plane

08:09 <emersion> if they need more bits, maybe they can use the metadata plane to store these

08:09 <ayaka_> besides the pixel format which is fixed

08:09 <ayaka_> meta data plane is for dma, while those options are written to registers directly

08:10 <emersion> are really all combinations of all options used in practice?

08:10 <ayaka_> we won't need cpu to access the meta data plane

08:10 <emersion> anyways, you should write your concerns to the dri-devel ML

08:11 <ayaka_> yes and it is a dynamic values

08:11 <emersion> drivers have claimed that 64 bits weren't enough before, and it turned out they were enough :P

08:12 <sima> jani, mripard I guess I'm missing context? what do I need to blush about?

08:12 Guest7768 has quit [Ping timeout: 480 seconds]

08:12 <emersion> and if you want to do cross-device buffer sharing in v4l2/3, you'll need to add an import IOCTL which takes a pixel format, a modifier, and multiple DMA-BUFs

08:13 <emersion> and if you want to fix the allocation problem, then we need to do a lot more work than that

08:13 <ayaka_> https://lore.kernel.org/linux-arm-kernel/202212011513.19kLY4e7-lkp@intel.com/T/#m9f2771fbb6a48ce8e2862e452a51b8b6c6484fae

08:13 <ayaka_> Also a 32 bytes parameters set would come with a compression meta data buffer.

08:13 <sima> ayaka_, yeah probably a dri-devel mail with all the info you need and your format description and everything that you'd ideally want is the best way to go

08:14 <ayaka_> sima, I did

08:14 <sima> thus far everyone who screamed that they need kilobytes of metadata eventually figured out that 56 bits are enough

08:15 <ayaka_> for each of DRM-FORMAT-MOD-SYNA-V4H1-64L4-COMPRESSED frame, they would have a different compression options set

08:15 <emersion> +1 for an exhaustive documentation of all of the metadata you need and the reasons why you need it

08:15 <emersion> good, sima is already on the case :)

08:16 <sima> emersion, thx a lot :-)

08:16 <ayaka_> emersion, I don't know what those 8 bytes means actually, I just know they are dynamic compression values, are written to registers directly

08:16 <jani> sima: <mripard> jani: hey, do you know who's in charge of maintaining dim these days? I've had a PR stuck for months on gitlab

08:16 <sima> worst case we need a big table with the actually needed stuff, enumerated

08:16 <sima> jani, sounds like mripard volunteered :-)

08:16 <emersion> ayaka_: can you find out what they mean?

08:16 <jani> sima: :D

08:17 <emersion> where do they come from? who picks them?

08:18 <ayaka_> because they are 8 registers that you should read for a plane, the compression algorithm is not public

08:18 <emersion> but the values you write to the registers, where are they coming from?

08:18 <emersion> the algorithm is not public, but are the parameters for the algorithm public?

08:19 <ayaka_> from the video decoder hardware

08:19 <emersion> designing modifiers blindfolded doesn't sound like a very good idea to me

08:19 <emersion> i think we'll need more info here

08:20 <sima> javierm, I have vague recollections that we've finally added a generic "does this driver even support this format/modifier" test to addfb, but I can't find it?

08:20 <sima> am I dreaming?

08:20 <ayaka_> all I tell is if you miss one of that 8 set, the plane can't be uncompressed properly

08:20 <emersion> sima, that rings a bell

08:20 <mort_> robclark: drm is leaking like a sieve in 6.4.7 as well, here's an excerpt from the kmemleak output: https://p.mort.coffee/REk

08:21 <sima> emersion, iirc we've had to add it to the gem helpers because the generic code couldn't check things at the right place ...

08:21 <sima> but I seem to be extremely dense at git grep this morning

08:21 dos1 has quit [Ping timeout: 480 seconds]

08:21 <ayaka_> what is why I am talking about a buffer sharing mechanism for the devices from the same vendor

08:22 <emersion> sima, c91acda3a380bcaf41b67c8fbab668ef8ddf91c3

08:22 <sima> emersion, thanks a lot, you're awesome

08:22 <emersion> np <3

08:24 <sima> ayaka_, link to your mail because a quick search for modifier didn't yield much?

08:24 <emersion> ayaka_: what you're talking about seems a lot like what we had before explicit modifiers, and we're trying to move away from that

08:24 <sima> aside from I got sidetracked on other modifier stuff :-)

08:24 <ayaka_> emersion, maybe we could simplify this problem, for example, an compression pixel format with dynamic HDR metadata

08:24 <sima> yeah vendor specific hidden metadata is considered uncool these days

08:24 <emersion> i mean, your "same vendor import/export magic" stuff

08:25 <MrCooper> ayaka_: a modifier is a constant attribute of a buffer; something which changes every frame can't be part of the modifier but has to be in a plane instead

08:25 <sima> also this ^^

08:25 <ayaka_> MrCooper, yes, but it would lead to the performance issue

08:25 <MrCooper> well, there's no choice

08:27 <ayaka_> well, how to pass HDR metadate(it is not static, regard it as dolby vision), with a pixel formats has used 4 planes

08:28 <MrCooper> HDR metadata needs a separate channel I think, e.g. a Wayland protocol or KMS properties

08:28 <emersion> yeah

08:29 <emersion> note, the kernel only supports HDR static metadata so far

08:29 <emersion> but dynamic metadata would be somewhat similar

08:30 <ayaka_> kms properties may not be too bad, while it would to problem to track which metadata belongs to which frame

08:30 dos1 has joined #dri-devel

08:31 <MrCooper> not with atomic KMS

08:31 <ayaka_> when you have lots of buffer instead of two

08:31 <MrCooper> just change the FB and metadata in the same atomic commit

08:32 mvchtz has joined #dri-devel

08:33 <ayaka_> well, I think the idea that sharing a whole frame between v4l2 and drm is not acceptable here. I would just implement it in the vendor kernel

08:34 <ayaka_> let the other vendors choose what they want

08:34 <emersion> why not do it the upstream way?

08:36 <ayaka_> I am not trying to hide any information here. KMS property is not too bad, unless you need to copy it from vdec hardware to kernel, kernel to userspace, then userspace to kernel, kernel to display hardware

08:36 <jani> mripard: merged now

08:36 <emersion> is the dynamic metadata that much data?

08:36 <ayaka_> in my plan, vdec to kernel(sharing drm frame structure) then kernel to display is enough

08:36 <emersion> static metadata is like a struct with 6 fields

08:37 <ayaka_> not that much, 64bytes for a 4 planes(2 graphics planes) frame

08:37 <emersion> that sounds very cheap

08:37 <emersion> trying to avoid the copies here sounds like a case of premature optimization

08:37 <ayaka_> while, you can't image how many buffer we would have

08:38 frieder has quit [Ping timeout: 480 seconds]

08:38 <emersion> you will have one new metadata blob per frame

08:38 <emersion> copying 64 bytes is trivial compared to all of the other stuff you need to do to display a frame

08:38 <ayaka_> in my ugly way, the userspace application didn't need to do any modifier

08:39 <ayaka_> do any modification

08:40 <emersion> your way may works well for your specific use-case with your specific hardware, but it doesn't work outside of this narrow scope

08:40 <emersion> may work*

08:41 <ayaka_> likes Gstreamer(although it doesn't drm modifier now) would work fine without set the properties for a pixel format

08:41 <ayaka_> well you see, in this tile and compression time, pixel format can't be used outside the same vendor

08:42 <emersion> as i said, "for your specific use-case with your specific hardware"

08:42 <ayaka_> also userspace access to it is not necessary even it is not a secure buffer

08:42 <ayaka_> I believe many vendor would do the same things

08:42 <emersion> i said before that i'm not interested in helping downstream vendor hacks

08:43 <ayaka_> I am trying not to do so

08:45 <emersion> do you understand that we can't add a new uAPI which only works for your specific case?

08:45 <ayaka_> Android's GKI won't affect the DRM interface, because drm always could have a userspace which only that should follow a standard

08:45 <mripard> jani: awesome, thanks :)

08:46 <ayaka_> we don't need drm uAPI to fix for GKI

08:46 <ayaka_> we could leave that frame buffer sharing thing aside

08:46 <ayaka_> I think I could still work on that Allocation Constraints

08:47 <emersion> yeah, we do need to work on that

08:47 RSpliet has joined #dri-devel

08:47 rgallaispou has joined #dri-devel

08:49 frieder has joined #dri-devel

08:51 <karolherbst> if I want to get a commit from (without cc stable or fixes or anything, just mentioning it fixes something in the commit description) drm-misc-next into kernel stable trees as fast as possible, what would be the proper steps?

08:53 <emersion> why not cc stable?

08:54 <karolherbst> because apparently some devs rely on the stable bot script to be smart enough

08:55 <karolherbst> anyway, it already got pushed to drm-misc-next, just wondering what's the proper path from there

08:57 <ayaka_> emersion, my case is little different, it is more about iommu. And for a frame buffer, the graphics plane could use iommu while the metadata could never do that

08:58 <_jannau__> karolherbst: https://www.kernel.org/doc/html/v4.10/process/stable-kernel-rules.html#option-2

08:58 <ayaka_> while plane 0 and plane 1 could be contiguous, plane 1 and plane 2 could be not

08:58 <karolherbst> well.. it hasn't been merged into Linus' tree yet

08:58 <emersion> karolherbst: https://www.kernel.org/doc/html/v4.10/process/stable-kernel-rules.html#option-2

08:58 RSpliet has quit [Ping timeout: 480 seconds]

08:58 <karolherbst> and I kinda don't want to wait those 6 weeks

08:59 <sima> karolherbst, cherry-pick to drm-misc-fixes with a sha1 reference to the -next commit (so that people aren't too confused why a commit shows up twice) and add the cc: stable there

08:59 <emersion> sounds like it should've been pushed to drm-misc-fixes instead

08:59 <karolherbst> sima: thanks

08:59 <_jannau__> karolherbst: that's not compatible with the stable tree

08:59 <sima> emersion, time machines still don't exist yet, hindsight and all that :-)

08:59 <emersion> lol

08:59 <sima> karolherbst, dim cite $sha1 or you'll piss of some checkers for the sha1 reference

09:00 <karolherbst> yeah.. I have my own `git fixes` alias which I think is doing exactly the same thing or something

09:00 <sima> karolherbst, it's a bit a fallout from the linux kernel's funky process of making the release branches the main tree :-/

09:00 <sima> karolherbst, with sha1 reference I meant the sha1 of the commit in -next

09:01 <sima> so that people realize the duplicated commit was intentional, not a mistake

09:01 <ayaka_> but which DMA-heap it should allocate from may not be the part of api

09:01 <karolherbst> yeah, I know

09:01 <karolherbst> ohhh

09:01 <karolherbst> but yeah

09:01 <sima> gregkh will still get grumpy, but we do this often enough in drm that it's not a big deal

09:01 <sima> karolherbst, for the other thing you have dim fixes $broken_sha1

09:01 <karolherbst> but you can also just use cherry-pick -x, no?

09:01 * ccr . o O ( is gregkh ever un-grumpy? )

09:01 <sima> which also tries to guesstimate whether you need cc: stable

09:02 <sima> karolherbst, yeah but that' only adds the sha1, not with the proper lkml approved commit citation format

09:02 <karolherbst> ahh

09:02 <sima> should abbrev the sha1 and add the commit title, so that it's less ugly and easier for random downstream trees to find what commit sha1 that patch is in their tree

09:07 cmichael has joined #dri-devel

09:08 <daniels> ayaka_: 'unimaginably large buffers' still isn't enough reason to avoid 64b copies; if you have 100 buffers in flight, then the 100x64-byte cost really does vanish into the line noise of the 100*1920*1080*1.5 you're already moving around per frame

09:09 <karolherbst> sima: seems like most commits just have a (cherry picked from commit $full_hash) thing... would be a bit confusing to do it differently than anybody else. How did you mean to integrate the `dim cite` output into the commit message?

09:10 <sima> karolherbst, just replace the full length sha1 with the output of dim cite

09:10 <sima> and yeah maybe dim cherry-pick needs some fixing?

09:10 <sima> jani, rodrigovivi tursulin dolphin ^^ since that just used by drm-intel

09:10 <karolherbst> my point was rather, if you do `git log | grep "cherry picked"` you almost only see the full length one :D

09:15 <karolherbst> but yeah.. if we want the layout to be different I guess dim should be updated there

09:19 RSpliet has joined #dri-devel

09:23 <ayaka_> daniels, I would leave this alone. You could regard this as application don't want to add a vendor specific branch

09:23 <ayaka_> that is not the intel case, that they are hiding modifier in the kernel, while intel would tell you you are using nv12 pixel format

09:28 <daniels> intel don't hide modifiers

09:28 <ayaka_> besides, it won't be against GKI, as long as I add the whole buffer allocation and attach api to the v4l2 or possible v4l3. When we get rid of M-variant pixel format we need a method to allocate multiple planes buffer in a call

09:29 <ayaka_> From my experience with its gen 9 gpu, the modifier is not visual between vaapi and drm

09:29 <ayaka_> until gpu gen 11th, it changes

09:32 <sima> karolherbst, they're pretty much all drm-intel-fixes cherry-picks, which are done with dim

09:33 <sima> everyone else carefully rebases trees so that fixes never show up in -next

09:33 <emersion> oh. i finally managed to do a rst cross-document section reference

09:33 <sima> ayaka_, intel libva was seriously asleep at the wheel wrt modifier support

09:33 tristan has joined #dri-devel

09:33 <sima> they only realized that they have to fix things asap when intel dgpu support happened

09:33 <emersion> libva doesn't fully support modifiers yet even

09:34 <sima> and when they tried to add some more modifiers to the old legacy implicit thing

09:34 <sima> emersion, yeah it's a dumpster fire :-/

09:34 tristan is now known as Guest7771

09:34 <emersion> just use vulkan ^^

09:34 <sima> ayaka_, we've also fully sunset that implicit path on latest gpus (finally, took way too long!)

09:34 <sima> emersion, yeah probably the answer

09:34 <sima> otoh thinking of intel's libva team contributing to mesa vk ...

09:35 <ayaka_> sima, there are more vendor drivers you don't know, realtek is one

09:35 <ayaka_> if it is not GKI, I won't need to develop that v4l2

09:35 <emersion> aha

09:36 <emersion> who would be the maintainer even? airlied?

09:36 <karolherbst> sima: I see

09:36 <karolherbst> still a bit hesistant of doing things differently than all the others though :D

09:40 <daniels> ayaka_: yes, there are a lot of drivers who will need to make the effort to do it properly if they want to become part of upstream

09:42 <daniels> we've been here before with graphics - ADF was an Android alternative to KMS which tried to bypass uAPI requirements and let vendors just freeform stuff anything they wanted to in there - it's dead now

09:43 <jani> karolherbst: are there any non-intel "(cherry picked from ...)" in the logs though?

09:44 <karolherbst> yeah

09:44 <karolherbst> but it feels like 95% are from intel

09:44 <jani> karolherbst: we only use that for drm-intel-next -> drm-intel-fixes/drm-intel-next-fixes

09:45 <jani> it's an artefact of us always applying all the patches to drm-intel-next (or drm-intel-gt-next), and cherry-picking the fixes from there

09:47 <jani> gregkh has been grumpy about it, but nobody's ever outright told us not to do it either. it just helps a lot with the committer model

09:47 <karolherbst> is greg grumpy about doing that or about the format used?

09:48 <karolherbst> anyway, if the format should be changed we kinda should do it in dim I guess

09:48 <karolherbst> and I'd rather not add another style scripts might have to deal with if we aren't going to change dim as well

09:50 <jani> I don't think there were ever any complaints about the format git cherry-pick -x produces. it was mostly about the fact that the referenced sha1's don't exist in upstream kernel yet, only in our branches and linux-next. they'll only make it to upstream kernel after the merge window

09:51 <karolherbst> yeah... I can see that causing some confusions

09:51 <karolherbst> some throw in the branch from where that commit comes from

09:51 <karolherbst> so I think that would make sense to include

09:52 <karolherbst> e.g. `(cherry picked from commit 1f682dc9fb3790aa7ec27d3d122ff32b1eda1365 in wireless-next)`

09:52 <jani> right

09:53 <jani> idk a lot of the time it just feels like flying below the radar is the best option, not make a fuss about it :p

09:53 <karolherbst> yeah.. so using a different format out of the sudden feels like doing the exact opposite :P

09:53 <jani> heh

09:54 <jani> there's also the never ending debates about Link: etc

09:54 <karolherbst> I wished all that stuff would be way more consistent across subsystems

09:54 <ayaka_> daniels, you can see many vendor drivers don't use drm at all

09:54 <karolherbst> what is there to debate about Link:?

09:54 <karolherbst> :D

09:55 <karolherbst> shouldn't have called it Link if you don't want random Links

09:55 <jani> well, some say it's just wrong to use it like dim does, i.e. adding a Link: back to the patch

09:55 <jani> some add Link: liberally to just about anything that looks like a link, but *also* to non-URLs

09:56 <karolherbst> as long as Linus merges it it can't be wrong (or something) :P

09:56 <jani> see the bit about staying below the radar ;)

09:58 <daniels> ayaka_: sure, they don't have to use DRM/KMS, but then they don't get in mainline kernels, so they're not our problem

09:59 <ayaka_> I just want to say I am not the worst guy as a developer for the vendor. The only barrier is the Android's GKI

10:00 <ayaka_> all I could do is offering a not too vendor specified method that could draw them back to use drm

10:01 <ayaka_> if there is not better option, I just leave it alone. We could work on what we could make progress

10:08 <sima> ayaka_, does android's gki allow you to shovel drm modifiers through at least, or not even that?

10:09 <ayaka_> sima, I could say even what intel does is allow

10:09 <sima> intel doesn't do the implicit thing anymore

10:10 <sima> and we're pretty much removing it everywhere else too for new platforms for existing drivers

10:10 <ayaka_> why that we cause a problem? it didn't break the code that all drm drivers are sharing

10:10 <sima> and new drivers in general don't get it

10:10 <sima> ayaka_, the goal of upstream is to build an ecosystem

10:10 <ayaka_> what would cause problem.

10:10 <sima> every vendor doing their own thing hurts that pretty fundamentally, so step-by-step we're replacing these vendor tricks

10:11 <sima> and you could argue that the drm design for formats/modifiers is bad and a pain, but it's kinda 10 years too late for that argument

10:11 <sima> so unless it's a case of "it cannot work" we'll keep with the current thing

10:11 <ayaka_> yes, I know. also we could only have 4 planes

10:11 <sima> because rolling out these ecosystem changes takes decades

10:12 <sima> allowing more planes is a fairly minor change, I /think/ all the vk/gl extensions would allow that already

10:13 <emersion> sima, i assume this patch needs at least one ack from a drm person? if so, any chance you could ack? https://patchwork.freedesktop.org/patch/547819/

10:13 <ayaka_> well, that may could help, we could hide those cpu cache data in plane 4 and plane 5

10:14 <ayaka_> but I don't think this option would be available until next EGL spec update

10:14 <emersion> also, if anyone is up for doc review, this one still needs attention: https://patchwork.freedesktop.org/patch/547783/

10:16 <sima> emersion, a-b: me on both

10:16 <emersion> ty

10:16 <sima> ayaka_, yeah if we have to rev a bunch of extensions then that's a bit annoying

10:16 <emersion> lol patchwork added a literal "Acked-by: me" line

10:16 <sima> :-)

10:19 <ayaka_> I believe I would still need to deal with those old code in next 5 years. Besides, we still need to solve the allocation problem at least

10:20 <sima> ayaka_, another trick that we iirc used for afbc is that sometimes the planes have a fixed layout

10:20 <sima> like nv12

10:20 <sima> and so logically it's multiple planes, but you only need one plane slot to describe the buffer

10:20 <sima> since I think afbc had the "we need more than 4 planes" issue too

10:20 <sima> ayaka_, which allocation problem?

10:20 <ayaka_> unfortunately, we can't

10:21 <emersion> sima, the unix allocator

10:21 <ayaka_> two planes need to allocate from the shm not dma

10:21 <emersion> placement, alignment, etc

10:22 <ayaka_> sima, even for a NV12 vendor tiled here, there is a padding line between y and uv plane

10:23 Guest7771 has quit [Read error: Connection reset by peer]

10:23 <ayaka_> also the address must be page alignment

10:26 heat_ has joined #dri-devel

10:26 <sima> yeah that's not nv12 anymore ...

10:26 <ayaka_> I forgot to mention the secure session id property, although it is not a part of pixel format like the (un)compression options

10:27 <sima> uh those are a mess, we only have I think 2 drivers that support secure buffers

10:27 <sima> and only very limited use-cases

10:27 <sima> (in upstream)

10:27 <sima> so that stuff is handled as part of the per-vendor gem render uapi right now

10:27 <ayaka_> it is thing we used to encrypt and decrypt the memory context to prevent memory frozen attacking

10:28 <sima> yeah i915.ko and amdgpu.ko support memory encryption like that too

10:28 <ayaka_> but you could still use the generic render api

10:28 <sima> but those buffers arent' shareable

10:28 <sima> there's no generic drm render api

10:29 <sima> mripard, most of the basic plumbing is (drm) infoframes helpers already

10:29 <ayaka_> why secure buffer is not shareable, it is ok for the user to know its fb_id

10:29 <ayaka_> that would be useful for audio and video sync

10:29 <sima> so not sure how much more you'd want to extract from i915, at least without 1-2 more drivers to show what's actually common and what not

10:30 <sima> ayaka_, shareable across drm drivers I mean

10:30 <ayaka_> so if we could solve this cross sharing fb_id, you could make a function generic

10:30 <ayaka_> into the sunlight

10:30 <sima> fb_id ... what do you mean with that one?

10:30 <ayaka_> well, both intel and amd are vaapi

10:31 <ayaka_> the fb_id is what drm present a whole frame buffer(with all its plane)

10:31 <sima> afaik the secure buffer stuff only works with egl extensions (unless you grab some proprietary vaapi with some vendor extensions)

10:31 <sima> fb_id aren't shareable at all

10:31 <ayaka_> nope, it could work with v4l2 stateful api, also stateless if they listen to my design first

10:31 <emersion> fb_id is tied to KMS, it's not a good basis for cross device sharing

10:32 <sima> emersion, I'd be impressed if you manage to :-)

10:32 <ayaka_> I know your point here. what I am trying to share is the per-vendor struct that drm uses to present a framebuffer

10:32 <emersion> IOW, one may want to do cross-device sharing without KMS involved

10:32 <sima> ayaka_, ah yeah that makes sense, since these drm fb metadata pieces are also used by vk/gl extensions

10:32 <emersion> sima, i mean, it would always be _possible_ to add a KMS FB file descriptor

10:32 <ayaka_> I know fb_id is only unique to a device not cross the drm device

10:33 <sima> unfortunately neither vaapi nor v4l understand them fully, and the patches to fix that have been stuck for years :-(

10:33 <sima> emersion, could just share the drm kms fd and yolo ...

10:33 <emersion> aha

10:34 <emersion> who cares about races

10:35 tristan has joined #dri-devel

10:36 <sima> but more seriously, drm fb is meant to be a pure metadata container, there's really no point in sharing that object itself :-)

10:36 tristan is now known as Guest7774

10:36 <sima> ayaka_, another extension idea would be to add properties to drm_fb, so that you could add all kinds of extensions

10:36 <emersion> i think ayaka's point is that there would be value in a share-able metadata container, that way it's easier to add new metadata fields without plumbing the world

10:37 <sima> like entire drm blobs for big amounts of metadata

10:37 <daniels> (unless your goal is to create a side-channel you can stuff tons of opaque data into - a design we've consistently rejected in the past)

10:37 <daniels> heh

10:37 <sima> doesn't solve the issue of how to pass it around in userspace

10:37 <ayaka_> sima, yes, we have talked about this, too much times of copying

10:37 <emersion> but i don't know if it's a good or bad idea

10:37 <sima> ayaka_, I don't buy that, unless you can show the overhead

10:37 <ayaka_> and that lead to vendor branch code as that

10:37 <sima> like atomic ioctl is extremely non-optimized, and thus far no one cared

10:37 <emersion> :)

10:38 <emersion> hm, well…

10:38 <emersion> i do care ;_;

10:38 <emersion> the core DRM part seems fine

10:38 <emersion> the driver-specific part causes issues

10:38 <sima> emersion, which parts?

10:38 <emersion> like, i miss frames on amdgpu if i try to do a few test commits

10:39 <sima> uh yeah that's a bit much overhead

10:39 <ayaka_> blob id is an option, but that can't be shared

10:39 <sima> emersion, for simple updates iirc i915 gets a few k commits/s or so

10:39 <emersion> they bw computation code takes ages in some cases

10:39 <emersion> their*

10:40 <sima> emersion, hwentlan_ ^^ I guess you know?

10:40 <emersion> iirc vsyrjala had a benchmark patch for i915

10:40 <ayaka_> and I don't like the idea that we need to call ioctl() first to get its size then let the kernel fill the buffer in second ioctl()

10:40 <emersion> hm i'm pretty sure i had an issue for this, but can't find it back

10:41 <ayaka_> let's back to the secure buffer case, that may be a good case

10:41 <sima> emersion, amdgpu is also pretty bad because they still have a big split between drm and dc data structures

10:41 <sima> so for a _lot_ of things they need to grab all the states and recompute everything

10:42 <emersion> oh yeah

10:42 <sima> which is not going to be great, but also just a bit a result of their currently still too big impedance mismatch between drm and dc

10:42 <ayaka_> I allocate a framebuffer it could contains several planes, but people can't access a plane independently, besides I don't want people know its physics address(It doesn't)

10:42 <emersion> clearly we need more abstraction layers to make the thing easier to understand :P

10:42 <emersion> i mean, i understand why it's like this but…

10:43 <sima> emersion, nah just moving more of the dc state into drm states

10:43 <ayaka_> but for a secure buffer, it could be a buffer id that sharing between REE and TEE, that only TEE know where the buffer is

10:43 <sima> like was done more on the object side of things

10:43 <sima> essentially demidlayer it, so that the dependent state recomputation can be partial

10:43 <emersion> maybe if dc had more of a collection of helpers design, rather than midlayer… it would blend better

10:43 <sima> yeah

10:43 <sima> but it's a ton of work to get there

10:43 <emersion> indeed

10:44 <sima> also I think i915 does a lot of tricks of only validating against current fifo settings

10:44 <sima> and recompute optimal ones only later or when really needed

10:45 <sima> that keeps cascading computations in atomic_check at bay

10:48 frieder has quit [Ping timeout: 480 seconds]

10:50 <ayaka_> btw, I didn't get what those dma fence or sync object are used for in drm, that is because drm doesn't have a queue(ping-pong is not a queue) and gpu(display) could scan out a buffer while the render is also updating it?

10:52 <emersion> it's for explicit sync

10:52 <emersion> once again removing implicit stuff

10:53 <emersion> explicit sync is a requirement for Vulkand and Android

10:54 <ayaka_> yes, I know android has this before vulkan in egl. I just don't get it why we need a sync point here that our v4l2 driver didn't really care

10:55 <ayaka_> I know what sync event is used for opencl because parallel group working

10:56 <mort_> https://p.mort.coffee/npO.diff this is one way to fix a kernel memory leak...

10:57 frieder has joined #dri-devel

10:59 bmodem has joined #dri-devel

11:00 <emersion> ayaka_: GPU rendering is asynchronous

11:00 <emersion> i don't know about video en/decoding

11:02 <ayaka_> video have the similar idea like video slice or tiles

11:03 <ayaka_> we could only decode like only few top lines to damage an area

11:03 <ayaka_> that is maybe useful for high frequent rate display and video

11:03 <emersion> by "asynchronous", i mean that submitting a GPU command buffer does not wait for completion

11:04 <emersion> so, if you draw, then read, the draw might not be complete by the time you read

11:04 <ayaka_> yes, I know, if we wait until it finish, userspace need to a flush

11:05 <emersion> no, a flush is not enough

11:05 <emersion> a flush only ensures that the GPU command buffer has been submitted to the hw

11:05 <ayaka_> there are front and back surface in a EGL surface

11:05 <emersion> glFinish() will wait for completion

11:05 <emersion> glFlush() will not

11:05 <ayaka_> yes glFinish() cost a lot

11:06 Net147 has quit [Quit: Quit]

11:07 Net147 has joined #dri-devel

11:08 <ayaka_> I may need to look into its api. I was wondering when we get a front surface, then commit it to kms, if that front surface is completion, what the driver should do?

11:08 <ayaka_> using the fb_id in the previous atomic commit?

11:08 Guest7774 has quit [Ping timeout: 480 seconds]

11:10 <emersion> if the buffer submitted to KMS is not ready at vblank time, the previous frame is re-used, yes

11:11 f11f12 has joined #dri-devel

11:15 flto_ has quit []

11:15 flto has joined #dri-devel

11:17 pochu_ has joined #dri-devel

11:19 pochu has quit [Ping timeout: 480 seconds]

11:24 donaldrobson has quit [Ping timeout: 480 seconds]

11:25 donaldrobson has joined #dri-devel

11:30 <mripard> sima: it looks like I can't find a way to express my point on that either, so I'll just stop. I don't want to create any unnecessary frustration or tension there, and the discussion is way out of scope for that particular patch series

11:32 <sima> mripard, I think I get your point, I just don't agree

11:32 <sima> getting hdmi is a big dumpster fire and took years for i915

11:33 <sima> but throwing in the towel and declaring the entire issue a userspace problem doesn't help, because userspace is in an even worse position to handle things correctly

11:33 <mripard> that's not what I was saying

11:34 dviola has quit [Quit: WeeChat 4.0.2]

11:34 <Venemo> anholt: how do I tell deqp-runner how many threads I want it to use?

11:34 <sima> mripard, I thought this was specifically about i915 having hdmi infoframe code that no one else does?

11:35 <mripard> I guess what I was saying is that "well, i915 is fixed so we should just ignore it" is kind of throwing the towel as well

11:35 <pendingchaos> Venemo: "-j n" or "--jobs n"

11:35 <Venemo> thanks

11:36 <mripard> anyway, yes, it's a dumpster fire, it will probably take a long time to fix for all the other drivers as well, and you made it clear that margins are not the right solution and we should fix every thing else

11:36 <mripard> so I guess we agree on the most important things there

11:37 <mripard> the rest are technicalities

11:39 <sima> mripard, definitely didn't want to get "i915 is done, it's all good" across

11:40 <sima> just wanted to highlight an example of where hdmi drivers probably would need to be, since the hardest part was figuring out what's all needed

11:40 <sima> replicating and extracting more helpers should be much easier

11:40 kts has joined #dri-devel

11:41 <sima> mripard, I was honestly surprised that aside of i915 no one else seems to use these infoframe helpers fully

11:42 yyds has quit [Remote host closed the connection]

11:43 itoral has quit [Quit: Leaving]

11:45 JohnnyonFlame has joined #dri-devel

11:51 JohnnyonFlame has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

11:51 JohnnyonFlame has joined #dri-devel

11:59 kts has quit [Quit: Leaving]

12:09 digetx has quit [Ping timeout: 480 seconds]

12:10 digetx has joined #dri-devel

12:11 <dliviu> hello, drm-misc-next maintainership question: I have a patch that did not Cc dri-devel, only sima and airlied directly. Noticed when trying to apply with dim. What is the protocol? Should I ask to resend?

12:15 Ahuj has joined #dri-devel

12:16 Danct12 has quit [Quit: WeeChat 4.0.2]

12:22 YuGiOhJCJ has quit [Ping timeout: 480 seconds]

12:23 tristan has joined #dri-devel

12:24 tristan is now known as Guest7783

12:27 <alyssa> jenatali: windows runners were down yesterday (IDK if they still are) so couldn't test windows with nir 2.0 MR, you might want to give that a smoke test

12:28 <jenatali> alyssa: Pretty sure I clicked play on the Windows jobs on at least one pipeline

12:28 <alyssa> :+1:

12:31 YuGiOhJCJ has joined #dri-devel

12:36 Company has joined #dri-devel

12:41 <alyssa> https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24432 really earns its "all jobs" CI pipelines :P

12:51 <jenatali> Ugh, the D3D jobs depend on clang-format now?

12:51 <jenatali> That's a huge pain

12:51 Armada has quit [Remote host closed the connection]

12:51 Armada has joined #dri-devel

12:52 <jenatali> I need to undo that, there's no good reason to trigger a Linux container just to run Windows jobs and it means I can't run just the jobs I want with one click from the UI anymore

12:52 <daniels> jenatali: fwiw, .gitlab-ci/bin/ci_run_n_monitor.py --target dozen-deqp is what you want

12:53 <jenatali> Yeah but as I've said, Windows doesn't have a way of storing tokens for that script which makes it a huge hassle every time I want to use it

12:53 <jenatali> Which I guess I could try to fix that instead, but also I don't think there's value in depending on clang-format for the Windows jobs

12:54 * alyssa regrets putting clang-format in CI

12:54 <jenatali> Unless there was a separate clang-format job that ran on the Windows runner, which I also don't think is valuable

12:56 godvino has joined #dri-devel

13:13 frankbinns has quit [Remote host closed the connection]

13:16 godvino has quit [Quit: WeeChat 3.6]

13:18 vliaskov__ has quit [Ping timeout: 480 seconds]

13:29 Guest7783 has quit [Ping timeout: 480 seconds]

13:35 yuq825 has left #dri-devel [#dri-devel]

13:35 YuGiOhJCJ has quit [Quit: YuGiOhJCJ]

13:36 <javierm> sima: sorry, I missed your message before because I was on PTO, but I see that emersion already answered

13:41 <hwentlan_> sima, emersion, yes, I'm aware

13:42 heat_ has quit [Read error: Connection reset by peer]

13:42 heat has joined #dri-devel

13:43 jkrzyszt_ has quit [Ping timeout: 480 seconds]

13:45 tristan__ has joined #dri-devel

13:47 Haaninjo has joined #dri-devel

13:51 <MrCooper> emersion sima hwentlan_: there's https://gitlab.freedesktop.org/drm/amd/-/issues/1740 but it was timeout-closed by Mario

13:51 <emersion> ah yes that one

13:51 <hwentlan_> yeah, that ticket is still valid

13:52 <emersion> i really don't like that auto-close policy

13:52 <emersion> i spent a lot of time collecting info about bugs and then it all goes to the void

13:52 <hwentlan_> but the reason most things takes long is because they go to our DML (display mode lib) for bandwidth computations which is massive and therefore slow

13:53 <hwentlan_> it's not an easy problem to solve

13:53 <emersion> yeah…

13:53 <sima> yeah 2ms for 5 test_only atomic calls is a bit much ...

13:54 <sima> well it just seems to be one really that's bad

13:54 <sima> *really bad

13:55 <emersion> yeah, really depends what you test for

13:56 <hwentlan_> is this mostly about enabling planes?

13:56 donaldrobson_ has joined #dri-devel

13:57 donaldrobson has quit [Ping timeout: 480 seconds]

13:57 heat has quit [Remote host closed the connection]

13:57 <hwentlan_> one could probably pre-compute certain common scenarios and cache them, but it wouldn't be pretty and would be sub-optimal since you can't pre-compute everything

13:58 heat has joined #dri-devel

14:00 <sima> hwentlan_, yeah a bit of grepping in the attached dmesg and it's only plane changes nothing else

14:01 <sima> also only one crtc

14:02 <sima> so should be a substantial subset of the overall atomic state and I have no idea how you managed to burn down over a ms on computing stuff with that

14:02 <MrCooper> https://gitlab.freedesktop.org/drm/amd/-/issues/2186 seems like higher priority though; can make moving the mouse cursor very painful on my gaming rig

14:02 <sima> but iirc amdgpu dc is pretty aggressive in escalating to "grab all states, recompute everything"

14:04 <sima> the 0.1ms is more in line with "atomic ioctl is just not very fast" stuff I think

14:04 f11f12 has quit [Quit: Leaving]

14:06 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

14:06 TMM has joined #dri-devel

14:06 <sima> for that I'd expect that profiling is needed to make sure you knock out the right inefficiencies, since they're absolutely everywhere

14:07 <sima> maybe with vkms with some planes and just pushing through test_only commits as fast as possible

14:10 fxkamd has joined #dri-devel

14:10 <robclark> mort_: hmm, well apq8016 has legacy cursor.. and is using disp/mdp instead of disp/dpu.. could be something related to one of those two things? Maybe try sw cursor to rule out something cursor related?

14:15 <hwentlan_> MrCooper, I agree

14:18 heat_ has joined #dri-devel

14:18 heat has quit [Remote host closed the connection]

14:25 sgruszka has quit [Ping timeout: 480 seconds]

14:27 angerctl has joined #dri-devel

14:29 <mort_> robclark: I don't know if I'm configuring X correctly .. but I tried to enable swcursor, and it doesn't seem to have stopped the leak

14:29 <mort_> I created a /usr/share/X11/xorg.conf.d/20-swcursor.conf with: Section "Device" \n\t Identifier "Card0" \n\t Option "SWCursor" "true" \n EndSection, which should turn on software cursor from what I can tell

14:30 Namarrgon has quit [Ping timeout: 480 seconds]

14:32 <robclark> if you enable some drm.debug traces, you should see mouse cursor (while nothing else is changing on screen) generate drm traces if hw cursor is used, but not otherwise.. or otherwise maybe check /proc/interrupts (hw cursor updates will enable vblank irq)

14:32 <MrCooper> I'd check the Xorg log file for whether the option is actually taking effect

14:32 <robclark> ahh, yeah, that might be easier

14:37 <mort_> the log does contain '(**) modeset(0): Option "SWcursor" "true"' so yeah, that's taking effect

14:38 rasterman has quit [Quit: Gettin' stinky!]

14:40 cmichael has quit [Ping timeout: 480 seconds]

14:41 fxkamd has quit []

14:41 fxkamd has joined #dri-devel

14:42 <robclark> mort_: ok.. then in theory you would see this with kmscube? That might be an easier/simpler thing to debug.. plus it can use either legacy pageflip like xf86-video-modesetting and atomic ioctl

14:45 <mort_> robclark: kmscube says "failed to set mode: permission denied", or, if I run it with -A, it says "failed to commit: permission denied"

14:46 <mort_> that's running without either a window manager or a compositor, if that matters

14:46 <robclark> kill xorg first, if you haven't

14:46 <mort_> ohh I assumed this was an X application

14:46 <robclark> nope

14:46 <javierm> mort_: no it's just a kms app. https://gitlab.freedesktop.org/daniels/kms-quads is another nice KMS test app

14:47 <mort_> it's leaking both by default and with -A

14:47 <robclark> hmm, ok

14:50 <mort_> as a hack, could I increment the crtc refcount in the alloc and then just plain kfree it in atomic_state_default_release, or is it meant to still be used after the release

14:51 frieder has quit [Remote host closed the connection]

14:53 <robclark> it is needed after the release if someone is waiting on one of the completions

14:54 tristan__ has quit [Remote host closed the connection]

14:55 tristan__ has joined #dri-devel

14:59 <robclark> looking at the kmemleak hexdumps.. it is leaking a single reference (the kref starts at the 9th byte)

14:59 <mort_> I have also found with my prints that the leaked commits end up with a refcount of 1

15:00 yyds has joined #dri-devel

15:00 <mort_> robclark: I am happy to help debug this if you want, and help debugging it would be appreciated. However, until and unless we figure it out, I will work around it by having a pool of allocated commit objects which I cycle through and re-use, so I don't think I will work on this myself outside of getting information for you

15:01 <sima> ... catching up: are we leaking struct drm_atomic_state?

15:02 <mort_> yeah

15:02 <robclark> sima: https://p.mort.coffee/REk fwiw

15:02 <mort_> with the msm drm driver, on my hardware (but not robclark's), on recent kernels (both 6.1.34 and 6.4.7 are tested), a drm_crtc_commit structure is leaked from drm_atomic_helper_setup_commit every flip

15:02 MrCooper has quit [Remote host closed the connection]

15:03 tristan__ has quit [Ping timeout: 480 seconds]

15:03 <sima> hm yeah that's leaking like a sieve :-/

15:03 <robclark> I'm not seeing anything else leaked, so it isn't like we are leaking crtc state or something

15:03 <mort_> yeah

15:03 <sima> well anything in there is still referenced I assume

15:04 <robclark> could be mdp5 vs dpu.. but driver itself doesn't touch the commit obj

15:04 <sima> it's a bit of work, but might be worth it to wire up ref_tracker.h support

15:05 <sima> ofc the thing is gloriously undocumented :-(

15:05 Duke`` has joined #dri-devel

15:06 MrCooper has joined #dri-devel

15:06 ndufresne has joined #dri-devel

15:06 <mort_> sima: see https://p.mort.coffee/DVX, I manually traced inc/dec :p

15:08 <dliviu> asking again: is it OK to submit a patch in drm-misc-next that doesn't have a link to patchwork because dri-devel was never Cc-ed?

15:08 <sima> oh it's struct drm_crtc_commit

15:13 <mripard> we had a similar issue for vc4 at some point, but I can't retell how we fixed it up https://github.com/raspberrypi/linux/issues/4474

15:14 <mripard> hopefully it will help :)

15:15 <mripard> looking at the offending patch, I think we were doing a get in setup_commit and one in destroy_state

15:15 <mripard> but we were also doing a get in duplicate_state

15:15 <mripard> so the refcounting was off

15:22 bmodem has quit [Ping timeout: 480 seconds]

15:28 MajorBiscuit has quit [Ping timeout: 480 seconds]

15:29 kzd has quit [Ping timeout: 480 seconds]

15:30 <sima> mort_, robclark I think I got it

15:31 MrCooper has quit [Quit: Leaving]

15:32 <mort_> sima: ooo, interesting, do tell

15:32 <sima> https://paste.debian.net/hidden/8a28a3f8/

15:32 <sima> broken since years it seems

15:33 <sima> 2017 is when we added refcount for the plane_state->commit pointer

15:33 jfalempe has quit [Quit: Leaving]

15:33 <sima> mort_, testing would be much appreciated, if it checks out I'll bake it into a proper patch

15:34 <sima> took me a while to load all the stuff into my brain again, since for a simple flip we have like 5 references floating around ...

15:36 <mort_> fwiw, this patch also "fixes" (or, well, works around) it: https://p.mort.coffee/P4n.diff, I'm running that right now and it's not leaking yet graphics is working

15:36 <mort_> but I will check out that patch, it's a much more proper fix

15:38 <sima> mort_, yeah that's just very dangerous duct-tape

15:38 <mort_> indeed

15:39 <sima> I'm pretty sure I've found it, because reconstructing the refcount leak with your log pointed me at exactly the code that was buggy

15:39 <sima> mdp5_plane_destroy_state wasn't dropping the refcount for plane_state->commit like it should

15:39 <sima> and that's been broken since 21a01abbe32a roughly

15:40 MrCooper has joined #dri-devel

15:40 <mort_> the fix looks very logical

15:41 FireBurn has quit [Quit: Konversation terminated!]

15:45 yyds has quit [Remote host closed the connection]

15:47 benjaminl has joined #dri-devel

15:49 <robclark> sima: oh, good catch, looks like mdp5 missed conversion as more stuff got added to plane state

15:53 <daniels> emersion: so apparently I don't know how to work git-send-email anymore, but thanks for the necromancy https://lists.freedesktop.org/archives/dri-devel/2023-August/417333.html (cc sima, anyone else interested in dmabuf/uapi)

15:53 <emersion> ohhhhhhhhh

15:53 rgallaispou has left #dri-devel [#dri-devel]

15:53 <emersion> <3

15:53 <emersion> i've linked that doc patch *so many times*

15:53 <daniels> I'd like to say two years is a new low, but it's probably not :\

15:54 <sima> emersion, you'll do the honors of applying it?

15:55 <emersion> i'll read it up and, yeah, apply it

15:55 <javierm> daniels: the glossary section is very useful!

15:55 <emersion> ah the missing cc's

15:56 Ahuj has quit [Ping timeout: 480 seconds]

15:56 <daniels> javierm: you can thank pq for that! I just copy & pasted and added like two things

15:56 <javierm> daniels: cool, thanks pq :)

15:57 <daniels> emersion: yeah, turns out if you use git-format-patch so you can write revision notes, then use git-send-email to send those, the Cc in your cover letter only applies to the 0/n, not to the individual patches

15:57 <daniels> it also turns out that it won't cc everyone on the thread if you didn't bother telling it to

15:57 <emersion> git send-email --annotate

15:58 <emersion> in DRM also we put the CC list in the commit message

15:58 <javierm> daniels: I've moved to patman almost a decade ago and never looked back

15:58 <emersion> git send-email will add everybody automatically

15:59 <emersion> but yeah, should really write a git-send-email alternative which doesn't suck

15:59 <sima> apparently it's called b4 or something

16:00 <javierm> emersion, sima: did you ever try patman? https://u-boot.readthedocs.io/en/latest/develop/patman.html

16:00 <_jannau__> another one? patman and b4 are already muh better

16:00 <emersion> b4 seemed really tied to the kernel workflow last time i looked

16:01 <sima> emersion, I think the only thing we might want to add in a follow-up is that for non-linear format everyone needs to use the stride computation like drm has for that format/modifier combo, or things break

16:01 <sima> that's one thing that has lead to really long threads in the past

16:01 <emersion> i just want a stateful tool which remembers version/cc/etc depending on the branch i'm in

16:01 <emersion> ah, yeah, good point

16:03 <daniels> javierm: I've moved to GitLab

16:04 <daniels> emersion: from what I've seen of patman, it's the closest thing indeed to that

16:04 <daniels> emersion: TIL --annotate!

16:04 <_jannau__> b4 depends on email based workflow since it keeps the metadata in an empty commit

16:04 <sima> s/impliied/implied/

16:04 <sima> emersion, ^^

16:05 <emersion> i'll need to check these tools out again

16:05 <javierm> daniels: I'm part of the patman evangelism strike force :P

16:05 <javierm> having all the patches metadata in the commit messages and just run patman is amazing. And you even have a --dry-run option

16:06 <daniels> yeah, patman is definitely the right thing for email workflows

16:07 djbw has joined #dri-devel

16:07 <sima> emersion, a-b: me on both, just finished reading

16:07 <emersion> cool

16:08 <sima> Care and attention should be taken to ensure that

16:08 <sima> + zero as a default uninitialized value signals no modifier.

16:08 <sima> ^^ maybe add ", and must not be accidentally mixed up with DRM_FORMAT_MOD_LINEAR, which equals zero"

16:09 <sima> but fine either way

16:09 <emersion> hm, zero does not mean "no modifier" then?

16:10 <emersion> also zero is not "uninitialized"

16:13 <sima> I guess it depends, some interfaces guarantee that no modifier is signalled with MOD_INVALID

16:14 <sima> some have an out-of-band flag (like addfb2/getfb2)

16:14 <emersion> yeah

16:14 <sima> I thought the note is just to make sure that you don't accidentally mix things up since it's confusing

16:17 <ayaka> I watched XDC 2022 | Explicit Synchronization for Linux Display Servers

16:17 <sima> maybe we should clarify this in a follow up

16:18 bgs has joined #dri-devel

16:19 <daniels> that's ... not what I meant to write

16:19 <ayaka> I think the sync point is only used for notified the event like a frame is rendered(by GPU), it can't be used to notify partly render is done?

16:19 <daniels> missing some words I think

16:20 <daniels> something like 'Care and attention should be taken to ensure that zero (as a default uninitialized value or boolean comparison) is **not** confused with no modifier.'

16:20 <daniels> total sense inversion ffs

16:20 <ayaka> besides why we use IN_FENCE_FD? should we send the framebuffer to an atomic request only after the userspace have been notified?

16:20 <emersion> i'd use some other word than "uninitialized" i think

16:20 <daniels> ('don't be confused! do the opposite of what you should!')

16:20 <daniels> emersion: just 'default' or?

16:21 <emersion> "uninitialized" in C means that the contents can be anything

16:21 <daniels> ayaka: in gfx, fences are only ever used to signal full completion of a frame. there would be no point in giving a Wayland server a fence which signaled when half the frame was complete

16:22 <ayaka> daniels, then why amd is trying to implement async page flip

16:23 <daniels> ayaka: they're totally different things

16:23 <emersion> daniels, do you mean this? https://en.cppreference.com/w/c/language/initialization#Empty_initialization

16:23 <ayaka> async page flip is scan out the rest of part with a new frame

16:23 <daniels> emersion: I guess I meant 'explicitly initialised but to all-zero as a default value that's totally safe as a "nothing here" sentinel for everything except FDs and modifiers'

16:24 <daniels> emersion: but that seems overly wordy

16:24 <daniels> ayaka: yes, which is different to beginning to scan out something before anything's been rendered to it

16:24 <emersion> "zero as a default value when omitted"?

16:24 <ayaka> then why I can't flush the top part with a new frame for example I want to display a large graphics while its bottom is not decode yet

16:25 <emersion> or just "as a default value"?

16:25 tobiasjakobi has joined #dri-devel

16:25 <emersion> but yeah, i'm nitpicking here :P

16:26 donaldrobson_ has quit [Remote host closed the connection]

16:27 <daniels> emersion: yeah, I think just 'as a default value' is probably easiest?

16:27 <emersion> wfm

16:27 <daniels> or 'default or initial'

16:27 <daniels> wanna just fix up locally or should I resend?

16:27 <emersion> yeah, initial sounds good to me

16:27 <emersion> i can fix up locally

16:27 <daniels> ayaka: you can if you want, it's just that very few people want that

16:28 <daniels> emersion: thanks!

16:28 tobiasjakobi has quit []

16:29 <ayaka> daniels, here is the case, a platform is designed for 4K decoding and display. But for the 8K display, its performance may not be enough. So you can't wait a frame to be finished to display

16:30 junaid has joined #dri-devel

16:30 <daniels> sure, in that case don't bother fencing, just display whatever's around at the time, and then the user will see half and half

16:30 <ayaka> you must start to scan it out for example its half is done

16:30 <daniels> ok, so then use fences to fence when half is done, or a third is done, or whatever

16:31 <daniels> that's something you'd have to put in your driver, but I don't imagine 'signal fence when you've done half the work' would get accepted as uAPI upstream, so it's just downstream hacks in which case you can do whatever you want to

16:33 <ayaka> well, that is what opencl could do, I am thinking about allow create multiple fences in video4linux because I didn't find any video4linux driver uses the fence

16:34 <ayaka> its stateless decoder could support decode a slice but it has been hold the buffer until the whole frame is done

16:35 idr has quit [Ping timeout: 480 seconds]

16:36 swalker__ has quit [Remote host closed the connection]

16:38 <ayaka> also I didn't get why the kms driver itself would consume IN_FENCE, not the userspace should wait until the notification then submit the commit

16:44 <ayaka> if the fence is not come that commit would not be submit, if the next commit came, it is the previous commit would be discard?

16:47 <daniels> no, you can't queue multiple submits

16:47 Zopolis4 has quit [Quit: Connection closed for inactivity]

16:49 rasterman has joined #dri-devel

16:52 <emersion> hm, sorry, no time to finish this up tonight

16:53 <daniels> emersion: I'm pretty sure it can survive another day :P

16:54 <ayaka> daniels, yes, I forget I should wait for the event or the out fence

16:54 vliaskov has joined #dri-devel

16:55 <ayaka> but why we could let the kernel wait for fence not in the userspace?

16:57 <daniels> because it avoids spurious wakeups and unnecessary queues with relatively deep/complex chains of operations

16:58 aravind has quit [Ping timeout: 480 seconds]

17:04 <ayaka> daniels, I think I need some document to understand how this fence (in and out) work with egl

17:05 <ayaka> from the kmscube, it would create an in fence for each cube frame and an out for each scan out

17:05 <daniels> in-fences are waited for before rendering begins; out-fences are signaled when rendering completes

17:06 <daniels> if you search around, there are a few presentations on how explicit synchronisation works

17:07 <ayaka> I could understand what in fence and out fence are used for from drm doc

17:08 <ayaka> but I just wondering why we can't resue those fence, we just need the out fence to make gpu to unlock the front buffer

17:08 <ayaka> while in fence for display to wait the gpu completed its front buffer

17:09 <daniels> how would you reuse fences? fences refer to one specific point in time

17:09 <daniels> the in-fence kmscube passes to KMS, refers to the completion of exactly one drawing command made by the GPU

17:09 <ayaka> because creating fence is a cost

17:09 <daniels> it doesn't change in time to be 'the completion of whatever the latest rendering is'

17:10 <ayaka> I am thinking the long time pending kmssink in gstreamer

17:10 <daniels> yes, so in its render callback it would have to accept a dma-fence with each frame

17:11 <ayaka> we don't need a in fence here but could we re-use out fence here

17:11 <ayaka> we just create two out fence here

17:15 <daniels> no, because fences always refer to one specific point in time

17:16 apinheiro has quit [Quit: Leaving]

17:16 <sima> robclark, ah right, the vma tracking locking fun was in the context of rpm I think ...

17:17 <sima> mort_, have a testing verdict on the patch already?

17:18 <ayaka> daniels, then should we drop the only poll drm event then switch to out fence way. Creating something and destroying something frequently doesn't sound a good idea

17:19 <robclark> vma lock was completely uninvolved with that since isn't used for reclaim.. only to give userspace an error if it tried to change the VA of a in-use vma... the rpm/qos hell is still there

17:19 <robclark> since that is about the obj resv / reclaim vs rpm/qos locking

17:20 <ayaka> s/only/old/

17:20 <daniels> ayaka: this is something that happens once every 16ms and is a very lightweight operation. I have no idea why you think it's a bottleneck.

17:24 alyssa has left #dri-devel [#dri-devel]

17:27 <zmike> eric_engestrom: any idea what's going on with https://gitlab.freedesktop.org/zmike/mesa/-/jobs/46637627

17:27 <zmike> I retried a couple times and it seems broken

17:31 <eric_engestrom> zmike: I've seen a bunch of 500 & 503 on the gitlab registry this afternoon, I'm guessing this is why it's failing

17:31 <eric_engestrom> I don't know any more than that though, ask the admins on #freedesktop

17:32 <ayaka> in 60fps mode it is true

17:32 <eric_engestrom> also, it might be related to the windows runners being overwhelmed, possibly someone doing something heavy

17:37 <daniels> looking at that job log, it's trying to pull x86_64-test_base and failing

17:37 <daniels> looking at the x86_64-test_base job log, the job claimed to succeed, but the container push failed https://gitlab.freedesktop.org/zmike/mesa/-/jobs/46634148

17:37 <daniels> so, retry that, then retry the other one

17:37 <zmike> k

17:41 sukrutb has joined #dri-devel

17:41 Kayden has quit [Read error: Connection reset by peer]

17:41 K`den has joined #dri-devel

17:42 K`den is now known as Kayden

17:42 <ayaka> daniels, now I am thinking the software fence is not fast enough, I am thinking anyone offer a hardware fence. Besides, ping-pong and page flip could be not enough, we had better prepare a queue that display could scan them out in order in such high fps case

17:43 <ayaka> cpu is not that real time for such task

17:44 <daniels> I haven't seen a system with a combination of such a high frame rate and such a low-end CPU that it couldn't service the IRQs quickly enough to do pageflips

17:44 <daniels> but if that's a problem you're seeing, that's something you'll be solving I guess

17:45 <ayaka> the cpu is not that low end it is quad arm a55 cores

17:47 <ayaka> but cpu has more work to do, also we don't use irq but message box now

17:49 <daniels> A55s can do pageflips

17:50 <ayaka> as I said, cpu is occupied by the audio

17:50 <daniels> well, if you manage to get yourself into a situation where you can't schedule enough time for one ioctl every 16ms (or 8ms or whatever), then that sounds quite bad, but also not something upstream's going to design for

17:50 <daniels> tbh it sounds like a case of trying to optimise problems which don't exist; you're much better off doing actual real-world measurements before guessing

17:51 <ayaka> Currently, it didn't run drm. But we have to use a queue(msg box) here to implement that refresh rate

17:52 greenjustin_ has joined #dri-devel

17:53 <ayaka> decoder could work exactly that speed in 4k mode, while not much for us to wait. Also we have to count the cost for REE and TEE context switching

17:56 <ayaka> the point is whether the upstream method should do such thing frequently. Or why would we invent poll() that yield the thread

17:58 greenjustin has quit [Ping timeout: 480 seconds]

18:14 _whitelogger has joined #dri-devel

18:25 sukrutb has quit [Remote host closed the connection]

18:33 benjamin1 has joined #dri-devel

18:34 alanc has quit [Remote host closed the connection]

18:34 alanc has joined #dri-devel

18:34 pochu_ has quit [Ping timeout: 480 seconds]

18:39 idr has joined #dri-devel

18:39 benjaminl has quit [Ping timeout: 480 seconds]

18:49 ngcortes has joined #dri-devel

18:52 mvlad has quit [Remote host closed the connection]

18:53 Kayden has quit [Quit: -> JF]

18:58 <mort_> sima: I just installed a kernel with the patch, and so far it looks good! kmalloc-256 has been at 1380K for a while now, nothing else looks suspicious

18:58 <mort_> I'll run a few memory logging tools I have overnight to see if I encounter anything weird but it's definitely not leaking a commit per flip anymore

18:59 tursulin has quit [Ping timeout: 480 seconds]

19:05 <sima> mort_, can I have some email for reported/tested-by credits?

19:18 <mort_> sima: the best one to use is probably dorum@noisolation.com

19:22 oneforall2 has quit [Remote host closed the connection]

19:22 sghuge has quit [Remote host closed the connection]

19:22 idr has quit [Remote host closed the connection]

19:23 sghuge has joined #dri-devel

19:23 idr has joined #dri-devel

19:26 oneforall2 has joined #dri-devel

19:26 Kayden has joined #dri-devel

19:26 <mort_> hardware is a snapdragon 410 if that's relevant

19:38 fee1dead has joined #dri-devel

19:40 ngcortes has quit [Ping timeout: 480 seconds]

19:43 benjaminl has joined #dri-devel

19:45 <fee1dead> Hi all. I'm having some trouble with a Touring card and the new NVK vulkan driver. My small vulkan triangle example is crashing Sway. I know it is experimental and not even merged, so maybe I should just wait? Wanted to know if it is a normal outcome or worth reporting and also asking for help if it is the latter. Thanks in advance!

19:50 benjamin1 has quit [Ping timeout: 480 seconds]

20:01 crabbedhaloablut has quit []

20:03 bgs has quit [Remote host closed the connection]

20:09 <airlied> fee1dead: probably at the just wait stage

20:30 <fee1dead> Will do.

20:37 danylo has quit [Ping timeout: 480 seconds]

20:38 rasterman has quit [Quit: Gettin' stinky!]

20:40 ngcortes has joined #dri-devel

20:43 junaid has quit [Remote host closed the connection]

20:45 <sima> robclark, just sent out the mdp5 leak fix, I'm assuming you'll apply somewhere?

20:46 <robclark> yeah.. it shows up in freedreno patchworks so one of us will grab it

20:56 <airlied> 90/92 sessions passed, conformance test FAILED so close, gl4.6 llvmpipe run, one ballot test in two configs

20:56 alyssa has joined #dri-devel

20:56 <alyssa> Anyone know of test coverage for perspective correct interpolateAtOffset?

20:57 <pendingchaos> the vulkan coverage probably isn't very good?

20:57 <pendingchaos> IIRC RADV's is at least somewhat incorrect

20:58 <alyssa> I'm not seeing anything in either gles or vk cts

20:59 <alyssa> I also don't have a VK driver that's able to run the vk tests regardless, so really hoping for GL CTS or piglit coverage :p

20:59 <alyssa> unfortunately the gles cts tests all pass even if I do perspective-incorrect interpolation

20:59 <alyssa> Oh I know how to test this, I can just forcibly lower all interpolation and then see what breaks

21:14 rgallaispou has joined #dri-devel

21:15 ced117 has quit [Ping timeout: 480 seconds]

21:16 <zmike> piglit?

21:20 ohmadcs^ has quit [Remote host closed the connection]

21:20 ohmadcs^ has joined #dri-devel

21:22 <alyssa> worth a shot but I don't see coverage

21:23 <zmike> I know there's interpolateatoffset coverage

21:24 Duke`` has quit [Ping timeout: 480 seconds]

21:24 <alyssa> yes, but nothing that specifically tests for perspective correction

21:24 <alyssa> i.e. with gl_Position.w set to anything other than 1

21:25 <zmike> shader_runner to the rescue

21:28 ced117 has joined #dri-devel

21:31 sima has quit [Ping timeout: 480 seconds]

21:32 benjamin1 has joined #dri-devel

21:34 vliaskov has quit [Remote host closed the connection]

21:37 heat_ has quit [Remote host closed the connection]

21:38 benjaminl has quit [Ping timeout: 480 seconds]

21:40 Haaninjo has quit [Quit: Ex-Chat]

21:56 rgallaispou has quit [Quit: WeeChat 4.0.2]

22:00 jmondi has quit [Read error: Connection reset by peer]

22:08 pcercuei has quit [Quit: dodo]

22:14 paulk-bis has joined #dri-devel

22:14 paulk has quit [Read error: Connection reset by peer]

22:14 fee1dead has quit [Remote host closed the connection]

22:22 JohnnyonFlame has quit [Ping timeout: 480 seconds]

22:27 kzd has joined #dri-devel

22:27 fxkamd has quit []

22:29 danylo has joined #dri-devel

22:41 <glennk> alyssa, i vaguely remember the ulp error between affine and true perspective being less for the affine than trying to correct using a float division when doing this on evergreen

22:42 <alyssa> hum?

22:51 djbw has quit [Read error: Connection reset by peer]

22:52 <glennk> see tgsi_interp_egcm in r600_shader.c

22:53 <glennk> it uses affine interpolation for interpolateAtOffset

23:24 glennk has quit [Ping timeout: 480 seconds]

23:39 rsripada has quit []

23:41 rauji___ has joined #dri-devel

23:45 JohnnyonFlame has joined #dri-devel