#dri-devel on 2021-10-01 — irc logs at oftc.irclog.whitequark.org

2021-07-26 22:56 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:07 co1umbarius has joined #dri-devel

00:09 columbarius has quit [Ping timeout: 480 seconds]

00:12 swick_ has joined #dri-devel

00:12 tursulin has quit [Read error: Connection reset by peer]

00:14 swick_ has quit []

00:14 swick has joined #dri-devel

00:28 swick has quit [Quit: WeeChat 1.6]

00:28 swick has joined #dri-devel

00:31 swick has quit []

00:31 swick has joined #dri-devel

00:35 leandrohrb has joined #dri-devel

00:45 nchery has quit [Quit: Leaving]

00:59 shashank1202 has joined #dri-devel

01:00 <imirkin> jenatali: i think this is in your wheelhouse, but not sure if that's the d3d12 backend or what? https://gitlab.freedesktop.org/mesa/mesa/-/issues/5438

01:01 <imirkin> ah no, nevermind. lavapipe.

01:02 <jenatali> Yeah, was going to say, they say WSL 1, which is software

01:04 <imirkin> i just saw "windows" :)

01:04 <jenatali> Heh, fair enough

01:05 columbarius has joined #dri-devel

01:05 <jenatali> If it was WSL 2 and Ubuntu 21.04, then yeah that's most likely the d3d12 backend

01:05 <zmike> Kayden: seems like a pretty reasonable plan to me

01:05 pushqrdx has quit [Remote host closed the connection]

01:06 co1umbarius has quit [Ping timeout: 480 seconds]

01:08 <imirkin> jenatali: is WSL1 vs WSL2 a windows version thing? is there any reason someone would be using WSL1 and not WSL2?

01:09 <HdkR> yes and yes :)

01:09 <jenatali> imirkin: WSL1 is Windows emulating a Linux kernel

01:09 <jenatali> WSL2 is running a Linux kernel in a VM

01:09 <jenatali> If you can't do virtualization for whatever reason (e.g. already virtualized) then WSL1 is still the thing to use

01:09 <imirkin> ahh ok

01:10 <HdkR> WSL1 is cute, WSL2 caused me headaches until Qualcomm fixed the issue with their hypervisor where VMs wouldn't clock the CPU speeds up :<

01:10 <jenatali> But we don't have graphics support for WSL1 at the moment. We prototyped it but never shipped it

01:11 <imirkin> right, ok. will try to remember that for the future

01:15 <jenatali> No worries either way

02:11 <anarsoul> jenatali: you should have named it wELK :)

02:11 <anarsoul> I mean wsl1

02:12 <jenatali> Alas, not my call

02:12 <jenatali> It was named way before I got involved with any of it

02:31 samuelig has quit [Remote host closed the connection]

02:36 thellstrom1 has joined #dri-devel

02:36 thellstrom has quit [Read error: Connection reset by peer]

02:55 Bennett has quit [Remote host closed the connection]

03:08 shashank1202 has quit [Quit: Connection closed for inactivity]

03:55 slattann has joined #dri-devel

04:11 slattann has quit []

04:29 slattann has joined #dri-devel

04:35 Duke`` has joined #dri-devel

04:37 <jekstrand> anarsoul: That's entirely possible. There are multiple pieces of varying packing code these days. :-/

04:39 <anarsoul> jekstrand: this one liner fixes it for me: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13132

04:40 <anarsoul> I don't see any rationale behind allocating vec3 at the very end

04:45 <jekstrand> Yeah, seems silly to me too

05:24 <Kayden> Seems fine to me

05:25 <Kayden> unless that causes vec2 to be double-parked

05:25 <Kayden> double-parking vec2s would be bad

05:25 <anarsoul> how is it different with double-parking vec3s?

05:26 <Kayden> if you had a vec4, a vec3, and 2 vec2's, if you got.... xyzw | xyz x | y xy _

05:26 <Kayden> instead of xyzw | xy xy | xyz _

05:27 <Kayden> but I'm not sure it actually does that

05:27 <Kayden> I think it tries to avoid double-parking if it can

05:27 <anarsoul> hold on

05:27 <Kayden> if it came up with xyzw | xyz _ | xy xy that would be just as good :)

05:27 <anarsoul> wouldn't it be xyzw | xyz- | xy xy?

05:27 <Kayden> I think so?

05:28 itoral has joined #dri-devel

05:29 <Kayden> oh hmm it advances slots if the packing order is different

05:30 <Kayden> so...it doesn't pack <vec2, scalar, scalar> into the same slot?

05:31 <anarsoul> let me try

05:31 <Kayden> it doesn't look like it would, but I thought it did, and it probably ought to

05:32 * Kayden hasn't seriously read that code in 5 years

05:32 kem has quit [Ping timeout: 480 seconds]

05:33 <Kayden> anarsoul: I think your change is good and you're right, there's no particular reason to have vec3 at the end

05:33 <Kayden> it seems like it could do a better job packing some things

05:33 <Kayden> err....the pass could do a better job than it is (regardless of your change)

05:35 <Kayden> hmm....the comment does mention that vec3 is explicitly at the end so that others aren't at risk of being double parked

05:36 javierm has quit [Quit: leaving]

05:37 <anarsoul> Kayden: I'll look into the code tomorrow night

05:37 javierm has joined #dri-devel

05:37 <anarsoul> off to bed now :)

05:40 mattrope has quit [Read error: Connection reset by peer]

05:41 <Kayden> have a good night!

05:42 kem has joined #dri-devel

05:42 Duke`` has quit [Ping timeout: 480 seconds]

05:44 slattann has quit []

05:47 Company has quit [Read error: Connection reset by peer]

05:53 slattann has joined #dri-devel

05:55 Hi-Angel has joined #dri-devel

05:58 flto has quit [Ping timeout: 480 seconds]

06:00 flto has joined #dri-devel

06:03 flto_ has joined #dri-devel

06:03 flto has quit [Remote host closed the connection]

06:24 pnowack has joined #dri-devel

06:25 tomeu has joined #dri-devel

06:30 tzimmermann has joined #dri-devel

06:43 slattann has quit []

06:44 slattann has joined #dri-devel

06:47 slattann1 has joined #dri-devel

06:49 danvet has joined #dri-devel

06:52 slattann has quit [Ping timeout: 480 seconds]

06:57 slattann has joined #dri-devel

07:03 slattann1 has quit [Ping timeout: 480 seconds]

07:03 <demarchi> I'm seeing a ton of warning on amdgpu in drm-tip... is it just me?

07:05 slattann1 has joined #dri-devel

07:09 slattann has quit [Ping timeout: 480 seconds]

07:15 samuelig has joined #dri-devel

07:22 rasterman has joined #dri-devel

07:30 slattann1 has quit [Ping timeout: 480 seconds]

07:31 <MrCooper> Kayden: "dmabuf export on buffers that aren't marked SHARED" isn't expected to work, that's what SHARED is about

07:32 <MrCooper> emersion: mutter still disables modifiers by default on amdgpu as well, probably until dma-buf hints

07:33 <Kayden> MrCooper: I wish that were true. It seems that SHARED is flagged for buffers which we know or expect will be shared, but the API lets you spontaneously export whatever you want.

07:33 slattann has joined #dri-devel

07:33 <Kayden> MrCooper: In particular, Piglit's bin/ext_image_dma_buf_import-export-tex hits this case

07:33 <MrCooper> sounds like bugs that should be fixed

07:34 <Kayden> and radeonsi has code to handle it: https://gitlab.freedesktop.org/mesa/mesa/-/blob/master/src/gallium/drivers/radeonsi/si_texture.c#L674

07:37 <Kayden> I'm not really sure how you think that should be fixed

07:38 <Kayden> the sequence of events is... glTexStorage2D() - creates texture and allocates storage - eglCreateImageKHR - I suppose this could mark it shared and convert it there? - eglExportDMABUFImageMESA

07:39 <Kayden> we can't know it's shared at step 1, because we're just allocating an ordinary texture

07:39 <Kayden> step 2, it's already allocated wrongly for sharing

07:39 <Kayden> step 3, it needs to be different

07:39 <MrCooper> right, similar issue with other bind flags

07:39 <MrCooper> some kind of transition at 2 maybe indeed

07:39 <Kayden> hmm. yeah, that would definitely be nicer

07:40 <Kayden> when something is made into an EGLImage it's almost certainly going to be shared

07:41 slattann has quit []

07:41 aissen_ has quit []

07:46 tursulin has joined #dri-devel

07:47 <Kayden> It looks like that eglCreateImageKHR ends up in dri2_create_from_texture(), which checks if EGL_MESA_image_dma_buf_export is possible, and calls pipe_context::flush_resource()

07:48 <Kayden> so flush_resource() could flag PIPE_BIND_SHARED if it isn't already and transition out of non-exportable forms

07:48 <Kayden> that seems like a much nicer solution

07:48 <Kayden> I wonder why radeonsi doesn't do that..

07:49 <MrCooper> not sure offhand

07:51 gawin has joined #dri-devel

07:52 idr has quit [Ping timeout: 480 seconds]

07:53 rasterman has quit [Quit: Gettin' stinky!]

07:53 <emersion> MrCooper: yea josh told me

07:53 <emersion> i don't understand why

07:54 <emersion> it's not like disabling modifiers will make direct scanout work

07:54 <emersion> if anything, it'd be the contrary: only advertise scanout-capable modifiers as supported by the compositor

07:55 <emersion> (what we do i gamescope)

07:55 <emersion> in*

07:55 <emersion> cc jadahl

07:56 <Kayden> if I recall correctly, on...Intel DG1?...if compositors don't do modifiers, then Mesa starts throwing them linear buffers

07:56 <emersion> that's a good thing :)

07:57 <emersion> but it would also be nice if the display hw wouldn't just freak out when y-tiling is used :P

07:57 <Kayden> but even if there are problems with some of the crazy ones, it would be nice to at least have modifier support but restricted to say, I915_FORMAT_MOD_X_TILED at least

07:57 <Kayden> yeah, absolutely :(

07:58 <emersion> y-tiling is the only reason why we have to tell users to disable modifiers

07:59 pcercuei has joined #dri-devel

07:59 lynxeye has joined #dri-devel

07:59 <Kayden> presumably that includes Y_TILED_CCS, Y_TILED_GEN12_RC_CCS, Y_TILED_GEN12_RC_CCS_CC too.

07:59 <jadahl> emersion: disabling modifiers make it work with xwayland :P (unredirect)

08:00 <emersion> jadahl: shouldn't be the case

08:00 <jadahl> really need to get to implementing that brute force intel thing to enable it for intel

08:00 <emersion> xwayland used to allocate with SCANOUT, but doesn't anymore

08:00 <jadahl> maybe it doesn't anymore then

08:01 <emersion> and also that's a bad excuse for disabling modifiers :)

08:01 <jadahl> heh, I don't disagree :P

08:01 <jadahl> the best excuse is breaking multi head

08:01 <emersion> yeah

08:01 <jadahl> such head ache :|

08:01 <emersion> that one i can get behind

08:01 <emersion> yes ;_;

08:02 <Kayden> that does sound painful :/

08:02 <emersion> downgrading from y-tiled to x-tiled isn't too hard when a single CRTC is involved

08:03 <emersion> but on hotplug if you need to downgrade *other* CRTCs then it's just a huge mess

08:04 <emersion> i wish we had a DRM_CAP_INTEL_PLEASE_NO_BLACK_SCREENS

08:04 <jadahl> my plan has been and still is to allocate all the things on hotplug, TEST_ONLY to see, goto 1 with another modifier for CRTC 1, goto 1 with CRTC 2, or etc

08:05 <emersion> yeah, that's the only way to do it…

08:05 <jadahl> the nice thing is that monitors take forever to turn on/off anyway, so if it takes some milli seconds so be it

08:06 slattann has joined #dri-devel

08:10 <MrCooper> emersion: enabling modifiers breaks direct scanout for DRI3 clients (which allocate buffers themselves, not Xwayland)

08:11 <emersion> on DRI3

08:11 <emersion> oh,*

08:11 <emersion> well why don't you just only advertise scanout modifiers if you want it so badly?

08:11 rbrune has joined #dri-devel

08:12 <MrCooper> because that would anger daniels ;) I mean would hurt some embedded platforms

08:13 yoslin_ has quit []

08:13 vivijim has joined #dri-devel

08:14 yoslin has joined #dri-devel

08:15 <MrCooper> dma-buf hints to the rescue

08:19 <lynxeye> MrCooper: Not using direct scanout will also hurt some embedded platforms. ;) But yea, dma-buf hints is the only way to solve things generically, without a truckload of assumptions and heuristics.

08:20 slattann has quit [Ping timeout: 480 seconds]

08:20 slattann has joined #dri-devel

08:20 frieder has joined #dri-devel

08:21 <pq> anholt_, FWIW, Weston should be importing YUV to EGL, provided you can find a Wayland client producing the kind of dmabufs you want to test with. But Weston will also fall back to hand-rolled import+conversion if direct EGL import fails.

08:27 rasterman has joined #dri-devel

08:27 Ahuj has joined #dri-devel

08:28 thellstrom has joined #dri-devel

08:28 dliviu has quit [Ping timeout: 480 seconds]

08:31 <pq> Kayden, I thought allocating with GL and dmabuf-exporting that was totally a "doctor, it hurts" scenario?

08:32 <Kayden> it certainly doesn't seem common, we have exactly 1 test case in all of piglit and the CTS that hits this

08:34 hansg has joined #dri-devel

08:34 thellstrom1 has quit [Ping timeout: 480 seconds]

08:35 dliviu has joined #dri-devel

08:36 jessica_24 has quit [Quit: Connection closed for inactivity]

08:39 JohnnyonF has joined #dri-devel

08:45 oneforall2 has quit [Quit: Leaving]

08:47 JohnnyonFlame has quit [Ping timeout: 480 seconds]

08:53 <tzimmermann> who needs drm_fbdev_overalloc?

09:06 <danvet> tzimmermann, page flipping on fbdev

09:07 <tzimmermann> danvet: indeed, but it's optional AFAIK

09:07 <MrCooper> also non-primary display with larger vertical resolution than primary one

09:07 <danvet> yeah we don't want to waste memory

09:07 <danvet> at least not by default

09:07 <danvet> MrCooper, only if you plug it in later on

09:08 <MrCooper> which happens by default with my external monitor connected via USB-C (DisplayPort alt mode)

09:08 <danvet> plus in theory we could fix that by dynamically allocating

09:09 <danvet> for drivers that use the generic fbdev stuff it should be pretty easy, since there we also intercept mmap

09:09 <danvet> hm fbcon, right

09:09 <tzimmermann> danvet, some context: i try to fix overalloc with devices that don't large resolutions: https://lore.kernel.org/dri-devel/5186020a-192f-4e04-adc2-25a34305fea6@www.fastmail.com/#t

09:10 <tzimmermann> people trigger this bug with simpledrm. the allocated BO cannot be larger than the screen size. so with overalloc, the height check in drm_internal_framebuffer_create() fails

09:11 <tzimmermann> long story short: no console

09:14 aissen has joined #dri-devel

09:18 slattann has quit [Ping timeout: 480 seconds]

09:21 frieder has quit [Remote host closed the connection]

09:23 oneforall2 has joined #dri-devel

09:47 slattann has joined #dri-devel

09:51 <tzimmermann> danvet, what is the actual meaning of mode_config.max_height? https://elixir.bootlin.com/linux/latest/source/include/drm/drm_mode_config.h#L347

09:51 <tzimmermann> all drivers treat it like the maximum programmable number of scanlines

09:52 <danvet> tzimmermann, why do you have overalloc set if you're not using it?

09:52 <danvet> tzimmermann, mripard: mlankhorst doesn't seem around, but drm-misc-fixes is in jeopardy

09:52 <tzimmermann> but what it means to the core is 'the number of addressable scanlines'

09:52 <tzimmermann> which is limited by vram

09:52 <danvet> https://lore.kernel.org/dri-devel/YVbZ%2FP3FA4KuL2%2Fw@phenom.ffwll.local/T/#m2df491acd1d49013e17f5b6d6693ace12be99d69

09:52 <danvet> ^^ would be great if someone can fix this of you too

09:53 <danvet> I gtg now for a bit, I'll try and do the drm-fixes pull this evening

09:53 <tzimmermann> danvet, some people configure overalloc and see the console fail

09:53 rgallaispou has joined #dri-devel

09:53 <danvet> tzimmermann, yeah, don't do that?

09:54 <danvet> or maybe we can hack up simpledrm to not overalloc, dunno

09:54 <danvet> also I thought the overalloc code should fall back to not-overallocated maybe?

09:54 <tzimmermann> but it should work. simpledrm uses shmem. there's no good reason why overalloc would break

09:55 <tzimmermann> i added the not-overalloc workaround why testing, but i think the problem is in the semantics of max_height

09:55 <tzimmermann> which brings me to my question

09:55 <danvet> maybe, but I really gtg now

09:55 <tzimmermann> what is the semantics of mode_config.max_height

09:56 <tzimmermann> no problem

09:56 tzimmermann has quit [Remote host closed the connection]

09:56 tzimmermann has joined #dri-devel

09:56 tzimmermann has quit [Remote host closed the connection]

09:57 tzimmermann has joined #dri-devel

09:59 slattann1 has joined #dri-devel

10:02 slattann has quit [Ping timeout: 480 seconds]

10:03 <MrCooper> tzimmermann: sounds like maybe the overalloc should be clamped to the maximum possible instead of failing (as well as possibly fixing the maximum)?

10:08 thellstrom has quit [Quit: thellstrom]

10:15 mlankhorst has joined #dri-devel

10:27 hansg has quit [Remote host closed the connection]

10:28 <tzimmermann> MrCooper, i thought the same. but the maximum is not clearly defined. the core/helpers expect virtual resolutions, while drivers seem to be setting physical resolutions. i guess we should clarify the docs

10:28 <tzimmermann> well, at least i now have an idea of what to do about this

10:30 flacks has quit [Quit: Quitter]

10:32 flacks has joined #dri-devel

10:38 <vsyrjala> mode_config.max_{width,height} is the max fb size like the docs say. if drivers want to limit the max display mode dimensions they can do it in .mode_valid/etc.

10:49 dviola has joined #dri-devel

10:57 kts has joined #dri-devel

11:06 <kts> Will crocus fix https://gitlab.freedesktop.org/drm/intel/-/issues/2024? Haswell Pentium G3250.

11:07 <daniels> Kayden: the answer to all your eglExportDMABUFImageMESA problems is just to not use it tbqh

11:11 adjtm has quit [Ping timeout: 480 seconds]

11:17 kts_ has joined #dri-devel

11:21 kts has quit [Ping timeout: 480 seconds]

11:30 elongbug has joined #dri-devel

11:32 Company has joined #dri-devel

11:33 thellstrom has joined #dri-devel

11:36 kts_ has quit []

11:37 kts has joined #dri-devel

11:37 slattann1 has quit []

11:41 adjtm has joined #dri-devel

11:48 Viciouss7 has quit []

11:49 Viciouss has joined #dri-devel

11:59 muhomor has quit [Ping timeout: 480 seconds]

12:02 kts has quit [Quit: Konversation terminated!]

12:13 itoral has quit []

12:24 Hi-Angel has quit [Ping timeout: 480 seconds]

12:27 thellstrom has quit [Quit: thellstrom]

12:28 Hi-Angel has joined #dri-devel

12:29 thellstrom has joined #dri-devel

12:35 rbrune has quit [Ping timeout: 480 seconds]

12:37 Hi-Angel has quit [Ping timeout: 480 seconds]

12:43 Hi-Angel has joined #dri-devel

13:00 hansg has joined #dri-devel

13:02 Peste_Bubonica has joined #dri-devel

13:06 muhomor has joined #dri-devel

13:10 <tzimmermann> vsyrjala, so it's the maximum size of the virtual screen

13:10 muhomor has quit [Remote host closed the connection]

13:11 <tzimmermann> most drivers seem to treat it like the physical size

13:11 <vsyrjala> should fix them then i guess :) this was discussed a while back iirc, and we updated the docs a bit at the time

13:12 <vsyrjala> i think someone suggested adding another set of things for the other limits, but imo that's a bit pointless as you can do that in the driver .mode_valid hook

13:13 <vsyrjala> also often the timings have much more complicated limits that just two simple max values

13:13 <tzimmermann> vsyrjala, that makes some sense

13:14 <vsyrjala> eg. intel_mode_valid() checks for a lot of other limits

13:26 muhomor has joined #dri-devel

13:36 Peste_Bubonica has quit [Remote host closed the connection]

13:36 Peste_Bubonica has joined #dri-devel

13:45 <zmike> dcbaker: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13141

13:47 thellstrom1 has joined #dri-devel

13:48 flto_ has quit []

13:49 flto has joined #dri-devel

13:53 thellstrom has quit [Ping timeout: 480 seconds]

13:53 Hi-Angel has quit [Quit: Konversation terminated!]

13:53 Hi-Angel has joined #dri-devel

13:54 Peste_Bubonica has quit [Quit: Leaving]

14:03 JohnnyonF has quit [Ping timeout: 480 seconds]

14:15 sarnex has quit [Read error: Connection reset by peer]

14:17 sarnex has joined #dri-devel

14:30 <pinchartl> has anyone given a thought about how the move to Bazel in Android could affect (in a positive way) building Mesa for AOSP ?

14:31 <pinchartl> (my hopes are close to none based on previous experience with AOSP, I'd like to have a good surprise for once)

14:31 <bnieuwenhuizen> wait, it is Bazel now?

14:33 <ddevault> if you value your sanity you will steer well clear of bazel

14:33 <pinchartl> they're switching from Soong with blueprint to Bazel with starlark as far as I can tell

14:33 <ddevault> the day mesa is built with bazel is the day mesa makes an enemy out of every linux distro

14:33 <pinchartl> ddevault: is it worse than Soong ?

14:33 <ddevault> I am not familiar with soong

14:33 <ddevault> but I am familiar with bazel

14:34 <pinchartl> the concepts are similar as far as I can tell

14:34 <ddevault> frightening.

14:34 <vsyrjala> the faq doesn't address meson vs bazel. i guess not enough people asked that question

14:35 <ddevault> the effort required to maintain a functional bazel system is on par with the effort expended for the rest of the project it builds

14:35 <pinchartl> it should be meson + bazel

14:35 <pinchartl> as in being able to integrate a meson-based project into the overall build system of AOSP

14:36 <vsyrjala> well thye adress make/ninja vs. bazel. not make/ninja+bazel either

14:37 <bnieuwenhuizen> I found bazel quite nice to use actually as long as you don't have too many config needs. That said we're not in charge of making build decisions for Android here

14:39 <pinchartl> if we were, the end result would be much saner ;-)

14:39 <pinchartl> the conclusion, when looking at integrating with their current build system, was along the lines of "no way". I was wondering if the new one could be better

14:40 <danvet> tzimmermann, MrCooper yeah I agree with clamping

14:40 <danvet> mode_config.max_h/w are enforced hints for kms users, as in your drm_fb shouldn't be bigger

14:41 <danvet> but also it's ofc a bit silly, since you can always hide a much bigger buffer and then trim it with stride and offset

14:41 <danvet> but since fbdev emulation is the kms users here, clamping the overallocation to the mode_config limits sounds like the right thing to do

14:42 <danvet> tzimmermann, mripard, mlankhorst I guess I don't get a respinned drm-misc-fixes?

14:43 <danvet> some of the patches in there are almost a month old by now, that's not good for committed -fixes

14:45 rgallaispou has quit [Read error: Connection reset by peer]

14:45 mattrope has joined #dri-devel

14:47 adjtm has quit [Remote host closed the connection]

14:48 adjtm has joined #dri-devel

14:49 gawin has quit [Ping timeout: 480 seconds]

14:52 Ahuj has quit [Ping timeout: 480 seconds]

15:03 Duke`` has joined #dri-devel

15:40 <robclark> pinchartl: jstultz might know something about android/AOSP build plans?

15:44 <pinchartl> when we discussed the topic last week during LPC, I think he shared my despair :-)

15:44 mlankhorst has quit [Ping timeout: 480 seconds]

15:44 gawin has joined #dri-devel

15:56 jessica_24 has joined #dri-devel

16:02 nchery has joined #dri-devel

16:12 nchery has quit [Remote host closed the connection]

16:13 kmn has quit [Quit: Leaving.]

16:14 pushqrdx has joined #dri-devel

16:16 <pushqrdx> something weird suddenly happened and i don't have enough graphics programming knowledge to explain it, fo some reason even though i don't have any compositor running and i am on modesetting/mesa driver which usually tear like crazy in firefox

16:16 <pushqrdx> suddenly firefox stopped tearing lol

16:17 nchery has joined #dri-devel

16:17 <pushqrdx> been like that for several hours already no tearing at all

16:17 <pushqrdx> usually tearing might stop for a few moments because by luck scrolling is aligned with vblank but not for hours

16:18 <vsyrjala> is firefox fullscreen?

16:19 <pushqrdx> no

16:22 gawin has quit [Ping timeout: 480 seconds]

16:24 tzimmermann has quit [Quit: Leaving]

16:34 kts has joined #dri-devel

16:34 kts has quit []

16:34 kts has joined #dri-devel

16:43 anujp has quit [Ping timeout: 480 seconds]

16:47 idr has joined #dri-devel

16:47 tobiasjakobi has joined #dri-devel

16:48 tobiasjakobi has quit [Remote host closed the connection]

16:52 <FLHerne> pinchartl: At least people have *heard* of Bazel

16:52 <FLHerne> that's got to help somehow

16:53 <pinchartl> depends, if hearing of Bazel makes people flee, it may not help :-)

16:54 <dcbaker> pinchartl: I know I joke about Google inventing a new build system every week, but...

16:54 <dcbaker> Bazel is another one of those "Designed to solve only Google's problems" build system

16:54 <dcbaker> options are hard

16:54 <dcbaker> but if you happen to have a cluster of 1000000000 cpus, it can make uses of them :)

16:56 <pinchartl> :-)

16:57 <pinchartl> I expect both Soong and Bazel to have similar issues, as they're designed to address similar use cases

16:57 macromorgan has quit [Read error: Connection reset by peer]

16:57 <pinchartl> but I was wondering if, by any chance, Bazel could also happen to help our problems. not by design of course, just by chance :-)

16:57 slattann has joined #dri-devel

16:57 macromorgan has joined #dri-devel

16:57 <dcbaker> yeah, but Bazel is written in Java :)

16:58 <dcbaker> I think it leaves us exactly where we were before, we either need to teach Bazel to understand meson, or meson to output bazel

16:58 gawin has joined #dri-devel

16:58 <robclark> danvet: curious if you've run into this issue w/ scheduler.. IIUC amdgpu is doing something similar, and most of the other drivers have a single sched-entity per drm_file (other than I think lima.. which might be broken with chromium).. https://gitlab.freedesktop.org/robclark/msm/-/commit/b2f2d283d60b9dfbabeb1b2df2cef662776313d3

16:59 <dcbaker> pinchartl: and I feel even less motivated this go around than I did last time :/

17:00 <danvet> robclark, huh that sounds every busted

17:00 <robclark> danvet: thanks to finch experiment (basically a type of A/B experiment) I was testing with SkiaRenderer (which doesn't have this issue) instead of legacy GLRenderer (which does).. otherwise I would have noticed the problem sooner

17:00 <danvet> robclark, I'm pretty sure iris uses one hw context per gl context, and we do let them free-float

17:01 <pinchartl> dcbaker: I can't blame you. I thought bazel had support for python extensions though. maybe I'm mistaken

17:01 <danvet> robclark, imo this is a gl spec question

17:01 <danvet> or maybe a mesa-should-quirk-stuff question

17:01 <robclark> then it is probably broken w/ GLRenderer if implicit sync is disabled ;-)

17:01 <robclark> and I agree, it is a bad assumption by GLRenderer, but I don't think that is going to get fixed at this point

17:01 <danvet> Kayden, ^^

17:02 <dcbaker> pinchartl: It might. But Soong had support for Go extensions, and when I asked about that they said "Oh, only Google gets to use those", essentially

17:02 <danvet> robclark, yeah still, hurting vk and everyone because of a single gl app seems wrong

17:02 kokoz has quit []

17:02 gitautas has joined #dri-devel

17:03 <danvet> robclark, or is the hilarious problem that drm/sched conversion now breaks userspace?

17:03 <robclark> I'm not really completely convinced that having a single "ring" per priority per process level hurts vk or anyone..

17:03 <robclark> right

17:05 <gitautas> Hi! I'm trying to develop an ultra-low-latency program that captures the framebuffer and encodes it in the hardware level, without copying the frames over to system memory. My questions are: is there some way to capture the framebuffer on AMD cards? I know that there is an API caled (RapidFire) but that seems to be windows only. Another one is whether there's some kind of abstraction for NvFBC and NvENC? I have an MVP program that works but debugging

17:05 <gitautas> mainting it is a complete pain.

17:06 gouchi has joined #dri-devel

17:06 <ajax> robclark: by "some apps" you mean opengl explicitly says using MakeCurrent like that is sufficient for synchronization

17:07 <ajax> (unless you turn it off, and by all means please do, but MakeCurrent implies glFlush

17:08 <pinchartl> dcbaker: right, the same probably applies to bazel extensions. android is such a joke...

17:09 <pinchartl> maybe they should attend some "build your community the right way" workshops, there's lots of them these days

17:09 <dcbaker> I see the same problem around Intel, people who don't understand F/OSS and are only solving their own problems

17:09 <pinchartl> or maybe it's just me who has missed the "disregard your community, you're way better than them" workshops

17:10 <dcbaker> It's a very corporate mentality

17:10 <pinchartl> I think it's particularly present in the Android team

17:11 <pinchartl> there's a lack of humility

17:11 ybogdano has joined #dri-devel

17:11 alyssa has joined #dri-devel

17:11 <dcbaker> :/ yeah

17:11 <alyssa> cwabbott: Super happy to see the preamble stuff reusable across drivers

17:12 <alyssa> jekstrand: How reusable is NIR constant folding?

17:12 <jekstrand> alyssa: What do you mean?

17:12 <alyssa> I know anholt_ has talked about doing a whole NIR interpreter, which would be Work™, but what about just the subset needed for uniform-constant ALU?

17:13 <dcbaker> pinchartl: on the awesome side though, scipy is moving to meson :)

17:13 <robclark> ajax: well, not 100% sure what GL says, but if we have multiple sched-entities per priority level per process, then doing kernel ioctl in the correct order is not sufficient to enforce execution on the GPU in that same order (unless you rely on implicit sync)

17:13 <alyssa> Once nir_opt_preamble lands, can we have a code path using nir_opt_preamble for only ALU and load_uniform, and then evaluate the preamble in CPU?

17:13 <alyssa> and maybe do that in a driver-generic way?

17:13 <robclark> pinchartl: curious if android is planning to build kernel with bazel? :-P

17:14 <alyssa> I guess that can't really be driver-generic due to load_preamble/store_preamble being needed.

17:14 <alyssa> For AGX, I don't care, Apple is happy to spawn a preamble shader to save a single ALU op

17:14 <jekstrand> alyssa: Should be doable.

17:14 <alyssa> For Mali, there's some fixed overhead to doing a preamble so for simple cases it's probably faster to just do it on the CPU

17:14 <jekstrand> alyssa: The core of constant folding is nir_constant_expressions.c/h which has a helper that takes an op, a set of nir_const_value arrays, bit size, and number of destination components.

17:15 * alyssa nods

17:15 <jekstrand> nir_opt_constant_folding just calls that

17:15 <alyssa> (For Mali, optimal is probably "small amounts of ALU, do it on the CPU. large amounts of ALU, or a texture load or something goofy, punt to hw")

17:15 <jekstrand> Or, for that matter, you could replace all the load_uniform with load_const and fold the shader.

17:15 <alyssa> too expensive

17:16 <pinchartl> robclark: I'd love to see them trying :-)

17:16 <jekstrand> alyssa: Sure

17:16 <alyssa> (I mean. Is it? Hm. Maybe not. Doing NIR ops at draw time seems like a bad idea but..)

17:16 <pinchartl> dcbaker: we're trying to move v4l-utils from autoconf to meson. there's lots of resistance from the V4L2 maintainer, but on the technical side, it's so much better

17:16 iive has joined #dri-devel

17:17 <jekstrand> alyssa: When anholt_ was talking about an interpreter, the idea was to "compile" the NIR down to something a bit faster.

17:17 <jekstrand> I don't remember exactly what we decided was reasonable

17:17 <dcbaker> pinchartl: that will be awesome when it's done :)

17:18 <alyssa> jekstrand: Yeah, for sure...

17:18 <jekstrand> But, even if nir_constant_expresssions.c doesn't do what you want, you've got the expressions in nir_opcodes.py. You can code-gen something else.

17:18 * alyssa nods

17:18 <jekstrand> Though I don't recommend that because, seriously, it's complicated.

17:18 <jekstrand> To get all the corners right, in any case.

17:18 <alyssa> It's a shame there's not a "easy" way to JIT nir to something reasonable.

17:19 <alyssa> I guess abusing llvmpipe's compute support is an option but...

17:19 <alyssa> that seems, err, heavy-handed.

17:19 <alyssa> (and doesn't help anholt_ )

17:19 <jekstrand> I don't know what the current best practices are for non-JIT interpreters

17:20 <alyssa> (...actually now I'm wondering what "execute preamble shaders with llvmpipe" would look like. I am, horrifyingly, not disgusted by the idea.)

17:20 <alyssa> I mean we have a whole software rasterizer infrastructure right there ....

17:20 <jekstrand> heh

17:20 <jekstrand> But then you'd have to load up LLVM. :P

17:21 <jekstrand> Your chromebook might not have enough disk for that

17:21 <alyssa> Ruuuude :-p

17:21 <pinchartl> :-)

17:25 <zmike> dcbaker: I thought cpp_args was a thing? is it not a thing?

17:25 <jekstrand> Hrm.... I bet we could amortize the cost of "pure" C evaluation if we did it wide....

17:25 <robclark> alyssa: I guess more of the CPU overhead would be just having to do CPU readback of constbuf.. the things you would actually be evaluating at draw time should generally be pretty straightforward expressions

17:26 <dcbaker> Meson uses "cpp" for "C++"

17:26 <jekstrand> *sigh*

17:26 <alyssa> robclark: that's only high overhead for UBOs or tc

17:26 <dcbaker> zmike: don't ask me why

17:26 heat has joined #dri-devel

17:26 <zmike> dcbaker: right, right, I had forgotten this detail

17:27 <jekstrand> dcbaker: Why? (You didn't say *I* couldn't ask. :P)

17:27 <dcbaker> historical happenstance

17:27 <robclark> alyssa: but you probably want TC ;-)

17:27 <dcbaker> lol

17:27 <pinchartl> dcbaker: while you're here, why does meson not support local variables in meson.build files ?

17:27 <alyssa> robclark: eventually

17:27 <zmike> dcbaker: while you're here, I don't suppose you've had a chance to fix that asan thing?

17:27 <zmike> hahahah

17:28 <zmike> quick everyone ask your meson questions!

17:28 <pinchartl> zmike: :-)

17:28 <dcbaker> lol

17:28 <jekstrand> dcbaker: While you're here.... Nah, I've got nothin'. :)

17:28 <pinchartl> I have multiple variables with the same name in different files, it works fine, but I'm concerned that one day I'll use one of those in a file before it's (re)defined and will introduce a bug

17:28 <jekstrand> pinchartl: Yeah, I have that fear sometimes too

17:29 <dcbaker> pinchartl: I think mostly because no one's every written the code

17:29 <dcbaker> there's been plenty of requests for it

17:29 <pinchartl> it's not a *big* deal, just a nice to have

17:29 <jekstrand> Even if it were something as simple as _foo is local

17:29 <jekstrand> or a keyword "local foo = blarg"

17:29 <dcbaker> but meson's architecture kinda assumes all variables have global scope, so it would be pretty non-trivial

17:29 <dcbaker> zmike: not yet

17:30 <dcbaker> it looks reasonable, I just need to figure out how long the argument has existed so we don't inject it in invalid cases

17:30 <zmike> makes sense

17:31 <dcbaker> pinchartl: I could solve it more easily with Meson++ because that uses a flat IR so file boundaries are easy to see and local variables could be folded away really quickly :)

17:32 <HdkR> Interpreters you say? I know about interpreters. Why need a NIR interpreter?

17:33 <alyssa> HdkR: silly shaders doing arithmetic on uniform expressions,

17:33 nsneck has joined #dri-devel

17:33 <alyssa> can hoist it to a dedicate preamble shader / preshader / pilot shader,

17:33 <alyssa> but might be faster to just evaluate on the CPU in some cases

17:33 <jekstrand> HdkR: Also, so we can convert softpipe to NIR and finally delete TGSI.

17:33 <jekstrand> nirpipe. :D

17:33 <alyssa> (Things like `varying * sqrt(uniform)`)

17:33 <jekstrand> But, also, CPU folding of uniforms.

17:33 <alyssa> jekstrand: not to be confused with nerdpipe

17:34 <jekstrand> So, when I get distracted by NIR refactoring, does that mean I've been NIRsniped?

17:34 <robclark> we can't completely delete TGSI.. but maybe we can move it into virgl

17:34 <robclark> :-P

17:34 <dcbaker> alyssa: is it really that different?

17:34 <jekstrand> I'd be ok with that resolution. :)

17:34 <alyssa> jekstrand: Yes

17:34 <HdkR> I can definitely understand hoisting to a pilot shader. But what hardware would it actually be faster to interpreter on?

17:35 <alyssa> HdkR: possibly Mali in certain circumstances but haven't benchmarked

17:35 <ajax> robclark: i mean... internally mesa _isn't_ issuing fences around context binds... but it certainly could

17:35 <HdkR> alyssa: Interesting...

17:35 <alyssa> since spawning a compute shader has some overhead

17:35 <jekstrand> HdkR: The problem isn't typically that it's faster to do it on the CPU but that doing it on the GPU is tricky and can introduce stalls.

17:36 <alyssa> jekstrand: The opposite is also true though :-p

17:36 <alyssa> hence, both :-p

17:36 <robclark> ajax: true.. but also kernel broke userspace

17:36 <jekstrand> Oh, doing it on the CPU can definitely introduce stalls. :)

17:37 <HdkR> So the CPU needs to be fast enough, and also the GPU needs to be bad at pilot shaders? :P

17:38 <jekstrand> Or not have pilot shaders

17:38 <alyssa> HdkR: Literally describing MT8183 right there :-p

17:38 <HdkR> jekstrand: That's the nvidia approach. Just make uniform math happen in the same shader :)

17:39 <robclark> for adreno, it's a preamble, so really same shader

17:39 <HdkR> 32x speedup baby~

17:39 <HdkR> I like the Adreno approach

17:42 <HdkR> Sounds like there are multiple needs for a lightweight JIT with basic passes though. Something slightly smarter than just code emission

17:42 <ajax> in fact

17:43 <ajax> now that i've said that i want to make zink do it

17:46 <HdkR> Poor zmike :D

17:46 <zmike> what

17:47 <zmike> ajax: let's focus up here buddy, one thing at a time

17:48 * zmike starts rewriting lavapipe as soon as he says that

17:50 anujp has joined #dri-devel

17:51 <ajax> i literally started typing "focus, adam" into irc, then switched machines so i could take the laptop out on the porch, and came back to being told to focus.

17:52 <zmike> hahahah

17:52 <zmike> watches synchronized

17:52 <ajax> i want a lot of things though. i want a gles2 driver for the verité, if that's any indicator of the relative position of my head and the clouds

17:52 * zmike takes a long blink

17:57 <alyssa> ajax: I dunno if it makes sense to do preamble stuff in zink + gl drivers or vk drivers + gl drivers - zink or everywherez

17:59 <ajax> i meant the "soften makecurrent's implicit flush into fencing" thing, not preambling

18:00 <alyssa> oh

18:00 <alyssa> also worth discussing preambling though

18:00 <zmike> no

18:00 <zmike> stop nerdsniping.

18:01 <alyssa> zmike: discussing as in like

18:01 <HdkR> I can amble on for a while, probably good to get some pre-amble stretches

18:01 <alyssa> "does it make sense to make it the vk drivers problem so zmike doesn't have to" :P

18:02 <robclark> it's already the vk driver's problem ;-)

18:03 <robclark> so zmike is off the hook

18:12 pushqrdx has quit [Remote host closed the connection]

18:12 <Kayden> robclark: i965 and iris have always used separate HW contexts per GL context...since Mesa was back at GL 3.0. Contexts aren't synchronized with one another beyond MakeCurrent implying glFlush. That's the app's job with fences.

18:13 <Kayden> iris is weird in that it uses two HW contexts per GL context - one for render, one for compute, and lets work go out-of-order a bit there, but implicitly synchronized for data dependencies, so it should be transparent to the app

18:13 <robclark> Kayden: the question is where drm_sched_entity fits in on the kernel side.. if you have multiple sched-entity there is nothing ensuring that execution on gpu is same order as flush to kernel

18:13 <Kayden> (eventually when we finally get the fabled compute command streamer we've been promised they'll be able to run in *parallel* which should be really fun...)

18:14 <Kayden> I'm pretty sure drm_sched_entity is tied to HW context in i915, but I haven't honestly read that code

18:14 ybogdano has quit [Ping timeout: 480 seconds]

18:14 <robclark> initially on kernel side, sched-entity was mapped 1:1 with submiqueue (kernel side counterpart to userspace context)

18:15 <Kayden> so yeah, it sounds like that would break on i965/iris

18:15 ngcortes has joined #dri-devel

18:15 <robclark> fwiw, https://patchwork.freedesktop.org/series/95349/ is what I just sent to fix that use-case

18:22 slattann has quit []

18:23 <Kayden> that really is a flawed assumption and a broken app

18:23 <Kayden> codifying that into kernel behavior seems like it will come back to bite you

18:23 <Kayden> as far as I know anyway...

18:24 <Kayden> if anything it seems like the kind of thing that might be better as a mesa driconf

18:24 <Kayden> (which sounds a bit awful)

18:24 <robclark> I think ajax's assessment was gl says MakeCurrent() is a sufficient barrier for multi-ctx rendering on a single thread.. possibly mesa should be creating fences for this.. but OTOH kernel broke userspace

18:26 <Kayden> that...may be true

18:26 <Kayden> your patch says "one sched entity per *process* per priority" though?

18:27 <Kayden> so you're not allowing scheduling between contexts even if they're always running in separate threads?

18:27 <Kayden> that would definitely be oversynchronizing

18:28 <robclark> there isn't really a good way to differentiate between multiple threads vs single thread.. OTOH things get serialized when they are written into the ring

18:29 slattann has joined #dri-devel

18:31 slattann has quit []

18:33 <ajax> actually i'm not completely convinced of what i said

18:34 <ajax> MakeCurrent is Flush but it's not Finish

18:35 <robclark> the question is, whether you can assume any ordering of what happens after Flush

18:36 X-Scale has joined #dri-devel

18:36 <ajax> so... command submission might be ordered between the two ctxs, but there's no requirement that they complete in order, or be externally visible after flushing

18:36 <robclark> that would lean towards "app bug"

18:37 <ajax> if they went down the same command queue it'd have the effect of a finish, because the readback command would issue after the draws flushed.

18:37 danvet has quit [Ping timeout: 480 seconds]

18:37 <ajax> but that's implementation detail, i thin

18:37 <robclark> at any rate.. I think it is a kernel-broke-userspace.. if we want to re-allow multiple ctx's in same process to complete out-of-order, I think we need to introduce a new flag in submitqueue-create ioctl so new-userspace can opt-in

18:37 pushqrdx has joined #dri-devel

18:38 pushqrdx has quit []

18:39 <ajax> i mean... i kind of want to enable that, but i also much prefer everyone turning the implicit flush off and using real sync apis like a grown up

18:39 <ajax> but agreed, kbu

18:39 pushqrdx has joined #dri-devel

18:41 X-Scale` has quit [Ping timeout: 480 seconds]

18:43 slattann has joined #dri-devel

18:43 <zmike> jekstrand: I think https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13141 should be good to go, but I'm getting a weird lavapipe crash https://gitlab.freedesktop.org/mesa/mesa/-/jobs/14255678

18:43 <zmike> seems like it's from the generated code and not actually anything related to lavapipe though?

18:46 <pushqrdx> what are the odds that by luck firefox is still somehow synchronized with vblank and doesn't tear for literally hours without using a compositor and on modesetting/intel driver

18:48 shashank_sharma has quit [Ping timeout: 480 seconds]

18:49 gawin has quit [Quit: Konversation terminated!]

18:50 heat has quit [Ping timeout: 480 seconds]

18:52 <Company> how expensive is eglCreateWindowSurface()?

18:52 <Company> will I be unhappy if I call it every frame? every few seconds? more than once per window?

18:53 elongbug has quit [Ping timeout: 480 seconds]

18:54 <ajax> approximately the cost of allocating the default framebuffer attachments

18:55 <Company> context: We're wondering about switching from U8 to FP16 if somebody suddenly opens a HDR wide gamut image in eog or whatever

18:56 <Company> so that's doable, we could just live without application-level API and dynamically switch as needed

18:58 <ajax> it's a one-time thing though, really. you do it when the underlying winsys window changes, and the pixel format is immutable for a given window

18:59 <Company> oh

19:00 <Company> so we have to decide on U8 vs FP16 once and then stick to it?

19:00 mctom has joined #dri-devel

19:01 <ajax> for a given EGLSurface, yes. if you want to change then you need to work out with your presentation manager how to convey the idea that the fp16 window replaces the u8 one

19:01 <ajax> not egl's job

19:02 <Company> I'd just eglDestroySurface() the old one

19:02 <ajax> or: would only be if someone decided to reflect the wayland api up as like eglHigherDefsPlzWL(dpy, oldsurf, newsurf)

19:02 <Company> and eglCreateWindowSurface() a new one

19:03 <ajax> yeah

19:04 <Company> in my mind it's just the buffer manager for the wl_surface

19:04 <Company> so it's like creating a new shm_pool or whatever

19:04 <jekstrand> zmike: Uh.... weird.

19:05 <jekstrand> zmike: Why is that test even running in CI? I wouldn't think that extension was exposed in a CI container.

19:05 <zmike> no idea, but it should be notsupported I think

19:05 <zmike> and yet

19:06 danvet has joined #dri-devel

19:10 <clever> where can i find more information on the TFU hw and UIF format used on at least the bcm2711?

19:13 <alyssa> robclark: giggle, fair enough

19:13 <alyssa> I would've hoped vk apps were less dumb about this

19:14 mctom has quit []

19:15 <ajax> jekstrand: given we're +1 thread for wsi already, is there any reason not to use two? it's really ugly for that one thread to manage two queues because there's two unrelated ways it can block so there's no good way to shut it down

19:16 <jekstrand> ajax: I see no reason why not, assuming we have a good reason for two.

19:16 <ajax> but you have to, because it allocates, because xcb is an unrelenting fount of joy

19:16 <jekstrand> ajax: We're already at "WSI is hard; let's spawn threads" so meh

19:16 <ajax> no no. it's the right approach.

19:16 <jekstrand> Also, I don't think you need the "cb" in that statement. :P

19:17 <ajax> fair

19:17 * ajax cracks knuckles, red bull

19:17 <jekstrand> But, yeah, if we need to threads, that's fine. We're already spawning behind the app's back so I see no reason why 2 is worse than 1.

19:17 * jekstrand has the nagging feeling that he's going to be asked to review something before too long

19:21 <ajax> i should be so lucky

19:22 danvet has quit [Ping timeout: 480 seconds]

19:27 <jekstrand> Better idea: Burn a few of my now infinite "owe you one" points to make daniels review it. :P

19:27 <txenoo> clever, TFU is the V3D Texture Formatting Unit, it is mainly used to convert differnet formats into UIF formatted textures.

19:27 <clever> txenoo: and what exactly is UIF? google cant find much

19:28 <txenoo> Is the tiled format used by V3D GPU.

19:28 <clever> is it similar to what the vc4 v3d used?

19:29 <txenoo> It is only used in V3D.

19:29 <clever> txenoo: https://docs.broadcom.com/doc/12358545 page 105, has the tiled format from the old v3d (pi0 to pi3)

19:29 <txenoo> TFU main usage in V3D is to convert linear textures to UIF format, so the GPU can sample from them.

19:30 <clever> ive not seen it called UIF in the old docs, the new docs are entirely absent, and the old v3d lacked a TFU

19:31 <clever> just trying to confirm, is UIF the same as what is described on page 105?

19:33 <txenoo> No it is a different format. I think that the layout is more complex, at least the code that handles it. There is code in mesa to upload linear textures using this UIF format.

19:33 <clever> ah

19:33 <clever> so there are 2 different tiled formats at play

19:34 <clever> old v3d (pi0 to pi3) uses whats in this pdf

19:34 <jenatali> Does hardware care about a distinction between shadow samplers and non-shadow samplers? Looks like vtn never labels samplers as shadow samplers. I'm wondering if there's already a pass to add that info into sampler types

19:34 <clever> and new v3d (bcm2711) uses a different one called UIF

19:34 <clever> and new v3d, has a dedicated TFU unit, to handle conversion, including yuv inputs

19:34 <alyssa> jenatali: isn't shadow <===> used with a comparison?

19:34 mlankhorst has joined #dri-devel

19:35 <anholt_> clever: there's C code in Mesa for it.

19:35 <jenatali> alyssa: Yeah

19:35 <jenatali> DXIL requires the sampler variable declaration to match the usage

19:35 <clever> src/broadcom/vulkan/v3d_tiling.c: * UIF is the general VC5 tiling layout shared across 3D, media, and scanout.

19:35 <txenoo> clever, here you have a pointer for that code https://gitlab.freedesktop.org/mesa/mesa/-/blob/main/src/broadcom/common/v3d_tiling.c

19:35 <clever> anholt_: ah, i can just read a file like this for more info then

19:35 <alyssa> jenatali: so no, hardware wouldn't care since it'd be embedded in the sampler instruction, there aren't declarations.

19:36 <jenatali> alyssa: Then why is it in the sampler properties in the glsl_type at all?

19:36 <clever> anholt_: i'm also having some trouble getting the vec online with bcm2711, i can feed it the needed 108mhz clock, and get it generating a valid ntsc signal

19:37 <clever> anholt_: but the muxing around the hvs/pv isnt entirely clear to me, and it only ever generates a solid black image

19:37 <anholt_> all of my knowledge of rpi clocks is in the linux kernel clock driver.

19:37 <clever> anholt_: i think the clock problem is entirely solved

19:38 <clever> i can measure the hsync/vsync rate with a scope, and confirm that it is clocked correctly

19:38 <clever> its just the mux to route things thru the pvs/pv/vec pipeline, that i havent solved

19:38 <anholt_> similarly, all my knowledge of tv output on rpi is in the vc4 vec code in the linux kernel.

19:39 <clever> anholt_: there are some things entirely absent from the linux source, like the arm core beying unable to even touch HVS registers until something happens

19:39 <clever> it reminds me of how the DSI drivers cant touch DSI regs, and have to cheat via the dma or mailbox

19:39 <alyssa> by logical deduction, anholt_ has no knowledge of this.

19:40 <clever> or isnt at liberty to say

19:40 <anholt_> given that linux touches those regs, if you find you can't touch those regs, I would suspect that you haven't powered the block on.

19:40 <txenoo> clever, UIF can no be used for scanout.

19:40 <clever> and only what is in the source, has been approved for release

19:40 <clever> anholt_: that is what every engineer has said, and that is totally wrong

19:41 <anholt_> I really don't have any secrets here.

19:41 <clever> anholt_: given that i have full xfce running on ntsc out, its definitely bloody on

19:41 <anholt_> maybe there's some axi bridge flag somewhere for hvs from arm, I dunno.

19:41 <clever> yeah

19:41 <clever> thats my theory

19:42 <clever> and every RPF guy i have talked to, said power domains, and then fell silent

19:42 <clever> anholt_: https://www.youtube.com/watch?v=BQyyVtmmVg8 2d animation, 3d animation, and full linux console, implies its definitely not power domains

19:43 <clever> but the arm cant touch HVS, so no pageflips, just a dumb static framebuffer

19:46 <clever> txenoo: does the bcm2711 v3d output uif or linear for the final render?

19:46 <clever> vc4 v3d had: linear raster, t-format, and lt-format

19:50 slattann has quit []

19:52 jekstrand has quit [Ping timeout: 480 seconds]

19:53 <jstultz> robclark: pinchartl: I unfortunately don't know about plans. When I heard about the bazel changes (not knowing anything about bazel) I wondered aloud if it might help with the external integration issues, but the response from google devs was approximately "not likely"

19:53 <jstultz> robclark: pinchartl: here dug up the twitter conversation: https://twitter.com/johnstultz_work/status/1327019403708284929

19:54 <agd5f> clever, on older AMD GPUs, interlaces timings didnt work with tiled surfaces. You had to use linear. maybe you have a similar limitation?

19:55 <clever> agd5f: the ntsc problem is present when using linear images, i'm just also researching the v3d in parallel

19:55 <agd5f> ok

19:56 <clever> agd5f: https://www.youtube.com/watch?v=u7DzPvkzEGA this would be an interlaced scanout, with 20 moving framebuffers (all being downscaled), plus one full-screen bg layer

19:56 <clever> the glitching happens when there is too much input data on the same scanline, and it cant generate data in time

19:57 <agd5f> sounds like a watermark or line buffer issue

19:57 <clever> agd5f: https://www.youtube.com/watch?v=JFmCin3EJIs 13 framebuffers, with no scaling, is the limit for a progressive scanout 1280x1024 display

19:58 <clever> yeah, i believe the glitching at 20 is a memory bandwidth issue

19:58 <clever> its down-scaling on the fly, so it has to fetch a lot more image data then your seeing

19:58 <clever> and there is no cache, so it has to re-fetch it for every sprite on the scanline

19:59 <clever> pre-scaling in ram would make it perform better, but my main goal is to push it over the edge, to learn where the edge is

20:00 <jstultz> pinchartl: and while I don't have any love for the android build system, i'd also not paint a whole team with the big "lack of humility" brush, as that's a common problem i see with individuals everywhere. :)

20:01 jekstrand has joined #dri-devel

20:11 <txenoo> clever, the display driver needs to be feeded with linear raster, v3d can render to UIF or linear but if it goes to display linear needs to be used.

20:11 <clever> ah, same as old v3d

20:11 <clever> i assume rendering to a tiled format, is only for when you want to re-use it as a texture in a second pass

20:12 <clever> a cheap way to implement a security camera in a video game, for ex

20:13 <txenoo> clever, yes

20:14 <clever> ive also seen cases of game console emulators not handling this well

20:14 <clever> one n64 game (mario cart i think) had a jumbotron on the course, that showed the main camera render

20:15 <clever> but for cases like the v3d, that means you have to render the scene twice, once to linear, and once to uif

20:15 <clever> so some emulators have a flag to disable that feature

20:17 <clever> i'll have to re-read all of the hvs code, and find the answer to my muxing problem

20:21 gawin has joined #dri-devel

20:26 hansg has quit [Quit: Leaving]

20:41 <daniels> jekstrand: I got your notification but the message isn’t showing - what am I reviewing for you after I’m done with zmike?

20:46 haagch has quit [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]

20:47 haagch has joined #dri-devel

20:52 <pinchartl> jstultz: fair point, I accept the criticism ;-)

21:02 markus has quit [Ping timeout: 480 seconds]

21:02 markus has joined #dri-devel

21:03 <pushqrdx> if someone wanted to implement a tearfree option (outside the compositor) at a lower level, where would be the best place for that, i am trying to understand the graphic stack on linux

21:04 <pushqrdx> and i wonder why doesn't modesetting and/or mesa provide something like triple buffering or vsync

21:04 Bennett has joined #dri-devel

21:06 <pushqrdx> tbh i am bit confused as to where mesa stand in the stack, is it the same thing as modesetting

21:06 <ajax> no

21:07 <pushqrdx> modesetting is part of x then? or is the kernel module

21:08 <ajax> there's a generic driver for X named modesetting, which more or less works with every kernel driver that implements a feature set that is also named modesetting.

21:08 <ajax> 'kms' tends to mean the kernel part specifically

21:10 <pushqrdx> so modesetting can be thought of as an interface, and the modesetting driver for X is just an implementation of that for xorg, modesetting the interface is what talks to the kernel side of things then

21:16 <clever> pushqrdx: my understanding is that modesetting and kms, is an api to do a collection of tasks

21:16 <clever> 1: configure the resolution of a video video output port

21:16 <clever> 2: allocate memory for framebuffers

21:16 <clever> 3: configure what xy coord a framebuffer is rendered at

21:16 <clever> 4: perform atomic flips between framebuffers

21:17 <clever> opengl drivers are sometimes implemented as a kms device, that can only do 3d render, and spits out a framebuffer when done

21:17 <clever> and the framebuffer handles can be exchanged between devices, so you can display it on a given output

21:18 <clever> but that only holds true for full screen rendering, when you have complete control of the video output

21:18 <clever> X11/wayland complicates matters

21:21 <jekstrand> daniels: ajax rewriting the Vulkan X11 WSI code. :)

21:22 Duke`` has quit [Ping timeout: 480 seconds]

21:22 <pushqrdx> so if we think about the route a draw call takes from an xclient to hardware is this somewhat accurate? xclient -> xserver -> xserver-driver(xf86-xxx) -> mesa -> drm -> kernel -> hardware

21:24 gawin has quit [Ping timeout: 480 seconds]

21:25 ybogdano has joined #dri-devel

21:27 <bnieuwenhuizen> pushqrdx: draw calls themselves go xclient -> mesa -> libdrm (for some drivers) -> kernel -> HW

21:27 <bnieuwenhuizen> assuming you don't use indirect rendering (and you mostly shouldn't)

21:28 <bnieuwenhuizen> then to display the output you're mostly right

21:29 <pushqrdx> bnieuwenhuizen oh so mesa cuts down on the trip to xorg?

21:30 <robclark> https://en.wikipedia.org/wiki/Direct_Rendering_Infrastructure

21:30 <daniels> jekstrand: I think that might push you into negative credit tbh

21:31 <bnieuwenhuizen> basically assuming you're using GL/vulkan only the final buffer from e.g. glXswapbuffers gets communicated to X

21:31 <jekstrand> daniels: I fear we may be experiencing some pretty serious credit inflation then...

21:31 <jekstrand> Back in my day, credit was worth something... You could get a whole patch series review for half a credit.

21:31 <jekstrand> </old man voice>

21:33 pushqrdx_ has joined #dri-devel

21:33 rasterman has quit [Quit: Gettin' stinky!]

21:37 <ngcortes> heads up folks, mesa ci will be down this afternoon while we replace a failing disk. Thing should be back online by tomorrow. we'll keep everybody posted.

21:37 pushqrdx has quit [Ping timeout: 480 seconds]

21:38 gawin has joined #dri-devel

21:40 <robclark> all of mesa-CI or just certain runners? In latter case you should push an MR to disable the necessary jobs

21:43 vivijim has quit [Ping timeout: 480 seconds]

21:44 <anholt_> robclark: ngcortes's thing is intel's private CI.

21:44 <robclark> ahh, ok

21:45 pnowack has quit [Quit: pnowack]

21:45 <daniels> phew …

21:45 <daniels> ngcortes: would be helpful if you said ‘intel CI’ in future!

21:46 <ngcortes> anholt_, daniels my bad! I did mean the intel mesa ci

21:46 <daniels> jekstrand: back in my day people were thrilled when their winsys gained the ability to understand hotplug …

21:46 <daniels> ngcortes: no problem, good luck with the firefighting!

21:46 <ngcortes> daniels, thanks!

21:48 Hi-Angel has quit [Ping timeout: 480 seconds]

21:52 pushqrdx has joined #dri-devel

21:53 gouchi has quit [Remote host closed the connection]

21:57 pushqrdx_ has quit [Ping timeout: 480 seconds]

22:00 <ajax> i think i might hate xcb

22:00 <Venemo> do you hate it less than xlib?

22:01 <ajax> not sure

22:02 <ajax> a few years ago the answer would have been "no" quite quickly though

22:09 <Venemo> really? I thought xcb was made to be saner than xlib

22:10 <HdkR> I've still yet to find the documentation on the xcb extensions. I have no idea where distros build their libraries from

22:10 <HdkR> Definitely isn't in the primary libxcb repo :|

22:11 gitautas has quit [Remote host closed the connection]

22:11 mlankhorst has quit [Ping timeout: 480 seconds]

22:15 nchery is now known as Guest1538

22:15 nchery has joined #dri-devel

22:15 Guest1538 has quit [Read error: Connection reset by peer]

22:23 ybogdano has quit [Ping timeout: 480 seconds]

22:32 angerctl has quit [Quit: reboot]

22:35 Namarrgon has joined #dri-devel

22:36 iive has quit []

22:47 Namarrgon has quit [Remote host closed the connection]

22:47 Namarrgon has joined #dri-devel

22:48 ybogdano has joined #dri-devel

23:02 <ngcortes> anyway, intel mesa ci should be back online by tomorrow. I've taken it offline while he new disk syncs (ci is super slow during this process so I took it offline to prevent it from bottlenecking)

23:19 <jekstrand> dcbaker: I just posted https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13156 and assigned you

23:19 <jekstrand> dcbaker: It'd be nice if you could give it a look. I think it's mostly fine except a bunch of the generators are still in the util/ directory but run from the runtime/ directory.

23:19 <jekstrand> I don't remember what the BKM for cross-directory python deps is or I'd have moved the generators too.

23:20 <jekstrand> Once we've got that, then I can move things like anv_shader_compile_to_nir into common code (with a better name, of course)

23:20 JohnnyonFlame has joined #dri-devel

23:21 aswar002 has quit [Quit: No Ping reply in 180 seconds.]

23:21 mdnavare has quit [Remote host closed the connection]

23:22 aswar002 has joined #dri-devel

23:22 <jekstrand> I'm contemplating whether or not it's practical to have a fully shared vk_pipeline_cache implementation. That seems like it'd be a good idea. Just have to figure out how it'd work.

23:22 mdnavare has joined #dri-devel

23:22 <jekstrand> Probably something that caches blobs?

23:23 unerlige has quit [Remote host closed the connection]

23:23 unerlige has joined #dri-devel

23:23 <jekstrand> RADV has a hand-rolled hash table

23:24 X-Scale` has joined #dri-devel

23:25 <jekstrand> airlied: Why'd you hand-roll the hash table in radv_pipeline_cache? It seems everyone has copied+pasted your hand-rolled hash table instead of using struct hash_table like ANV does. :-/

23:25 <jekstrand> If there's a good reason for it, I'm all ears.

23:27 <HdkR> Mesa uses xxhash for its hash table now right?

23:27 <jekstrand> Ugh... THere's so much pipeline cache copy+pasta... :-(

23:27 <jekstrand> HdkR: I think so.

23:27 <jekstrand> HdkR: Yes, we do

23:28 <jekstrand> But not for the hash set. ¯\_(ツ)_/¯

23:28 <HdkR> Could have been before Anthony replaced the hash with xxhash, so it wasn't performing as well as it could? :P

23:28 X-Scale has quit [Ping timeout: 480 seconds]

23:29 <HdkR> Looks like it was only a year ago now

23:29 <jekstrand> HdkR: Or it could be that ANV originally had that hash table krh hand-rolled and then radv copied+pasted it before we switched to struct hash_table. :P

23:29 <HdkR> That too

23:29 <jekstrand> And everyone else copied+pasted from RADV

23:30 <HdkR> That's definitely the simpler answer

23:31 <bnieuwenhuizen> jekstrand: wondering if that had anything to do with trying to use Vulkan allocators?

23:31 <bnieuwenhuizen> I mean that is a typical NIH cause

23:31 <bnieuwenhuizen> then again it seems to use plain malloc in radeonsi so maybe not :P

23:31 <bnieuwenhuizen> in radv*

23:31 <jekstrand> According to 10f9901bcef7724cb72fb2fe7e3dd8d6660d2f34, it was because krh wanted to store the shader binaries themselves in the pipeline cache like we did in the i965 shader cache.

23:32 <bnieuwenhuizen> wdym store the binaries themselves?

23:32 <jekstrand> I mean the pipeline cache contains a BO full of shader binaries and you execute directly from the cache.

23:32 <jekstrand> But then we realized that you can't actually do that because clients are allowed delete the pipeline cache after calling vkCreateGraphicsPipelines().

23:33 <bnieuwenhuizen> ah

23:33 <jekstrand> That patch has a bugzilla link.... Tells you how old it is. :D

23:33 <bnieuwenhuizen> want me to write a patch to convert radv or is a mass patch incoming?

23:34 <jekstrand> bnieuwenhuizen: Once !13156 lands, I'm thinking of making a common vk_pipeline_cache implementation and then switching all the drivers over to it.

23:36 <jekstrand> If I do it right, everyone will also get NIR caching and disk caching for free.

23:37 <jekstrand> And the pipeline_cache_control extension, maybe? Whichever one it is that lets you control locking.

23:38 ybogdano has quit [Ping timeout: 480 seconds]

23:38 ngcortes has quit [Remote host closed the connection]

23:43 pcercuei has quit [Quit: dodo]

23:48 tursulin has quit [Read error: Connection reset by peer]

23:52 ybogdano has joined #dri-devel

23:55 rsripada_ has joined #dri-devel

23:58 dolphin has quit [Ping timeout: 480 seconds]

23:59 Ryback_ has quit [Ping timeout: 480 seconds]

23:59 rsripada has quit [Ping timeout: 480 seconds]

23:59 dolphin has joined #dri-devel