#dri-devel on 2022-02-01 — irc logs at oftc.irclog.whitequark.org

2021-07-26 22:56 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:01 <anholt> gawin: we already convert unsupported operations in nir.

00:04 <gawin> do remember in which file? i'd add a few more

00:05 <gawin> it'd be superb to move "transform_r300_vertex_CMP" into NIR

00:09 mlankhorst has quit [Ping timeout: 480 seconds]

00:09 <anholt> I guess you could make an r300-specific cmp, but I think most of our wins would come from sorting out has_fused_comp_and_csel for ntt

00:11 <anholt> or once we land the tgsi ra series, then we do some greedy csel fusing in ntt.

00:12 <anholt> note that "make an r300-specific cmp" would involve both a nir and tgsi opcode, and we don't really make new tgsi opcodes these days.

00:12 <anholt> so at that point you're probably actually talking about making yourself a copy of ntt as the frontend for r300's compiler.

00:13 pcercuei has quit [Quit: dodo]

00:18 ybogdano has joined #dri-devel

00:23 <zmike> airlied: well I filed the issue and got a pretty quick reply, so it looks like my compiler patch is correct and this is a longstanding bug that nobody else has managed to hit

00:34 <gawin> anholt: I was thinking about getting rid of all "transforms", which are introducing new temps (for example opcodes which are implemented by 3 or more opcodes). except moving into NIR, we could either create some garbage collector (implementing SSA) or manually cleanup after each "transform" (maybe it'd require rewriting allocator). I guess first option is simplest and easiest to do(?)

00:35 <gawin> iirc float_const_write_dynamic_loop_read_vertex is failing because "transform_r300_vertex_CMP" is grabbing too many temps

00:36 <anholt> register allocation should make an arbitrary number of cmps take just 1 more (at worst) total temp.

00:36 <anholt> if that's not the case, then you need to fix your RA.

00:37 <anholt> (spoilers: radeon does, in fact, need to fix its ra)

00:44 <gawin> for now storing index and reusing all over is good idea?

01:07 ybogdano has quit [Ping timeout: 480 seconds]

01:16 italove31 has quit []

01:16 italove31 has joined #dri-devel

01:16 <airlied> zmike: I disagree with his assessment, since the GLSL spec doesn't apply, it's the GLSL ES spec that needs the language

01:17 Ristovski has joined #dri-devel

01:17 <zmike> airlied: I guess we'll see what he says

01:18 <airlied> I don't see that sort of language anywhere in the GLSL ES spec

01:19 Lucretia-backup has quit [Remote host closed the connection]

01:19 <airlied> adding a if targetting OpenGL to that spec would clarify it alright

01:19 Lucretia-backup has joined #dri-devel

01:20 ced117 has quit [Remote host closed the connection]

01:20 ced117 has joined #dri-devel

01:30 fxkamd has quit []

01:32 <maxzor> FLHerne, please don't sue me for plagiarism https://en.wikipedia.org/wiki/AMDgpu_(Linux_kernel_module)

01:32 <maxzor> one could argue that someone could have done it during these years.

01:33 <maxzor> probably won't last long due to lack of reliable sources...

01:49 <airlied> maxzor: amdgpu isn't developed under ROCm

01:50 <airlied> amdkfd is the rocm component of the amdgpu driver

01:50 <airlied> the driver is developed on the amd-gfx mailing list, and internally at AMD by whatever teams have requirements on it I suppose

01:50 <maxzor> do you still maintain this hard disinction within amdgpu.ko?

01:51 <airlied> amdkfd is now just part of amdgpu

01:51 <airlied> but there is still a large part of kfd specific code in there

01:51 columbarius has joined #dri-devel

01:53 <airlied> and you can turn off KFD completely still with CONFIG_HSA_AMD

01:53 <maxzor> airlied, do you know a place where one can read a summary of the difference between the designs of amdkfd and other graphics drivers? Read on the CRIU thread that amdkfd was subject to mid-term change/redesign also.

01:53 co1umbarius has quit [Ping timeout: 480 seconds]

01:53 <airlied> nope never seen it written down in a neat explaination

01:53 <airlied> amdkfd isn't a graphics driver

01:53 <maxzor> right

01:54 <airlied> it exists solely to provide userspace command submission to firmware provided compute queues

01:54 <maxzor> why is courbet not doing it ':)

01:54 <airlied> amdkfd was probably a bad idea in hindsight

01:54 <airlied> since they tried to move away from the fd model to a process vm model

01:55 <airlied> so multiple GPUs could share a process VM and have the kernel manage it all

01:59 <maxzor> knowing virtually nothing to the graphics pipeline I won't ask relevant questions on this part :<

01:59 <maxzor> thank you for inputs!

02:00 <airlied> also I don't think amdgpu is explicitly tested on fd.o gitlab

02:00 <airlied> fd.o gitlab is used for mesa driver testing, don't think anyone has graduated it to testing kernel drivers yet

02:01 <maxzor> have you been approached for rocm mesa interop yet?

02:01 <maxzor> hip has patches mentioning vulkan, not sure if it is reserved to amdvlk/pal

02:04 ngcortes has quit [Remote host closed the connection]

02:09 <maxzor> I started assembling the understanding yesterday that the design differences that you just mentioned could be a hindrance to this interop, and that they had not been worked through yet

02:12 V_ has quit [Remote host closed the connection]

02:15 <airlied> maxzor: I think they only care about pal

02:15 <maxzor> -.-

02:15 <airlied> zmike: btw do you have a gles build of the gtf test suite? I wonder if those tests get executed there as well

02:15 <airlied> granted you don't need GTF for latest gles conform, but older ones all used it

02:18 <zmike> airlied: no, I only build for gl

02:18 <zmike> I think you'd have to check the mustpass list anyway to find out?

02:19 <airlied> oh mustpass good plan, I'll find an old one

02:20 <airlied> yes those are in the gles3.2 mustpass

02:28 lplc has joined #dri-devel

02:33 gawin has quit [Ping timeout: 480 seconds]

02:47 <zmike> according to jekstrand the atan tests can't pass as fp16, so it seems we're at an impasse

02:51 <graphitemaster> speaking about barriers, if you have a draw call which writes to a texture in an FBO and you read that texture in a compute shader (say with texelFetch), what barrier is actually necessary here?

02:51 <graphitemaster> The glMemoryBarrier spec is very clear about it only being value for *writes from shaders* not writes via the framebuffer

02:52 <graphitemaster> Right now I just have FETCH barrier but that doesn't seem correct to me

02:53 <graphitemaster> FRAMEBUFFER barrier is when you have a draw call that will write to a texture (via FBO attachments) that was previously written via a shader (imageStore)

02:53 <imirkin> when you say "draw call which writes to a texture", you just mean you're drawing a tri onto some fb, right?

02:53 <imirkin> not like imageStore or whatever?

02:53 <graphitemaster> Yeah

02:53 <imirkin> iirc you don't need to do anything

02:53 <graphitemaster> imageStore specifically you'd use SHADER_IMAGE_ACCESS barrier for

02:53 <jekstrand> zmike: Sure... Blame it all on me. :-P

02:54 <graphitemaster> So writes from a draw call (to framebufffer attachments) are implicitly synchronized with compute dispatches that read those attachments?

02:54 <imirkin> graphitemaster: and later draws, yes

02:54 <graphitemaster> That's not true in Vulkan

02:54 <imirkin> forget compute for a second

02:54 <graphitemaster> What is the purpose of glTextureBarrier then

02:54 <imirkin> let's say you want to do a 2-pass render

02:54 <zmike> jekstrand: whoa whoa calm down buddy I'm just citing your expert assessment

02:54 <imirkin> that's when you're reading the framebuffer from the same shader as which is writing it

02:55 <graphitemaster> imirkin, https://bugs.freedesktop.org/show_bug.cgi?id=101572#c1

02:55 <zmike> I just wanna get this failboat uncapsized

02:55 <graphitemaster> This says something different so I'm really confused

02:55 <imirkin> the first part agrees with me :)

02:56 <imirkin> anyways... nha tends to know this stuff. he did at least part of the images bringup in mesa iirc

02:56 <graphitemaster> Wait how does it agree with you, am I reading it wrong

02:56 <imirkin> he says that glMemoryBarrier is not for you :)

02:56 <graphitemaster> Yeah that I know

02:56 <imirkin> that part agrees with me too :)

02:57 <graphitemaster> Okay well we're in agreement there

02:57 <graphitemaster> Do I need texture barrier though :D

02:57 <imirkin> ARB_texture_barrier has this texzt:

02:57 <imirkin> This extension relaxes the restrictions on rendering to a currently

02:57 <imirkin> bound texture and provides a mechanism to avoid read-after-write

02:57 <imirkin> hazards.

02:57 <imirkin> so this is about reading and writing to the same texture in the same shader

02:58 <graphitemaster> Oh

02:58 <imirkin> i.e. having a texture bound both for sampling and framebuffer

02:58 <imirkin> basically this is only allowed if you're only reading the "current" location

02:58 <graphitemaster> I didn't even think that was possible without something like fragment interlock

02:58 <imirkin> and glTextureBarrier() is a way to let you flush the texture cache since you just updated everything

02:58 <graphitemaster> Or framebuffer fetch

02:59 <imirkin> and this only works if there's no over-draw

02:59 <imirkin> i.e. it's for a very very very limited use-case

03:00 V has joined #dri-devel

03:01 <graphitemaster> So then in OpenGL every compute dispatch needs to complete before a draw can even begin

03:01 <graphitemaster> There's literally no room for overlap

03:01 <imirkin> no

03:01 <imirkin> other way. and only if the compute shader is sampling from the fbo

03:01 <graphitemaster> Or writing to it

03:01 <imirkin> no

03:01 <imirkin> compute shader can't "write" to the fbo in the same natural way

03:01 <graphitemaster> Oh but if it's writing to it you need FRAMEBUFFER barrier before the draw

03:01 <imirkin> it'd have to use imageStore/etc

03:01 <imirkin> exactly.

03:02 <graphitemaster> Makes sense

03:02 <imirkin> anyways ... this is my understanding.

03:07 <HdkR> Time for Nouveau to add support for Maxwell PSI :P

03:07 <imirkin> PSI?

03:07 <HdkR> Pixel Shader Interlock

03:07 <imirkin> yeah. that'll happen.

03:07 <imirkin> NEVER

03:08 <graphitemaster> fragment interlock on NV is really slow

03:08 <HdkR> What, you don't want to RE the hundreds of instructions it takes to do PSI? :D

03:08 <graphitemaster> So you couldn't possible to worse than the official driver

03:08 <imirkin> i'm going to do better

03:08 <imirkin> i'm going to give people the benefit of not providing that ext

03:09 <imirkin> and then they won't be tempted into doing stupid shit

03:09 <jekstrand> :)

03:09 <graphitemaster> In fact, interlock on the official driver turns my whole pass into the speed of nouveau on a modern NV GPU XD

03:09 <HdkR> haha

03:09 <jekstrand> It supposedly doesn't suck too bad on Intel

03:09 <airlied> zmike: I think they pass on llvmpipe :-P

03:09 <jekstrand> Once again, optimizing the things that don't matter...

03:09 <HdkR> ^ I've heard that as well

03:09 <airlied> if I turn off the constbuf stuff, but I haven't had time to check fully, maybe in a few hours I'll dig in

03:09 <graphitemaster> I think Intel has it for framebuffer fetch reasons

03:10 <imirkin> jekstrand: intel just serializes everything, so it's a no-op right? :p

03:10 <graphitemaster> And Intel has framebuffer fetch because they need to software blend the advanced blend equations

03:10 <graphitemaster> Since it's spec now right?

03:10 <imirkin> everyone has to do advanced blend in software

03:10 <graphitemaster> NV supports KHR_blend_equation_advanced in hardware

03:10 <HdkR> Everyone hates the photoshop modes

03:10 <zmike> airlied: I shouldn't have to disable that, and it still doesn't fix the atan tests

03:11 <imirkin> graphitemaster: huh? since when?

03:11 <graphitemaster> 3000?

03:11 <imirkin> ampere? ok, i can believe that.

03:11 <imirkin> or did you mean the year 3000? :p

03:11 <graphitemaster> haha

03:11 <graphitemaster> Oh god please tell me we're not using OpenGL in the year 3000

03:11 <jenatali> Is there any good lowering available already for those advanced blend modes?

03:11 <imirkin> jenatali: yea

03:12 <imirkin> you just have to implement a "fbfetch" op

03:12 <zmike> as long as you can do fbfetch

03:12 <jekstrand> jenatali: Yup. Just implement frambuffer_fetch_non_coherent and you get it "for free"

03:12 <jenatali> Excellent. In terms of framebuffer fetch or something else?

03:12 <imirkin> which you can lower into a texture load if you want

03:12 <jekstrand> If you can do coherent fbfetch, we automatically enable the coherent blend advanced extension

03:12 <imirkin> there isn't even support in mesa for a non-lowered impl :)

03:13 <graphitemaster> You can emulate fbfetch with another implicit R32UI and atomic increment the pixels on that then do a nice spinny boy on it

03:13 <imirkin> [would be addable, obviously, but it hasn't come up...]

03:13 <graphitemaster> That's what I have as a fallback :|

03:13 <graphitemaster> Spin locks in a fragment shader are always ... fun

03:13 <graphitemaster> I bet there is a less stupid way

03:13 <jekstrand> imirkin: I'm sure zmike will add it so he can hook it into VK_EXT_blend_equation_advanced to get at that sweet NV hardware... :-P

03:13 <imirkin> heheh

03:14 <graphitemaster> But I don't get to write drivers, just GL :|

03:14 <jenatali> I think we could do it with ROVs maybe. I'll get to that eventually

03:14 <imirkin> yeah, that nvidia does it in hw is completely news to me

03:14 <imirkin> but i haven't kept up with the latest

03:14 <zmike> jekstrand: I had actually forgotten about that, but it's on my list

03:14 <imirkin> insufficiently interested, tbh

03:14 <zmike> and I think gallium does support a non lowered impl

03:15 <zmike> it's got all the modes

03:15 <imirkin> it gets the info, but i thought it got the lowered thing. maybe someone made an option to pass through the originals ... svga maybe?

03:16 <graphitemaster> I didn't even see there is a BlendBarrierKHR, shrug

03:16 <airlied> zmike: not lowering precision pretty much disables that cap anyways

03:16 <imirkin> graphitemaster: theres a _coherent variant

03:16 <imirkin> which is supported by intel, naturally

03:16 <imirkin> (and all the arm drivers i think)

03:16 <graphitemaster> LOL

03:16 <graphitemaster> Intel is so weird

03:16 <imirkin> and then you don't have to do barriers

03:17 <jekstrand> And you can draw more than one triangle at a time.

03:17 <zmike> airlied: only for desktop

03:18 <imirkin> you can draw as many triangles as you want without _coherent

03:18 <zmike> and I don't have to pass atan in es

03:18 <imirkin> as long as there's no overdraw :)

03:18 <zmike> so I can figure out the mat ones and disable lowering for desktop gl and be fine

03:18 <graphitemaster> Would be nice if you could get access to the hardware blend units in compute shaders

03:18 <jekstrand> Let's start by figuring out the mat ones. Those look like they might be legit fails.

03:19 <jekstrand> zmike: ^^

03:19 <zmike> that's my plan during my meeting block tomorrow

03:19 <jekstrand> zmike: It's possible whatever bug is breaking those is affecting atan()

03:19 <airlied> https://gitlab.freedesktop.org/mesa/mesa/-/issues/5477#note_1099092

03:19 <airlied> is the problem I found in llvmpipe before with getuniform

03:20 <airlied> I was going to look into the matrix ones to see if it was similiar

03:20 <HdkR> graphitemaster: That's just called a fragment shader :)

03:20 <jekstrand> zmike: But I still suspect atan() is something with fp16

03:20 <airlied> well we do lower atan in glsl to some stuff with 32-bit stuff in it

03:20 <graphitemaster> HdkR, okay but I need to do 3D voxel stuff and fragment shaders don't support 4096 layers, and using a geometry shader to emit those is sadge

03:21 <imirkin> graphitemaster: NV_viewport_array2?

03:21 * airlied would rather we at least have a root cause before we just go axing code paths

03:21 <jekstrand> sure

03:21 <zmike> I'm ok with this

03:22 <jekstrand> I'm generally a fan of figuring out why bugs exist rahter than hacky workarounds.

03:23 <graphitemaster> imirkin, meh, also neh

03:23 * airlied assumes gles doesn't have glGetUniform

03:23 <imirkin> graphitemaster: or NV_geometry_shader_passthrough?

03:23 <airlied> oh it does

03:24 <graphitemaster> Can't set that many viewports

03:24 <imirkin> graphitemaster: oh yeah, good point. 16 viewports

03:24 <airlied> I suspect the fp16 constbuf stuff is buggy wrt GetUnifom, and we just haven't tested it

03:24 <jekstrand> Yeah, when I looked at the mat test, the shaders looked fine. I suspect we've got a mismatch somewhere with uniform upload.

03:24 <jekstrand> But I don't know that code well at all.

03:25 * jekstrand doesn't do GL :-P

03:25 * imirkin definitely won't mention ARB_copy_image...

03:26 <airlied> yeah fixing the uniform upload to now lower to 16-bit fixes the mat tests

03:26 <jenatali> Oh there's a blend barrier. Okay that helps

03:27 <jenatali> I hadn't actually read that spec yet, just the brief and got scared

03:27 <zmike> airlied: nice!

03:27 <jekstrand> imirkin: My ARB_copy_image implementation no longer exists. :P

03:27 <imirkin> hehe

03:27 <imirkin> you piped it through core

03:27 <imirkin> that's still there

03:27 <jekstrand> jenatali: Yes, there's a barrier so you can do your flushes etc.

03:27 <jekstrand> imirkin: Yeah... There are a few lines in src/mesa/ that still blame to me. :P

03:28 <jenatali> Aka copy back or something

03:28 <graphitemaster> How much do you want to bet BlendBarrierKHR has the same function address as TextureBarrier on NV

03:28 <imirkin> graphitemaster: definitely does as i've implemented!

03:28 <graphitemaster> :D

03:28 <imirkin> just a texture flush

03:31 <imirkin> since the advanced blend thing is precisely the scenario that ARB_texture_barrier talks about

03:32 <graphitemaster> I wish over-draw was allowed and it was coherent

03:32 <graphitemaster> I'd have a sweet use for that

03:33 <jekstrand> Then you need Intel hardware!

03:33 <imirkin> with texture barrier?

03:33 <imirkin> or with advanced blend?

03:33 <graphitemaster> texture barrier

03:33 <imirkin> you can do that with ...

03:33 <imirkin> GL_EXT_shader_framebuffer_fetch

03:33 <imirkin> largely supported on arm

03:33 <graphitemaster> NV does not support it :P

03:33 <imirkin> right.

03:34 <jekstrand> Intel does!

03:34 <jekstrand> On Gen9+

03:34 <imirkin> like i said ... arm

03:34 <imirkin> :p

03:34 <imirkin> embedded

03:34 <imirkin> (j/k. mostly.)

03:34 <graphitemaster> The coverage is real bad, a whole 9.64%

03:37 <graphitemaster> Interlock is too slow too

03:37 <jekstrand> Not on Intel. :P

03:38 <graphitemaster> Okay but can Intel do NV_path_rendering

03:38 <graphitemaster> If we're talking weird shit for a moment

03:38 <jekstrand> CAN is an interesting word...

03:38 <HdkR> CAN? Probably. Want? Probably not.

03:38 <graphitemaster> Imagine putting a potscript and SVG parser in mesa

03:39 <graphitemaster> *postscript

03:39 <graphitemaster> potscript is funny tho

03:39 <Sachiel> displayport printers

03:40 <jekstrand> VK_KHR_printer_surface

03:41 <graphitemaster> You know, when NV_path_rendering came out it actually had something really cool, the paths rendered could be conservative - in the sense that even partial pixel overlap with a fragment could produce a fragment, ala conservative rasterization - before we had conservative rasterization extensions - so using it for shadow volumes would've been pretty cool

03:41 <jekstrand> I wonder how hard that'd be to implement... Can't be that hard to send an image to /dev/lp0

03:42 * airlied asked about fp16 glGetUniform intercations

03:42 <graphitemaster> My bad :| sorry - muddying up the IRC, I should get back to work.

03:42 <jekstrand> graphitemaster: No worries. This channel is 2/3 BS anyway. :)

03:43 <jekstrand> airlied: What interactions?

03:43 <jenatali> WGL had GDI metafile surfaces so that you could render to printable surfaces IIRC

03:43 <imirkin> NV_path_rendering supports SVG parsing in the GL. what could go wrong.

03:43 <graphitemaster> jekstrand, fine one last BS since it's topical https://github.com/graphitemaster/printer-display

03:43 <airlied> jekstrand: if we have 16-bit constant lowering, and you set a 32-bit uniform and read it back with glGetUniform we currently would lose precision

03:44 * airlied thinks PIPE_SHADER_CAP_FP16_CONST_BUFFERS might not be compliant as we have it implmenetd

03:44 <zmike> sure seems that way

03:44 <jekstrand> graphitemaster: I love that you list "latency" as a reason not to use a printer. :D

03:45 <HdkR> Need some io_uring for that printer to remove some copies and lower latency </s>

03:45 <jekstrand> airlied: Ugh... I suspect you should be able to read it back exacct...

03:45 <imirkin> airlied: if it's set as mediump in the shader? dunno

03:45 <jekstrand> HdkR: Blit via photocopier, anyone?

03:45 <imirkin> airlied: or if it's auto-lowered?

03:45 <graphitemaster> jekstrand, those 6 lines of C exist in every printer driver, fun fact.

03:46 <airlied> imirkin: if mediump

03:46 <imirkin> airlied: just like using GL_FLOAT uploads of texture data with RGBA8. you don't get the exact same thing back out ... dunno

03:47 <imirkin> airlied: what happens if you use glUniform1i with a float uniform? is that legal?

03:47 <airlied> imirkin: not sure actually

03:48 <imirkin> if it's legal, does it convert to float? and if it does, what happens if you stick something greater than 1<<23?

03:48 <imirkin> (you probably see where i'm going with this line of reasoning)

03:48 <airlied> imirkin: the spec is again vague :-P

03:49 <airlied> imirkin: though the interface for setting a float uniform takes a 32-bit value

03:49 <airlied> which is sorta different than setting an int and getting a float

03:50 <airlied> converting it to garbage on readback is different than converting it internally

03:50 <jekstrand> Do we actually implement GetShaderPrecisionFormat?

03:51 <airlied> yes from limits set in the state tracker

03:51 <jekstrand> So we do

03:52 <jekstrand> airlied: Yeah, I've got no idea what the spec means for glGetUniform()

03:52 <jekstrand> It says incredibly little

03:54 <imirkin> "get uniform" - how much more spec do you really need? it's in the function name!

03:59 <jekstrand> I expect that it should return the same value set through glUniform()

04:00 <jekstrand> And not lose precision

04:00 <jekstrand> But what do I know?

04:00 <imirkin> jekstrand: would you expert glGetTextureImage to return the same data you fed in via glTexImage?

04:00 <imirkin> expect*

04:00 <imirkin> coz you'd be sorely dissapointed...

04:01 <jekstrand> No, but that's different, IMO

04:01 <jekstrand> For one thing, textures have an internal format that gives some idea of the precision

04:01 <jekstrand> I guess uniforms sort-of do

04:01 <jekstrand> It could go either way

04:01 <imirkin> yea

04:02 <imirkin> but think of it this way ... what's the benefit of glGetUniform? i.e. why would one ever do that?

04:02 <imirkin> other than writing a GTF test

04:02 <imirkin> tbh i can't think of any great use-cases. but ultimately you'd want to know the value that the shader was going to use

04:03 <jekstrand> imirkin: apitrace probably uses it

04:03 <imirkin> jekstrand: what for?

04:03 <jekstrand> To get uniforms, obviously. :P

04:03 <imirkin> it already captured the glUniform's as they were being "sent in"

04:04 <jekstrand> Yeah.

04:04 <imirkin> shouldn't need to "get" them, except maybe one of the "inspect" views

04:04 <jekstrand> So maybe not apitrace

04:04 <jekstrand> I do

04:04 <imirkin> in which case i think seeing the values the shader was using might be preferable?

04:04 <jekstrand> I don't know. I'm sure there's some app out there that found a "good" reason. :P

04:06 <jekstrand> “¯\_(ツ)_/¯

04:06 <imirkin> anyways, i'd argue for "return the real value"

04:06 <imirkin> the only counterargument is glGetCompressedTexture

04:06 <imirkin> where you're supposed to return the original data

04:07 <imirkin> if you e.g. decompressed it internally, you can't just recompress it (for a variety of reasons)

04:09 <jekstrand> ¯\_(ツ)_/¯

04:37 maxzor has quit [Ping timeout: 480 seconds]

05:00 mattrope has quit [Read error: Connection reset by peer]

05:33 nchery has quit [Read error: Connection reset by peer]

05:36 maxzor has joined #dri-devel

05:47 Duke`` has joined #dri-devel

05:52 jewins1 has joined #dri-devel

05:52 jewins has quit [Read error: Connection reset by peer]

05:56 <airlied> jekstrand, zmike : another data point is the float test passes, the vec2 one fails

05:56 <airlied> and I've built old gles conformance with gtf, it also fails

05:57 <airlied> and it's actaully the reference shader failing

06:03 LexSfX has joined #dri-devel

06:04 mszyprow_ has joined #dri-devel

06:14 <airlied> jekstrand, zmike : submitted a fix for the atan gtf tests to kc-cts internally gitlab

06:15 LexSfX has quit []

06:19 mszyprow_ has quit [Ping timeout: 480 seconds]

06:22 LexSfX has joined #dri-devel

06:27 jewins1 has quit [Ping timeout: 480 seconds]

06:31 itoral has joined #dri-devel

06:44 sdutt has quit [Ping timeout: 480 seconds]

06:47 <airlied> jekstrand, zmike : the matrix fail seems to be lowering f16mat

06:49 <airlied> actually not sure how the matrix is going to be packed here

07:02 <airlied> seems to bug somewhere in mesa with lowering fmat ubo loads, vec3 vs vec4

07:05 Duke`` has quit [Ping timeout: 480 seconds]

07:12 tursulin has quit [Read error: Connection reset by peer]

07:12 mlankhorst has joined #dri-devel

07:23 danvet has joined #dri-devel

07:25 MajorBiscuit has joined #dri-devel

07:30 pnowack has joined #dri-devel

07:36 mvlad has joined #dri-devel

07:48 frieder has joined #dri-devel

07:49 gouchi has joined #dri-devel

07:49 frieder has quit [Remote host closed the connection]

07:50 frieder has joined #dri-devel

07:50 frieder has quit []

07:50 frieder has joined #dri-devel

07:51 gouchi has quit []

07:53 pnowack has left #dri-devel [#dri-devel]

07:56 pnowack has joined #dri-devel

08:01 tzimmermann has joined #dri-devel

08:07 Major_Biscuit has joined #dri-devel

08:14 MajorBiscuit has quit [Ping timeout: 480 seconds]

08:17 <mlankhorst> airlied: looks like just some DP_HELPER selects were missing

08:21 tursulin has joined #dri-devel

08:41 <danvet> tzimmermann, javierm I finally sent out my fbcon series includeing fbdev core maintainers entry patch

08:41 <danvet> acks welcome and all that

08:41 <javierm> danvet: yes, it's on my TODO to look at this morning

08:41 <tzimmermann> ok

08:42 <javierm> there are some easy ones that I can ack right away like your revert of the efifb hack. But for others I'll need to dig into the fbcon code

08:42 <javierm> danvet: btw, on the discussion about Rx formats. If we advertise that we will still need to convert to invert mono right ?

08:43 <danvet> javierm, invert mono?

08:43 * danvet lost

08:43 <danvet> javierm, well the revert I thought needs more work, or did I misunderstand the discussion with tzimmermann and you yesterday?

08:44 <danvet> I thought there's still a corner where sysfb can slip through and be resurrected

08:44 <danvet> that's why I put the revert and final cleanup at the very end

08:44 <javierm> danvet: yes, it needs more work but I'm happy to ack if we have a path to fix it correctly

08:45 <javierm> as long as that doesn't go to stable, it's OK to fix it before a release I think

08:45 <danvet> javierm, yeah but please include that

08:45 <danvet> so I don't go ahead and merge it, breaking stuff :-)

08:45 <javierm> danvet: Ok, gotcha

08:45 <javierm> let's wait to ack that then

08:45 <danvet> the other thing that bummed me out while doing these patches is that I think fbcon locking is unfixable

08:45 <danvet> without the threaded printk support

08:45 <danvet> and making fbcon always threaded

08:45 <danvet> you just can't take any kind of lock from random printk contexts

08:46 <danvet> this is the entire kgdb/oops printing path, once more

08:46 <javierm> right. But we don't know when that threaded printk support is going to land, right ?

08:46 <danvet> to be correct it'd need to be trylock&bail-out all the way down into every driver

08:46 <danvet> Soon(tm)

08:46 <danvet> it's more that -rt folks decided to not make everything threaded by default, because regression

08:47 <danvet> I think we don't get much of a choice really, it's just the correct thing

08:48 <javierm> danvet: yeah

08:48 <javierm> danvet: re: formats - https://patchwork.kernel.org/project/dri-devel/patch/20220131201225.2324984-3-javierm@redhat.com/

08:49 <danvet> ah

08:49 <danvet> so we need a pile of drm_fourcc with inverted bits?

08:49 <danvet> or just invertedR1?

08:49 <javierm> danvet: if we want to exporse the correct format as supported by the panel, yes

08:50 <danvet> maybe call them Dx for darkness

08:50 * danvet bad at naming

08:50 <javierm> danvet: on the other hand is kind of nice that all user-space works with the advertised format

08:50 <javierm> would that be the case for {R,D}x ?

08:50 <javierm> I was able to run plymouth, gdm, etc on the monochromatic display

08:51 <danvet> javierm, oh we pretty much need the rgbx8888 fallback if you care about modern distro userspace

08:51 <danvet> it's more about the discussion to allow the efficient path

08:51 <danvet> especially the fbdev side of it

08:51 <javierm> danvet: got it. Then proferred reversed mono but rgbx8888 as fallback. Makes sens e

08:52 <javierm> danvet: I think though that this could be done as a follow-up. There's no need to block drivers for this kind of HW due the missing bits

08:52 <emersion> user-space has no way to figure out that it should prefer to use reversed mono instead of xrgb

08:52 <danvet> yeah I think what we should do is {native formats in preference order} + rgbx88888 as sw fallback

08:52 <emersion> it would be nice to have a way to know

08:52 <danvet> emersion, the format list is sorted

08:52 <javierm> specially since we need to type that up in user-space anyways

08:52 <emersion> oh

08:52 <danvet> start at the top, pick first you like

08:52 <danvet> emersion, maybe another doc patch?

08:52 <emersion> :D

08:52 <danvet> so at least in my reviews for format lists I tried to make sure the sw fallback is the very last one

08:53 <danvet> heck I even started a patch series to make sure the first is really the preferred

08:53 <danvet> and fbdev emulation would pickt hat u

08:53 <danvet> *pick that up automatically

08:53 <danvet> so that we could get rid of the terrible preferred_depth confusion we have

08:53 <javierm> emersion: look how the (only) format supported as set as DRM_MODE_TYPE_PREFERRED

08:53 <danvet> but alas I got lost

08:53 <javierm> https://www.spinics.net/lists/dri-devel/msg331406.html

08:54 <emersion> javierm: that's just the mode

08:54 <danvet> javierm, that's modes, not formats

08:54 <danvet> formats you only have xrgb8888

08:54 <danvet> so ideally there'd be an inverted mono first

08:54 <danvet> and then xrgb8888

08:55 <javierm> err, right. Sorry conflated format with mode. Still didn't have coffee :)

08:55 <imirkin> danvet: there's the somewhat upsetting situation that right now there's no good way to represent non-32-bit formats on BE. in practice the drivers for that hw don't support AddFB2 so it mostly works out.

08:55 <danvet> emersion, oh I just realized that I never sent out that patch set :-/

08:55 <javierm> danvet: so user-space should pick the first one listed as preferred then ?

08:55 <danvet> imirkin, be is sooooo dead :-P

08:55 <emersion> imirkin: DRM_FORMAT

08:55 <emersion> err

08:55 <emersion> imirkin: DRM_FORMAT_BIG_ENDIAN?

08:55 <imirkin> emersion: don't tell me about the BIG_ENDIAN flag

08:55 <imirkin> it doesn't work

08:56 <imirkin> at all :)

08:56 <emersion> :P

08:56 <javierm> danvet, emersion: or maybe have a DRM_MODE_FORMAT_PREFERRED ?

08:56 <emersion> javierm: not sure how that'd work

08:56 <danvet> javierm, don't like that as much

08:56 <emersion> also a sorted liost is better

08:56 <danvet> we have the same problem with modifiers

08:56 <javierm> danvet, emersion: Ok

08:56 <imirkin> the drivers work with depth, and userspace works with depth, so it all works out

08:57 <danvet> also what emersion said, it's just binary, but maybe you have more preferrence

08:57 <emersion> so you can have R8 in-between if that's not as bad as XRGB8888 but not as good as R1

08:57 <javierm> emersion: yeah, agreed. Better to make it a convention that the first one is the preferred

08:57 <imirkin> but if you remove the "depth confusion" then that might no longer hold true

08:57 <danvet> like bochs wants C8 first, then rgb565, then xrgb8888 (because just not enough ram in the fake card)

08:57 <emersion> what is bochs again?

08:57 <javierm> emersion: right, because R8 would just need one conversion while XRGB8888 would need two -> greyscale -> reversed mono

08:57 <imirkin> qemu, but much older

08:57 <emersion> eh

08:58 <airlied> it's also the qemu vga adapter

08:59 <airlied> mlankhorst: ++ will stick that in top of the merge

09:00 <javierm> emersion: is there user-space that supports Rx formats ?

09:00 <emersion> javierm: not right now, but i can type that up if you want

09:00 <emersion> inverted would be more work

09:00 <javierm> emersion: no need for now, was just curious. Because I think we need XRGB8888 anyways for the current versions

09:01 <danvet> mlankhorst, maybe backmerge that merge to get it into drm-misc-next then?

09:01 <emersion> yes, XRGB8888 is good to have anyways to run user-space without special Rx support

09:01 <javierm> emersion, danvet: but thanks for the explanations, I understand now why Rx and Dx (or whatever will be called) would be nice to have

09:01 <javierm> and make it Dx, Rx, XRGB8888 in that order

09:02 <emersion> yea

09:02 <danvet> +1

09:02 <danvet> also maybe I should resurrect my patch set to clean up this confusion

09:02 <emersion> D1, R1, R8, XRGB8888 even if you want

09:02 <javierm> right

09:02 <emersion> pixman supports R1 and R8 so should be easy to plumb to wlroots and/or weston

09:06 <javierm> libdrm also supports R8 AFAICT, didn't find R1 there

09:12 pcercuei has joined #dri-devel

09:14 <danvet> I guess I should resurrect that half-done preferred format series I've started

09:14 <danvet> but it's really messy :-(

09:14 <danvet> javierm, the idea was to sort all the format lists and then use that in drm_fbdev_generic_setup to compute the preferred depth

09:14 <danvet> and also to compute the preferred bpp getcap

09:15 <danvet> and also pave the way for moving fbdev over to fourrcc format codes

09:15 <danvet> maybe was a bit too ambitious

09:15 <danvet> but thoughts on this direction?

09:15 <danvet> tzimmermann, ^^ anyone else who cares about smaller drivers and this stuff?

09:16 <danvet> 14c1e12ba605d8770cae3e8078e520365daca921 is essentially the motivation

09:16 soreau has quit [Read error: Connection reset by peer]

09:16 <tzimmermann> that sounds good

09:16 <danvet> but maybe also good prep work for adding more formats to fbdev emulation

09:16 soreau has joined #dri-devel

09:17 <tzimmermann> danvet, javierm, btw, i've been able to mmap gem-shmem buffers for fbcon without the extra shadow buffer for fbdefio. this will save memory and reduce latency

09:17 <danvet> nice

09:18 <tzimmermann> i also investigated the slowness of fbcon

09:18 <tzimmermann> and the reason why drm drivers have a slow console is...

09:18 <tzimmermann> fbdev!

09:18 <tzimmermann> *tadaa.wav*

09:19 <danvet> how?

09:19 <tzimmermann> the code in sys_fillrect is much slower than the same code in cfb_fillrect

09:20 <tzimmermann> the compiler doesn't do a good job

09:20 <tzimmermann> sys_fillrect take ~3500 cycles to fill a single line with a pattern

09:20 <tzimmermann> cfb_fillrect takes ~700

09:21 <tzimmermann> all the natve fbdev drivers use cfb_ functions, because they operate on i/O memory

09:21 <danvet> lolz

09:21 <danvet> that's pretty bad

09:21 <tzimmermann> drm uses the sys_ functions because we have a shadow buffer

09:21 <tzimmermann> yep: first lol, than facepalm. that was my reaction too

09:22 <tzimmermann> i have a simple patch that brings sys_filrect down to ~300 cycles for filling a single line with a pattern

09:22 <tzimmermann> so takes have of the time of cfb_ (as i would expect)

09:22 <tzimmermann> 'half'

09:23 <javierm> danvet: the idea makes sense to me as well

09:23 <tzimmermann> i'll post i patchset after i looked at sys_copyarea() and sys_imageblit(). but the issue is the same there

09:24 <danvet> tzimmermann, if your microbenchmark rewrites the same line over and over it should be much faster

09:24 <danvet> as long as the line fits in l2

09:24 <danvet> but yeah faster than cfb_ sounds good already

09:24 <javierm> tzimmermann: plot twist :)

09:25 Major_Biscuit has quit []

09:25 <tzimmermann> danvet, i directly measure the performance of fbcon while i use it: something like 'time find /usr/share/ -type '

09:25 <tzimmermann> with rdtsc

09:26 <tzimmermann> javierm, indeed. everyone's been blaming drm, then the problem is in fbdev helpers. the rest of the code paths make no difference AFAICT

09:27 <javierm> tzimmermann: cool

09:27 <tzimmermann> and drives home danvet's argument about how hard it is to write a fast 2d blitter

09:27 MajorBiscuit has joined #dri-devel

09:28 <tzimmermann> it might take a bit to get this finished. i'm having quite a bit of work to do ATM

09:28 <javierm> tzimmermann, danvet: a funny thing is that the original ssd1307fb driver didn't pass all the fbtests from geert but the emulated ssd1307 DRM driver did

09:28 <tzimmermann> :D

09:29 <javierm> also rmmod ssd1307fb caused a NULL pointer deref. Thought about digging more about for these two issues but then decided that wasn't worth it

09:30 Company has joined #dri-devel

09:34 <MrCooper> tzimmermann: nice find!

09:37 <danvet> javierm, can we port the fbtest to igt perhaps?

09:38 <danvet> or are you not that bored :-)

09:43 <javierm> danvet :) I think is a good idea, I can add it to my TODO in case I get some free time at some point

09:43 <javierm> specially if we plan to replace all the fbdev drivers, should make sure that the emulated fbdev path does not regress

09:48 <pq> javierm, what's the problem with doing xrgb8888 -> reversed mono conversion in a single pass?

09:48 <emersion> more code to type?

09:49 <javierm> pq: you could optimize that but unsure if will be worthy since we would like to advertise greyscale (Rx) too

09:49 <pq> I also don't think we are going to need Dx since we have Cx formats, but I'll elaborate that on the mailing list. Too much for irc.

09:49 <javierm> so you will need the Rx -> Dx code anyways

09:50 <javierm> pq: agreed. I've summed up the format discussions here in the ML

09:51 <javierm> danvet, tzimmermann, emersion, sven: I've two meta discussions 1) what to do with the DT binding and whether we should keep it compatible or use the latest and greatest DT conventions

09:52 <javierm> and 2) if the driver should be named ssd130x (more accurate) or ssd1307 (less accurate but consistent with ssd1307fb name)

09:52 <sven> wrong sven i think :)

09:53 <javierm> sven: sorry, I meant sravn I think

09:54 <danvet> javierm, ping robher, but I thought the idea is to go with latest&greatest and have the others maybe as fallback?

09:54 <danvet> or perhaps pinchartl has an opinion too

09:54 <danvet> also who cares about driver names

09:54 <danvet> i915 supports i915

09:54 <danvet> and also down to i830

09:54 <javierm> danvet: Ok, thanks

09:54 <danvet> and also like 10 more generations later

09:54 oneforall2 has quit [Quit: Leaving]

09:55 <danvet> javierm, I'm definitely very far from an authority wrt dt things

09:55 <pq> javierm, tzimmermann, I'm really happy to see what you're doing here :-D Of course danvet too.

10:01 rasterman has joined #dri-devel

10:02 <emersion> yes, thanks a lot javierm :)

10:03 <javierm> pq, emersion :)

10:11 <tzimmermann> MrCooper, pq, glad you like it :)

10:13 oneforall2 has joined #dri-devel

10:17 bnieuwenhuizen_ has joined #dri-devel

10:20 bnieuwenhuizen has quit [Ping timeout: 480 seconds]

10:37 <danvet> tzimmermann, gl has moved away from I/Y formats, they're officially deprecated

10:37 <danvet> so a lot of userspace moved towards R as the greyscale format

10:38 <tzimmermann> danvet, no big deal

10:38 <danvet> (reason is some technicality in the shaders, which is only available in legacy gl context with backwards compat cruft enabled)

10:39 <danvet> essentially R loads a (x, 0, 0, 0) in the shader

10:40 <tzimmermann> then i propose 'I' as in 'index' :)

10:40 <danvet> while I loads as (x, x, x, 1) iirc and Y as (x, x, x, x) or something like that

10:40 <pq> daniels, didn't you disagree with using R for grayscale?

10:41 <pq> ..in DRM pixel formats

10:42 xyene_ has joined #dri-devel

10:43 <pq> I just sent my Cx format proposition, FWIW.

10:44 <daniels> yeah I definitely prefer C to R

10:44 <emersion> a read-only palette would be… quite messy to handle from user-space

10:45 xyene has quit [Ping timeout: 480 seconds]

10:45 <emersion> especially if it's just grayscale…

10:45 <danvet> yeah read-only palette to confer meaning to Cx that "oh btw it's linear greyscale" seems like a funky interface

10:45 <danvet> like we also have yuv formats

10:45 <danvet> and some implied color space attached to them

10:45 <danvet> mixing up paletted with linear modes seems like a funky idea

10:45 <pq> I'd agree if the read-only palette was *only* for grayscale. But it's not.

10:46 <pq> emersion, what's messy about it?

10:46 <emersion> user-space rendering becomes a hell more complicated with a palette

10:47 <danvet> pq, also the main motivator for these seems to be "more efficient fbcon rendering"

10:47 <pq> emersion, I don't see it that way. Either you use the colors you can display, or you render your usual 8-bit sRGB and then convert with your preference, either trying to match color or intensity.

10:47 <danvet> so unless we rewrite fbdev/fbcon, whatever that thing is doing wins

10:48 <pq> emersion, so it wouldn't be changing rendering, but you just add a quantization step at the end.

10:48 JoshuaAs- has joined #dri-devel

10:48 <emersion> pq, what can i do to figure out that i can just use the pixman grayscale format if the kernel gives me C8 + a read-only LUT?

10:49 <pq> danvet, sure. My proposal was to make those panels more useful in general, not really for fbcon.

10:49 <emersion> i need to have an heuristic to guess that the LUT is linear

10:49 <ishitatsuyuki> it's a h264 zoom recording of lecture, with very long I-frame interval (~15s) and full of P-frames in between. The bitrate is also very low at ~200kbps, and at that rate it looks like sending commands to my GPU is becoming a greater overhead

10:49 <ishitatsuyuki> afaik vaapi requires a flush for every frame (since flush is bound to render target), so there probably isn't much can be done on the application side?

10:49 <ishitatsuyuki> sorry for interrupting, but do you know if vaapi is suitable for very low bitrate streams? I'm currently investigating a case where hwdec has much worse seeking performance than sw

10:50 <pq> emersion, I don't think you'd do that. Either you use the palette explicitly (pixman has palettes), or you render to argb8888 and then quantize to the palette.

10:50 <emersion> i don't want to do that

10:51 <pq> oh...kay...

10:51 <emersion> that's wasting a lot of CPU time doing unnecessary stuff

10:51 <emersion> why should i need a palette to just do grayscale?

10:51 <pq> that depends on what your source material is in

10:51 <emersion> why should i render to argb8888 to just display some grayscale?

10:51 <pq> the display may not be grayscale

10:52 <emersion> if it prefers R8, it is

10:52 <pq> then you do R8

10:52 <emersion> then why do you want to use C8?

10:52 <pq> my proposal is for Cx formats, particularly for displays that are NOT grayscale

10:52 <emersion> it's just complicating things for no good reason

10:52 <emersion> oh, but that's not the case here

10:53 <pq> Did I not mention white/blue LED displays?

10:53 <emersion> why is R1 bad?

10:53 <pq> ask daniels

10:53 <pq> this is my answer to "R1 is bad" but I do not think R1 is bad

10:54 <emersion> in the special case of C1, maybe that's okay

10:54 <emersion> but C8/C16/etc will be a pain to deal with

10:54 <emersion> (with a read-only LUT, that is)

10:54 <pq> anyway, I do not expect anyone to actually follow my proposal, I just wanted it out.

10:55 <danvet> pq, for me the difference is that with Cx you have to use paletted rendering/quantization/color mapping

10:55 <danvet> with Rx you can pretend it's just gamma ramp as usual and works out

10:55 <danvet> and for most cases that's good enough

10:56 <danvet> because at that color depth not many people care about accurate color grading

10:56 <danvet> and yeah C1 is a bit a special case since if you don't care there's really nothing you can do better with R1

10:56 <pq> xrgb8888 is also quantized and mapped

10:56 <pq> and could benefit from dithering

10:56 <danvet> yeah but most people don't care about the difference

10:57 <danvet> there's a lot of people who are perfectly happy with output that's "not too shitty"

10:57 <danvet> ofc sliding scale

10:57 <danvet> so forcing everyone to go through a full quantization/color mapping pipeline just because it's a nice model feels like serious overkill

10:58 <pq> danvet, emersion, you know, I was in your shoes exactly when talking R vs. C for grayscale with daniels recently. So I blame daniels. :-)

10:58 <danvet> sure if they'd do, the result would be a lot better

10:58 <danvet> but reality is that a lot of the stuff running on top of drm drivers is very far from that ideal

10:59 <danvet> like all the converting we do in the kernel to give you xrgb8888 is very bad thing from that pov

10:59 <danvet> perfect world userspace would do this

10:59 <emersion> i mean, i'm not against allowing user-space to do fancy color stuff with a palette

10:59 <danvet> imperfect world that stance would result in a lot of black screens and disappointed users

10:59 <emersion> but please don't make it mandatory

10:59 <danvet> yup

10:59 <danvet> pretty much my point

11:00 <pq> emersion, I can't make it mandatory. It's the driver who decides which DRM formats it accepts.

11:00 <danvet> like with Rx you can also attach a color profile for the screen

11:00 <danvet> (do they support paletted screens?)

11:00 <pq> I presented a plan to make Cx formats useful, that's all.

11:01 <javierm> pq: thanks, it's appreciated

11:01 <danvet> pq, I think for stuff like vga16 your Cx proposal is solid

11:01 <emersion> pq, i was speaking from the driver PoV here :)

11:01 <danvet> since vga16 mode is very much not R4

11:01 <emersion> "please drivers, expose R1/R8, so that my user-space can be simple"

11:01 <pq> danvet, an ICC profile can have a LUT, but I think it is intended to be interpolatable, so not really.

11:01 <danvet> so C4 + fixed palette in srgb or so sounds good

11:01 <javierm> pq: my take is that these panels are really shitty, so I'm not sure that level of color accuracy is worth it

11:02 <javierm> pq: you can't even play doom because its minimum resolution supported is 320x200 :P

11:02 <danvet> anything can run doom assuming sufficient c3 have passed I thought

11:03 <daniels> my only objection to Rx is that r means red, not monochrome

11:03 Akari` has joined #dri-devel

11:03 <daniels> maybe Gx or Wx or whatever?

11:04 <danvet> daniels, yeah I think if we go with Rx then at least a doc patch would be good

11:04 <emersion> r means "coloR"

11:04 <pq> daniels, and I think pixel formats should *not* specify colorimetry at all. So I'm with danvet and emersion on R.

11:04 <danvet> but really we could also say they're for roughly a single color with some kind of probably gamma mapped intensity scale

11:04 Akari has quit [Remote host closed the connection]

11:04 <javierm> emersion: where color here is grey, right ?

11:04 <emersion> yeah, but could be any color

11:05 <danvet> whereas Cx is "you get all kinds of random things, and in many cases you can specify the palette through GAMMA_LUT

11:05 <danvet> "

11:05 <emersion> yea

11:05 <javierm> emersion: yes, I got that but wanted to say that could Rx could match a greyscale based on your definition

11:05 <emersion> ideally a new prop would be better as mentionned before

11:05 <danvet> so vga16 is C4 with fixed palette

11:05 <danvet> but a green lcd with 4 bits roughly monotonic scale is R4

11:06 <emersion> yea

11:06 <pq> danvet, I very much agree with that.

11:06 <danvet> and yeah R1 probably makes no sense in that world and we should have C1 only

11:10 itoral has quit [Remote host closed the connection]

11:11 itoral has joined #dri-devel

11:17 <daniels> pq: I understand that colorimetry is interesting, I just think that given we have no extant userspace for this, that ‘hey so red is actually grey sometimes!’ is too much of a cute sleight of hand, and that a new format token would be easier and more explicit

11:17 <danvet> I guess a FIXED_PALETTE_XXXX where XXXX is the fourcc for that palette could work out

11:18 itoral has quit [Remote host closed the connection]

11:18 <danvet> daniels, there's fbdev/fbcon, which I think is the main motivator for these

11:18 itoral has joined #dri-devel

11:18 <danvet> so making it too hard to support fbdev defeats the point

11:19 <daniels> fbdev could do G1 just as well as it could R1?

11:19 <daniels> hmm no, not G. that’s already taken isn’t it :)

11:21 <javierm> daniels, danvet: tbh for this particular driver/device the fake DRM_FORMAT_XRGB8888 is enough really

11:22 <javierm> the I2C bus is so slow and the panel only 128x64, that you could do a gazillion copies and transformations in RAM that wouldn't be a bottleneck

11:24 <javierm> but understand that's important for monochomatic panels that have high speed busses and are used on slow machines

11:24 <emersion> my main motivation for a grayscale pixel format is to let user-space figure out that colors won't be displayed

11:25 <javierm> emersion: yes, agreed

11:29 <danvet> daniels, I'm also thinking this from the gl side, which can do Rx already

11:29 <danvet> and really does not want to import Ix or Yx because of all the historical baggage

11:29 <danvet> and at that point we're just bikeshedding a letter

11:30 itoral has quit [Remote host closed the connection]

11:30 itoral has joined #dri-devel

11:30 <danvet> so R is as much as lie as any of the other letters, but it's a lie that at least is convenient for one of our most important userspace

11:31 <danvet> daniels, the other thing and I guess that was a misunderstanding, but I thought you or pq argued for only Cx for these

11:31 <danvet> and that would make it more tricky to distinguish the "does this thing have a palette or is it more monotonic single color" meaning

11:31 <danvet> which at least fbdev cares about and I think makes at least some sense

11:31 <daniels> not really, because you then have to manually broadcast the R channel out to the other colour channels - unless you do really want it to be red-only

11:32 <daniels> I agree that Cx + fixed palette is a bad idea

11:32 <danvet> well yeah but compositor needs to have these shaders anyway

11:32 <daniels> pq and emersion nicely illustrated that

11:32 <daniels> danvet: not really - for every other format, unless we need to do YUV conversation, we just assume that the sampled colour is not a lie

11:33 <danvet> hm

11:33 <danvet> but I don't expect gl folks to be happy if we inflict Ix on them

11:33 <daniels> It’s completely trivial driver-side to make Mx (or whatever) produce (x,x,x,1)

11:34 <danvet> but I guess it's all there already

11:34 <danvet> and yeah on the Cx side I wonder whether we need distinct fourcc for fixed vs user-controlled palette

11:35 <daniels> I’m just going on the general principle that explicit >>> surprising

11:35 <danvet> yeah

11:35 <danvet> daniels, so importing a yuv drm_fourcc is specc'ed to work like oes_image_external, or whatever that was again?

11:36 <daniels> yes if the driver accepts it

11:36 <daniels> if not, you open-code the conversation

11:36 <daniels> *conversion

11:36 <daniels> stupid phone

11:47 <danvet> yeah but with Ix I expect you'll have to do the Rx mapping anyway in the compositor, so why bother

11:47 flacks has quit [Quit: Quitter]

11:47 <danvet> and on the display I don't think we'll ever have a need for a real Rx format

11:47 <danvet> since rewriting luts and ctms to fit whatever the hw does is easy

11:48 flacks has joined #dri-devel

11:52 JohnnyonFlame has joined #dri-devel

11:54 itoral has quit [Remote host closed the connection]

11:54 <pq> daniels, if you think that a channel labeled "red" should also produce something you look at and think "it looks red", then that is colorimetry. If we start encoding anything about colorimetry in pixel formats, we'll just get an explosion of formats with no benefit that I can see. If you want to express that a single-channel panel is grayscale, then let's do that, but not with a pixel format.

11:55 itoral has joined #dri-devel

11:58 <pq> Otherwise you're going to need GRAY8 for grascale, G8 for green shaded panels, O8 for orange shaded panels... there is no end to that

11:58 JohnnyonF has quit [Ping timeout: 480 seconds]

11:59 <daniels> shouldn’t we just start calling our channels numbers then, rather than something like ‘red’?

11:59 <pq> "red", "green" and "blue" are convenient channel names, because we also have the same named primaries in displays, and they match. When you have a display that has less than three primaries, the labelling is no longer intuitive but it also doesn't matter.

12:00 <pq> sure, that would work

12:00 <pq> or any arbitrary names

12:00 itoral has quit [Remote host closed the connection]

12:01 <daniels> I mean, as written, we can’t expect that XRGB [1.0, 1.0, 0.0, 0.0] to produce red - it could equally be a light blue

12:01 itoral has joined #dri-devel

12:01 <daniels> but that seems absurd tbh

12:02 <pq> right, because that display has all three channels, which are enough to represent a color volume.

12:03 <pq> if you have a two-channel display, what do you then? and now we're talking about a single-channel display.

12:04 <pq> There is absolutely no difference between R8 and GRAY8 pixel formats: they are laid out in memory the same way, they encode the same single-channel values, they do sub-sampling the same way (i.e. don't)

12:05 <daniels> that’s a good argument that ‘red’ only is the wrong representation for greyscale, no?

12:06 <pq> if you want to rename R8 to GRAY8, that's fine with me. As long as we don't have both.

12:12 itoral has quit [Remote host closed the connection]

12:12 itoral has joined #dri-devel

12:17 kts has joined #dri-devel

12:21 fxkamd has joined #dri-devel

12:34 devilhorns has joined #dri-devel

12:38 <FLHerne> pq: What you say about why G8, O8 etc. is why I find your palette suggestion confusing

12:38 <FLHerne> If you tell an application its display is a white/blue LCD, what use is it supposed to make of that information?

12:40 <FLHerne> I find it hard to imagine any application that could treat "white/blue monochrome" differently from "grayscale"

12:44 <pq> FLHerne, are you confident saying that no-one will ever need know what the display is like? I wouldn't.

12:44 <pq> of course, it doesn't have to be KMS to deliver that information, it could be app settings too that the end user has to fiddle with

12:45 <pq> but, *if* we want KMS to be able to describe such displays, then this is my proposal for now.

12:47 <pq> otherwise, you probably use R8 or even XRGB8888 on that white/blue LCD, and hack your app colors so that they truncate to white/blue good enough to make sense on the display. Also taking into account how the driver happens to mangle XRGB8888 into 1-bit.

12:47 itoral has quit [Remote host closed the connection]

12:48 itoral has joined #dri-devel

12:50 <pq> maybe that embedded device has a web UI for arranging things on screen, and the preview would be nice to use the actual colors.

12:51 <pq> it's a question of how automatically and where is the system configured

12:53 <FLHerne> pq: I just can't see what action an application could make based on that. If the display isn't the right shade of blue, or it wants to render in orange, then...tough luck, it can only render in that shade of blue

12:56 <FLHerne> and if it isn't designed to be monochrome, trying to quantize other colours based on their blue content will be less legible than simply using intensity as if it's greyscale

12:56 <pq> FLHerne, that's true. It might have different color themes to pick from.

12:58 <pq> there is no reason to pick just one channel of RGB even if the display is blue, you can always weight all components. That may result in gray or some other scale, whatever you prefer.

12:58 <pq> *weigh

13:02 shsharma has joined #dri-devel

13:03 itoral has quit [Remote host closed the connection]

13:04 itoral has joined #dri-devel

13:13 itoral has quit [Remote host closed the connection]

13:14 itoral has joined #dri-devel

13:17 itoral has quit [Remote host closed the connection]

13:18 itoral has joined #dri-devel

13:25 Daanct12 has joined #dri-devel

13:30 Danct12 has quit [Ping timeout: 480 seconds]

13:31 kts has quit [Quit: Konversation terminated!]

13:32 itoral has quit [Remote host closed the connection]

13:35 Danct12 has joined #dri-devel

13:40 Daanct12 has quit [Ping timeout: 480 seconds]

13:43 off^ has joined #dri-devel

13:55 <zmike> airlied: nice, those fix everything except GTF-GL46.gtf21.GL2Tests.glGetUniform.glGetUniform

13:56 nchery has joined #dri-devel

13:58 <zmike> though as you noted in another ticket, that's just loss of precision

14:01 sdutt has joined #dri-devel

14:02 sdutt has quit []

14:02 sdutt has joined #dri-devel

14:07 JohnnyonF has joined #dri-devel

14:13 JohnnyonFlame has quit [Ping timeout: 480 seconds]

14:29 pcercuei has quit [Ping timeout: 480 seconds]

14:59 jewins has joined #dri-devel

15:02 APic has joined #dri-devel

15:08 mattrope has joined #dri-devel

15:18 <glennk> what would a drm fourcc be for a 4 bit packed single channel pixel format? "R4" or some other pet bikeshed color? asking for a friend...

15:18 <daniels> oh god

15:18 <pq> R4

15:19 <pq> or C4 if you need a palette :-p

15:20 <pq> I mean, I always assumed Rx and Cx with x < 8 are tightly packed.

15:21 <pq> but if they are not, then call it e.g. XR44 for one byte per pixel

15:22 <pq> DRM_FORMAT_XC62 etc. for a C2 pixel in a byte.

15:23 pcercuei has joined #dri-devel

15:25 <glennk> C4 sounds volatile

15:29 <pq> YC88_420 would be fun: full resolution 8-bit luminance plane with 8-bit paletted color on 2x2 sub-sampled chroma plane. Or something.

15:30 <glennk> at least its not a bit plane interleaved tile format

15:31 <pq> Could also do GC88, where green is given as-is, but red-blue are given via palette

15:32 <glennk> will probably go the route R8 with alternate RGBA8888 and just convert in the driver, to keep userspace a bit more sane

15:32 <pq> D should probably be reserved for depth, because XR is hot

16:03 kts has joined #dri-devel

16:18 Duke`` has joined #dri-devel

16:21 <danvet> glennk, pq I wonder whether we should solve the complete bonkers stuff with modifiers

16:21 <danvet> but I guess more fourcc can't hurt

16:21 shsharma has quit [Remote host closed the connection]

16:21 <danvet> complete bonkers = funny bit interleaved nonsense like banked vga

16:23 <glennk> i think "complete bonkers" already got used up by x14rgb666

16:25 <danvet> yeah maybe

16:25 <danvet> glennk, well we don't have that yet in drm_fourcc.h, maybe there is still hope left

16:26 <glennk> or GBRG3553 (byteswapped rgb565)

16:28 mbrost has joined #dri-devel

16:29 <zmike> anholt: have you tried the mold linker at all yet? seems noticeably faster for mesa and cts at least

16:29 <danvet> glennk, we have a big/little endian flag that everyone ignores

16:30 shsharma has joined #dri-devel

16:30 <danvet> so that should be RGB565 | BIG_ENDIAN

16:30 <danvet> yes fourcc have flags, it's entertaining

16:31 <glennk> its not really a big endian format, its just byteswapped relative to the host

16:34 kts_ has joined #dri-devel

16:35 <imirkin> danvet: the BIG_ENDIAN flag doesn't work btw

16:36 <imirkin> it's not that it's ignored -- it will just trigger failures

16:36 <imirkin> adding support in the driver isn't enough, all the core bits need help too

16:37 <danvet> imirkin, oh I know

16:37 <imirkin> vmwgfx (iirc) limps along by only supporting rgba8888, which just gets flipped into another format very early on in the pipeline

16:37 <danvet> yeah there isn't really anyone who handles this correctly

16:38 <imirkin> but like the format lookup logic in drm_fourcc will fail if that flag is set, etc

16:38 <danvet> and userspace definitely won't render byteswapped rgb formats

16:38 <imirkin> (at least that's my recollection last i looked)

16:38 kts has quit [Ping timeout: 480 seconds]

16:38 <danvet> imirkin, you'd need to include it as a format with that flag set

16:38 <danvet> since it's strictly speaking a different one (in most cases at least)

16:39 <imirkin> i was going to make nvidia hw "respect" it properly, since they can support both irrespective of host endian

16:39 <danvet> and ideally we'd have a canonicalize stage or something

16:39 <imirkin> but the core just had no support for piping that info through

16:39 <danvet> so if you have byteswap hw, double your format list

16:39 <imirkin> there is no format for RGB565|BIG_ENDIAN

16:39 <imirkin> if you just do it like that, ka-boom

16:39 <danvet> ah right the format info probably fails

16:39 <danvet> I guess should filter it out there

16:39 <danvet> instead of making the table twice the size

16:40 ybogdano has joined #dri-devel

16:40 <imirkin> the pre-nv50 hw can support both

16:40 <danvet> otoh if we just do it everywhere then format canonicalization becomes a mess

16:40 <imirkin> it's just a bit somewhere

16:40 <imirkin> whether to byteswap or not

16:40 <danvet> so maybe actually adding rgb565|BIG_ENDIN makes more sense

16:40 <imirkin> yeah, it makes sense. but the format info is all off as a result

16:40 <danvet> yeah

16:41 <imirkin> not an infeasible task to fix, but ... hard to care

16:41 <imirkin> esp when "depth" works totally fine :)

16:41 <danvet> like with 8888 formats the ship kinda sailed, so there we should canonicalize

16:41 <danvet> but for the others I don't think allowing | BIG_ENDIAN blindly is a good idea

16:41 <danvet> e.g. all the ones which are just bytestreams like all the yuv stuff

16:41 <imirkin> actually the 8888 formats is where we already canonicalize -- there's logic which flips from RGBA8888 to ABGR8888

16:42 <danvet> only on the addfb1 -> addfb2 compat path

16:42 <danvet> not anywhere else

16:42 <imirkin> (based on a driver quirk)

16:42 <imirkin> ah yeah, probably

16:42 <danvet> so yeah we give you a canonical fourcc code

16:42 <danvet> but if userspace gives you an alias in addfb2, we don't do anything

16:44 <imirkin> in nouveau's case, iirc we disallow addfb2 on such configurations, and xf86-video-nouveau uses addfb1 anyways

17:04 iive has joined #dri-devel

17:07 <sravn> javierm: As long as the Kconfig entries are logical then the driver name is less important - but I like that it is not explicit

17:09 <sravn> javierm: For the binding the target group is small and they likely have to do some adaptions anyway - so here I suggest to go with a binding that describes the actual HW the best. And the backligth is conceptually separate - so describe it so.

17:09 gawin has joined #dri-devel

17:10 <sravn> The way to do so could be to deprecate the current pwms property and add the backlight node - so the name etc. is kept but backlight is described in a new way. And the drm driver only supports the non-deprecated way to specify backlight

17:11 <sravn> This model everything in the best and most logical way. Even in the fbdev world few drivers have built-in backlight support.

17:23 pcercuei has quit [Quit: Lost terminal]

17:24 pcercuei has joined #dri-devel

17:27 <glennk> backlight on an oled panel... is a concept...

17:30 <imirkin> well, brightness on a panel is a concept

17:30 <imirkin> (one which does nothing about the intelligence level of the content being viewed, unfortunately)

17:30 gawin has quit [Ping timeout: 480 seconds]

17:33 <javierm> sven: thanks for your review. Could you please mention in the ML ?

17:34 <javierm> sven: I don't really have a strong opinion. I do wonder what's the value of the backlight DT node since won't have any properties...

17:34 <javierm> sven: but Geert and Andy seems to have stronger opinions on the topic so would be good if you comment there and see what they think

17:35 heat has joined #dri-devel

17:35 <sven> wrong sven again ;)

17:35 <javierm> sven: arg, sorry again! I meant sravn ^

17:37 mszyprow_ has joined #dri-devel

17:37 <javierm> dianders: feel free to add my r-b for the revert patch if you want to push ASAP

17:38 <dianders> javierm: OK, done. I'll push it now.

17:38 <javierm> dianders: I didn't get that this was only for a test and had no value otherwise

17:38 <javierm> dianders: btw, this is a patch I wrote to add a debugfs entry also for a test, commit ("28af109a57d1 driver core: add a debugfs entry to show deferred devices")

17:38 <javierm> dianders: if you want to use as a refernce, sholdn't be more code than what you typed for sysfs

17:39 <dianders> javierm: OK, thanks! I'll take a look at it.

17:39 <javierm> cwabbott: yw

17:39 <javierm> err, dianders ^ obviously I can't type today

17:45 gouchi has joined #dri-devel

17:45 mszyprow_ has quit [Ping timeout: 480 seconds]

17:45 <maxzor> airlied, Yes they do focus on pal only... http://ix.io/3Okb

17:46 <dianders> javierm: So adding something basic to debugfs isn't hard, the problem I ran into was figuring out where to put it and how to manage the hierarchy. I could put a top-level "edp-panel" in debugfs and then list panels underneath, but that felt slightly ugly. It also would need to get deleted not based on driver remove but on _module_ remove (so if someone rmmod's edp-panel then the top level dir goes away).

17:47 <dianders> javierm: sysfs had the nice property that everything was already organized by device, which is why I moved my code there. There did seem to be DRM stuff in debugfs but it was more about exposing general DRM properties. This isn't _really_ a general drm property but is just a quirk about this particular panel driver I wanted to expost...

17:54 ngcortes has joined #dri-devel

17:57 ngcortes has quit [Remote host closed the connection]

17:59 gawin has joined #dri-devel

18:03 MajorBiscuit has quit [Ping timeout: 480 seconds]

18:03 mbrost has quit [Read error: Connection reset by peer]

18:03 haagch has quit [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]

18:05 haagch has joined #dri-devel

18:05 ybogdano has quit [Ping timeout: 480 seconds]

18:06 mbrost has joined #dri-devel

18:08 MajorBiscuit has joined #dri-devel

18:08 <maxzor> what is this? https://github.com/GPUOpen-Drivers/pal/blob/dev/src/core/hw/gfxip/rpm/g_rpmGfxPipelineBinaries.h

18:09 karolherbst has quit [Remote host closed the connection]

18:11 <pendingchaos> compiled shaders, mostly to copy, clear or decompress images

18:11 karolherbst has joined #dri-devel

18:14 <jenatali> Ugh, using softfp64 is going to like quadruple the CI time for the d3d12 driver :(

18:14 <imirkin> you should see what adding support for 64-bit vertex attribs does

18:14 karolherbst has quit [Remote host closed the connection]

18:14 <imirkin> there's 1GB of shader tests.

18:14 <jenatali> Ridiculous

18:14 <imirkin> i think the CI containers normally delete them

18:14 karolherbst has joined #dri-devel

18:15 <imirkin> (the idea of the tests was to cover lots of cases, but the end result is that _no_ cases are covered)

18:15 <jenatali> I think I might see about adding some lowering for the double instructions that DXIL is missing, so we can at least get mul, div, add, sub, and fma without having to use the softfp64

18:15 <jenatali> I think it's just floor/ceil/frac and rounding

18:16 <imirkin> are you _sure_ you don't have those?

18:16 <imirkin> those are _pretty_ basic

18:16 <jenatali> Yes

18:16 <jenatali> Unfortunately

18:16 <imirkin> all hw (which does fp64) definitely supports that

18:17 <imirkin> pretty sure even G200 supported that (the tesla-era DX10 GPU which had fp64 support)

18:17 <jenatali> Yeah. I figured. But for whatever reason they were never added to shader model 5, and fp64 just hasn't changed since then...

18:18 <imirkin> i'd encourage you to take another look

18:18 <jenatali> I've looked

18:18 <imirkin> in case it's called something funny

18:18 fxkamd has quit [Remote host closed the connection]

18:18 <imirkin> do you have a link to the SM5 ref pages?

18:18 <imirkin> it's not that i don't believe you, but ... i don't believe you ;)

18:18 fxkamd has joined #dri-devel

18:18 <jenatali> https://github.com/microsoft/DirectXShaderCompiler/blob/master/lib/DXIL/DxilOperations.cpp#L71 - see the table with "false" in the doubles column

18:20 <imirkin> looking at https://docs.microsoft.com/en-us/windows/win32/direct3dhlsl/dmov--sm5---asm- now ... unbelievable.

18:21 <jenatali> Yeah. I know. I've complained to our compiler folks and they agree it's kind of ridiculous. Hopefully they'll get it added for a future shader model, but also, fp64 just isn't a priority so who knows

18:22 <imirkin> i see. they don't specify a new "round", it's inherited from SM4

18:22 <imirkin> and SM4 doesn't have doubles

18:22 <jenatali> Yeah, exactly

18:22 <imirkin> much sad.

18:23 <jenatali> In SM5, double instructions need to explicitly be different from floats. In SM6 they're just another overload, but the set of valid overloads was just inherited from which instructions got added to SM5

18:25 MajorBiscuit has quit [Ping timeout: 480 seconds]

18:29 <cwabbott> jenatali: you might want to check out nir_lower_double_ops.c

18:29 <jenatali> cwabbott: It lowers fract and floor to each other :(

18:29 <cwabbott> uhh, no it doesn't or else intel would be really busted

18:30 <cwabbott> the list of double ops supported sounds awfully like the list of ops Intel supported before gfx12...

18:30 <jenatali> Oh, wait

18:30 <jenatali> Oh DXIL doesn't have ftrunc either, that's what was missing

18:30 <jenatali> Maybe that's all I need to add then?

18:30 <cwabbott> and that pass was all we needed to get intel working

18:31 <cwabbott> the pass lowers ftrunc tooo

18:32 <jenatali> Oh, then it just needs to be run in a loop instead of relying on the GL frontend doing it for me, I see

18:32 <cwabbott> nope

18:32 <cwabbott> nir_function_impl_lower_instructions is smart

18:32 <jenatali> Then maybe I just missed setting a bit...

18:32 <jenatali> Let me try again

18:32 <cwabbott> it will re-run the callback on lowered instructions

18:32 <cwabbott> so you never need to call it in a loop

18:33 <jenatali> Ack. Guess I just saw the unsupported instruction, saw it came from the lowering, and assumed

18:33 <jenatali> Thanks for making me look again

18:33 <cwabbott> you're basically in exactly the same boat as iris, btw

18:34 <cwabbott> either intel inherited the list of double instructions from DX or vice-versa

18:34 <jenatali> Good to know

18:34 <cwabbott> until one of the gens when they decided to nuke 'em all

18:41 heat has quit [Ping timeout: 480 seconds]

18:48 <jenatali> cwabbott: Yeah I just somehow missed adding nir_lower_dfract into my bitmask :(

18:49 <cwabbott> that'll do it :)

18:51 <jenatali> Pretty embarassing

18:53 frieder has quit [Remote host closed the connection]

18:59 <javierm> dianders: I wasn't in front of my laptop and didn't want to keep typing from my phone and look silly :)

19:00 ybogdano has joined #dri-devel

19:01 <javierm> dianders: I see what you meant. There seems to be though a drm_debugfs_create_files(), maybe you could use that ?

19:04 <airlied> robclark: so why have we all the code to upload consts in 16-bit if the hw can do it?

19:08 <javierm> dianders: looking at other DRM drivers, that's what they use and expose the debugfs entries in a dir using the struct drm_minor of the struct drm_device .primary

19:11 <javierm> dianders: maybe you can follow that convention for the edp-panel ? Looking at the entries added by drivers, many are specific and not general DRM properties

19:12 <dianders> javierm: Thanks for the pointer! I'm pretty sure I missed that function when looking before. I'll look in detail after lunch.

19:12 <javierm> dianders: Ok!

19:12 <robclark> airlied: hmm, where? Somehow I guess we aren't hitting that for freedreno because we very much expect our const bufs to not be packed to fp16..

19:14 agjohnston has joined #dri-devel

19:15 <airlied> robclark: _mesa_uniform does it

19:16 <airlied> copy_uniforms_to_storage has copy_to_float16

19:16 <airlied> robclark: ah you don't enable the fp16 const cap

19:16 <airlied> wierd for some reason I thought you wrote that support

19:17 <robclark> I think mareko did

19:17 <airlied> must have been mareko

19:18 <zmike> hm radeonsi also exports PIPE_SHADER_CAP_GLSL_16BIT_CONSTS

19:19 <zmike> maybe this is the magical key

19:19 <airlied> mareko: so can you comment on how you see fp16 float mat3 packed?

19:23 alanc has quit [Remote host closed the connection]

19:23 alanc has joined #dri-devel

19:24 shsharma has quit [Remote host closed the connection]

19:31 devilhorns has quit []

19:33 agjohnston has left #dri-devel [#dri-devel]

19:35 shsharma has joined #dri-devel

19:40 pnowack has quit [Quit: pnowack]

19:53 tzimmermann has quit [Quit: Leaving]

20:00 lemonzest has quit [Quit: WeeChat 3.4]

20:08 callen92 has joined #dri-devel

20:11 <airlied> mareko: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14817 and linked issue, you might have some idea how you saw it working

20:13 pnowack has joined #dri-devel

20:19 ybogdano has quit [Ping timeout: 480 seconds]

20:19 <graphitemaster> O_o: Really surprised if (any(...) || any(...)) does not compile on NV, says there's no matching overload for "bor"

20:23 Haaninjo has joined #dri-devel

20:35 gouchi has quit [Remote host closed the connection]

20:39 ybogdano has joined #dri-devel

20:40 <sravn> javierm: Looked a bit more on the ssd130x stuff - typed my reply on ML.

20:41 <sravn> javierm: Forget about my uneducated rambling on irc, it helped reading a bit in the datasheets

20:42 ngcortes has joined #dri-devel

20:46 cworth has quit [Ping timeout: 480 seconds]

20:53 callen92 has quit []

20:53 mvlad has quit [Remote host closed the connection]

20:54 <javierm> sravn: thanks. Yes, I also was leaning towards fixing the DT but then Geert comment and reading the datasheets opened my eyes

20:56 <demarchi> javierm: did you face any issue regarding merging https://patchwork.freedesktop.org/patch/470882/?series=99030&rev=2 in drm-misc-next?

20:56 <cheako> For VkCreateImageView is levelCount: 4294967295 a problem? This app, No Man Sky, creates a bunch of small ish(960x120) images creating two views for each.

20:59 <cheako> This is related to the video I've been sharing: https://youtu.be/QMBp0B9BCFQ

21:01 <javierm> demarchi: yes I did and mentioned to you here in the channel. Sorry, I thought you saw it

21:03 <javierm> 16:34 < javierm> | demarchi: sorry, got distracted and now looked at your patch

21:03 <javierm> 16:34 < javierm> | demarchi: doesn't apply cleanly because drm-misc-next is still based on v5.16-rc5 and there are changes in include/linux/string_helpers.h landed in v5.17-rc1

21:03 <demarchi> javierm: oh... I lost that message

21:04 <javierm> 16:35 < javierm> | I could easily resolve the merge conflict but my worry is that could cause issues down the road

21:04 <javierm> 16:35 < javierm> | probably better to wait until drm-misc-next is rebased on top of v5.17-rcx ?

21:04 <demarchi> thanks... Yeah, it needs to be on 5.17-rc1 because there were other patches touching that file

21:04 mszyprow_ has joined #dri-devel

21:04 <demarchi> but yes, may be better to wait to wait for the backmerge

21:04 <demarchi> javierm: thanks for looking into that

21:05 <javierm> demarchi: no worries. I'll try to remember pushing once drm-misc-next moves to 5.17-rc1, but please ping me in case I forget

21:05 <demarchi> will do

21:05 <javierm> demarchi: cool, thanks

21:10 ybogdano has quit [Ping timeout: 480 seconds]

21:12 mlankhorst has quit [Ping timeout: 480 seconds]

21:20 <airlied> anholt: in that pipeline in 13779 for the assert, where is the assert triggered?

21:28 gouchi has joined #dri-devel

21:28 gouchi has quit []

21:31 ngcortes has quit [Ping timeout: 480 seconds]

21:34 <dcbaker> Anyone interested in running mesa inside the build tree instead of installing it first might be interested in: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14826

21:35 <Sachiel> oh, that's interesting

21:35 <HdkR> Oh, that's cute. I love it.

21:36 mszyprow_ has quit [Ping timeout: 480 seconds]

21:48 shsharma has quit [Ping timeout: 480 seconds]

21:50 <jenatali> dcbaker: Ooh, cool. That sounds handy for running unit tests on Windows via meson devenv meson test

21:50 <jenatali> Don't have to copy binaries around then

21:50 <dcbaker> There's probably some more work to make it useful on windows

21:51 <jenatali> Oh I'm not talking about the Mesa-specific changes, just the feature in general :)

21:51 <jenatali> It adds all DLL-producing directories to PATH

21:53 <dcbaker> yup

21:54 <dcbaker> although I thinik we try to add DLL producing paths when running tests as well? Or maybe we've only talked about it but never done it

21:54 <dcbaker> If we haven't done that, we should

21:55 <jenatali> Pretty sure it hasn't been done. At least last I checked

21:55 <jenatali> Maybe there's just an old version of meson running in CI though

22:21 MajorBiscuit has joined #dri-devel

22:22 ngcortes has joined #dri-devel

22:23 ybogdano has joined #dri-devel

22:24 <cheako> I've been looking at this app, NMS, I've dumped vulkan api seeing that it's de-allocating and re-allocating alot after reciving a VK_SUBOPTIMAL_KHR. I'm assuming I can write a layer that caches vulkan primitives and skips over sending these calls? I'm terrible at explaining.

22:25 <cheako> I wonder if this may illuminate the "re-create" cycle causing my FPS randomization.

22:26 <cheako> Such a layer could be useful in other debugging situations?

22:28 Haaninjo has quit [Quit: Ex-Chat]

22:30 Duke`` has quit [Ping timeout: 480 seconds]

22:32 MajorBiscuit has quit [Ping timeout: 480 seconds]

22:32 ngcortes has quit [Ping timeout: 480 seconds]

22:41 MajorBiscuit has joined #dri-devel

22:44 ybogdano has quit [Ping timeout: 480 seconds]

22:52 gawin has quit [Ping timeout: 480 seconds]

22:53 <cheako> At a glance it seems there is a leak, can the delta number of AllocateMemory and FreeMemory diverge? I assume something like the awk https://pastebin.com/65fhiHGR would show an even number of allocs-free, but instead this value increases.

22:55 <cheako> Validation layers doesn't show it.

23:04 pzanoni has quit [Quit: Coyote finally caught me]

23:05 tjaalton_ has joined #dri-devel

23:07 tjaalton has quit [Ping timeout: 480 seconds]

23:09 danvet has quit [Ping timeout: 480 seconds]

23:15 MajorBiscuit has quit [Ping timeout: 480 seconds]

23:19 ybogdano has joined #dri-devel

23:19 gawin has joined #dri-devel

23:21 maxzor has quit [Quit: Leaving]

23:31 pzanoni has joined #dri-devel

23:32 pzanoni has quit [Remote host closed the connection]

23:34 pzanoni has joined #dri-devel

23:40 <dianders> javierm: probably not your working hours anymore, but I did spend some time looking at this. Unfortunately, drm_panel is _really_ quite disconnected from the rest of drm. There is no drm_device anywhere near the panel these days. Even back when there used to be a drm_device owned by the panel it wasn't really a good fit...

23:42 nchery has quit [Ping timeout: 480 seconds]

23:43 <dianders> javierm: Ah, though maybe this is one of those cases where I need to move edp_panel to use a panel bridge or something? I'll poke there...

23:57 <dianders> ...no, that's not right. panel bridge is for the other direction (the client of a panel). Also not clear that would help me get into the DRM's debugfs...

23:58 ybogdano has quit [Ping timeout: 480 seconds]

23:59 pcercuei has quit [Quit: dodo]

23:59 <airlied> anholt: just acked nuking the assert