#dri-devel on 2024-02-16 — irc logs at oftc.irclog.whitequark.org

2022-12-21 00:45 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:11 q66 has quit [Ping timeout: 480 seconds]

00:12 Calandracas has quit [Ping timeout: 480 seconds]

00:15 pcercuei has quit [Quit: dodo]

00:20 Calandracas has joined #dri-devel

00:41 Kayden has quit [Quit: -> home, then elsewhere]

00:41 vliaskov has quit [Remote host closed the connection]

00:43 glennk has quit [Ping timeout: 480 seconds]

00:45 digetx is now known as Guest2943

00:45 digetx has joined #dri-devel

00:46 Guest2943 has quit [Ping timeout: 480 seconds]

00:46 q66 has joined #dri-devel

01:17 iive has quit [Quit: They came for me...]

01:28 frankbinns1 has quit [Remote host closed the connection]

01:28 frankbinns1 has joined #dri-devel

01:30 alanc has quit [Remote host closed the connection]

01:31 alanc has joined #dri-devel

01:32 co1umbarius has joined #dri-devel

01:34 columbarius has quit [Ping timeout: 480 seconds]

01:53 countrysergei has joined #dri-devel

02:03 heat has quit [Remote host closed the connection]

02:03 frankbinns2 has joined #dri-devel

02:03 heat has joined #dri-devel

02:06 konstantin_ has joined #dri-devel

02:06 konstantin is now known as Guest2952

02:06 konstantin_ is now known as konstantin

02:07 Guest2952 has quit [Ping timeout: 480 seconds]

02:09 frankbinns1 has quit [Ping timeout: 480 seconds]

02:31 YuGiOhJCJ has joined #dri-devel

02:52 adarshgm has joined #dri-devel

02:53 anujp has quit [Ping timeout: 480 seconds]

02:53 aravind has joined #dri-devel

03:01 <airlied> zmike: sorry I screwed up in gitlab and canned one of your ci jobs

03:02 <zmike> 😬

03:09 dviola has left #dri-devel [WeeChat 4.2.1]

03:13 dviola has joined #dri-devel

03:14 davispuh has quit [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]

03:33 u-amarsh04 has quit [Quit: Konversation terminated!]

03:33 u-amarsh04 has joined #dri-devel

03:44 cheako has quit [Quit: Connection closed for inactivity]

03:47 yyds has joined #dri-devel

04:14 yyds has quit [Read error: Connection reset by peer]

04:17 Leopold_ has quit [Remote host closed the connection]

04:18 yyds has joined #dri-devel

04:24 Leopold_ has joined #dri-devel

04:28 u-amarsh04 has quit []

04:28 u-amarsh04 has joined #dri-devel

04:29 u-amarsh04 has quit []

04:30 u-amarsh04 has joined #dri-devel

04:35 adarshgm has quit [Read error: Connection reset by peer]

04:35 <u-amarsh04> git bisecting

04:35 <u-amarsh04> git bisect skip 18 times in a row was tedious

04:37 bmodem has joined #dri-devel

04:51 sarthakbhatt has quit [Quit: Leaving.]

04:53 anujp has joined #dri-devel

05:05 Company has quit [Quit: Leaving]

05:05 heat has quit [Ping timeout: 480 seconds]

05:16 sukrutb has quit [Ping timeout: 480 seconds]

05:39 Duke`` has joined #dri-devel

05:51 sima has joined #dri-devel

05:54 anujp has quit [Ping timeout: 480 seconds]

05:59 fab has joined #dri-devel

06:38 yyds has quit [Ping timeout: 480 seconds]

06:40 <mareko> zmike: the vertex shader input is an untyped 32-bit or 16-bit number. The input type doesn't matter, but only the number of bits matters. R8_UINT fully determines the contents of those bits, so in this case, R8_UINT is always zero-extended to 32 or 16 bits for the shader input.

06:40 kts has joined #dri-devel

06:42 <mareko> zmike: while the shader input type doesn't matter for how the input is initialized, it does matter for algebraic instructions, for example, signed and unsigned integer comparison instructions behave differently if the top bit is 1

06:43 <mareko> zmike: so the full input type is really for the shading language itself, not for the input initialization

06:44 <mareko> zmike: so yes, it's legal to have mismatching vertex formats and shader input types

06:53 sukrutb has joined #dri-devel

06:55 GreaseMonkey has quit [Remote host closed the connection]

06:55 GreaseMonkey has joined #dri-devel

06:56 junaid has joined #dri-devel

06:57 cheako has joined #dri-devel

07:02 tzimmermann has joined #dri-devel

07:09 kts has quit [Quit: Leaving]

07:12 Kayden has joined #dri-devel

07:14 kts has joined #dri-devel

07:14 fab has quit [Quit: fab]

07:15 sukrutb has quit [Ping timeout: 480 seconds]

07:15 glennk has joined #dri-devel

07:18 yyds has joined #dri-devel

07:30 junaid_ has joined #dri-devel

07:33 <countrysergei> https://github.com/uzh/signal-collect/blob/master/src/main/scala/com/signalcollect/util/SplayIntSet.scala it seems that W3C triplestore rdfa and sparql solutions have the needed structure in that case of signal collect. https://www.zora.uzh.ch/id/eprint/119575/1/20172959.pdf compressed splay root nodes. SPlayNode is indice to the list, and nodes contain the whole interval.

07:34 <countrysergei> there are tools to convert between triplestore and sparql

07:34 junaid_ has quit []

07:36 <countrysergei> same as there are tools to convert from xml like openmath to triplestore and sparql.

07:36 macromorgan_ has joined #dri-devel

07:36 macromorgan has quit [Read error: Connection reset by peer]

07:39 jsa has joined #dri-devel

07:40 jeeeun841351908 has quit []

07:40 jeeeun841351908 has joined #dri-devel

07:40 fab has joined #dri-devel

07:51 jsa has quit []

08:00 sghuge has quit [Remote host closed the connection]

08:00 sghuge has joined #dri-devel

08:13 rgallaispou has joined #dri-devel

08:18 tzimmermann has quit [Remote host closed the connection]

08:23 kzd has quit [Ping timeout: 480 seconds]

08:36 tursulin has joined #dri-devel

08:39 kts has quit [Ping timeout: 480 seconds]

08:40 <u-amarsh04> git bisect so far: 91 good, 93 bad and 60 skipped commits

08:40 tzimmermann has joined #dri-devel

08:44 kts has joined #dri-devel

08:45 mripard has joined #dri-devel

08:48 jkrzyszt has joined #dri-devel

08:56 mvlad has joined #dri-devel

08:58 junaid has quit [Remote host closed the connection]

09:10 kts has quit [Ping timeout: 480 seconds]

09:35 benjaminl has quit [Ping timeout: 480 seconds]

09:36 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

09:36 TMM has joined #dri-devel

09:46 yyds has quit [Remote host closed the connection]

09:48 yyds has joined #dri-devel

09:56 fab has quit [Quit: fab]

09:56 fab has joined #dri-devel

10:02 guru__ has joined #dri-devel

10:06 oneforall2 has quit [Ping timeout: 480 seconds]

10:10 cmichael has joined #dri-devel

10:13 u-amarsh04 has quit [Quit: Konversation terminated!]

10:15 u-amarsh04 has joined #dri-devel

10:19 u-amarsh04 has quit []

10:20 damxo has joined #dri-devel

10:20 damxo has quit []

10:21 vliaskov has joined #dri-devel

10:24 u-amarsh04 has joined #dri-devel

10:26 YuGiOhJCJ has quit [Quit: YuGiOhJCJ]

10:27 <u-amarsh04> still bisecting

10:49 benjaminl has joined #dri-devel

10:49 bolson has quit [Remote host closed the connection]

10:59 f11f12 has joined #dri-devel

11:07 pcercuei has joined #dri-devel

11:14 heat has joined #dri-devel

11:23 cmichael has quit [Quit: Leaving]

11:27 cmichael has joined #dri-devel

11:34 fab has quit [Quit: fab]

11:35 glennk has quit [Ping timeout: 480 seconds]

11:46 <countrysergei> but intset of bitsets uses some division to understand the how to scan nodes from intervalfrom to intervalto.

11:50 <countrysergei> but it might be only for debugging also, not sure

11:53 kts has joined #dri-devel

11:54 kts has quit [Remote host closed the connection]

11:58 robmur01 has quit [Ping timeout: 480 seconds]

12:07 kts has joined #dri-devel

12:08 rasterman has joined #dri-devel

12:20 Company has joined #dri-devel

12:33 heat has quit [Read error: Connection reset by peer]

12:33 heat has joined #dri-devel

12:36 bmodem has quit [Ping timeout: 480 seconds]

12:50 Jeremy_Rand_Talos has quit [Remote host closed the connection]

12:51 Jeremy_Rand_Talos has joined #dri-devel

12:53 <countrysergei> it however seems something interesting, last definition in bitset seems to suggest that this is encoding that is made from basevalue as 64, so 64+1 is first bit but next ones i dunno yet, likely 64+2 etc. but bytes are done separately, min max has some while loops but can be overwritten once understood about the encoding.

12:54 <countrysergei> This goes very close to what i had been suggesting, so i think the code is usable

13:03 <zmike> mareko: I was afraid you were going to say that 🤕

13:04 <zmike> what was your ask the other day? whether rgba8 with stride=1 was legal?

13:07 kts has quit [Ping timeout: 480 seconds]

13:08 <countrysergei> the docs suggest that some primitive check-pointing is supported, have not inspected what it means, but technically one would want to get rid of the sparql or triplestore parsing overhead for openmath dictionaries.

13:10 heat is now known as Guest3003

13:10 Guest3003 has quit [Read error: Connection reset by peer]

13:10 heat has joined #dri-devel

13:14 Jeremy_Rand_Talos has quit [Remote host closed the connection]

13:14 Jeremy_Rand_Talos has joined #dri-devel

13:17 <countrysergei> also docs suggest some limitations that are not seeming very relevant to use case on 64bit architectures, so the biggest limitation is that arbitrary precision is not supported, but refinement id of 2 in power of 31 seems already incredibly large

13:18 <countrysergei> fully bounded sparql queries do not support this splay cache , i looked from wikipedia what it means, so those are not needed either.

13:19 countrysergei was kicked from #dri-devel by ChanServ [You are not permitted on this channel]

13:23 yyds_ has joined #dri-devel

13:25 minecrell has quit [Quit: Ping timeout (120 seconds)]

13:25 yyds has quit [Ping timeout: 480 seconds]

13:34 jernej has quit [Quit: Free ZNC ~ Powered by LunarBNC: https://LunarBNC.net]

13:34 minecrell has joined #dri-devel

13:38 glennk has joined #dri-devel

13:40 sgm has quit [Remote host closed the connection]

13:43 sgm has joined #dri-devel

13:44 jernej has joined #dri-devel

13:50 bmodem has joined #dri-devel

13:57 rasterman has quit [Quit: Gettin' stinky!]

14:11 Omax has quit [Remote host closed the connection]

14:15 guru__ has quit []

14:15 oneforall2 has joined #dri-devel

14:21 Omax has joined #dri-devel

14:28 MrCooper has quit [Remote host closed the connection]

14:28 MrCooper has joined #dri-devel

14:37 kts has joined #dri-devel

14:37 kts has quit []

14:48 aravind has quit [Read error: Connection reset by peer]

14:48 yyds_ has quit [Read error: Connection reset by peer]

14:49 macromorgan_ has quit []

14:49 macromorgan has joined #dri-devel

14:54 yyds has joined #dri-devel

14:54 <Hazematman> Hey, does anyone here happen to know the status of mesa on android? I'm messing around with the latest aosp main and trying to build a cuttlefish emulator image that includes mesa with latest mesa master (not the mesa main that its aosp, the latest on fdo gitlab) and when I try to build I get the error "FAILED: ninja: unknown target 'MODULES-IN-external-mesa3d'"

14:55 Haaninjo has joined #dri-devel

15:05 yyds has quit [Remote host closed the connection]

15:11 bmodem has quit [Ping timeout: 480 seconds]

15:18 kzd has joined #dri-devel

15:19 <pq> hwentlan, did you notice https://lists.freedesktop.org/archives/dri-devel/2024-February/441285.html yet?

15:26 bolson has joined #dri-devel

15:28 kts has joined #dri-devel

15:45 ungeskriptet is now known as Guest3018

15:45 ungeskriptet has joined #dri-devel

15:51 Guest3018 has quit [Ping timeout: 480 seconds]

15:51 tzimmermann has quit [Remote host closed the connection]

16:16 f11f12 has quit [Quit: Leaving]

16:16 mbrost has joined #dri-devel

16:19 mripard has quit [Quit: mripard]

16:20 yyds has joined #dri-devel

16:21 mbrost has quit [Remote host closed the connection]

16:22 mbrost has joined #dri-devel

16:24 anujp has joined #dri-devel

16:26 kts has quit [Ping timeout: 480 seconds]

16:28 mbrost has quit [Remote host closed the connection]

16:28 mbrost has joined #dri-devel

16:34 mbrost_ has joined #dri-devel

16:38 heat has quit [Remote host closed the connection]

16:38 heat has joined #dri-devel

16:41 mbrost has quit [Ping timeout: 480 seconds]

16:57 yyds has quit [Remote host closed the connection]

16:59 cmichael has quit [Quit: Leaving]

17:04 sukrutb has joined #dri-devel

17:05 <sima> more people should know about drm_vblank_work

17:08 tobiasjakobi has joined #dri-devel

17:08 mbrost_ has quit [Ping timeout: 480 seconds]

17:10 tobiasjakobi has quit [Remote host closed the connection]

17:16 rasterman has joined #dri-devel

17:17 noord has left #dri-devel [...]

17:24 sarthakbhatt has joined #dri-devel

17:34 alyssa has quit [Quit: alyssa]

17:52 rasterman has quit [Quit: Gettin' stinky!]

17:53 glennk has quit [Ping timeout: 480 seconds]

17:57 <pepp> sima: thx for your comments on the trace events series. Did you get a chance to look at v3?

17:57 <pepp> sima: because it could answer your chained fences question with the addition made to dma_fence_chain_init

18:03 <sima> pepp, oops missed that, was a bit chaos this week

18:03 <sima> looking now

18:04 jkrzyszt has quit [Ping timeout: 480 seconds]

18:07 <sima> pepp, doesn't really add any of the big design questions, since I still don't see how exactly you're going to tie it all together

18:07 <sima> like what if you have a pile of apps and compositors rendering

18:07 <sima> since I'm guessing you're guessing the actual dependencies through the processes that do stuff?

18:08 <sima> or it's _extremely_ amdgpu specific, and that doesn't sound very useful

18:09 <sima> or at least quite suboptimal design point since both atomic commit machinery and drm/sched is very generic by now and knows what's going on in driver-independent code entirely

18:10 <sima> pepp, or put another way: if the generic events are only of use with the amdgpu specific stuff, they're not really generic

18:11 <sima> (including existing amdgpu specific trace events imo)

18:11 <pepp> sima: they shouldn't be amdgpu-specific. But it's also possible that I baked amdgpu-specific assumptions because that's the only hardware I can test on

18:12 <sima> pepp, I mean if you can do the gpuvis dependency tracing with all amdgpu trace points disabled, then I think it's solid

18:12 <sima> if you need any amdgpu specific events, then it doesn't look like a solid design yet

18:12 <sima> e.g. https://lore.kernel.org/dri-devel/20240216151006.475077-6-pierre-eric.pelloux-prayer@amd.com/

18:13 <sima> that seems needed, and it definitely wont exist on other drivers which also use drm/sched, and so _do_ have the dependency information fully available in generic data structures

18:13 <sima> and so the generic trace events should be able to get that out to userspace

18:14 <pepp> no it's not needed; it also works fine without this. But the application doing the parsing needs to know how to transform a series of individual events into a list of jobs (= with a begin and an end)

18:14 <sima> yeah, that's the part which doesn't really work

18:15 <sima> and why I think we need a clear fence->fence trace event or it's just a mess

18:15 <sima> and we kinda have that

18:17 <pepp> even with a fence->fence event, the parsing app would have to determine which N-events form a job

18:29 <sima> pepp, https://paste.debian.net/hidden/ae666b5f/ some notes sprinkled around in generic code which should be all the places you need

18:30 <sima> won't cover i915-display because despite that that's atomic, it hand-rolls this stuff still

18:30 <sima> but not your problem imo

18:30 <sima> also we have really annoying tracepoints because they're not even close to consistent with dumping stuff like fences or crtc

18:31 <sima> but I'm not sure whether we can break them all or whether we need _v2 versions that are consistent

18:31 <sima> so it's a pretty solid mess, but zero guessing needed for which dependencies belong to which work, that info is all there

18:33 <pepp> sima: interesting, thanks!

18:34 <sima> pepp, also I /think/ but not entirely sure that on the drm renderD side of things (not amdkfd) all the ttm memory management also goes through drm_sched_job

18:34 <sima> so might instead want to annotate those better with "what is this" than add driver specific events

18:36 <sima> pepp, I think what would be really great is uapi docs about how you need to use those, and how to assemble them back into meaningful stuff, in the drm uapi section

18:36 <sima> so we can officiate this as "fully uapi tracepoints that we'll promise to never break

18:36 <sima> I think that would also help a lot in reviewing the overall design and whether it is something drm can sign up to support for a close approximation of "forever"

18:37 <pepp> sima: alright, makes sense. Getting feedback from gpuvis dev would probably be useful too

18:37 <sima> oh absolutely, if we make this forever uapi we need userspace that's reviewed by the userspace project

18:38 <sima> so if they're not happy about the bazillion different ways we dump dma_fence into tracepoints, that would be something we need to fix

18:39 <sima> pepp, oh a really fun testcase would be a multi-gpu system

18:39 <sima> like amd apu+discrete or so

18:39 <sima> can't be i915 because that is neither using drm/sched and also hand-rolling too much atomic

18:40 <pepp> sima: I've tested multi-GPU a couple of times, was working fine

18:40 <sima> but xe.ko should be on board with all this

18:40 <sima> pepp, like rendering on discrete and displaying on integrated?

18:40 <pepp> yes

18:40 <sima> that's the fun one ...

18:40 <sima> nice :-)

18:41 <sima> pepp, oh if it's not clear how to get the atomic commit dependencies - drm_atomic_helper_wait_for_fences has the authoritative answer

18:42 gallo[m] has joined #dri-devel

18:43 <pepp> sima: noted, thanks

18:45 <sima> or should have, and I /think/ all drivers except should be covered including any memory management fences

18:45 glennk has joined #dri-devel

18:45 <sima> *except i915

18:46 <DemiMarie> sima: is it reasonable for native contexts to `mlock()` all buffers passed to the GPU?

18:46 <sima> if not that would be a good reason to fix these drivers by moving them to standard functions

18:46 <sima> DemiMarie, it wont work for drm buffer objects

18:46 <DemiMarie> sima: are those pageable?

18:46 <sima> yeah

18:46 <DemiMarie> can that be disabled?

18:47 <sima> where's the fun in that :-P

18:47 <sima> especially for discrete you'd probably break the world since this breaks vram swapping ...

18:48 <sima> so I'd expect any gpu buffer object mlock to be somewhat driver specific :-/

18:48 <sima> DemiMarie, my brain fails me and I can't remember why you need mlocked gpu memory?

18:49 <sima> we chatted about how hard preemption is, but I forgot why mlock matters

18:50 <DemiMarie> sima: First, it avoids taking a bunch of complex code paths in the driver that are likely to have bugs. Second, any memory that ever gets mapped into a Xen VM must be pinned. Third, any page _provided_ by a Xen VM will be (inherently) pinned.

18:52 <DemiMarie> I expect most users to have iGPUs primarily, simply because those are the majority of GPUs on the market.

18:53 <DemiMarie> So far, my understanding is that the two biggest concerns about virtio-GPU native contexts for Qubes OS are the rate of memory unsafety vulnerabilities in drivers and the lack of a mechanism to notify the security team when such a vulnerability is discovered.

18:54 <sima> hm so not sure how much bugs you avoid, because we're still going to run all the code to prepare&map the memory

18:54 <sima> maybe a few less corner cases

18:55 <sima> DemiMarie, wrt security team, airlied&me aren't on that list, but we do get pulled in as needed for any gpu security issue

18:55 <sima> so security@kernel.org should work for these too

18:56 <DemiMarie> sima: so the context is that Qubes OS issues a Qubes Security Bulletin whenever a vulnerability is discovered that affects the security of Qubes OS. This includes vulnerabilities in Qubes OS’s dependencies.

18:56 <sima> and aside from some really old hw horrors we do expect that if a drm driver has renderD nodes, each open file of that is isolated from the others

18:57 <sima> ah yeah that one you wont get, simply because there's way too many of these and analyzing them all is hard work

18:57 <DemiMarie> how many (give or take a factor of 2) do you mean?

18:58 <DemiMarie> Obviously the total number of kernel vulns is huge, but only a small subset of those are relevant here.

18:58 <sima> well I mean including all gotchas in drm code with uaf, locking fun that looks exploitable, input validation lolz and bugs you can probably break

18:58 <sima> I'd expect a substantial part of cc: stable patches are exploitable for drm

18:58 <DemiMarie> how many of those per year do you have roughly?

18:59 <sima> assuming they are for a driver you care about

18:59 <sima> let me ask git for some guesstimates

19:02 <Hazematman> <gallo[m]> "Hazematman: welcome to the club...." <- I was able to resolve that specific build using this MR https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27648

19:02 <Hazematman> I'm trying to get llvmpipe&lavapipe building so I also had to make some changes to include a LLVM build that made them happy

19:02 <Hazematman> Right now waiting for the image to build

19:05 <sima> DemiMarie, ok some from a rough inaccurate git log the 6.1 lts kernel has ~65 fixes to shared drm core

19:05 <sima> about every 5th looks real scary, most of the others are hw quirks and stuff like that

19:05 <Hazematman> gallo: also thanks for the compliment :)

19:05 <sima> real scary = might be exploitable, but I'm defintiely not going to make a fool of myself and guess

19:06 <sima> DemiMarie, 6.1 is a bit older than a year

19:07 <sima> I didn't look at drivers because a) that's much harder to asses and b) there's a lot more noise that's probably just fixing display issues but can't be exploited in any meaningful way

19:08 <sima> so 10 per year for drm core code sounds about right, plus whatever is for the drivers you're using

19:09 <DemiMarie> sima: 10 per year is about what we have for everything else in Qubes OS, plus the number of stuff in drivers which is about the same IIRC

19:09 <sima> (bit aside, but that's what I expect the firehose of CVE's will also match with once the new kernel CNA gets going)

19:10 <DemiMarie> hopefully that gets big companies (Google?) to throw more people at increasing overall code quality

19:10 <sima> I think the expectation is that there'll be on the order of hundreds of CVEs for each kernel release

19:10 <sima> ofc a really big amount only apply to specific hw support, but there's a _lot_

19:11 <sima> DemiMarie, there's also the issue that with all the kernel hardening enabled, a lot of these are a lot less exploitable

19:11 <sima> but since those are all Kconfig knobs, they all count

19:12 <sima> plus you probably still want to patch, just in case someone figures out how to knock out the hardening

19:12 <DemiMarie> sima: I wonder if companies will start pushing to rip out old code, simplify stuff, etc

19:12 <sima> DemiMarie, old code is generally dead code

19:13 <sima> and I think in practice the issue is a lot more that upgrading breaks too much, so I expect that users who care hopefully are a lot more motivated to build up _really_ good CI

19:13 <sima> so that they can validate new upstream release faster

19:13 <DemiMarie> which will in turn help everybody

19:13 <sima> a leisurely year or so that even the good android vendors take to upgrade is just not going to work

19:13 <sima> yeah

19:13 <sima> and hopefully also catch issues faster and before they hit a release

19:15 <sima> DemiMarie, for actual fundamental improvement I think weeding out the stupid "undefined behaviour lolz" in the linux kernel C flavor is going to help a lot more

19:15 <DemiMarie> I also suspect enterprise distros will start trimming their Kconfigs.

19:15 <sima> it's really hard, but a lot has been achieved already

19:15 <sima> oh yeah

19:15 <sima> plus probably enable a lot more hardening, even if it costs

19:15 <sima> since it doesn't help with the flood, but it helps with the severity

19:15 <DemiMarie> It is extremely obvious that a CVE does not affect a distro if the fix is to code that is not included in the build

19:16 <sima> so you have a bit more time since it's not obvious stuff that even fools can exploit

19:16 <sima> yeah

19:16 <sima> so maybe also some build tools that generate the actual list of CVEs impacting your build

19:16 <DemiMarie> For Qubes OS turning on GEM_BUG_ON() might be a good idea

19:16 <DemiMarie> looks like it would catch at least one OOB write

19:17 <sima> also I figure that stuff like CONFIG_VT=n will hopefully accelerate

19:17 <sima> there's some really horrible stuff there

19:17 <DemiMarie> syzbot is also going to start auto-assigning CVEs, IIUC

19:17 <sima> yeah maybe

19:17 <sima> although that one kinda boils down to "who pays for the work to fix stuff"

19:18 <DemiMarie> enterprise distro vendors

19:18 <DemiMarie> they will have no choice due to compliance requirements

19:18 <sima> yeah maybe someone will find some budget hopefully

19:18 <sima> plus disabling a lot of the old horrors like CONFIG_VT

19:18 konstantin_ has joined #dri-devel

19:18 konstantin is now known as Guest3039

19:18 konstantin_ is now known as konstantin

19:18 <DemiMarie> at what point will non-enterprise distros be able to turn that off?

19:19 <sima> entirely depends upon how hard they care about the kernel console

19:20 <DemiMarie> the problem is that client hardware has no OOB management and no serial port

19:20 <sima> I think most desktop distros are actually leading enterprise distros, since there's some infra work missing still that needs really recent kernels

19:20 <sima> android/cros have it disabled since years

19:20 <DemiMarie> So until userspace comes up you are booting up blind

19:21 <sima> oh do not rely on drm for emergency logging

19:21 <sima> it's entirely busted

19:21 <sima> but we'll get a new drm panic handler to fix this properly for real, which is one of the infra issues

19:21 simon-perretta-img has quit [Ping timeout: 480 seconds]

19:21 <DemiMarie> My long-term hope is for much of DRM and the GPU drivers to be rewritten in Rust

19:22 <sima> DemiMarie, also defacto desktop distros load the gpu drivers from initrd, and at that point you can have a small userspace logger running too

19:22 <DemiMarie> Heck, even a way to make use-after-free and OOB memory access defined at the cost of a 4x slowdown and serializing everything in the kernel might be a win for some setups.

19:22 <sima> because of the entire fw issues

19:23 <DemiMarie> sima: I think we will see stuff moving to a dm-verity volume for fw

19:23 <sima> DemiMarie, all the integer math stuff is getting fully defined at least

19:23 <sima> and I think range-validate arrays are coming, yay

19:23 <DemiMarie> sima: if there was a way to have overflow trap in release builds that would be awesome

19:23 <sima> compiler-checked range-validated arrays I mean

19:23 <sima> DemiMarie, it's coming

19:23 <sima> the issue is that there's a lot of integer math that intentionally overflows

19:23 <sima> to check userspace input

19:24 junaid has joined #dri-devel

19:24 <sima> so you can't oops on that or you just made an exploit :-)

19:24 <DemiMarie> that still leaves UAFs and locking problems, though

19:24 <sima> yeah

19:24 <sima> although there's scope based automatic cleanup now

19:24 <DemiMarie> you can solve those with fat pointers and a global lock but it means a 4x or more slowdown last I checked

19:24 <sima> that should at least help a lot with bugs in error paths

19:24 <sima> but ofc, huge amounts of work

19:24 Guest3039 has quit [Ping timeout: 480 seconds]

19:25 <DemiMarie> ultimately, though, I think C needs to be replaced

19:25 <sima> DemiMarie, yeah given that rcu use is growing steadily I don't think that'll work

19:25 <DemiMarie> C is just not a viable language in the 2024 threat environment

19:25 <sima> outside of some very niche cases

19:25 <sima> yeah, but the issue is a bit that there's too much C

19:25 <sima> so I think both improving C as much as possible and working on replacing it is needed

19:26 <sima> there's some good stuff coming in C standards discussions afaik too

19:26 <DemiMarie> And also deprivileging it by moving it to userspace

19:26 <DemiMarie> Unfortunately for DRM/GPU stuff that fails miserably, because (at least in the Qubes OS threat model) the GPU driver is privileged by definition!

19:26 <sima> oh yeah absolutely

19:26 <sima> like config_vt=n is just a must

19:27 <sima> DemiMarie, well if you look at the entire gpu stack we already have like 90% in userspace

19:28 <DemiMarie> sima: I also mean stuff like networking, USB, Bluetooth, Wi-Fi, filesystems, etc

19:29 <sima> yeah those are all fairly tricky

19:29 <sima> DemiMarie, I wonder whether per-gpu-ctx processes on the host side for virtio-gpu would be doable

19:30 <DemiMarie> sima: Maybe? How would it help?

19:30 <sima> that way if you can exploit the userspace part of the hw driver, it should be a lot more limited

19:30 <DemiMarie> sima: the current plan is native contexts, so the userspace part runs in the guest

19:30 simon-perretta-img has joined #dri-devel

19:30 <sima> yeah that's another one, also should be faster

19:30 <DemiMarie> The old virgl/venus stuff is never going to be supported in Qubes OS

19:33 <DemiMarie> In fact one major reason that work for GPU accel in Qubes OS has not started yet is that native contexts are still not upstream except for Qualcomm last I checked

19:42 gouchi has joined #dri-devel

19:42 gouchi has quit [Remote host closed the connection]

19:42 rah has joined #dri-devel

19:57 simon-perretta-img has quit [Ping timeout: 480 seconds]

19:58 Lyude has quit [Quit: Bouncer restarting]

19:58 simon-perretta-img has joined #dri-devel

19:59 Lyude has joined #dri-devel

20:10 junaid has quit [Remote host closed the connection]

20:10 simon-perretta-img has quit [Ping timeout: 480 seconds]

20:11 simon-perretta-img has joined #dri-devel

20:25 jsa has joined #dri-devel

20:36 Duke`` has quit [Ping timeout: 480 seconds]

20:38 Duke`` has joined #dri-devel

20:51 <DemiMarie> Google said (at XDC 2023) that 80% of KMD vulnerabilities are not exploitable in the native context case. So that reduces the flood from 20 VM escapes per year down to less.

20:53 tursulin has quit [Ping timeout: 480 seconds]

20:56 jsa has quit []

21:09 countrysergei has quit [Remote host closed the connection]

21:10 Duke`` has quit [Ping timeout: 480 seconds]

21:41 rbmarliere has quit [Quit: rbmarliere]

21:41 rbmarliere has joined #dri-devel

21:42 flom84 has joined #dri-devel

21:50 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

21:50 TMM has joined #dri-devel

21:53 sukrutb has quit [Ping timeout: 480 seconds]

21:55 mvlad has quit [Remote host closed the connection]

22:04 mbrost has joined #dri-devel

22:16 sravn has quit []

22:17 sravn has joined #dri-devel

22:23 heat is now known as Guest3066

22:23 Guest3066 has quit [Read error: Connection reset by peer]

22:23 heat has joined #dri-devel

22:26 rgallaispou1 has joined #dri-devel

22:28 rgallaispou has quit [Read error: Connection reset by peer]

22:30 rgallaispou1 has quit [Read error: Connection reset by peer]

22:30 mbrost has quit [Remote host closed the connection]

22:30 rgallaispou has joined #dri-devel

22:31 mbrost has joined #dri-devel

22:37 heat is now known as Guest3067

22:37 Guest3067 has quit [Read error: Connection reset by peer]

22:37 heat has joined #dri-devel

22:46 flom84 has quit [Ping timeout: 480 seconds]

22:58 vliaskov has quit []

22:59 sukrutb has joined #dri-devel

23:01 sima has quit [Ping timeout: 480 seconds]

23:12 <karolherbst> jenatali: sooo.. we need to support opencl-c.h with llvm-15+ afterall, because support for some extensions are missing if not using it (e.g. cl_intel_subgroups). Do you want to have a compile time switch to include it in the binary or should I just unconditionally embeded it with static llvm? It's like 800kb

23:13 <karolherbst> ehh wait..

23:13 <karolherbst> it's probably less after compression

23:13 <karolherbst> though we only compress libclc?

23:13 <karolherbst> anyway...

23:14 <karolherbst> do you want a flip for the file? Though given that some AI/ML stuff just depends on that intel subgroup ext might as well already ship it...

23:16 <jenatali> karolherbst: I don't care one way or another. It's a drop in the bucket compared to actual clang

23:17 <karolherbst> true

23:17 <karolherbst> makes it easier for me to always include it :)

23:27 mbrost has quit [Remote host closed the connection]

23:27 mbrost has joined #dri-devel

23:34 mbrost has quit [Remote host closed the connection]

23:34 mbrost has joined #dri-devel

23:35 glennk has quit [Ping timeout: 480 seconds]

23:51 jeeeun841351908 has quit [Remote host closed the connection]

23:52 jeeeun841351908 has joined #dri-devel

23:57 dsrt^ has joined #dri-devel