#dri-devel on 2025-01-30 — irc logs at oftc.irclog.whitequark.org

2024-07-16 04:52 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:01 oneforall2 has quit [Remote host closed the connection]

00:02 fedora has quit [Ping timeout: 480 seconds]

00:03 minecrell has quit [Ping timeout: 480 seconds]

00:03 minecrell has joined #dri-devel

00:04 knurd_ has joined #dri-devel

00:04 oneforall2 has joined #dri-devel

00:05 JRepin has quit []

00:05 JRepin has joined #dri-devel

00:06 apinheiro has quit [Quit: Leaving]

00:08 knurd has quit [Ping timeout: 480 seconds]

00:27 amarsh04 has quit []

00:28 vliaskov has quit [Ping timeout: 480 seconds]

00:29 glennk has quit [Ping timeout: 480 seconds]

00:33 u-amarsh04 has joined #dri-devel

00:39 <daniels> mareko: ask ajax and MrCooper, but I think they only really need llvmpipe and spice

00:43 <daniels> jenatali: thankyou!

00:44 <jenatali> I'd been putting it off. I really hate building LLVM

00:45 <daniels> me too buddy

00:59 <jenatali> Apparently the Vulkan runtime no longer installs unattended with /S but now the SDK includes it?

01:25 JRepin has quit []

01:26 JRepin has joined #dri-devel

01:30 mbrost has joined #dri-devel

01:51 nerdopolis has quit [Read error: Connection reset by peer]

01:52 nerdopolis has joined #dri-devel

01:54 <zmike> tarceri: actually I assigned for you to make sure it goes in since it's blocking another MR from landing

02:01 zsoltiv__ has quit [Ping timeout: 480 seconds]

02:09 guludo has quit [Quit: WeeChat 4.5.1]

02:15 <mareko> MrCooper: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33211/diffs?commit_id=70398ff5140891899927590c46d27ef8c48c6898

02:21 mbrost has quit [Ping timeout: 480 seconds]

02:22 The_Company has joined #dri-devel

02:28 Kayden has joined #dri-devel

02:29 Company has quit [Ping timeout: 480 seconds]

02:41 heat has quit [Ping timeout: 480 seconds]

02:45 <jenatali> Ugh how do I see which LLVM module is needed but missing?

02:49 <airlied> usually grep

02:54 alane has quit []

02:54 alane has joined #dri-devel

03:17 The_Company has quit []

03:50 Turkish-Men has quit [Remote host closed the connection]

03:52 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

03:53 TMM has joined #dri-devel

04:01 nerdopolis has quit [Ping timeout: 480 seconds]

04:41 JRepin has quit []

04:41 JRepin has joined #dri-devel

05:01 JRepin has quit []

05:01 JRepin has joined #dri-devel

05:05 robmur01 has quit [Ping timeout: 480 seconds]

05:08 lemonzest has quit [Quit: WeeChat 4.5.1]

05:44 lemonzest has joined #dri-devel

05:52 azerov has joined #dri-devel

05:57 fab has joined #dri-devel

06:04 robmur01 has joined #dri-devel

06:08 YuGiOhJCJ has quit [Quit: YuGiOhJCJ]

06:14 NiGaR has quit [Remote host closed the connection]

06:14 davispuh has quit [Ping timeout: 480 seconds]

06:17 kts has joined #dri-devel

06:37 NiGaR has joined #dri-devel

06:40 valpackett has joined #dri-devel

06:57 lplc_ has quit []

06:59 lplc has joined #dri-devel

07:00 itoral has joined #dri-devel

07:03 JamesidowuToyin[m] has joined #dri-devel

07:04 jsa1 has joined #dri-devel

07:10 fab has quit [Quit: fab]

07:15 dolphin has joined #dri-devel

07:18 sima has joined #dri-devel

07:28 glennk has joined #dri-devel

07:35 kts has quit [Ping timeout: 480 seconds]

07:43 fab has joined #dri-devel

07:45 <MrCooper> mareko: technically I'm in a different team now (focus on mutter & Xwayland), as is ajax, so you rather need to ask airlied or José Exposito; AFAIR we do support amdgpu with acceleration on ppc64el in RHEL in principle though, so not having any CI coverage isn't great

07:45 kts has joined #dri-devel

07:45 JamesidowuToyin[m] has quit [autokilled: This host violated network policy and has been banned. Mail support@oftc.net if you think this is in error. (2025-01-30 07:45:39)]

07:47 vliaskov has joined #dri-devel

07:53 rasterman has joined #dri-devel

08:00 sghuge has quit [Remote host closed the connection]

08:00 sghuge has joined #dri-devel

08:01 jsa1 has quit [Ping timeout: 480 seconds]

08:03 jsa1 has joined #dri-devel

08:04 tzimmermann has joined #dri-devel

08:18 phasta has joined #dri-devel

08:34 jkrzyszt has joined #dri-devel

08:42 kaiwenjon_ has joined #dri-devel

08:43 kaiwenjon has quit [Read error: Connection reset by peer]

08:45 <sima> dakr, good mail, thanks for doing the wrestling

08:46 <sima> also chatted with airlied and we're at 15+ years of dma-api maintainers randomly nacking stuff gpu drivers want/need

08:51 dsimic is now known as Guest7449

08:51 dsimic has joined #dri-devel

08:53 Guest7449 has quit [Ping timeout: 480 seconds]

08:55 jsa1 has quit [Ping timeout: 480 seconds]

08:55 apinheiro has joined #dri-devel

09:03 jsa1 has joined #dri-devel

09:10 lynxeye has joined #dri-devel

09:10 kaiwenjon_ has left #dri-devel [#dri-devel]

09:11 kaiwenjon has joined #dri-devel

09:17 mehdi-djait3397165695212282475 has joined #dri-devel

09:46 kts has quit [Ping timeout: 480 seconds]

09:48 kts has joined #dri-devel

10:08 oppagangnam has joined #dri-devel

10:09 oppagangnam has quit [Remote host closed the connection]

10:14 heuristicsman has joined #dri-devel

10:23 kts has quit [Ping timeout: 480 seconds]

10:27 <heuristicsman> what i indeed assume , but do not have so much experience yet on doubly compressed intrinsics alike, is: + - are compatible only if the subtrahand before adding is enough big yet always smaller than the one subtracted from (naturally the case), so in other words, i do not think single operand meets the requirement, so the data banks scaffold of selection needs to be decoded from

10:27 <heuristicsman> compilers work and operand needs to be added to that bank and then again encoded together back to double encoding to yield needed result, since the banks answer selection logic is enough big it has both views decode and encode which are compatible. I think there are only minor such rules, but saviour is that multibank data access can be done and batched add for operands, which should be

10:27 <heuristicsman> compatible with say non-illformed output. Needs a bit confirming, i kinda recall those rules from the past testings.

10:37 heuristicsman has quit [Remote host closed the connection]

10:37 feaneron has joined #dri-devel

10:52 mvlad has joined #dri-devel

10:55 feaneron has quit [Remote host closed the connection]

10:57 <sima> DemiMarie, on the amd/virtio discussion, all these issues you point out is why I think there's either pup(FOLL_LONGTERM) or real hw support so that the iommu/gpu handles page faults/invalidations at the hw level

10:57 <sima> and the mmu notifiers just pass tlb flush commands forward as needed

10:57 feaneron has joined #dri-devel

10:57 <sima> anything else indeed just falls apart everywhere at the seams

10:58 distrohumiliation has joined #dri-devel

10:59 distrohumiliation has quit [Remote host closed the connection]

11:03 tangoentanglement has joined #dri-devel

11:06 guludo has joined #dri-devel

11:10 <tangoentanglement> so maybe it is possible to pad operands, to stay at required size, i.e that it would not yield bigger value from smaller so it would be able to pass dependecies down the line to later instructions sort of like coalesce to subtract, if this is not possible than only constants can be compiled in and passed forward at compile time straight as deps to later instructions, but anyways

11:10 <tangoentanglement> taking the bank of selection scaffolds and decoding them is not so high overhead either. I am testing all that on this month February, but i think i did make sense at least i vaguely remember so from calculator times.

11:26 Company has joined #dri-devel

11:36 tangoentanglement has quit [Remote host closed the connection]

11:47 kode548 has joined #dri-devel

11:50 kode54 has quit [Ping timeout: 480 seconds]

11:55 feaneron has quit [Remote host closed the connection]

12:05 kode548 has quit []

12:05 kode54 has joined #dri-devel

12:36 jinglearoundstars has joined #dri-devel

12:37 rsalvaterra_ has joined #dri-devel

12:37 rsalvaterra_ is now known as rsalvaterra

12:43 Omax has quit [Ping timeout: 480 seconds]

12:49 nerdopolis has joined #dri-devel

13:00 <jinglearoundstars> it much looks like the transition values with padding would indeed function, so you pass startingfrommaxvalue_as_anyconstant+singlepackedvalue and consistently receive as such from double encoded scaffolds, so receiver slots would be decoded properly, can be defined as offset which otherwise would not be used. As if only operands were decoded with offset appended i think then they

13:00 <jinglearoundstars> can be received well in double encoded slots of receive or arrival slots. But those parts in the logs i can not remember where they landed, the execution itself is easy, but still a number of lines of work to do, as i said -- i am not interested in sharing that work anymore. That is very reactive or say invasive code to the world, but i assume many parties of rich people actually

13:00 <jinglearoundstars> have it to print money to back up their manufacturing losses. I mean i am not able to compete without leveling up there, otherwise it much looks like i would get eaten for breakfast. But if the value is double encoded from subtract, it needs no offset anymore, cause compiler already filled it. I am in a war with many terrorists however i am not likely in conflict with the successful

13:00 <jinglearoundstars> people, due to those other morons bothering me with teams , i need to push better code to my own, that is appearing as stereotypical side effect, people have asked why do i need to go there, well those are interesting questions. Answer is stereotypical targeted envy by leftovers at me, which you do not get , so it is rather strange to you as to why i work so hard.

13:01 JRepin has quit []

13:01 JRepin has joined #dri-devel

13:12 Omax has joined #dri-devel

13:12 <jinglearoundstars> but there is no triple encoding anywhere in compiler anymore (so this simplifies), only addressing, since triple encoding is a side of adding two double encoded hashes addressed naturally.

13:13 <jinglearoundstars> that is because i succeeded in data banks access

13:13 <jinglearoundstars> so triple encodings and above is not anymore needed.

13:14 feaneron has joined #dri-devel

13:16 kzd has quit [Quit: kzd]

13:24 sguddati has joined #dri-devel

13:25 <zmike> mareko: is LINEAR really not supported for RGBA32F formats?

13:27 apinheiro has quit [Quit: Leaving]

13:28 <zmike> cuz it seems to work...

13:32 itoral has quit [Quit: Leaving]

13:41 kzd has joined #dri-devel

13:50 vliaskov has quit [Read error: No route to host]

13:50 pcercuei has joined #dri-devel

13:51 sarnex has quit [Read error: No route to host]

13:52 jsa1 has quit [Ping timeout: 480 seconds]

13:52 sarnex has joined #dri-devel

13:55 jinglearoundstars has quit [Remote host closed the connection]

14:05 sguddati has quit [Ping timeout: 480 seconds]

14:12 jsa1 has joined #dri-devel

14:12 nerdopolis has quit [Ping timeout: 480 seconds]

14:13 sandiorboiko has joined #dri-devel

14:14 nashpa has quit []

14:17 dliviu has joined #dri-devel

14:25 hexa- has quit [Quit: WeeChat 4.4.3]

14:26 <sandiorboiko> maybe i explained it again not as sharply overall, but you see double encodings are simpler to be done as address mappings imo, there is no need to do full encoding if the form is already some bank that is only at 3000digits, so it can be now addressed from table, but overall i am screwed, my timeline on the works is starting to get tight , grandmother wants to kick me out, and wolves

14:26 <sandiorboiko> are as much against me as they ever were. I DO NOT KNOW, WHERE they let me live like a real human it's entirely hippocratic like what is happening. I get overthe line after months, but i do not have that time. It is very difficult last effort that i am trying, and i need it to happen, but can not work at all in such environments they cheat me in.

14:26 hexa- has joined #dri-devel

14:28 sandiorboiko has quit [Remote host closed the connection]

14:35 jsa1 has quit [Ping timeout: 480 seconds]

14:36 Omax has quit [Ping timeout: 480 seconds]

14:39 Omax has joined #dri-devel

14:41 rgallaispou has quit [Read error: Connection reset by peer]

14:43 rgallaispou has joined #dri-devel

14:53 heat has joined #dri-devel

14:56 <DemiMarie> sima: In this case pup(FOLL_LONGTERM) is even more attractive because device memory is just virtual memory.

14:57 <DemiMarie> sima: Can the forced migration to device memory be done reliably?

14:58 <DemiMarie> Also, time to bypass the DMA API maintainers and send something directly to Linus?

15:00 <phasta> You should think long-term. Are then fixes and reworks also to be sent directly to him 3 years down the road?

15:02 dolphin has quit [Quit: Leaving]

15:02 <sima> DemiMarie, I didn't really follow that part since it was about virtio specific things

15:03 <sima> the kernel really can't, because if you do this like hmm you again need hw support for pagefaults

15:03 <sima> plus hmm cannot guarantee migration to device memory

15:04 <DemiMarie> sima: the idea I had is to move the pages to device memory and leave them there

15:05 <sima> anon memory probably freaks out to no end if it's suddenly device memory without a struct page

15:05 <DemiMarie> If you don't have HW support for pagefaults then it's up to the host kernel to fail the operation.

15:05 <DemiMarie> What about device memory with a struct page?

15:05 <sima> you could do it as coherent device memory, then anon memory works in your device memory (unlike device private memory that hmm uses)

15:06 <sima> but you're again stuck on the core mm's inability to guarantee migration

15:06 <sima> migration is all best effort

15:06 <DemiMarie> stop_machine()? Only half joking.

15:06 <sima> not enough

15:07 <DemiMarie> Why can't migration be reliable?

15:07 <sima> linux core mm does a lot of randomly grabbing a page/folio reference, and those all block migration

15:07 <sima> with enough whacking it mostly works for stuff like cma or memory hotunplug with zone_moveable, but it's brittle

15:08 <DemiMarie> What about make_device_exclusive_range() or similar, but without the exclusive part?

15:08 fab has quit [Quit: fab]

15:08 <sima> pup(FOLL_LONGTERM) is one of the pieces to make it less brittle, so that you know whether an elevated refcount is temporary and more retrying should help

15:08 <sima> or a permanent pin, and more retrying is only going to heat the world

15:08 <sima> DemiMarie, that doesn't move anything

15:09 * heat the world

15:10 <sima> DemiMarie, I guess you could try with coherent device memory and just migrating really, really hard

15:10 <sima> then you're at the same peril like cma or memory hotunplug

15:10 <DemiMarie> sima: could there be a way to lock out anyone who tries to grab a reference?

15:10 <sima> but for per critical stuff like hmm migration it's fundamentally fallible

15:11 <sima> DemiMarie, disable all the cool features like transparent hugepages

15:11 <sima> numa load balancing

15:11 <sima> ksm

15:11 <sima> writeback too iirc

15:11 <sima> constantly more getting added

15:11 <sima> defo direct i/o

15:11 <DemiMarie> sima: I meant "grab a mutex so they block"

15:12 <sima> no

15:12 <sima> DemiMarie, https://chaos.social/@sima/113911739075079093

15:12 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

15:12 TMM has joined #dri-devel

15:12 <heat> in theory you could do that but you'd create "heating the world" on the opposite, refgrabbing direction

15:12 <DemiMarie> Why is that?

15:13 <sima> see link but tldr is the linux core mm is designed on the principle that quicksand is awesome

15:13 <heat> because if there was a refcount lock-out you'd spin on folio_get

15:13 <heat> because there isn't, you spin on page migration (or fail)

15:14 <heat> it's way easier to fail page migration than failing a normal-ass refcount

15:14 <sima> it's also that core mm is lockless to the max

15:14 <DemiMarie> For performance reasons?

15:14 <heat> yes

15:14 <sima> so even if you hold a reference and the lock for something, it's really surprising how little guarantees that often gives you

15:15 <sima> like the entire pte walking is just pure yolo, and it happens absolutely everywhere all the time

15:15 <DemiMarie> Why does it not crash? RCU?

15:15 <heat> hey it's not pure yolo it's homebred RCU

15:15 <sima> some of the best people in the world banging their heads at it for decades

15:16 <heat> gup_fast generally just disables interrupts and doesn't use RCU

15:16 <sima> heat, oh yeah it's a work of art

15:16 <heat> to free a page table you need to do a TLB shootdown thus IPI thus if your IRQs are disabled it's safe to traverse

15:16 <heat> it is in effect homebred RCU

15:17 <sima> there's also so much fun due to locking inversions

15:17 <sima> where you lookup a thing, grab the locks and then recheck whether you got the right one

15:17 <sima> and there's fundamentally no way to just take a lock to make things stable

15:17 <sima> and it's getting worse every year, like with lockless vma traversals and page faults

15:17 <DemiMarie> I wonder at what point it would actually have been faster (dev time wise) to formally prove the whole thing correct and not have to do the debugging.

15:18 <sima> DemiMarie, open random file in mm/ and stand back in awe at the if ladders

15:18 <sima> especially anything handling pagetable entries

15:18 <sima> but yeah formal proof probably good idea

15:19 <sima> but the issue is also, what do you even want to proof

15:19 <DemiMarie> "no memory corruption"

15:19 <sima> because some things look very, very fish from a "will it livelock" pov

15:19 <sima> not even close to enough

15:19 <DemiMarie> no deadlocks, no livelocks, etc

15:19 <sima> the livelocks are real pain

15:20 <sima> and often stochastic stuff

15:20 <sima> like the race windows align such that you win often enough to never pile up, but if you'd have consistently bad luck you'd pile up

15:20 * DemiMarie wonders if past a certain point people should just be using multiple machines, rather than trying to make mm scale to huge machines

15:20 <sima> yes

15:20 <sima> cloud didn't happen just for fun

15:20 <heat> this is not just about making mm scale to huge machines

15:21 <heat> small machines are also heavily impacted

15:21 <heat> big locks suck

15:21 <sima> yeah small cros tend to really thrash mm

15:21 <heat> the per-vma locking patches address problems <checks notes> in android when apps create like 80 threads at startup

15:21 <DemiMarie> Big locks suck unless you care about reliability and security way more than performance. I suspect that is why OpenBSD is so full of them.

15:22 <heat> OpenBSD is full of them because it's a hobby kernel

15:22 <sima> yup

15:22 <heat> they would like to get rid of them and are slowly doing so

15:22 <sima> that too

15:22 <sima> like I think core mm is probably one place where rust wont help

15:23 <sima> like some of the memory barrier comments in there are just pure nightmare fodder

15:23 <DemiMarie> ATS might, though. That's full dependent & linear types.

15:23 <sima> since it's not just about your cpu code, but also about stuff like how tlb fetches actually walk pagetables on your machine

15:24 <heat> like, yes big locks make for simpler code, which is nice for security and reliability. but they also make you prone to suffer terrible choking on those huge locks, thus a reliability problem (and in effect, probably a security one, depending on what you're running)

15:25 <sima> DemiMarie, I think more formal proofing would be good, afaik only rcu in upstream linux is fully formally proved

15:26 <DemiMarie> sima: I was thinking of extracting core mm from F* or Coq.

15:26 JRepin has quit []

15:26 JRepin has joined #dri-devel

15:27 <DemiMarie> heat: I think safety critical systems prefer to use multiple components that are individually single-threaded. They can scale by having many cores that don't share memory.

15:27 <sima> DemiMarie, e.g. https://lore.kernel.org/dri-devel/887df26d-b8bb-48df-af2f-21b220ef22e6@redhat.com/ last paragraph

15:27 <sima> device-exclusive was added, but not everywhere, boom in way too many places

15:28 <DemiMarie> Honestly I think userptr is rather cursed.

15:31 <DemiMarie> Can migration be reliable enough to make uAPI depend on it?

15:33 <DemiMarie> I also wonder if this could be dealt with using hypervisor magic: "hey, that page of mine is a blob object now"

15:35 riteo has joined #dri-devel

15:42 <mareko> zmike: why wouldn't it be supported?

15:42 <zmike> mareko: I have an MR to fix

15:43 fab has joined #dri-devel

15:46 rasterman has quit [Quit: Gettin' stinky!]

15:46 JRepin has quit []

15:46 JRepin has joined #dri-devel

15:59 davispuh has joined #dri-devel

16:01 sguddati has joined #dri-devel

16:09 sguddati has quit [Ping timeout: 480 seconds]

16:10 bolson has joined #dri-devel

16:12 vanjasaroda has joined #dri-devel

16:12 vanjasaroda has quit [Remote host closed the connection]

16:14 haaninjo has joined #dri-devel

16:15 traditionalwiki has joined #dri-devel

16:16 traditionalwiki has quit [Remote host closed the connection]

16:19 mbrost has joined #dri-devel

16:22 neverthelessmaniac has joined #dri-devel

16:24 Duke`` has joined #dri-devel

16:25 jsa1 has joined #dri-devel

16:55 mbrost has quit [Ping timeout: 480 seconds]

16:55 phasta has quit [Ping timeout: 480 seconds]

16:57 mbrost has joined #dri-devel

16:57 anholt has joined #dri-devel

16:58 jsa1 has quit [Ping timeout: 480 seconds]

17:26 tzimmermann has quit [Quit: Leaving]

17:32 lynxeye has quit [Quit: Leaving.]

17:46 <neverthelessmaniac> how to explain here, well encoding to compressed format deploys encoder from the big value from i-cache and result scaffolds virtual cache from structures, the last which is the most overheadish operation, the data loop isn't perfect either but slimmer by say 3fold perhaps? So it's hence cheaper in the compiler to embed state as remainder of whole buffer of banks which compiler

17:46 <neverthelessmaniac> lifted anyways already once, but we do not want to do that so often. then you can say things like, i want the first bank, and remove all of the other banks of trillion options in the register, and that since decoding is done in lookup table ends up as being faster. That is also lot faster for IO. Now you write intrinsics say you want to access some tiled set of answer banks, you

17:46 <neverthelessmaniac> have toppest largest state, and when you remove first state you get everything but first etc. This option became possible cause of this simple fs hack i posted which is not as heavy as encoding from full initial value to packed. So now you can address say you want to bring together bankset1 and bankset2 and hit to execute them, so deps would go from first set to second and as to

17:46 <neverthelessmaniac> however you need. so remember decoding is cheaper than initial encoding, so you want to go more this way for perfectionism on performance.

17:51 JRepin has quit []

17:52 JRepin has joined #dri-devel

17:54 <jenatali> Ugh. Meson 1.5.1 can't use CMake to find LLVM 19

17:54 <jenatali> What a mess

17:55 <daniels> jenatali: ...

17:56 <jenatali> Means I need to rebuild the primary Windows container too to get a new meson apparently

17:58 * daniels twitches

17:58 <daniels> that was a deeply unpleasant time of my life

17:58 mbrost has quit [Ping timeout: 480 seconds]

17:58 <daniels> the bit where I broke up with my long-term girlfriend was probably way less damaging than Windows + Meson + LLVM + CMake + CI

17:59 <jenatali> Yeah... I got the build working locally with llvm19 so at least I'm pretty confident that just bumping meson should work

17:59 <daniels> heading out now, fingers crossed for you tho :)

18:11 <dj-death> daniels: and you do this for work...

18:15 mehdi-djait3397165695212282475 has quit []

18:16 alanc has quit [Remote host closed the connection]

18:20 alanc has joined #dri-devel

18:22 Kayden has quit [Quit: -> JF]

18:29 neverthelessmaniac has quit [Ping timeout: 480 seconds]

18:34 rz_ has quit [Ping timeout: 480 seconds]

18:46 jsa1 has joined #dri-devel

18:49 simplestofsuch has joined #dri-devel

18:49 <jenatali> Aaaand new meson doesn't install without long paths enabled

18:50 <jenatali> I hate dependency updates

18:57 <mareko> wouldn't it be nice if LLVM wasn't required by Mesa

19:03 <jenatali> Mhmm

19:04 <jenatali> LLVM as a runtime dependency is terrible

19:07 <kisak> mareko: hypothetically, how would you feel about delaying pulling llvm<18 support until after mesa 25.0-branchpoint and hopefully radeonsi/ACO is good to go for the newer AMD gfx generations by the time 25.1 rolls around? ~non-sequitor~ If the mesa build sees llvm 15 is around, but not usable with radeonsi/llvm, will it automatically build radeonsi/ACO or will it fail the build as requirements not met

19:07 <kisak> for radeonsi/llvm?

19:09 <kisak> jenatali: llvm being too new for meson autodetect is a chronic issue. Over in Debian land, the build system adds in the equivilent to

19:09 <kisak> export PATH:=/usr/lib/llvm-15/bin/:$(PATH)

19:10 <jenatali> Yeah but Windows doesn't do llvm-config :(

19:10 <kisak> well, that's dandy

19:11 <jenatali> Fun, LLVM 19 requires /Zc:preprocessor for MSVC to be able to compile its headers

19:11 <jenatali> Hopefully Mesa likes that too

19:15 <jenatali> Looks like yes, phew

19:20 <dcbaker> jenatali: we shouldn’t require long paths in meson. That sounds like a bug on our end

19:21 <jenatali> dcbaker: It was a test that got run during chocolatey install that was too long

19:21 <jenatali> I'll grab the log, one sec

19:22 <alyssa> mareko: llvmpipe's existence makes that kind of a nonstarter..

19:23 <jenatali> dcbaker: Ah pip, not choco. Log: https://gitlab.freedesktop.org/mesa/mesa/-/jobs/70278327#L391

19:24 <jenatali> And I was wrong it's not meson it's numpy :(

19:24 <jenatali> Oh it's meson's tests running as part of numpy's install. Gross

19:25 <dcbaker> jenatali: of course it’s cmake… and of course it’s in numpy which has a vednored copy of meson while we get some of their stuff upstream…

19:25 <dcbaker> I wonder if I can ask the numpy folks to not run our tests on install

19:26 <jenatali> Seems like the right call

19:30 <dcbaker> Although that’s also an old version of numpy and numpy >=2.0 should work

19:35 <mareko> kisak: I can delay that. LLVM isn't required by AMD drivers and ACO is used when LLVM is disabled at build time, but it's also not a tested or optimized configuration on RDNA 1-4. It's possible that when you enable llvmpipe, it also enables LLVM for radeonsi.

19:36 <mareko> radeonsi+ACO likely won't be ready by 25.1

19:37 <jenatali> dcbaker: There's an issue with some of Mesa's scripts that prevent it from working with >= 2.0

19:37 ohogb has joined #dri-devel

19:38 <dcbaker> Sigh. I guess i only fixed piglit. I probably fix that

19:38 <jenatali> Oh maybe it was piglit, I don't remember. That same container gets used to build both

19:40 <jenatali> Yeah https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29649#note_2493559 says it was piglit

19:40 <jenatali> Should've checked if that constraint could be removed. Oh well

19:44 Kayden has joined #dri-devel

19:47 feaneron has quit [Remote host closed the connection]

19:47 jkrzyszt has quit [Ping timeout: 480 seconds]

19:47 feaneron has joined #dri-devel

19:52 tobiasjakobi has joined #dri-devel

19:55 mbrost has joined #dri-devel

19:55 sima has quit [Ping timeout: 480 seconds]

20:11 Duke`` has quit [Ping timeout: 480 seconds]

20:12 phasta has joined #dri-devel

20:13 rsalvaterra_ has joined #dri-devel

20:13 rsalvaterra_ is now known as rsalvaterra

20:16 Duke`` has joined #dri-devel

20:29 <simplestofsuch> what intrinsics i meant address of bankset, then address of a bank, then address of cellset, address of a cell which are all address of programset. So four 3000digit worth values are enough, you do not need mul for this. Now you add those fields with accessor routines, so first two bankssets to be accessed is some index and from them first two banksets you target three banks, then four

20:29 <simplestofsuch> cellssets then 9 cells. and what happens next they were double encoded already, you reindex them according to access and run the lookup table on it as you got them from a data bank, since it was dependency tree, so that index yields you enourmous bunch of instructions which you can subindex again or not etc, but the compiler itself did only two rounds however from which the last was

20:29 <simplestofsuch> inexpensive, in other words what compiler did was encode once a very expensive loop then add together some single encoded values and encode them into a tiled address. due to the data access routine i posted it can be done, but there is only one catch the intrinsic like showed on the paper of Cornell's library hindi genius needs inexpensive lookup tables cause addends or second operator

20:29 <simplestofsuch> needs to be decoded differently, since the compiler would anyways hide the latency, you end up doing the sets transitions in whatever batch with addressing, you want to add 4 12 or 23 of them together it's upto you. simplest is two , not very hard is 20 you just shift the operands like adder intrinsic is shown in the paper, but i knew this too tbh. i figured out similar things.

20:29 ohogb has quit []

20:29 ohogb has joined #dri-devel

20:29 ohogb has quit []

20:30 ohogb has joined #dri-devel

20:33 phasta has quit [Ping timeout: 480 seconds]

20:35 dviola has joined #dri-devel

20:36 jsa1 has quit [Ping timeout: 480 seconds]

20:44 simplestofsuch has quit [Remote host closed the connection]

20:44 paulk-bis has joined #dri-devel

20:46 paulk has quit [Ping timeout: 480 seconds]

20:51 diacibenuci has joined #dri-devel

20:52 JRepin has quit []

20:52 JRepin has joined #dri-devel

20:55 jsa1 has joined #dri-devel

20:56 bolson has quit []

20:57 <diacibenuci> so all i tried to say is from the moment of encoding the first big value, it's saner not to use that loop anymore, since if you permute two double encoded values the curve is already polynomialy larger set, now if you do 20 of them you are already at millions of qubits etc. without any performance loss, since the lookup table is just a small magic value. all you do is read the bits then

20:57 <diacibenuci> change 62 if present to say 69 and if not no access done, and this goes very fast, lot faster than the full dma or loop encoder. More performance is not possible it's military grade scheme what i develop but i do not want to work with military if they kill wrong people.

20:59 ryanneph has joined #dri-devel

21:01 <jenatali> Uh... glsl compiler warnings test is failing with access violation (segfault) and I don't repro it :(

21:04 <DemiMarie> sima: Actually, there is another option: try to migrate the pages, and if that is not possible, either return an error to userspace or leave the pages on the CPU and try again later.

21:06 mbrost has quit [Ping timeout: 480 seconds]

21:10 <jenatali> Uh... and passed on re-run. That's not good

21:11 diacibenuci has quit [Remote host closed the connection]

21:14 ledookyn has joined #dri-devel

21:27 <ledookyn> I started the rant with, that you would not see that full encoder anymore in real code, cause there is no point for this anymore, so you would not able to understand any of the actual code if i was not speaking about it. And with that i try to finish the story too. You can change the depth of vision by just using a magic value, back and forth by adding say 1000bank values together together

21:27 <ledookyn> trilliontimestrilliontimes20trillions, i do not have access to such calculator yet, i am choosing something. so that is something like millions of qubits likely, whatever i do not know, i am tired now. Such men as you dwfreed should be fucking dropped to sharks or crocodiles food, fucking annoying shitbag you are.

21:27 <ledookyn> one after another (shifting their operands), and encoding that to smaller number back but on the fly with logics in magic value. so now a set has in every cell 20times 64bit etc. That is though real mathematics however i must say i am not very good mathematician, however when every cell has 20times 64bit and encoded in 3000digits, it's arguably already using such formula

21:27 ledookyn has quit [Remote host closed the connection]

21:28 Duke`` has quit [Ping timeout: 480 seconds]

21:55 anholt has quit [Ping timeout: 480 seconds]

22:03 guludo has quit [Ping timeout: 480 seconds]

22:14 anholt has joined #dri-devel

22:14 mvlad has quit [Remote host closed the connection]

22:25 Calandracas has quit [Remote host closed the connection]

22:28 Calandracas has joined #dri-devel

22:30 Calandracas_ has joined #dri-devel

22:31 mbrost has joined #dri-devel

22:35 jsa1 has quit [Ping timeout: 480 seconds]

22:36 Calandracas has quit [Ping timeout: 480 seconds]

22:38 haaninjo has quit [Quit: Ex-Chat]

22:43 Calandracas has joined #dri-devel

22:49 mbrost has quit [Ping timeout: 480 seconds]

22:50 Calandracas_ has quit [Ping timeout: 480 seconds]

23:47 ohogb has quit [Ping timeout: 480 seconds]