#dri-devel on 2023-06-12 — irc logs at oftc.irclog.whitequark.org

2022-12-21 00:45 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:04 fxkamd has quit []

00:05 fxkamd has joined #dri-devel

00:23 fxkamd has quit []

00:25 sassefa has quit [Quit: sassefa]

00:25 sassefa has joined #dri-devel

00:28 benjaminl has quit [Ping timeout: 480 seconds]

00:38 co1umbarius has joined #dri-devel

00:39 columbarius has quit [Ping timeout: 480 seconds]

00:47 benjaminl has joined #dri-devel

00:55 Danct12 is now known as Guest2769

00:55 Danct12 has joined #dri-devel

01:09 YuGiOhJCJ has joined #dri-devel

01:14 <kode54> btw

01:14 <kode54> that issue I reported for flickering water layer in Borderlands 3?

01:14 <kode54> I replicated it in Borderlands 2

01:14 <kode54> I'll post as such in the issue

01:14 <kode54> easier to get to it there, you don't have to watch any benchmark

01:15 <kode54> you can just left-click and pan the view around to look at the lake in the valley below

01:47 heat has joined #dri-devel

01:47 heat_ has quit [Read error: Connection reset by peer]

02:01 camus has quit [Ping timeout: 480 seconds]

02:03 benjaminl has quit [Ping timeout: 480 seconds]

02:05 sassefa has quit [Remote host closed the connection]

02:20 camus has joined #dri-devel

02:22 camus1 has joined #dri-devel

02:28 camus has quit [Ping timeout: 480 seconds]

03:19 Company has quit [Quit: Leaving]

03:19 heat has quit [Ping timeout: 480 seconds]

03:28 bmodem has joined #dri-devel

03:30 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

03:30 TMM has joined #dri-devel

04:00 Danct12 has quit [Quit: WeeChat 3.8]

04:03 guru_ has quit []

04:04 oneforall2 has joined #dri-devel

04:06 jewins has quit [Ping timeout: 480 seconds]

04:21 benjaminl has joined #dri-devel

04:37 kzd has quit [Ping timeout: 480 seconds]

05:13 tzimmermann has joined #dri-devel

05:19 benjaminl has quit [Ping timeout: 480 seconds]

05:34 fab has joined #dri-devel

05:42 bgs has joined #dri-devel

05:46 JohnnyonFlame has joined #dri-devel

05:56 jkrzyszt has joined #dri-devel

06:08 Leopold__ has joined #dri-devel

06:08 sima has joined #dri-devel

06:11 Leopold_ has quit [Ping timeout: 480 seconds]

06:26 lyudess has joined #dri-devel

06:30 alanc has quit [Remote host closed the connection]

06:30 alanc has joined #dri-devel

06:32 Lyude has quit [Ping timeout: 480 seconds]

06:36 fab has quit [Quit: fab]

06:58 sghuge has quit [Remote host closed the connection]

06:58 sghuge has joined #dri-devel

06:58 benjaminl has joined #dri-devel

06:58 jfalempe has joined #dri-devel

07:05 benjaminl has quit [Quit: WeeChat 3.8]

07:07 tursulin has joined #dri-devel

07:13 fab has joined #dri-devel

07:16 frankbinns1 has joined #dri-devel

07:18 smiles_ has quit [Ping timeout: 480 seconds]

07:18 frankbinns has joined #dri-devel

07:19 frankbinns2 has joined #dri-devel

07:24 frankbinns1 has quit [Ping timeout: 480 seconds]

07:26 frankbinns has quit [Ping timeout: 480 seconds]

07:32 <pq> emersion, I read your reply on the KMS color pipeline thread, and I agree with everything your wrote.

07:33 pochu has joined #dri-devel

07:36 camus1 has quit [Read error: Connection reset by peer]

07:36 camus has joined #dri-devel

07:57 JohnnyonFlame has quit [Ping timeout: 480 seconds]

07:57 <emersion> pq, sweet!

07:59 lynxeye has joined #dri-devel

08:05 smiles_ has joined #dri-devel

08:09 swalker_ has joined #dri-devel

08:09 swalker_ is now known as Guest2793

08:11 swalker__ has joined #dri-devel

08:17 Guest2793 has quit [Ping timeout: 480 seconds]

08:19 anarsoul|2 has joined #dri-devel

08:19 anarsoul has quit [Read error: No route to host]

08:24 xroumegue has joined #dri-devel

08:26 camus1 has joined #dri-devel

08:26 camus has quit [Read error: Connection reset by peer]

08:35 camus has joined #dri-devel

08:35 camus1 has quit [Remote host closed the connection]

08:36 jewins has joined #dri-devel

08:44 jewins has quit [Ping timeout: 480 seconds]

08:45 oneforall2 has quit [Remote host closed the connection]

08:45 oneforall2 has joined #dri-devel

08:55 pochu has quit [Quit: leaving]

10:04 rsalvaterra has quit []

10:05 rsalvaterra has joined #dri-devel

10:23 Danct12 has joined #dri-devel

10:33 pochu has joined #dri-devel

10:44 Danct12 has quit [Ping timeout: 480 seconds]

10:56 rasterman has joined #dri-devel

11:04 Leopold__ has quit [Remote host closed the connection]

11:08 Leopold_ has joined #dri-devel

11:10 robmur01 has quit [Remote host closed the connection]

11:13 JohnnyonFlame has joined #dri-devel

11:15 robmur01 has joined #dri-devel

11:27 pochu has quit [Read error: Connection reset by peer]

11:28 pochu has joined #dri-devel

11:35 bmodem1 has joined #dri-devel

11:40 bmodem has quit [Ping timeout: 480 seconds]

11:59 Piraty has quit [Remote host closed the connection]

11:59 Piraty has joined #dri-devel

12:00 Piraty has quit []

12:02 Piraty has joined #dri-devel

12:16 vliaskov has joined #dri-devel

12:39 djbw_ has quit [Read error: Connection reset by peer]

12:49 kasper93_ is now known as kasper93

13:02 FireBurn has joined #dri-devel

13:03 heat has joined #dri-devel

13:05 <FireBurn> Would someone mind reverting 58e67bb3c131da5ee14e4842b08e53f4888dce0a I'm hoping to avoid it getting sent to airlied and onto linus

13:13 <zamundaaa[m]> Is there a way to import an EGL fence?

13:15 <zamundaaa[m]> I'm trying to blit a texture from one GPU to another, and with NVIdia that causes artifacts because of the lack of synchronization. Ideally I'd create an EGL fence on the source GPU, and have the other GPU wait before doing the blit with eglWaitSync, but I haven't found a way to actually get a fence for this on the destination GPU

13:20 <emersion> look at weston maybe

13:21 FireBurn has quit [Ping timeout: 480 seconds]

13:21 <pq> EGL_ANDROID_native_fence_sync might be the key

13:24 <zamundaaa[m]> ah, so the fd is passed in as an attribute. Thanks!

13:25 <ickle> win 16

13:36 jewins has joined #dri-devel

13:47 <emersion> sad that there's no drm_syncobj love

13:58 f11f12 has joined #dri-devel

13:58 alyssa has joined #dri-devel

13:59 <alyssa> gfxstrand: ok, I have something typed out to kill off abs/neg/fsat modifiers without requiring any nontrivial changes to backends

13:59 <alyssa> (in particular, it does not require the backend to have working copyprop or dead code elimination)

13:59 <alyssa> I hate it, but more than that I hate that we have backends that don't have DCE

13:59 <alyssa> and, it means we actually have a chance of killing them off

14:00 <gfxstrand> :sob

14:00 <alyssa> so, probably worth the stupid

14:00 <alyssa> the usual strategy--

14:00 <alyssa> ahead-of-time trivialize pass that inserts copies to ensure fabs/fneg/fsat are folded 100% of the time,

14:00 <alyssa> helpers to chase through fabs/fneg/fsat at backend isel time,

14:01 <alyssa> and a gaurantee to backends that fabs/fneg/fsat will be chased 100% of the time so they just need to Not emit any code for them

14:02 <gfxstrand> Running HSW now

14:03 <gfxstrand> Let's see how bad the damage is.

14:08 fab has quit [Remote host closed the connection]

14:10 <alyssa> from the nir_register changes?

14:10 <alyssa> (Intel doesn't use lower_to_source_mods anymore so thankfully it's spared of this particular abomination)

14:10 <alyssa> only uses are ntt, etnaviv, a2xx, lima, and r600/sfn

14:11 <alyssa> I am not volunteering to rewrite people's compilers

14:11 <alyssa> so.. this the consolation prize

14:12 <gfxstrand> I'm more worried about vec-to-reg

14:13 <alyssa> nod

14:13 <alyssa> midgard seems happy with it

14:13 <gfxstrand> Okay, ptn bug fixed.

14:21 swalker__ has quit [Remote host closed the connection]

14:23 kzd has joined #dri-devel

14:27 <gfxstrand> alyssa: https://paste.centos.org/view/89c5ba29

14:27 <gfxstrand> I've not done any analysis on why

14:28 <gfxstrand> Also, that's vec4-only. I filtered out FS/CS.

14:28 zehortigoza has quit [Quit: Coyote finally caught me]

14:28 zehortigoza has joined #dri-devel

14:28 <alyssa> gfxstrand: :| disappointing

14:28 <alyssa> I mean. I would still rip off the bandaid personally, but

14:29 <alyssa> midgard was total instructions in shared programs: 1518573 -> 1514188 (-0.29%)

14:29 <gfxstrand> I'll look into it this afternoon

14:29 <alyssa> I guess what we're seeing here is that Intel has significantly better vec4 copyprop than Midgard and we're getting a regression to the mean

14:31 <alyssa> gfxstrand: what's your personal threshold for acceptable shaderdb hit?

14:36 nehsou^ has quit [Remote host closed the connection]

14:45 hramrach has joined #dri-devel

14:47 <hramrach> hello, what card are supported?

14:48 FireBurn has joined #dri-devel

14:49 <hramrach> RADV page https://docs.mesa3d.org/drivers/radv.html is a stub that point to https://www.x.org/wiki/RadeonFeature/ which has a nice feature table which ends with Arctic Islands. So I suppose for Navi I should turn to Windows?

14:50 <pendingchaos> https://www.x.org/wiki/RadeonFeature isn't useful for determining hardware support

14:50 <pendingchaos> that feature table isn't about radv

14:51 <pendingchaos> apparently it documents radeonsi, but clearly outdated

14:51 <hramrach> so what is useful documentation for radv?

14:52 <pendingchaos> https://docs.mesa3d.org/envvars.html#radv-driver-environment-variables

14:52 <pendingchaos> besides those, it's just a Vulkan driver

14:52 <alyssa> gfxstrand: also, pushed nir/legacy-mods, it has your pushed fix squashed in though not the unpushed ptn fix

14:53 <hramrach> but that documents driver some diver settings, not what hardware it supports

14:54 <pendingchaos> RADV should support all AMD GPUs supporting Vulkan

14:55 <hramrach> the moment they are released?

14:56 <pendingchaos> there might be some delay (both because of release schedules and development effort) depending on how different the new GPU is from predecessors

14:57 <pendingchaos> gfx1100 and gfx1101 for example, should be basically the same

14:57 <pendingchaos> gfx1030 and gfx1100 had significant differences

14:57 <hramrach> so how do I tell when a GPU has aged enough to be supported?

14:57 <llyyr> rdna3 was supported pre-release already

14:58 <llyyr> generally stuff should work on release but they might be buggy and that gets sorted out over time

14:59 <hramrach> That would be nice improvement since the times that table is from

14:59 <pendingchaos> I don't think there's any official list of RADV hardware support, so you can't easily tell

14:59 <llyyr> radv supports all GCN/RDNA cards

14:59 <pendingchaos> I think usually phoronix and such will release an article when a generation of gpus is supported

15:00 <llyyr> so from hd 7000 series up to rx 7xxx

15:00 fab has joined #dri-devel

15:01 <hramrach> yes, phoronix would probably have that

15:04 <pendingchaos> ah, release notes also have new hardware support

15:04 <pendingchaos> like https://docs.mesa3d.org/relnotes/22.3.0.html has "Mali T620 on panfrost" and "initial GFX11/RDNA3 support on RADV"

15:05 <hramrach> but they are split by version, not by hardware

15:12 fxkamd has joined #dri-devel

15:16 tzimmermann has quit [Quit: Leaving]

15:19 alyssa has left #dri-devel [#dri-devel]

15:20 pochu has quit [Quit: leaving]

15:20 djbw_ has joined #dri-devel

15:22 <mupuf> hramrach: assume everything to work, unless the hardware is really exotic

15:22 <mupuf> If it doesn't: file a bug

15:23 <mupuf> Generally, the faster GPUs are better supported

15:23 <mupuf> Unless they are really expensive

15:23 <mupuf> (CDNA cards for example)

15:26 <pendingchaos> I don't think RADV works at all on CDNA

15:26 <pendingchaos> at least, I don't think the newer ones can support Vulkan

15:29 <mupuf> Right :D

15:29 tursulin has quit [Ping timeout: 480 seconds]

15:30 jkrzyszt has quit [Ping timeout: 480 seconds]

15:37 idr has joined #dri-devel

15:42 Duke`` has joined #dri-devel

15:42 bmodem1 has quit [Ping timeout: 480 seconds]

15:43 mbrost has joined #dri-devel

15:47 benjaminl has joined #dri-devel

16:00 fab has quit [Remote host closed the connection]

16:10 <hramrach> so let's say that consumer cards should work, server cards not

16:10 <hramrach> thanks

16:11 jessica_24 has joined #dri-devel

16:12 pcercuei has joined #dri-devel

16:21 JohnnyonF has joined #dri-devel

16:23 JohnnyonFlame has quit [Ping timeout: 480 seconds]

16:27 alyssa has joined #dri-devel

16:27 <alyssa> Why is the LLVM IR generated by gallivm so chunky T_T

16:27 <alyssa> I guess that includes a big chunk of rasterizer in there too?

16:29 caseif has quit [Quit: ZNC 1.8.2+deb2+b1 - https://znc.in]

16:29 caseif has joined #dri-devel

16:41 YuGiOhJCJ has quit [Quit: YuGiOhJCJ]

16:44 caseif has quit [Quit: ZNC 1.8.2+deb3.1 - https://znc.in]

16:44 caseif has joined #dri-devel

16:45 <DavidHeidelberg[m]> anholt: btw. are the `swrast` runners definitely lost or at some point there is chance in future?

16:45 <anholt> they are gone. not going to be standing anything up at least until we have kata.

16:49 <agd5f> pendingchaos, you could support vulkan on CDNA cards, but they would only have transfer/compute/media queues, no GFX.

16:50 <pendingchaos> didn't the recent CDNA remove texture filtering? I think Vulkan requires that

16:55 <pendingchaos> I guess it probably could be emulated with a lot of effort

16:55 <pendingchaos> but maybe there's more mandatory Vulkan features that are missing

16:57 Duke`` has quit [Ping timeout: 480 seconds]

16:58 <agd5f> pendingchaos, should still be supported, at least according to the MI200 ISA document

16:58 <agd5f> supports everything needed for OCL 2.x

17:00 <pendingchaos> seems to have image_sample

17:00 <pendingchaos> no mipmaps though?

17:03 <pendingchaos> or gather

17:03 <alyssa> agd5f: then why did Marek need to do a software texturing pipeline for CDNA?

17:05 Duke`` has joined #dri-devel

17:10 <jenatali> gfxstrand, alyssa : Ping on !23173, there's still a few nir patches in there in need of ack/review

17:14 smiles_ has quit [Ping timeout: 480 seconds]

17:24 <alyssa> jenatali: trade you? https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests?scope=all&state=opened&author_username=alyssa&label_name[]=NIR&milestone_title=Needs%20review

17:24 <jenatali> :P

17:24 <alyssa> the scoped barriers one is only 3 patches left

17:24 <alyssa> https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23191#note_1946902

17:24 <jenatali> I suppose that's only fair, I'll take a look

17:32 kts has joined #dri-devel

17:33 <agd5f> alyssa, maybe it is then? Not sure.

17:45 lynxeye has quit [Quit: Leaving.]

17:55 stuarts has joined #dri-devel

17:55 flibitijibibo has joined #dri-devel

18:24 pcercuei has quit [Quit: leaving]

18:29 ngcortes has joined #dri-devel

18:44 <alyssa> jenatali: thoughts on https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23351/diffs?commit_id=f6e9ab6b547af1b0eb241e5afa133ddc2b04e4c8 ?

18:45 <alyssa> the whole SM5 shift mess is, as usual, a mess

18:46 <alyssa> The obviously alternative is changing ubfe_imm to only produce ubfe if lower_bitfield_extract is set, otherwise, ubitfield_extract is produced

18:46 <alyssa> s/obviously/obvious/

18:46 <alyssa> It's unclear to me if that's better or worse

18:47 <alyssa> For the _imm case, ubfe and ubitfield_extract are interchangeable (since we can just mask the immediate at build time)

18:47 <alyssa> (or better yet, assert the immediate < 32)

18:47 <alyssa> Hmm.. maybe I should do that actually

18:47 <alyssa> pendingchaos: thoughts on ^^?

18:53 <alyssa> I once again wonder if the default really should be khronos behaviour and _sm5 suffixed ops do the masked thing... meh

18:53 <alyssa> would like to kick that can down the road again though.. I just want ubfe_imm or equivalent for agx

18:57 Kayden has quit [Quit: -> JF]

19:02 benjaminl has quit [Ping timeout: 480 seconds]

19:07 Haaninjo has joined #dri-devel

19:15 <jenatali> 🤷‍♂️ I looked at it but I don't really have any strong opinions on the matter

19:17 digetx is now known as Guest2834

19:17 digetx has joined #dri-devel

19:19 <alyssa> valid

19:19 <alyssa> maybe pendingchaos does

19:19 benjaminl has joined #dri-devel

19:20 Guest2834 has quit [Ping timeout: 480 seconds]

19:20 Kayden has joined #dri-devel

19:24 <pendingchaos> I think building ubfe/ubitfield_extract depending on lower_bitfield_extract and using a unified helper makes sense, but having two helpers doesn't sound like a real problem

19:25 <alyssa> sure

19:25 <alyssa> let me know the preferred bikeshed colour and I'll paint it

19:29 kts has quit [Quit: Konversation terminated!]

19:29 rasterman has quit [Quit: Gettin' stinky!]

19:35 benjaminl has quit [Ping timeout: 480 seconds]

19:37 kts has joined #dri-devel

19:37 <alyssa> airlied: when you have a few minutes could I pick your brain about AoS/SoA gallivm?

19:37 vliaskov has quit []

19:37 <alyssa> usually don't like to "ask to ask" but I don't yet have a coherent question formulated

19:38 gio_ has joined #dri-devel

19:38 <airlied> alyssa: my brain has defeated you by purge AoS/SoA knowledge right down to knowing which one is which

19:38 <airlied> but yes ask and pick away

19:39 gio has quit [Read error: Connection reset by peer]

19:40 <airlied> alyssa: also sampling AoS/SoA is slightly different to the AoS/SoA execution model

19:40 <airlied> by default we use soa execution and mostly soa sampling but sometimes sampling goes to aos mode

19:41 <airlied> for one narrow use case we use aos execution

19:42 kts has quit [Quit: Konversation terminated!]

19:43 <alyssa> oh boy

19:43 <alyssa> airlied: The basic question I have is that load_reg/store_reg take arrays of LLVMValueRefs

19:44 <alyssa> instead of just a single LLVMValueRef for the whole vector

19:44 kts has joined #dri-devel

19:44 benjaminl has joined #dri-devel

19:44 <alyssa> it seems in the AoS path only the [0] component is used

19:44 <alyssa> but in the SoA path every component is used separately

19:45 <alyssa> I guess "AoS" is like vec4 gpus and "SoA" is like scalar GPUs?

19:45 <alyssa> in that case, why would gallivm even see vectorized NIR in the first place?

19:45 <alyssa> why not scalarize completely in NIR, so we only need the single LLVMValueRef (corresponding to either the one component or the whole vector)?

19:46 <airlied> probably because the core code was originally TGSI designed and TGSI is vec4

19:46 <airlied> so it just kept doing that when I ported it to NIR, and handled vecotrs

19:46 <airlied> but it doesn't really correspond to GPUs that well

19:47 <alyssa> OK

19:47 <airlied> SoA mode is it stores 4/8-wide scalars

19:47 <airlied> so a vector in SoA mode is just a set of vec-len scalars each of which is 4/8 channels wide

19:48 <airlied> depending on avx etc

19:48 <alyssa> yes, that's how scalar GPUs work

19:48 <airlied> oh my scalar gpus have uniform regs which llvmpipe doesn't :-P

19:48 <alyssa> mine don't

19:49 <airlied> AoS is a special case for storing 16-wide chars

19:49 <airlied> so that you can process 4 8-bit RGBA pixels in one go

19:49 <airlied> it's very limited in scope in what you can do

19:49 <airlied> it's just to provide a fast path for blits and copies

19:49 <alyssa> Right, ok

19:50 <airlied> so yes we could probably scalarize completely in NIR for the aos case, but the TGSI code still exists

19:50 <alyssa> OK

19:50 <alyssa> mostly i'm trying to understand why assign_dest (for example) takes an array of valuerefs instead of just one

19:50 <alyssa> but you're saying that's just TGSI legacy?

19:51 <airlied> what one value ref would it take?

19:51 <airlied> if dest has 4 components

19:51 <airlied> you can't do vectors of arrays of values in llvm IR

19:51 <alyssa> why would you ever have that, though?

19:51 <airlied> because we haven't scalarised 4 component stores

19:51 <alyssa> ooh

19:51 <airlied> though maybe in practice we have

19:52 <alyssa> like, store_ssbo?

19:52 <airlied> I think the main uses caess are the vec4 type constructors

19:52 <airlied> nir_vec4 etc

19:53 <alyssa> right..

19:53 <airlied> where you have one ssa value that is a vector of scalars but the scalars are 8-wide arrays

19:53 <alyssa> maybe I'm objecting to the "Loop for R,G,B,A channels" in the SoA case in visit_alu

19:53 ngcortes has quit [Ping timeout: 480 seconds]

19:53 <alyssa> not really interested in reworking this. just trying to figure out what to do for my NIR rework

19:53 <alyssa> and today is llvmpipe

19:53 <alyssa> day

19:55 fab has joined #dri-devel

19:55 tobiasjakobi has joined #dri-devel

19:55 benjaminl has quit [Ping timeout: 480 seconds]

19:55 <airlied> yeah so we do all the operations once on each component of the vector, then collect the results, then store them back as an array

19:55 tobiasjakobi has quit []

19:56 <airlied> I just didn't see the value for register stores of sticking them into an LLVM array

19:56 <airlied> just to pull them back out again

19:56 <airlied> since register stores actually go to memory, as opposed to just hide inside the ssa value hash table

19:56 <alyssa> why are there multiple components on the vector?

19:56 <alyssa> aren't we calling lower_alu_to_scalar in the SoA case?

19:57 <alyssa> I guess we aren't

19:57 <alyssa> we should be, I guess

19:57 <airlied> lavapipe does, not sure llvmpipe does

19:57 <airlied> probably a cleanup possible there

19:57 <alyssa> doesn't look like it does

19:57 <alyssa> yeah.. not today's cleanup though

19:57 <alyssa> currently defeaturing nir_register from llvmpipe

19:58 <airlied> there's probably quite a lot of llvm side stuff that could be moved to NIR side

19:58 <alyssa> Yeah

19:58 <airlied> it's mostly a legacy of TGSI and whatever state nir was in when I wrote it

19:58 <alyssa> piles of the graphics pipeline emulation code could be common NIR passes too I think

19:58 <alyssa> llvmpipe using nir_lower_blend anyone? ;-D

19:58 <airlied> oh that stuff is so finely hand written

19:59 <alyssa> D=

20:01 <airlied> I fear to tread in the blending pipeline, so many hand coded swizzle calls that I don't really understand

20:03 <gfxstrand> That sounds like a good argument for NIRifying

20:03 <alyssa> :crab_fire:

20:07 <airlied> it would be, but I doubt it would get as fast

20:07 <airlied> since it's mostly hand writing LLVM IR to optimise thing

20:07 <airlied> not sure translating NIR would achieve the same level, since NIR doesn't have a view into the LLVM 4-8 wide fun

20:10 <airlied> alyssa: I think the other reason we don't scalarise in NIR is for the soa/aos decision point there might not be a simple point to do it

20:10 <alyssa> hum

20:10 zf has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

20:11 zf has joined #dri-devel

20:11 zf has quit []

20:12 <mareko> alyssa: the CDNA thing is going to be answered in due time

20:13 <airlied> mareko: do you know if anyone runs a cdna card in a workstation? :-)

20:14 <mareko> airlied: I don't know much about where CDNA is used

20:14 <airlied> seems to be server gpus, but who wants a rack in their home :-P

20:15 <alyssa> mareko: mysterious ^_^

20:16 <mareko> other than what's publicly knows, such as El Capitan

20:16 <mareko> *known

20:18 <HdkR> airlied: I have a rack in my home :P

20:19 <HdkR> Might get a second one if I feel like experimenting a bit more

20:21 <airlied> HdkR: do you have soundproofing :-)

20:21 <airlied> the only locations I could put a rack would be near me or outside in the sun

20:21 <mareko> and cooling

20:22 <HdkR> Nah, I'm only rocking 4U chassis with slow spinning fans in it. Loudest thing is one of the 10gbit network switches

20:22 <alyssa> airlied: Do you hate this https://rosenzweig.io/0001-gallivm-Switch-to-reg-intrinsics.patch

20:23 * airlied has to wear noise cancelling headphones to compile llvm or use the preprod navi33 card I have

20:23 <alyssa> Lool

20:23 <HdkR> Sounds about right for most rack-mount things :D

20:23 <karolherbst> airlied: that reminds me...

20:23 ngcortes has joined #dri-devel

20:24 <karolherbst> btw, how did you end up connecting that one to your system?

20:25 sima has quit [Ping timeout: 480 seconds]

20:25 <airlied> alyssa: seems to be about what I'd expect

20:26 <airlied> karolherbst: I ended up turning my machine on it's side, putting it on a cardboard box, and when I put that card in I stick another piece of cardboard box between it and the PSU to ensure it is supported

20:26 <alyssa> airlied: =D

20:26 <karolherbst> oof

20:26 <airlied> I really should get an PCIE extender so I can put it flat on something

20:27 <karolherbst> yeah.. I planend to use a PCIe extender as well...

20:27 <karolherbst> forgot about it

20:28 <alyssa> airlied: welp, 4 backends down, 11 to go.. ugh... https://gitlab.freedesktop.org/mesa/mesa/-/issues/9051

20:29 <airlied> alyssa: would be interested to see what CI says

20:29 <alyssa> me too

20:30 <jenatali> alyssa: Are there that many backends that consume registers?

20:30 <alyssa> jenatali: Sadly, yes

20:30 <DemiMarie> anholt: I take it you mean “kata containers”? Is that to prevent any more sandbox escapes?

20:30 <jenatali> Oof

20:31 <alyssa> though DXIL isn't one of them so you're off the hook I Guess

20:31 kts has quit [Remote host closed the connection]

20:32 <jenatali> Yep

20:32 <alyssa> I need to update the MR description

20:33 <alyssa> 'cause killing off abs/neg/sat modifiers is also in scope for this now :~)

20:33 zf has joined #dri-devel

20:33 <DemiMarie> Why is CDNA still considered a GPU? It can’t even do graphics, so I imagine it would belong under drivers/accel instead.

20:33 <alyssa> that's a lot less onerous though, since no mature backends use them

20:33 <alyssa> ntt, etnaviv, a2xx, lima, and r600

20:33 <alyssa> and I did ntt

20:34 <alyssa> IDK who's going to do the other 4

20:34 <alyssa> I typed up a lot of helpers to make it painless as possible to migrate, but even so

20:35 <alyssa> I'm not volunteering myself to work on those 4

20:36 fab has quit [Quit: fab]

20:37 benjaminl has joined #dri-devel

20:38 rasterman has joined #dri-devel

20:42 ngcortes has quit [Read error: Connection reset by peer]

20:43 rasterman has quit [Quit: Gettin' stinky!]

20:44 <alyssa> gfxstrand: ooh I hit the prog-to-nir thing in CI, fun

20:44 <alyssa> pushed to the MR the source/dest modifier stuff and the llvmpipe conversion at any rate

20:46 <alyssa> zink, you're up

21:12 mbrost has quit [Ping timeout: 480 seconds]

21:14 Duke`` has quit [Ping timeout: 480 seconds]

21:31 frankbinns1 has joined #dri-devel

21:32 Andrew-R has joined #dri-devel

21:32 HerrSpliet has joined #dri-devel

21:32 gcarlos57 has joined #dri-devel

21:32 jkhsjdhjs_ has joined #dri-devel

21:32 lcn_ has joined #dri-devel

21:33 enunes- has joined #dri-devel

21:33 phryk_ has joined #dri-devel

21:33 tonyk5 has joined #dri-devel

21:33 calebccff_ has joined #dri-devel

21:33 dwlsalmeida4 has joined #dri-devel

21:33 larunbe has joined #dri-devel

21:33 JoshuaAs- has joined #dri-devel

21:33 _jannau__ has joined #dri-devel

21:33 tales-aparecida8 has joined #dri-devel

21:34 <airlied> do I report spam or get unlimited vbucks, big choices

21:34 jeeeun8413519 has joined #dri-devel

21:34 macc24_ has joined #dri-devel

21:34 uajain_ has joined #dri-devel

21:34 milek7_ has joined #dri-devel

21:34 grillo_03 has joined #dri-devel

21:34 bnieuwenhuizen_ has joined #dri-devel

21:35 Plagman has joined #dri-devel

21:35 aleasto- has joined #dri-devel

21:35 Arsen_ has joined #dri-devel

21:35 gpiccoli_ has joined #dri-devel

21:35 V_ has joined #dri-devel

21:35 pH5_ has joined #dri-devel

21:35 mavchatz has joined #dri-devel

21:35 mlankhor1t has joined #dri-devel

21:36 xroumegue has quit [charon.oftc.net helix.oftc.net]

21:36 frankbinns2 has quit [charon.oftc.net helix.oftc.net]

21:36 RSpliet has quit [charon.oftc.net helix.oftc.net]

21:36 AndrewR has quit [charon.oftc.net helix.oftc.net]

21:36 alarumbe has quit [charon.oftc.net helix.oftc.net]

21:36 i-garrison has quit [charon.oftc.net helix.oftc.net]

21:36 mvchtz has quit [charon.oftc.net helix.oftc.net]

21:36 milek7 has quit [charon.oftc.net helix.oftc.net]

21:36 Hi-Angel has quit [charon.oftc.net helix.oftc.net]

21:36 bbrezillon has quit [charon.oftc.net helix.oftc.net]

21:36 jeeeun841351 has quit [charon.oftc.net helix.oftc.net]

21:36 mwk[m] has quit [charon.oftc.net helix.oftc.net]

21:36 bnieuwenhuizen has quit [charon.oftc.net helix.oftc.net]

21:36 mauld has quit [charon.oftc.net helix.oftc.net]

21:36 fdu has quit [charon.oftc.net helix.oftc.net]

21:36 kusma has quit [charon.oftc.net helix.oftc.net]

21:36 urja has quit [charon.oftc.net helix.oftc.net]

21:36 calebccff has quit [charon.oftc.net helix.oftc.net]

21:36 JosExpsito[m] has quit [charon.oftc.net helix.oftc.net]

21:36 enunes[m] has quit [charon.oftc.net helix.oftc.net]

21:36 nicofee[m] has quit [charon.oftc.net helix.oftc.net]

21:36 YaLTeR[m] has quit [charon.oftc.net helix.oftc.net]

21:36 ram15[m] has quit [charon.oftc.net helix.oftc.net]

21:36 halfline[m] has quit [charon.oftc.net helix.oftc.net]

21:36 APic has quit [charon.oftc.net helix.oftc.net]

21:36 LinuxHackerman has quit [charon.oftc.net helix.oftc.net]

21:36 gallo[m] has quit [charon.oftc.net helix.oftc.net]

21:36 bubblethink[m] has quit [charon.oftc.net helix.oftc.net]

21:36 danylo has quit [charon.oftc.net helix.oftc.net]

21:36 aradhya7[m] has quit [charon.oftc.net helix.oftc.net]

21:36 fkassabri[m] has quit [charon.oftc.net helix.oftc.net]

21:36 hch12907 has quit [charon.oftc.net helix.oftc.net]

21:36 Sumera[m] has quit [charon.oftc.net helix.oftc.net]

21:36 doras has quit [charon.oftc.net helix.oftc.net]

21:36 Eighth_Doctor has quit [charon.oftc.net helix.oftc.net]

21:36 T_UNIX has quit [charon.oftc.net helix.oftc.net]

21:36 go4godvin has quit [charon.oftc.net helix.oftc.net]

21:36 Mis012[m] has quit [charon.oftc.net helix.oftc.net]

21:36 Tooniis[m] has quit [charon.oftc.net helix.oftc.net]

21:36 talcohen[m] has quit [charon.oftc.net helix.oftc.net]

21:36 x512[m] has quit [charon.oftc.net helix.oftc.net]

21:36 tintou has quit [charon.oftc.net helix.oftc.net]

21:36 gdevi has quit [charon.oftc.net helix.oftc.net]

21:36 AlexisHernndezGuzmn[m] has quit [charon.oftc.net helix.oftc.net]

21:36 kunal10710[m] has quit [charon.oftc.net helix.oftc.net]

21:36 pinchart1 has joined #dri-devel

21:36 fdu_ has joined #dri-devel

21:36 vidal72[m] has quit [charon.oftc.net helix.oftc.net]

21:36 tuxayo has quit [charon.oftc.net helix.oftc.net]

21:36 heftig has quit [charon.oftc.net helix.oftc.net]

21:36 Soroush has quit [charon.oftc.net helix.oftc.net]

21:36 shoragan has quit [charon.oftc.net helix.oftc.net]

21:36 pH5 has quit [charon.oftc.net helix.oftc.net]

21:36 pinchartl has quit [charon.oftc.net helix.oftc.net]

21:36 siddh has quit [charon.oftc.net helix.oftc.net]

21:36 jkhsjdhjs has quit [charon.oftc.net helix.oftc.net]

21:36 tales-aparecida has quit [charon.oftc.net helix.oftc.net]

21:36 dwlsalmeida has quit [charon.oftc.net helix.oftc.net]

21:36 _jannau_ has quit [charon.oftc.net helix.oftc.net]

21:36 phryk has quit [charon.oftc.net helix.oftc.net]

21:36 gcarlos5 has quit [charon.oftc.net helix.oftc.net]

21:36 mlankhorst has quit [charon.oftc.net helix.oftc.net]

21:36 aleasto has quit [charon.oftc.net helix.oftc.net]

21:36 lcn has quit [charon.oftc.net helix.oftc.net]

21:36 tonyk has quit [charon.oftc.net helix.oftc.net]

21:36 macc24 has quit [charon.oftc.net helix.oftc.net]

21:36 Arsen has quit [charon.oftc.net helix.oftc.net]

21:36 grillo_0 has quit [charon.oftc.net helix.oftc.net]

21:36 uajain has quit [charon.oftc.net helix.oftc.net]

21:36 enunes has quit [charon.oftc.net helix.oftc.net]

21:36 V has quit [charon.oftc.net helix.oftc.net]

21:36 Plagman_ has quit [charon.oftc.net helix.oftc.net]

21:36 JoshuaAshton has quit [charon.oftc.net helix.oftc.net]

21:36 gpiccoli has quit [charon.oftc.net helix.oftc.net]

21:36 jeeeun8413519 is now known as jeeeun841351

21:36 lcn_ is now known as lcn

21:36 jkhsjdhjs_ is now known as jkhsjdhjs

21:36 lyudess has quit []

21:36 dwlsalmeida4 is now known as dwlsalmeida

21:36 grillo_03 is now known as grillo_0

21:36 Lyude has joined #dri-devel

21:36 mauld has joined #dri-devel

21:38 apinheiro has joined #dri-devel

21:38 APic has joined #dri-devel

21:39 urja has joined #dri-devel

21:39 xroumegue has joined #dri-devel

21:39 i-garrison has joined #dri-devel

21:39 bbrezillon has joined #dri-devel

21:41 <alyssa> what happened with https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23191 ?

21:41 <alyssa> the CI pipeline I mean

21:41 <alyssa> I am very confused

21:41 <jenatali> There was a failed job

21:45 <alyssa> and that failed the whole pipeline?

21:45 <alyssa> neat. that's new

21:45 <jenatali> Hm? Is it new?

21:46 rcf has quit [Ping timeout: 480 seconds]

21:46 <alyssa> maybe?

21:47 <alyssa> well, in that case I need help since IIRC iris doesn't build on arm

21:48 * alyssa tries anyway in case that was fixed

21:48 tintou has joined #dri-devel

21:48 go4godvin has joined #dri-devel

21:49 go4godvin is now known as Guest2915

21:49 pinchart1 is now known as pinchartl

21:50 <alyssa> oh, it does, cool

21:50 <alyssa> where's my drm-shim though

21:51 <alyssa> iris doesn't have drm-shim? :(

21:51 <alyssa> intel_stub_gpu. right

21:52 LinuxHackerman has joined #dri-devel

21:53 <alyssa> OK, reproduced

21:57 <alyssa> Ohhhh

21:57 <alyssa> Lol

21:57 <alyssa> OK

21:57 <alyssa> I see what happened

21:57 fkassabri[m] has joined #dri-devel

21:57 <alyssa> whoopsies

21:57 <alyssa> today's edition of "stupid spot the bug"

21:58 <alyssa> and fixed

21:58 <alyssa> well test still crashes for me because of arb_fragment_shader_interlock-image-load-store: ../src/intel/isl/isl_tiled_memcpy.c:609: choose_copy_function: Assertion `!"" "ISL_MEMCOPY_STREAMING_LOAD requires sse4.1"' failed.

21:59 <HdkR> A bit difficult to get SSE4.1 on your Macbook

21:59 <alyssa> Little bit yeah

21:59 <gfxstrand> hehe... Yeah....

22:00 <gfxstrand> I thought we had a non-SSE path

22:00 <alyssa> gfxstrand: you do, but iris was specifically asking for streaming

22:00 <gfxstrand> Ah, yes...

22:00 <gfxstrand> Because it can

22:00 <gfxstrand> Because it only runs BDW+ which is always paired with a GPU that supports SSE4.1

22:00 <gfxstrand> Unless that GPU is an Arc in which case it could be plugged into raspberry pi for all you know.

22:01 <alyssa> Yep

22:01 <alyssa> Well, not a raspberry pi I don't think

22:01 <alyssa> low to mid-tier arm doesn't work with dGPUs usually

22:01 <HdkR> Probably more a SolidRun Honeycomb or Ampere eMAG

22:01 <alyssa> yeah

22:01 <alyssa> server grade arm64 + dGPU

22:05 YaLTeR[m] has joined #dri-devel

22:09 <DemiMarie> Are there any mid-grade Arm64 chips?

22:09 <jenatali> Oof, that's a fun bug. Glad there was a test that caught it, though I'm surprised there was only one failure

22:09 <DemiMarie> mid-grade = desktop PC class

22:09 <gfxstrand> Apple

22:10 <gfxstrand> Otherwise, not that I'm aware of.

22:10 halfline[m] has joined #dri-devel

22:10 Piraty has quit [Remote host closed the connection]

22:11 <DemiMarie> Is that likely to ever change?

22:11 Piraty has joined #dri-devel

22:11 <jenatali> Some of QC's higher end chips are approaching that IMO

22:12 Hi-Angel has joined #dri-devel

22:16 <DemiMarie> gfxstrand: I’m a little bit salty about Apple having so many non-standard SMMUs. Means that Xen support for Apple Silicon is unlikely to ever happen.

22:16 hch12907 has joined #dri-devel

22:20 calebccff_ is now known as calebccff

22:25 <alyssa> gfxstrand: Hmm?

22:25 <alyssa> oh I see

22:28 <alyssa> ok, zink is converted too

22:28 <alyssa> I think that's enough backends for proving the design is sensible

22:28 Haaninjo has quit [Quit: Ex-Chat]

22:28 <alyssa> i'm off for the night then

22:29 T_UNIX has joined #dri-devel

22:29 <alyssa> pretty steady progress though

22:30 <alyssa> getting close to taking the Draft status off, so that's exciting

22:30 <alyssa> for me

22:30 <alyssa> not exciting if you were ignoring it and will soon have to convert your backends :~P

22:32 x512[m] has joined #dri-devel

22:37 apinheiro has quit [Quit: Leaving]

22:39 bgs has quit [Remote host closed the connection]

22:45 Sumera[m] has joined #dri-devel

22:48 rcf has joined #dri-devel

22:58 <Lynne> who works on vulkaninfo? khronos?

23:04 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

23:04 TMM has joined #dri-devel

23:10 smiles_ has joined #dri-devel

23:12 <gfxstrand> alyssa: Okay, I think I found at least some of the HSW regressions

23:12 <gfxstrand> lower_vec_to_movs is being a tiny bit more clever about placement of register stores.

23:12 <gfxstrand> But in a pretty niche edge-case

23:14 <bnieuwenhuizen_> Lynne: lunarg

23:14 <Lynne> do they have an issue tracker?

23:14 <gfxstrand> Lynne: https://github.com/KhronosGroup/Vulkan-Tools

23:16 <Lynne> thanks, I thought about writing an equivalent to vainfo/vdpauinfo/nv-video-info, but thought it would be better off being a part of vulkaninfo

23:18 Kayden has quit [Quit: -> leaving office]

23:18 <gfxstrand> alyssa: Basically, you're missing try_coalesce

23:19 <gfxstrand> Or rather your coalesce_swizzle thing isn't quite as good for some reason.

23:22 <alyssa> gfxstrand: I'd believe it

23:22 JohnnyonF has quit [Ping timeout: 480 seconds]

23:35 <gfxstrand> alyssa: So the big difference as far as I can tell is that try_coalesce in lower_vec_to_movs puts the register write directly in the ALU op that generates the swizzle source. In a store_reg world, that would mean placing a store_reg immediately after.

23:35 <gfxstrand> alyssa: Whereas in lower_vec_to_regs, you insert the store_reg at the vec location and then eliminate the swizzling mov, leaving the store_reg as-is.

23:35 <gfxstrand> So the store_reg ends up living at the vec location.

23:35 <alyssa> => extra moves because that store isn't trivial

23:35 <alyssa> ?

23:36 <alyssa> s/isn't/may not be/

23:36 <gfxstrand> I'm not following

23:36 <alyssa> I may not be either

23:37 <alyssa> The reason the placement matters is presumably because putting the store_reg too late will cause nir_trivialize_registers to insert a move that won't be coalesced?

23:37 <gfxstrand> No

23:37 <gfxstrand> It's because, thanks to SSA, the coalescing that happens in try_coalesce works across blocks.

23:38 <gfxstrand> It doesn't matter if the fdp4 or whatever it is happens to be 17 blocks away, if the vec is the only user, we can re-swizzle it and write the register as part of the fdp4.

23:38 <alyssa> haswell supports control flow????????

23:38 <gfxstrand> Yes, sadly.

23:38 <gfxstrand> :P

23:39 <alyssa> we dont talk about broadwell, no no no

23:39 <gfxstrand> By contrast, when you emit the store_reg at the location of the vec and then try to coalesce later, the problem is much harder because you're moving a store_reg with insufficient information.

23:39 <gfxstrand> Well, you have enough information

23:39 <gfxstrand> It's possible

23:39 <gfxstrand> Each component is written exactly once

23:40 <gfxstrand> But it's a lot harder than when we're doing it in try_coalesce and the value we're dealing with is SSA.

23:40 * alyssa is trying to page in enough details of the passes for this to make sense

23:44 <alyssa> gfxstrand: I'm still not following why it matters where the store_reg instruction is placed

23:45 <alyssa> except I guess because trivialize_registers inserting extra moves because it doesn't see across bblock boundaries

23:46 <gfxstrand> It matters because back-end vec4 copy-prop and register coalesce suck

23:47 <alyssa> Oh, well, yes

23:47 <idr> Understatement of the year...

23:48 <alyssa> gfxstrand: I can try to reintroduce try_coalesce instead of the 2 pass thing

23:48 <alyssa> tomorrow, I mean. it's past working hours now I just saw an interesting problem

23:48 <gfxstrand> Yeah

23:48 <gfxstrand> That's fine

23:48 <alyssa> would ppreciate if you can send me a small affected shader that I can play with

23:49 <alyssa> but if not I can probably construct smoething

23:49 <alyssa> I don't really remember why I did the 2 pass thing

23:51 <gfxstrand> alyssa: It's sitting in your e-mail

23:51 <alyssa> thanks!

23:51 <alyssa> it appears I may have texted you the reason weeks ago but had disappearing messages on

23:51 <alyssa> couldn't have been that important (-:

23:54 paulk has joined #dri-devel

23:55 paulk-bis has quit [Read error: Connection reset by peer]

23:58 heat has quit [Remote host closed the connection]