Remix.run Logo
duvenaud 4 days ago

Author here! What a surprise. This was an abandoned project from 2019, that we never linked or advertised anywhere as far as I know. Anyways, happy to answer questions.

mugivarra69 4 days ago | parent | next [-]

why (if) was this not picked for further research? i know that oatml did quite amount of work on this front as well and it seems the direction is still being worked on. want to get ur 2 cent on this approach.

duvenaud 3 days ago | parent [-]

BNNs certainly have their uses, but I think people in general found that it's a better use of compute to fit a larger model on more data than to try to squeeze more juice from a given small dataset + model. Usually there is more data available, it's just somewhat tangentially related. LLMs are the ultimate example of how training on tons of tangentially-related data can ultimately be worthwhile for almost any task.

idontknowmuch 4 days ago | parent | prev | next [-]

Somewhat related — I’d love to hear your thoughts on dex-Lang and Haskell for array programming?

duvenaud 3 days ago | parent [-]

I still am excited by Dex (https://github.com/google-research/dex-lang/) and still write code in it! I have a bunch of demos and fixes written, and am just waiting for Dougal to finish his latest re-write before I can merge them.

timeinput 4 days ago | parent | prev | next [-]

What did you use to produce the article? I really really like the formatting.

duvenaud 3 days ago | parent [-]

I think we used a distill.pub template. Also Jerry wrote some custom BNN fitting code in javascript. I'll ask my co-authors to open-source it.

duvenaud 3 days ago | parent [-]

Update: the code is here:

https://github.com/jerryqhyu/distill_bayes_net

esafak 4 days ago | parent | prev [-]

just a little typo, but it's Kullback-Leibler.

duvenaud 3 days ago | parent [-]

Thanks for pointing that out!