Remix clone Hacker News

Author here! What a surprise. This was an abandoned project from 2019, that we never linked or advertised anywhere as far as I know. Anyways, happy to answer questions.

▲

mugivarra69 4 days ago | parent | next [-]

why (if) was this not picked for further research? i know that oatml did quite amount of work on this front as well and it seems the direction is still being worked on. want to get ur 2 cent on this approach.

	▲	duvenaud 3 days ago \| parent [-]
		BNNs certainly have their uses, but I think people in general found that it's a better use of compute to fit a larger model on more data than to try to squeeze more juice from a given small dataset + model. Usually there is more data available, it's just somewhat tangentially related. LLMs are the ultimate example of how training on tons of tangentially-related data can ultimately be worthwhile for almost any task.

▲

idontknowmuch 4 days ago | parent | prev | next [-]

Somewhat related — I’d love to hear your thoughts on dex-Lang and Haskell for array programming?

	▲	duvenaud 3 days ago \| parent [-]
		I still am excited by Dex (https://github.com/google-research/dex-lang/) and still write code in it! I have a bunch of demos and fixes written, and am just waiting for Dougal to finish his latest re-write before I can merge them.

▲

timeinput 4 days ago | parent | prev | next [-]

What did you use to produce the article? I really really like the formatting.

▲

duvenaud 3 days ago | parent [-]

I think we used a distill.pub template. Also Jerry wrote some custom BNN fitting code in javascript. I'll ask my co-authors to open-source it.

	▲	duvenaud 3 days ago \| parent [-]
		Update: the code is here: https://github.com/jerryqhyu/distill_bayes_net

▲

esafak 4 days ago | parent | prev [-]

just a little typo, but it's Kullback-Leibler.

	▲	duvenaud 3 days ago \| parent [-]
		Thanks for pointing that out!