Remix.run Logo
Knowledge Distillation of Black-Box Large Language Models (2024)(arxiv.org)
66 points by babelfish 6 hours ago | 7 comments
dmezzetti 4 hours ago | parent | next [-]

Well-Read Students Learn Better: On the Importance of Pre-training Compact Models

Related paper that's a good read: https://arxiv.org/abs/1908.08962

Alifatisk 6 hours ago | parent | prev | next [-]

Why is this published again? Is this a reference to recent events?

babelfish 5 hours ago | parent [-]

I just saw some post about it on Threads and found it interesting so decided to share!

tough 12 minutes ago | parent [-]

My best guess is this is a reference to the recent accusations from Anthropic of chinese labs ¨distilling¨ on their models

linolevan 6 hours ago | parent | prev | next [-]

Can we note that this is a 2024 paper in the title?

duendefm 6 hours ago | parent | prev [-]

The Chinese are really going strong on destroying the American AI economy bubble. Honestly, despite the fact that I'm totally pro USA and anti China, I think we should help them crashing the American AI bubble. They are controlling everything and we can't even buy a new computer nowadays while getting no benefit from this. I wish some influential programmers stimulated coders everywhere to skip Claude and Chatgpt subscriptions for Chinese ones, at scale. If we programmers united we could help this bubble burst, I'm sure.

nozzlegear 5 hours ago | parent [-]

> skip Claude and Chatgpt subscriptions for Chinese ones, at scale. If we programmers united we could help this bubble burst, I'm sure.

I'm doing my part!