Knowledge Distillation of Black-Box Large Language Models (2024)

dmezzetti 4 hours ago | parent | next [-]

Well-Read Students Learn Better: On the Importance of Pre-training Compact Models

Related paper that's a good read: https://arxiv.org/abs/1908.08962

▲

Alifatisk 6 hours ago | parent | prev | next [-]

Why is this published again? Is this a reference to recent events?

▲

babelfish 5 hours ago | parent [-]

I just saw some post about it on Threads and found it interesting so decided to share!

	▲	tough 12 minutes ago \| parent [-]
		My best guess is this is a reference to the recent accusations from Anthropic of chinese labs ¨distilling¨ on their models

▲

linolevan 6 hours ago | parent | prev | next [-]

Can we note that this is a 2024 paper in the title?

▲

duendefm 6 hours ago | parent | prev [-]

The Chinese are really going strong on destroying the American AI economy bubble. Honestly, despite the fact that I'm totally pro USA and anti China, I think we should help them crashing the American AI bubble. They are controlling everything and we can't even buy a new computer nowadays while getting no benefit from this. I wish some influential programmers stimulated coders everywhere to skip Claude and Chatgpt subscriptions for Chinese ones, at scale. If we programmers united we could help this bubble burst, I'm sure.

	▲	nozzlegear 5 hours ago \| parent [-]
		> skip Claude and Chatgpt subscriptions for Chinese ones, at scale. If we programmers united we could help this bubble burst, I'm sure. I'm doing my part!