Remix.run Logo
Iv 4 days ago

"Starting from a single base LLM"

Ok, zero data, except the data used in the teacher model.

nickpsecurity 4 days ago | parent [-]

Only 1-15TB of data processed at $10k-$100m depending on model size. Then, this shaves off a few hundred to a few grand on fine-tuning. I mean, we're still saving money at least.

markmoscov 4 days ago | parent [-]

[dead]