Remix clone Hacker News

new | show | ask | jobs Github

	▲	evilduck 8 hours ago
		To be fair to your field, that advancement seems expected, no? We can do things to LLMs that we can't ethically or practically do to humans.
	▲	AlphaAndOmega0 5 hours ago \| parent [-]
		I'm still impressed by the progress in interpretability, I remember being quite pessimistic that we'd achieve even what we have today (and I recall that being the consensus in ML researchers at the time). In other words, while capabilities have advanced at about the pace I expected from the GPT-2/3 days, mechanistic interpretability has advanced even faster than I'd hoped for (in some ways, we are very far from completely understanding the ways LLMs work).