Remix clone Hacker News

new | show | ask | jobs Github

	▲	nkzd 12 hours ago
		Why are you 'worried' about it? Shouldn't we strive for better technology even if it means some will 'lose'?
	▲	yorwba 11 hours ago \| parent \| next [-]
		"Better" isn't just about increasing benchmark numbers. Often, it's more important that a system fails safely than how often it fails. Automatic speech recognition that guesses when the input is unclear will occasionally be right and therefore have a lower word error rate, but if it's important that the output be correct, it might be better to insert "[unintelligible]" and have a human double-check.
	▲	ks2048 10 hours ago \| parent \| prev \| next [-]
		Ideally, you'd be able to specify exactly what you want - do you want to write-out filled pauses ("aaah", "umm")? Do you want to get a transcription of the the disfluencies - re-starts, etc. or just get out a cleaned up version?
	▲	IshKebab 10 hours ago \| parent \| prev [-]
		It's better in terms of WER. It's not better in terms of not making shit up that sounds plausible. Probably the answer is simply to tweak the metric so it's a bit more smart than WER - allow "unclear" output which is penalised less than actually incorrect answers. I'd be surprised if nobody has done that.