Remix clone Hacker News

new | show | ask | jobs Github

	▲	deepsquirrelnet 5 hours ago
		If you want to go deeper on language models, try these project ideas: - Zero-shot encoders like tasksource or GliNER - Natural language inference: https://huggingface.co/blog/dleemiller/nli-xenc-ways-to-use - GRPO training - GEPA prompt tuning Qwen 0.6B (or GEPA, then GRPO) - Use an embedding model and train a classifier (MLP, logistic, svm) - Use a larger LLM to generate a synthetic dataset (beware of lack of diversity, mine "seed text" from real sources first) - Synthetically generate "hard examples" where more than one category may be valid and DPO tune your preferred responses
	▲	throwaw12 26 minutes ago \| parent [-]
		may I ask where did you get the list? I am looking for ways to get involved in going little more deeper on LLMs (I have very high level understanding, but my direct work doesn't involve them, hence I am not familiar with deeper details)