| ▲ | deepsquirrelnet 5 hours ago | |
If you want to go deeper on language models, try these project ideas: - Zero-shot encoders like tasksource or GliNER - Natural language inference: https://huggingface.co/blog/dleemiller/nli-xenc-ways-to-use - GRPO training - GEPA prompt tuning Qwen 0.6B (or GEPA, then GRPO) - Use an embedding model and train a classifier (MLP, logistic, svm) - Use a larger LLM to generate a synthetic dataset (beware of lack of diversity, mine "seed text" from real sources first) - Synthetically generate "hard examples" where more than one category may be valid and DPO tune your preferred responses | ||
| ▲ | throwaw12 26 minutes ago | parent [-] | |
may I ask where did you get the list? I am looking for ways to get involved in going little more deeper on LLMs (I have very high level understanding, but my direct work doesn't involve them, hence I am not familiar with deeper details) | ||