| ▲ | vunderba 11 hours ago | ||||||||||||||||
Smaller models might not make the best agentic coding assistants, but I have a 128GB RAM headless machine serving llama.cpp with a number of local models that handles various tasks on a daily basis and works great. - Qwen3-VL:30b > A file watcher on my NAS sends new images to it, which autocaptions and adds the text descriptions as a hidden EXIF layer into the image along with an entry into a Qdrant vector database for lossy searching and organization. - Gemma3:27b > Used for personal translation work (mostly English and Chinese). Haven't had a chance to try out the Gemma4 models yet. - Llama3.1:8b > Performs sentiment analysis on texts / comments / etc. | |||||||||||||||||
| ▲ | verdverm 10 hours ago | parent [-] | ||||||||||||||||
Look into updating to Gemma4 and Qwen3.6, they are good at agentic things. qwen36moe with unsloth's 8bit quant is my daily driver now. | |||||||||||||||||
| |||||||||||||||||