Remix.run Logo
chabes 5 hours ago

The small models are getting really impressive.

I recently realized that Qwen3.5:4B is way more capable than I thought a model that size could be.

Combine that with the work Liquid puts into RL and fine tuning, and you get models that perform extremely well on minimal hardware.

Combine that with your own fine tuning, and you get a specialized tool that is fast, private, and doesn’t require internet connection.

r0b05 5 hours ago | parent [-]

What did you use qwen3.5 4b for?

steve_adams_86 3 hours ago | parent | next [-]

I use it for triaging my messages and emails and reminding me how all of it ties together. It uses Obsidian to know where to put stuff and how to connect information. It isn't perfect. It's very slow (using a 32GB M2 Max) but fast enough for my needs.

A good example of how it's helpful is that it will make certain things relatively frictionless. Like, I need to pay property taxes. I hate this stuff. I got the email reminder from my municipality and it made an entry in my TODOs which points to page with instructions to pay the taxes, including my folio and access numbers for when I log in. That was taken from the email and a document which contains past property tax information. I have it all there, but it compiles relevant data into dedicated TODO pages.

I'm so bad at doing all of this myself. I really don't enjoy it. Send me to buy a carrot at the store and I'll happily walk 30 minutes there and back to do it. It isn't the effort so to speak; it's how unrewarding, inefficient, and bureaucratic it all is. I'm allergic to it. Why isn't it baked into my income taxes? Why are we still doing this?

Sometimes it does a really bad job of making TODOs. Like my wife messaged me about what our dinner plan was, so Qwen went ahead and made a plan for chicken meatball soup based on messages from a week earlier. It totally fabricated the recipe. Yet, I don't know, it was still helpful to be reminded that I'm in charge of dinner.

It's probably best at scaffolding responses to emails I don't want to send. I will write it, but I appreciate basic information being fleshed out so I can write it without jumping around looking for files or numbers or whatever constantly.

I use it with a custom harness. It could be a lot better. Everything about it could be better. The model is remarkably good for its size and price, though.

Letting Sonnet 4.6 do it instead always yields much better results, much faster, but it's kind of like using a new phone vs a super old one. They can both get you there. The sound quality and camera might be worse, it doesn't look as fancy, but the new one is $1200 and the old one is free on marketplace if you're handy with a screwdriver and a fresh battery. Sounds great to me

Worth noting: this was all vibe-coded using Opus 4.6 and 4.7. It's the only project I've built that is strictly vibe-coded. It's simultaneously exciting and disgusting. I'm not sure if I'll ever 'software engineer' it, or I'll just let it be slop. It works.

cjtrowbridge 4 hours ago | parent | prev | next [-]

its really good at agentic tasks

sroussey 4 hours ago | parent | prev [-]

I find it works well in the browser.