▲ | simonw 6 days ago | |
Fine-tuning for specific tasks. I'm hoping to see some good examples of that soon - the blog entry mentions things like structured text extraction, so maybe something like "turn this text about an event into an iCal document" might work? | ||
▲ | turnsout 6 days ago | parent | next [-] | |
Google helpfully made some docs on how to fine-tune this model [0]. I'm looking forward to giving it a try!
| ||
▲ | CuriouslyC 6 days ago | parent | prev | next [-] | |
Fine tuning messes with instruction following and RL'd behavior. I think this is mostly going to be useful for high volume pipelines doing some sort of mundane extraction or transformation. | ||
▲ | iib 6 days ago | parent | prev [-] | |
This is exactly the fine-tuning I am hoping for, or I would do if I had the skills. I tried it with gemma3 270M and vanilla it fails spectacularly. Basically it would be the quickadd[1] event from google calendar, but calendar agnostic. [1] https://developers.google.com/workspace/calendar/api/v3/refe... |