Remix.run Logo
simonw 6 days ago

Fine-tuning for specific tasks. I'm hoping to see some good examples of that soon - the blog entry mentions things like structured text extraction, so maybe something like "turn this text about an event into an iCal document" might work?

turnsout 6 days ago | parent | next [-]

Google helpfully made some docs on how to fine-tune this model [0]. I'm looking forward to giving it a try!

  [0]: https://ai.google.dev/gemma/docs/core/huggingface_text_full_finetune
CuriouslyC 6 days ago | parent | prev | next [-]

Fine tuning messes with instruction following and RL'd behavior. I think this is mostly going to be useful for high volume pipelines doing some sort of mundane extraction or transformation.

iib 6 days ago | parent | prev [-]

This is exactly the fine-tuning I am hoping for, or I would do if I had the skills. I tried it with gemma3 270M and vanilla it fails spectacularly.

Basically it would be the quickadd[1] event from google calendar, but calendar agnostic.

[1] https://developers.google.com/workspace/calendar/api/v3/refe...