| ▲ | jeswin 2 hours ago | |
I would not trust AI on images. But I once had ChatGPT tell me that an MRI report was very likely to be incorrect based on the text, and offered a different diagnosis. Since it was semi insisting, I visited another doctor who made me do a retest. Long story short, ChatGPT was correct. Again, this is just one single person's experience. So not worth much. | ||
| ▲ | nostrebored an hour ago | parent [-] | |
I think that much of the visual gap is because what to attend to in images is less structured. Anecdotally small qwen finetunes (ie less than 10B) take task accuracy from sub 30% on FMs to 90%. We have sold some of these for outcome based back office tasks. I think we’ll see a lot of specialized VLMs that provide real value. | ||