Gemini is better than either at multi modal, google also has their tensor processor stuff with ridiculously high T/s output they need for acceptable UX