The fact that they do this isn't very bullish for them achieving whatever they define as AGI.
You don't expect AGI to be multi-modal?
What is AGI?
Artificial general intelligence