it seems to be able to do native speech-speech
It does for sure. I did some more digging and it does real-time too. That's fascinating.