15 t/s way too slow for anything but chatting, call and response, and you don't need a 3T parameter model for that
Wake me up when the situation improves
Just wait for the M5-Ultra with a terabyte of RAM.