My guess is that new architectures will be about doing more with less compute. For example, are there architectures that can operate at lower bit precision or better turn off and on components as required by the task?