Don’t measure model cost by token price. Measure based on tokens used to achieve a task.
Others report in this thread that it’s about 2x more expensive due to outputs: https://news.ycombinator.com/item?id=48312774