| ▲ | ilia-a 10 days ago | |
Seems odd limit, especially since it highly dependant on Token provider used, with Opus this is not much and could easily be burnt in a week or less, but with something like deepseek the 1500 can literarily be an annual budget. That being said, I do have to wonder why someone as bug as say Uber, simply not rollout OSS model in the cloud for their team, I'd imagine that would be cheapest & most flexible option, while also keeping all the data shared with LLM private. | ||
| ▲ | iceman28 10 days ago | parent | next [-] | |
It’s not just about the model but also setting up the system to create and share compute (GPUs) which is quite complicated on its own. Ubers primary business focus isn’t infrastructure. | ||
| ▲ | 10 days ago | parent | prev [-] | |
| [deleted] | ||