Open AI shenanigans in Cerberas IPO. They have a deal to pay a minimum of $10.53/m tokens (under 100% 24/7 capacity utilization) that is more like $40-$50/m in real world. They charge $14/m

humanspiral@lemmy.ca · 3 days ago

Open AI shenanigans in Cerberas IPO. They have a deal to pay a minimum of $10.53/m tokens (under 100% 24/7 capacity utilization) that is more like $40-$50/m in real world. They charge $14/m

humanspiral@lemmy.ca · edit-2 2 days ago

I made a math mistake. Theoretical minimum cost to openAI is $3.15/m ($3.30/m with electricity) tokens, as cerebras has fixed context windows per user, and codex spark allows 3.33 concurrent users per node. That is still $16.50/m optimistic (20% of theoretical capacity) cost for $14/m revenue.

I guess there is a market for very fast response tasks. OpenAI does have a routing system that charges a high cost per token, but gets most of the work done by their smaller/cheaper models behind the scenes.

But, this turns out not to be ultra stupid if OpenAI has the internal training/improvement token workload to completely saturate the datacenter for its own use. Cerebras does have a training advantage over nvidia. It’s immature software stack only applies to cutting edge inference techniques.