

I made a math mistake. Theoretical minimum cost to openAI is $3.15/m ($3.30/m with electricity) tokens, as cerebras has fixed context windows per user, and codex spark allows 3.33 concurrent users per node. That is still $16.50/m optimistic (20% of theoretical capacity) cost for $14/m revenue.
I guess there is a market for very fast response tasks. OpenAI does have a routing system that charges a high cost per token, but gets most of the work done by their smaller/cheaper models behind the scenes.
But, this turns out not to be ultra stupid if OpenAI has the internal training/improvement token workload to completely saturate the datacenter for its own use. Cerebras does have a training advantage over nvidia. It’s immature software stack only applies to cutting edge inference techniques.








Fortunately, it is unlikely the US will wage war on China because it is already so far behind. The US is directly collapsing, and it explains its political desperation, but the war is entirely the oligarchy against the people, and protecting their pillage. Certainly massive spending for war is hard to stop, but the appetite to lose yet another war is not going to go up after Iran.