Discussion about this post

User's avatar
Scenarica's avatar

The reason every forecast including Anthropic's own keeps being wrong in the same direction is that the existing frameworks are built for single-loop technologies. Electricity had demand-side network effects, more people on the grid meant more appliances got built which drove more demand. AI tokens have something with no precedent: simultaneous demand-side AND supply-side network effects. More usage creates more use cases (demand loop). But more usage also generates training signal and fine-tuning data that makes the model better (supply loop). The consumption of the product is itself the R&D that improves the product.

No previous technology had both loops running at once. Electricity doesnt improve when you consume more of it. But every token consumed makes the next token more valuable because the model underneath has improved from the interaction. Thats why the 80x number keeps surprising even the people building it, theyre forecasting with single-loop frameworks against a double-loop system and the error compounds in every cycle. The electricity analogy is directionally right but it actually understates the dynamic, because electricity never had the property where the act of consumption was simultaneously an act of production.

ToxSec's avatar

Really nice read. super interesting that grok was only using 11% of Colossus. The deal ended up working out well for them, but i still didn't realize how low the usage was.

11 more comments...

No posts

Ready for more?