Discussion about this post

User's avatar
Romilly Cocking's avatar

My two cent's worth; everyone I know who has the skill and money is exploring the limits of locally running LLMs. At the same time the Chinese see an opportuning to diminish the competitive threat from the USA by releasing smaller. more powerful models. These can be run locally and come close to proprietary models 10-100x their size. We're also seeing self-improving models and agent teams working well. It's hard to see any of the proprietary model providers surviving, let alone all, and the investment in new data-centres will steadily decline. All this against the backdrop of a global energy crisis. fun times.

Pawel Jozefiak's avatar

Beck's investor confidence argument is compelling, but I'd split the constraint in two. Investor signaling is real. Although the engineering part is also real - the throttling I saw last month wasn't uniform across use types, which suggests compute cost varies a lot by task. Long agentic sessions got cut harder than single completions. That looks like infrastructure constraint, not just signaling.

Both can be true at once. More interesting question: if limits are primarily narrative and lift when funding stabilizes, does demand catch up fast enough to immediately recreate the constraint? That cycle could run for a while before the infrastructure actually catches up.

6 more comments...

No posts

Ready for more?