Vitalik Buterin Warns Against “Naive AI Governance,” Proposes Alternative Model
Ethereum co-founder Vitalik Buterin has cautioned against over-reliance on artificial intelligence for governance, arguing that “naive AI governance” could easily be exploited.
In a recent post, Buterin explained that if AI systems were tasked with allocating funding for contributions, individuals would quickly attempt to manipulate them. “People WILL put a jailbreak plus ‘gimme all the money’ in as many places as they can,” he warned.
An alternative: info finance
Instead, Buterin advocates for an “info finance” approach, which he described in a prior essay. Under this model, anyone can contribute governance models to an open marketplace. These models would then be subject to spot-check mechanisms, which could be triggered by anyone and ultimately evaluated by a human jury.
This is also why naive "AI governance" is a bad idea.
If you use an AI to allocate funding for contributions, people WILL put a jailbreak plus "gimme all the money" in as many places as they can.
As an alternative, I support the info finance approach ( https://t.co/Os5I1voKCV… https://t.co/a5EYH6Rmz9
— vitalik.eth (@VitalikButerin) September 13, 2025
This framework, Buterin argues, avoids the pitfalls of centralized AI governance by combining model diversity with human oversight.
Institution design for robustness
Buterin calls this an “institution design” approach, a system that allows large language models (LLMs) from different contributors to be plugged in, rather than hardcoding a single model.
He argues this design is more resilient because it:
- Encourages real-time diversity of models.
- Builds in incentives for both model creators and external observers to spot weaknesses.
- Provides mechanisms to correct errors quickly.
By combining human juries with market-driven model diversity, Buterin suggests governance systems can become more resistant to manipulation while remaining adaptable to new risks.

Fill in necessary fields and publish