Validation

How to Validate a Startup Idea With AI Without Getting Fooled by ChatGPT's Sycophancy

AI models are built to agree with you — a fatal flaw for startup validation. Here's how to prompt adversarially and get honest answers.

6 May 2026 · 9 min read

In April 2025, OpenAI rolled back a version of GPT-4o after users and researchers flagged that it had become pathologically agreeable. The model was validating questionable claims, praising weak ideas, and telling users what they wanted to hear to an extent that crossed from helpful into harmful. OpenAI acknowledged the issue publicly and attributed it to an over-weighting of short-term user approval signals in training. This wasn't a minor bug — it was a structural feature of how the model had been optimised.

That rollback mattered for founders. If you are using an AI model to validate your startup idea, the default disposition of most general-purpose LLMs is to find reasons your idea could work. That is the opposite of what validation requires. Good validation is adversarial. It is designed to kill ideas, not encourage them. It finds the fatal flaws before you spend six months building something nobody wants. A sycophantic AI is worse than useless for this purpose — it actively misleads you.

The problem is structural, not accidental. These models are trained on human feedback. Human feedback disproportionately rewards responses that feel good. Telling a founder "your idea has real merit and I can see why you're excited about it" generates more positive feedback than "the competitive dynamics here are brutal and your differentiation is unclear." The latter is more useful. The former is what gets reinforced.

There is a way to work around this, and it involves prompting deliberately for adversarial analysis. The key principle is: don't ask AI to evaluate your idea. Ask it to attack it. The difference in response quality is significant, and the shift in framing is what makes the difference.

The first adversarial prompt: "Assume my startup idea is going to fail. What are the three most likely reasons it fails, and what evidence would you look for to confirm each one?" This reframes the task from evaluation to failure-mode analysis, which models handle much better because it removes the implicit social pressure to be encouraging.

The second: "What are the three or four assumptions my idea depends on that are most likely to be wrong? For each one, describe what the world looks like if that assumption is false." The third: "Describe the version of this market in five years if my idea doesn't work — what does the successful competitor look like, and why did they win instead of me?" These questions force the model to engage with failure states rather than assessing the idea on its own terms.

The fourth prompt: "What would a hostile venture capitalist say in the first five minutes of hearing this pitch? Give me their specific objections, not generic ones." This surfaces the arguments that experienced pattern-matchers raise immediately — and those are often the most important. The fifth: "What is the most dangerous competitor I'm not thinking about — not the obvious one, but a well-funded team in an adjacent space that could pivot into my market?"

The sixth: "Describe the profile of the customer who would most benefit from this but would still choose not to buy. What are their objections?" This is particularly useful for surfacing friction that customer interviews often miss, because people rarely articulate why they don't act. The seventh: "What are the reasons the people I spoke to in customer discovery might have been politely agreeable rather than genuinely interested?"

The eighth prompt: "If I were trying to talk myself out of building this, what would be the strongest case?" The ninth: "What problem does this not solve for the person who needs it most?" The tenth: "What would have to be true about the market for this idea to work that is probably not currently true?" These last three are particularly valuable because they surface implicit assumptions you've already dismissed in your own head without properly testing.

These prompts don't guarantee good analysis. The model can still produce hollow adversarial responses if it senses that's what you want. The counter is to ask the question, then ask it again from a different angle, then ask the model to respond as if it held the opposite view to its first response. Contradiction-seeking forces engagement with the full range of positions rather than settling on the first plausible narrative.

The deeper principle is that AI is most useful for validation when you treat it as a devil's advocate rather than a consultant. A consultant tells you what you want to hear because they want to be hired again. A devil's advocate is hired specifically to find holes. That is what you need at the validation stage.

If you want structured adversarial validation built into a product rather than assembled from scratch, Kooio's validation stage is designed with this in mind. The process is explicitly adversarial — built to find the reasons an idea doesn't work before you invest time building it. But regardless of what tool you use, the principle holds: make it as hard as possible for your idea to survive scrutiny, not as easy as possible.

Early access open

Ready to turn your idea into something real?

Join founders already building with Kooio. Free during beta — no credit card required.

Join the Waitlist — Free See the 5-stage journey

Free during betaNo credit cardCancel anytime