ML//code generation//CriticGPT
- OpenAI model trained to spot bugs in LLM-generated code.
OpenAI model trained to spot bugs in LLM-generated code.
LLMs produce code with subtle security issues, logic errors, and silent failures that pass basic tests.
CriticGPT catches errors human reviewers miss, especially in complex codebases.
Anthropic's parallel: use SAEs (mechanistic interpretability) to identify features that cause buggy generation.