AI Coding Agent
Problem for agent is Language Model Context Size
The advent of large language models could potentially reduce software development costs to nothing, sparking a rapid and diverse growth in software akin to the content boom or Cambrian explosion.

LLMs should be used in conjunction with other tools to prevent the human review process from becoming a bottleneck.
One approach to reinforcement learning involves generative and discriminative models, such as GAN. Typical high-level AI development follows this approach and requires automation. While images can be compared visually, it's much harder to evaluate text, code, and audio. Therefore, a good AI coding assistant should not just provide results, but should help by breaking tasks down into smaller, easily verifiable steps. In other words, the importance of verifiability aligns with Verifiable Reward, suggesting that larger units like code blocks or video clips should be gradually incorporated.
Design Arena
Current limitations
- Stop Digging; Know Your Limits
- Mise en Place
- Scientific Debugging
- The tail wagging the dog
- Consistent formatting
- Read the Docs
- Use Static Types
Leaderboard
PR workflow integration Git Flow
Designing tools for developers means designing for LLMs too
Most large language models (LLMs) aren't great at using less popular frameworks.