BOAD: Discovering Hierarchical Software Engineering Agents via Bandit Optimization
Original article: https://arxiv.org/abs/2512.23631v2
Large language models (LLMs) have shown strong reasoning and coding capabilities, yet they struggle to generalize to real-world software engineering (SWE) problems that are long-horizon and out of distribution. Existing...

This entry is part of the Top 50 AI Agent Articles curation.