ai-agents-2-2 Trustworthy agents in practice Safety and trust considerations for real deployments where agents take consequential actions. Nina Reed 9 Jun 2026 · 1 min read
ai-agents-2-2 What Do Evolutionary Coding Agents Evolve? Recent work pairs LLMs with evolutionary search to iteratively generate, modify, and select code using task-specific feedback. These systems have produced strong results in mathematical discovery and algorithm design, ye... Nina Reed 19 May 2026 · 1 min read
ai-agents-2-2 Debug2Fix: Can Interactive Debugging Help Coding Agents Fix More Bugs? While significant progress has been made in automating various aspects of software development through coding agents, there is still significant room for improvement in their bug fixing capabilities. Debugging and invest... Nina Reed 20 Feb 2026 · 1 min read
ai-agents-2-2 RedCodeAgent: Automatic Red-teaming Agent against Diverse Code Agents Code agents have gained widespread adoption due to their strong code generation capabilities and integration with code interpreters, enabling dynamic execution, debugging, and interactive programming capabilities. While... Nina Reed 2 Oct 2025 · 1 min read
ai-agents-2-2 LLM Agents Making Agent Tools Tool use has turned large language models (LLMs) into powerful agents that can perform complex multi-step tasks by dynamically utilising external software components. However, these tools must be implemented in advance b... Nina Reed 17 Feb 2025 · 1 min read