News

Microsoft Paper Intros Fully Autonomous AI Framework, Turning Devs into Supervisors

GitHub Copilot is one thing, AutoDev is another, completely automating software development with autonomous AI agents that do all the work themselves, turning developers into supervisors.

A recently published paper from five Microsoft researchers, titled, AutoDev: Automated AI-Driven Development, explains the concept:

We present AutoDev, a fully automated AI-driven software development framework, designed for autonomous planning and execution of intricate software engineering tasks. AutoDev enables users to define complex software engineering objectives, which are assigned to AutoDev's autonomous AI Agents to achieve. These AI agents can perform diverse operations on a codebase, including file editing, retrieval, build processes, execution, testing, and git operations. They also have access to files, compiler output, build and testing logs, static analysis tools, and more. This enables the AI Agents to execute tasks in a fully automated manner with a comprehensive understanding of the contextual information required.

As might be expected, that presentation has stirred up a lot of developer angst on Hacker News and elsewhere.

That might be because of snippets in the paper like: "The developer's role within the AutoDev framework transforms from manual actions and validation of AI suggestions to a supervisor overseeing multi-agent collaboration on tasks, with the option to provide feedback. Developers can monitor AutoDev's progress toward goals by observing the ongoing conversation used for communication among agents and the repository."

AutoDev Framework
[Click on image for larger view.] AutoDev Framework (source: Microsoft).

Notions like that prompted HN comments like: "Maybe ignorant, but if AI can get to a point of fully automating SWEs, hardly any white-collar knowledge based job is safe."

As far as the nuts and bolts of the framework, the figure below illustrates how AutoDev workflow enables an AI Agent to achieve an objective by performing actions in a repository. "The Eval Environment executes the suggested operations, providing the AI Agent with the resulting outcome. In the conversation, purple messages are from the AI agent, while blue messages are responses from the Eval Environment."

AutoDev Workflow
[Click on image for larger view.] AutoDev Workflow (source: MIcrosoft).

The framework improves on tools like GitHub Copilot by enabling autonomous AI agents to execute actions like those listed above, with key features listed as:

  • The ability to track and manage user and AI agents conversations through a Conversation Manager
  • A library of customized Tools to accomplish a variety of code and SE related objectives
  • The ability to schedule various AI agents to work collaboratively towards a common objective through an Agent Scheduler
  • The ability to execute code and run tests through an Evaluation Environment

"We've shifted the responsibility of extracting relevant context for software engineering tasks and validating AI-generated code from users (mainly developers) to the AI agents themselves," the paper said. "Agents are now empowered to retrieve context through Retrieval actions and validate their code generation through Build, Execution, Testing, and Validation actions."

The researchers published impressive benchmark numbers and foreshadowed further work "to integrate AutoDev into IDEs as a chatbot experience and incorporate it into CI/CD pipelines and PR review platforms."

About the Author

David Ramel is an editor and writer at Converge 360.

comments powered by Disqus

Featured

  • Hands On: New VS Code Insiders Build Creates Web Page from Image in Seconds

    New Vision support with GitHub Copilot in the latest Visual Studio Code Insiders build takes a user-supplied mockup image and creates a web page from it in seconds, handling all the HTML and CSS.

  • Naive Bayes Regression Using C#

    Dr. James McCaffrey from Microsoft Research presents a complete end-to-end demonstration of the naive Bayes regression technique, where the goal is to predict a single numeric value. Compared to other machine learning regression techniques, naive Bayes regression is usually less accurate, but is simple, easy to implement and customize, works on both large and small datasets, is highly interpretable, and doesn't require tuning any hyperparameters.

  • VS Code Copilot Previews New GPT-4o AI Code Completion Model

    The 4o upgrade includes additional training on more than 275,000 high-quality public repositories in over 30 popular programming languages, said Microsoft-owned GitHub, which created the original "AI pair programmer" years ago.

  • Microsoft's Rust Embrace Continues with Azure SDK Beta

    "Rust's strong type system and ownership model help prevent common programming errors such as null pointer dereferencing and buffer overflows, leading to more secure and stable code."

  • Xcode IDE from Microsoft Archrival Apple Gets Copilot AI

    Just after expanding the reach of its Copilot AI coding assistant to the open-source Eclipse IDE, Microsoft showcased how it's going even further, providing details about a preview version for the Xcode IDE from archrival Apple.

Subscribe on YouTube

Upcoming Training Events