What is the difference between Cloud Agents and traditional AI coding assistants?

Traditional AI coding assistants (Tab completion, synchronous agents) require developers to be present in real-time, guiding each step. Cloud Agents run on isolated cloud VMs, autonomously executing development tasks for hours, then returning merge-ready PRs with logs, video recordings, and live previews — developers only need to review artifacts.

What does Cursor's 35% PR data mean?

Cursor CEO Michael Truell disclosed that over 35% of merged pull requests at Cursor are created by agents running autonomously in cloud VMs. This signals that agents have shifted from auxiliary tools to core productivity drivers, and developers are transitioning from code authors to agent orchestrators.

What is a self-driving codebase?

A self-driving codebase is an architectural vision where multiple Cloud Agents run continuously in the background — auto-triaging issues, fixing bugs, performing code reviews, maintaining dependency updates — shifting developers from being in-the-loop (deeply involved in each step) to on-the-loop (supervising and guiding overall direction).

How can Cloud Agents be used safely?

Key measures include running agents in isolated sandbox VMs, configuring repository access with least-privilege principles, setting up automated review gates (CI checks must pass before merge), maintaining audit logs for all agent operations, and adding human approval steps before production deployments.

The Cloud Agent Era: A Paradigm Shift from Synchronous AI Coding to Autonomous Agents

2026-04-22 - QubitTool Tech Team

AI-assisted programming is undergoing a fundamental paradigm shift. From Tab autocomplete to synchronous agents to Cloud Agents running on isolated cloud VMs — developers no longer guide AI line by line. Instead, they launch multiple autonomous agents that independently complete entire development workflows, from coding and testing to building and demoing. Over 35% of merged PRs at Cursor are now created by agents, with agent usage growing 15x year over year. This isn't incremental improvement — it's a paradigm-level leap.

Key Takeaways

Three eras: Tab autocomplete (Era 1) → Synchronous agents (Era 2) → Cloud Agents (Era 3), with each transition accelerating
Cloud Agent essence: Agents run in isolated cloud VMs with full dev environments (browsers, terminals, test runners), enabling developers to launch multiple agents in parallel
Output revolution: From diffs to artifacts — Cloud Agent outputs include logs, video recordings, live previews, and merge-ready PRs
Data-backed: 35% of merged PRs at Cursor created by agents; agent users now outnumber Tab users 2:1
Role shift: Developers move from writing code to decomposing problems → launching agents → reviewing artifacts → providing feedback

Want to quickly browse and compare AI coding tools and agent frameworks? Check out our tool directories:

👉 AI Tool Directory · Agent Directory

Three Eras of AI Coding: From Completion to Autonomy

To understand the revolutionary significance of Cloud Agents, we need to trace the full arc of AI coding evolution. Cursor CEO Michael Truell frames it as three distinct eras, each fundamentally redefining human-AI collaboration.

Era 1: Tab Autocomplete (2022–2024)

The first era was defined by GitHub Copilot and Cursor Tab. AI identified low-entropy, repetitive code patterns and developers accepted suggestions with a Tab press. This was the origin of "AI-assisted" coding — AI handled mechanical coding while developers retained full control.

python

# Era 1: Tab Autocomplete
# Developer writes function signature, AI completes the body
def calculate_fibonacci(n: int) -> int:
    # AI autocomplete ↓
    if n <= 1:
        return n
    return calculate_fibonacci(n - 1) + calculate_fibonacci(n - 2)

This era lasted nearly two years. Developers still wrote the majority of logic manually, with Tab autocomplete mainly removing friction at the margins.

Era 2: Synchronous Agents (2024–2025)

When LLMs became capable enough to function as conversational agents, the second era arrived. Developers shifted from typing code to directing agents through prompt-response loops. Cursor Composer, Copilot Chat, and TRAE's IDE mode are typical representatives.

typescript

// Era 2: Synchronous Agent Dialogue
// Developer: "Add pagination to the user list, 20 per page, with prev/next"
// Agent generates full implementation ↓

interface PaginationParams {
  page: number;
  pageSize: number;
}

function paginateUsers(users: User[], params: PaginationParams) {
  const { page, pageSize } = params;
  const startIndex = (page - 1) * pageSize;
  return {
    data: users.slice(startIndex, startIndex + pageSize),
    total: users.length,
    currentPage: page,
    totalPages: Math.ceil(users.length / pageSize),
  };
}

The limitations of synchronous agents were clear: they competed for local machine resources, required developer presence at every decision point, and could only handle one task at a time.

Era 3: Cloud Agents (2026–)

The third era breaks both constraints. Cloud Agents run in isolated cloud VMs, autonomously working for hours after receiving a task, and return merge-ready PRs with full artifacts (logs, video recordings, live previews). The developer's role shifts from "writing code" to "defining problems and evaluating outputs."

yaml

# Era 3: Cloud Agent task description
# Developer launches a Cloud Agent from Slack
task: |
  Users reported a clipboard data exfiltration vulnerability.
  Please:
  1. Reproduce the issue and build a PoC demo page
  2. Locate the root cause and implement a fix
  3. Add regression tests
  4. Record before/after comparison video
  5. Create a merge-ready PR

# Agent autonomously executes in VM:
# → Start backend → Build HTML demo → Run tests → Record video → Submit PR
# Developer reviews artifacts when notified

graph TD A["Era 1 - Tab Autocomplete"] -->|"LLM reasoning improves"| B["Era 2 - Synchronous Agents"] B -->|"Cloud VMs + async execution"| C["Era 3 - Cloud Agents"] A1["Developer writes code, AI completes snippets"] --> A B1["Developer guides agent, step-by-step dialogue"] --> B C1["Developer defines problems, Agent delivers autonomously"] --> C style A fill:#e8f5e9,stroke:#4caf50 style B fill:#e3f2fd,stroke:#2196f3 style C fill:#fce4ec,stroke:#e91e63

The pace of evolution is accelerating: the Tab era lasted nearly two years, while the synchronous agent era may not survive a full year before Cloud Agents become the new baseline.

Core Architecture: VM Isolation and Artifact Output

Cloud Agents achieve true autonomous execution through their underlying VM isolation architecture and a fundamentally new artifact output model.

Isolated VM Architecture

Each Cloud Agent runs in a fully isolated cloud VM with:

Complete development environment: editor, terminal, build tools, test runners
Browser instance: can start servers, navigate web pages, verify UI behavior
File system access: read/write the codebase, operate Git branches
Network capabilities: access APIs, download dependencies, interact with external services

This means agents no longer compete with the developer's local machine for CPU, memory, and GPU. Developers can launch 5, 10, or more agents simultaneously, each working independently in its own VM.

javascript

// Conceptual model of a Cloud Agent runtime
const cloudAgent = {
  vm: {
    os: "Linux",
    cpu: "dedicated vCPUs",
    memory: "dedicated RAM",
    storage: "ephemeral SSD",
    network: "isolated VPC",
  },
  tools: {
    editor: "headless VS Code / Cursor",
    terminal: "full bash shell",
    browser: "headless Chromium",
    git: "full Git CLI with repo access",
    testRunner: "jest / pytest / go test",
  },
  capabilities: [
    "read and write files",
    "run shell commands",
    "start servers",
    "navigate web pages",
    "take screenshots",
    "record video demos",
    "create pull requests",
  ],
};

From Diffs to Artifacts: A Fundamental Shift in Output

In the synchronous agent era, AI output was primarily code diffs — developers had to review every change line by line, mentally reconstructing the full context. In the Cloud Agent era, output becomes rich artifacts:

Output Type	Synchronous Agent	Cloud Agent
Code changes	diff/patch	Merge-ready PR
Execution process	Real-time terminal output	Structured logs
Verification evidence	None	Test reports + screenshots
Demo materials	None	Video recording + live preview
Review approach	Line-by-line code review	Watch demo first, then review code

This shift enables a critical advantage: parallel review becomes practical. When 5 agents complete tasks simultaneously, developers can first watch each agent's demo video and screenshots, quickly judge directional correctness, then drill into code details only for PRs that warrant attention.

The Data: Agents Are Taking Over Code Production

Data from industry leaders clearly demonstrates the speed and scale of this transition.

Cursor's Internal Numbers

Cursor CEO Michael Truell shared several key data points in February 2026:

35% of merged PRs created by agents running autonomously in cloud VMs
15x year-over-year growth in agent usage: from experimental feature to core workflow
User ratio inversion: In March 2025, Tab users outnumbered agent users 2.5:1. By early 2026, agent users outnumber Tab users 2:1
Cursor's team believes the synchronous agent era may not last a full year before being superseded — faster than the Tab era's two years

This data comes from Cursor's own team's actual development process — not lab benchmarks, but real production output. When an AI coding company itself uses agents for one-third of its merged code, it signals that Cloud Agents have crossed the viability threshold.

Agent Capability Evolution

On February 24, 2026, Cursor shipped "Agents can now control their own computers." This wasn't just a feature — it was a qualitative leap in capability:

Agents can start servers, open browsers, navigate web pages
Agents can manipulate spreadsheets, run test suites
Agents can interact with the software they create
Developers can take over the agent's remote desktop to directly use modified software
All operations run in isolated VMs, with zero impact on local environments

This means agent task scope expanded from "generating code" to "building and demoing working software."

Leading Cloud Agent Implementations: Cursor, TRAE, and GitHub

The Cloud Agent paradigm isn't a single product's innovation — it's an industry-wide trend. Here are the three most representative implementations.

Cursor Background Agent

Cursor's Background Agent is currently the most mature Cloud Agent implementation:

Developer describes a task in Cursor (desktop/web/mobile/Slack/GitHub)
Agent auto-onboards onto the codebase in a cloud VM
Agent autonomously executes coding, testing, and debugging cycles
Agent generates merge-ready PR with logs, video recordings, and live preview
Developer reviews artifacts and can take over the remote desktop to experience modified software

bash

# Launch a Background Agent via Cursor CLI
cursor agent start \
  --task "Refactor auth module, migrate session-based auth to JWT" \
  --branch "feature/jwt-auth-migration" \
  --notify slack:#engineering \
  --artifacts video,preview,logs

# Agent autonomously executes in cloud VM:
# 1. Clone repo → Analyze code structure
# 2. Implement JWT auth logic
# 3. Migrate all route middleware
# 4. Run test suite → Fix failing tests
# 5. Record demo video → Generate live preview link
# 6. Create PR → Notify Slack

TRAE SOLO: The Responsive Coding Agent

TRAE (The Responsive Coding Agent) takes a different path. Positioned as a "responsive programming agent," SOLO mode lets AI drive the entire development process:

IDE mode: Preserves traditional development workflow with AI assisting alongside (similar to Era 2 synchronous agents)
SOLO mode: AI autonomously drives the full development flow, from requirement understanding to code delivery

SOLO Coder's distinctive feature is its multi-agent architecture — rather than a single agent executing serially, multiple specialized agents collaborate in parallel: one handles architecture, one handles implementation, one writes tests, one performs code review.

For more foundational concepts on AI Agents and architecture design, check out our deep-dive guide: How to Build AI Agents: Architecture Design & Code Guide.

GitHub Agentic Workflows

GitHub approaches the Cloud Agent paradigm from another angle — embedding agents into CI/CD pipelines. GitHub Agentic Workflows, which entered Technical Preview in early 2026, has several core features:

Markdown replaces YAML: Describe workflow intent in natural language Markdown files instead of complex YAML configurations
Runs on GitHub Actions: Agents execute in Actions runners, leveraging existing CI/CD infrastructure
Scales across millions of repos: Integrates with the GitHub Actions ecosystem for deployment across millions of repositories

markdown

<!-- .github/workflows/auto-triage.md -->
# Automatic Issue Triage and Fix

When a new issue is created:
1. Analyze the issue description — determine if it's a bug report or feature request
2. If bug: attempt to reproduce, locate root cause, create fix PR
3. If feature request: assess scope, generate technical proposal draft
4. Auto-add appropriate labels and assignees
5. Reply with analysis results in the issue

The significance of this Agentic Workflow is that agents are no longer one-shot tools triggered manually by developers. They become continuously running "digital workers" in the repository — handling issue triage, PR reviews, CI failure analysis, and dependency maintenance automatically.

For practical LLM integration in CI/CD, see: LLM-Powered CI/CD: Automated Code Review in Practice.

The Self-Driving Codebase: From Vision to Roadmap

All these Cloud Agent implementations point toward the same ultimate vision — the Self-Driving Codebase. This analogy borrows from autonomous driving's level system, describing a gradual progression from human-maintained to self-evolving codebases.

Three-Phase Evolution

Phase 1: Establish Agent Primitives

Introduce foundational Cloud Agent capabilities — automated testing, automated code review, automated dependency updates. This is "L2 assisted driving," where agents handle specific, well-defined tasks.

Phase 2: Discover System Bottlenecks

As agents take on more tasks, systemic bottlenecks emerge: insufficient test coverage, unclear requirement documents, missing Context Engineering practices. The core work at this phase is optimizing the "agent's working environment."

Phase 3: Scale the Software Factory

Multiple agents run in parallel, each focused on different responsibilities. Developers shift from being in-the-loop (deeply involved in each step) to on-the-loop (supervising and guiding overall direction).

graph LR subgraph "Phase 1: Agent Primitives" A1[Automated Testing] --> A2[Auto Code Review] A2 --> A3[Dependency Updates] end subgraph "Phase 2: Bottleneck Optimization" B1[Test Coverage Boost] --> B2[Documentation Standards] B2 --> B3[Context Engineering] end subgraph "Phase 3: Software Factory" C1["Agent Pool, Multi-Agent Parallel"] --> C2["Human on-the-loop, Supervise & Direct"] end A3 --> B1 B3 --> C1

Self-Driving in Practice

For a typical mid-size SaaS project, a self-driving codebase might include these continuously running agents:

yaml

# self-driving-codebase.yaml
agents:
  - name: triage-agent
    trigger: "new issue created"
    action: "Analyze issue → Classify → Assign → Attempt auto-fix"

  - name: review-agent
    trigger: "new PR created"
    action: "Code review → Security scan → Perf analysis → Feedback"

  - name: dependency-agent
    trigger: "weekly schedule"
    action: "Scan deps → Update minor versions → Run tests → Create PR"

  - name: docs-agent
    trigger: "API schema changed"
    action: "Sync API docs → Update SDK → Notify downstream"

  - name: test-agent
    trigger: "coverage drops below threshold"
    action: "Analyze uncovered paths → Generate test cases → Create PR"

For a deeper dive into multi-agent system architecture, see: Multi-Agent Systems: Architecture Deep Dive.

The Developer Role Transformation

The Cloud Agent era's impact goes far beyond tools — it's redefining the core skill requirements of the "software engineer" profession.

From Code Author to Agent Orchestrator

Dimension	Traditional Developer	Cloud Agent Era Developer
Core skill	Writing high-quality code	Problem decomposition + artifact review
Time allocation	70% coding, 30% review	20% coding, 40% review, 40% planning
Tool relationship	Use IDE to write code	Orchestrate multiple agents in parallel
Quality assurance	Manual testing + code review	Define review criteria + evaluate agent output
Work mode	Synchronous, serial	Asynchronous, parallel

The essence of this shift can be captured by Cursor's factory metaphor: developers no longer build the product directly — they build and oversee the system that builds it.

Essential Skills for the New Era

Problem decomposition: Breaking ambiguous requirements into clearly executable agent tasks demands deeper engineering thinking than writing code.

Context engineering: Preparing sufficient, precise context for agents — project specs, architecture docs, testing strategies — directly determines agent output quality. For more on this critical topic, see the Context Engineering concept overview.

Artifact review: Rapidly evaluating agent output quality — not reading code line by line, but first examining demo videos, test reports, and performance metrics, then deciding whether to drill into code details.

Feedback precision: When agent output doesn't meet expectations, providing precise feedback (rather than rewriting) to guide iterative improvement.

Synchronous Agent vs Cloud Agent: Comprehensive Comparison

To help teams evaluate whether and how to migrate to Cloud Agent workflows, here's a systematic comparison:

Dimension	Synchronous Agent	Cloud Agent
Runtime	Developer's local machine	Isolated cloud VM
Resource contention	Competes with local IDE/browser	Fully isolated, zero contention
Task duration	Minutes (requires continuous attention)	Hours (async execution)
Parallelism	Single task, serial	Multi-agent parallel
Developer presence	Required at every decision point	Only reviews final artifacts
Output format	Code diffs	PR + logs + video + preview
Review method	Line-by-line code review	Watch demo first, then review code
Best for	Small edits, exploratory coding	Cross-file refactoring, features, bug fixes
Cost model	Per-token billing	VM time + token billing
Maturity	Widely adopted	Rapidly maturing (mainstream in 2026)
Representative products	Cursor Composer, Copilot Chat	Cursor Background Agent, TRAE SOLO

These modes are complementary, not competing. Synchronous agents suit fast exploratory coding and small changes, while Cloud Agents handle clearly defined medium-to-large tasks. Vibe Coding principles apply to both — for more on effective AI collaboration, see: Vibe Coding in Practice: The Art and Science of AI Programming.

Security and Trust: Cloud Agent's Achilles Heel

When agents have full repository access, can autonomously execute commands, and create PRs, security is no longer a theoretical concern.

Risk Matrix

Risk Type	Description	Mitigation
Code injection	Agents may introduce vulnerable code	Mandatory CI security scans
Data leakage	Agents accessing files with sensitive info	Least privilege + file-level access control
Supply chain attacks	Agents may introduce malicious deps	Lock dependency versions + audit new deps
Prompt injection	Malicious instructions in Issues/PRs	Input sanitization + agent behavior boundaries
Over-trust	Auto-merging insufficiently reviewed PRs	Human approval gates

Security Best Practices

yaml

# agent-security-policy.yaml
security:
  sandbox:
    type: "isolated-vm"
    network: "restricted"
    allowed_domains:
      - "github.com"
      - "registry.npmjs.org"
    blocked_operations:
      - "modify CI/CD config"
      - "access production secrets"
      - "delete branches"

  review_gates:
    - name: "ci-must-pass"
      required: true
    - name: "security-scan"
      required: true
    - name: "human-approval"
      required_for:
        - "changes to auth modules"
        - "infrastructure config changes"
        - "dependency additions"

  audit:
    log_all_commands: true
    log_file_access: true
    retention: "90 days"

The core security challenge: we're granting increasing autonomy to AI systems, which demands more mature engineering practices to maintain trust boundaries. For agent security and guardrail design, see the MCP (Model Context Protocol) approach to tool-calling security, and explore security-related integrations via the MCP Server Directory.

Looking Ahead: When Codebases Learn to Drive Themselves

The Cloud Agent era is just beginning. As of early 2026: Cursor Background Agents have proven viability, TRAE SOLO is exploring new frontiers in responsive coding, and GitHub Agentic Workflows is embedding agent capabilities into the world's largest code hosting platform.

Key trends for the next 12–18 months:

Agent protocol standardization: Multiple agents need standardized communication protocols (MCP has started this work at the tool-calling layer) for cross-platform agent collaboration
From code agents to product agents: Agent capabilities will expand from code generation to product design, user testing, and A/B experimentation — covering the full product delivery chain
Agent evaluation maturity: Measuring agent output quality, reliability, and security will drive new evaluation frameworks and benchmarks
Developer education transformation: Programming education will shift from "how to write code" to "how to decompose problems, orchestrate agents, and review outputs"

This isn't a debate about whether AI will replace developers — it's about how developers redefine their value in an agent-driven world. The best developers will be those who can build and manage self-driving codebases.

This is part 9 of the AI Agent Engineering series. In the previous article, we explored Claude Code's full-stack practice from terminal to CI/CD. Next, we'll continue exploring the cutting edge of agent engineering.

Previous:Claude Code in Practice: Full-Stack Agent Programming from Terminal to CI/CD

Next:Computer Use in Practice: Building AI Agents That Control Browsers and Operating Systems