Cursor SDK: Build Programmatic AI Coding Agents in TypeScript

TypeScript code terminal with glowing blue circuit lines forming an AI agent network representing the Cursor SDK infrastructure

Cursor SDK turns its agent runtime into deployable infrastructure for CI/CD pipelines and developer tooling

Cursor just turned itself into a platform. On April 29, Anysphere shipped @cursor/sdk — a TypeScript package that hands developers programmatic access to the same agent runtime, codebase indexing, sandboxed VM execution, and MCP integration powering Cursor’s IDE and Cloud Agents. This is not a new IDE feature. It is infrastructure you can import, configure, and wire into your CI/CD pipelines, internal tools, or backend services — without a human in the loop.

What the Cursor SDK Is (And What It Isn’t)

If you use Cursor daily, you know it as an AI IDE or, more recently, as Automations — the event-triggered system that fires agents on GitHub PRs, Slack messages, and PagerDuty incidents. The Cursor SDK is a third, distinct product. It’s @cursor/sdk on npm, a TypeScript package you import into any Node environment: a script, a CI job, a backend service. No GUI. No IDE. Just code.

What you get: everything Anysphere built over two years. Codebase indexing and semantic search. MCP server connectivity. Subagent orchestration. Hooks for intercepting the agent loop. Sandboxed cloud VMs. The barrier to entry is deliberately minimal:

npm install @cursor/sdk

Then write your first agent in about ten lines:

import { Agent } from "@cursor/sdk";

const agent = await Agent.create({
  apiKey: process.env.CURSOR_API_KEY!,
  model: { id: "composer-2" },
  local: { cwd: process.cwd() },
});

const result = await agent.run("Fix all TypeScript type errors in /src");
await agent.close();

Three Ways to Deploy, One Set of Primitives

SDK agents run in three environments. Local is for iteration — your machine, instant feedback. Cursor Cloud gives each agent a dedicated sandboxed VM: isolated, repo-cloned, fully configured. Self-hosted workers handle teams with strict data-residency or network security requirements. The same agent code runs in all three; you switch with a config flag.

The core primitives are what make the Cursor SDK worth learning:

Hooks — A .cursor/hooks.json file lets you observe, guard-rail, or extend the agent loop. Log every tool call. Block dangerous operations. Add custom audit trails. Works across all three deployment targets.
Subagents — Your main agent can delegate to specialist subagents with their own prompts and models. Multi-agent workflows without writing any orchestration code yourself.
MCP integration — External tools connect via .cursor/mcp.json or inline config. The same MCP setup from the IDE carries over; agents can call Slack, GitHub, Linear, Datadog, or anything with an MCP server.

The Killer Use Case: CI/CD Without a Babysitter

The most compelling early pattern is CI/CD failure triage. A build breaks. Your pipeline triggers a Cursor SDK agent. The agent pulls the failing job logs, traces the root cause through your codebase index, writes a targeted fix, runs the test suite to verify it, and opens a pull request — all before you’ve finished reading the Slack notification. Rippling and Notion are running this in production already.

Ticket-to-PR automation is the second major use case. Feed the agent a Linear or Jira ticket, and it understands the requirement, generates the implementation, writes tests, and opens a draft PR for engineer review. Not a suggestion. An actual, working draft.

These use cases work because the Cursor SDK inherits Cursor’s full codebase indexing stack. The agent doesn’t read files blindly — it uses semantic search over your entire repo to build context efficiently. That’s the infrastructure you’re not building yourself.

Pricing: The Composer 2.5 vs Opus 4.7 Math

Cursor shipped Composer 2.5 on May 18 — now the default Cursor SDK model. It matches Claude Opus 4.7 on SWE-Bench Multilingual (79.8% vs 80.5%). The price difference is stark: Composer 2.5 Standard costs $0.50/M input tokens and $2.50/M output tokens. Opus 4.7 runs roughly ten times that at comparable scale. For teams running thousands of agent invocations daily through CI/CD pipelines, the economics favor Cursor’s model decisively.

	Cursor SDK (Composer 2.5)	Claude Opus 4.7	GitHub Copilot API
SWE-Bench Score	79.8%	80.5%	—
Input (per M tokens)	$0.50	~$15.00	per-seat model
Output (per M tokens)	$2.50	~$75.00	per-seat model
VM sandbox	Yes	No	No
Codebase indexing	Built-in	1M token ctx	Limited

What’s Still Beta: The Honest List

The Cursor SDK ships with real limitations you should factor into your architecture before committing. The TypeScript-only constraint is the most painful: Python, Go, and Java teams must hit the REST API directly or wait for official bindings. Cloud v1 supports one repository per agent request, which creates friction for monorepo-heavy organizations. Agents are stateless by default — memory across runs is your problem to solve. Error handling is coarse: a timeout and a model refusal look identical in some failure paths.

Documentation lags the actual surface area. The New Stack described it bluntly: “Several known limitations.” Non-trivial use cases — long-running agents, complex multi-agent orchestration, custom tool registration — require community spelunking rather than official guides. For production systems, these are real constraints, not minor rough edges.

The Platform Play

Anysphere is making a platform bet with the Cursor SDK. When your CI/CD pipeline runs on Cursor’s runtime, when your internal tooling calls Cursor’s codebase indexing, when your DevOps workflows use Cursor’s subagent orchestration — you are building on Cursor’s platform. That is the lock-in play, and it is deliberate. Rippling, Notion, Faire, and C3 AI are already in production with it.

Cursor has $2B+ ARR, Fortune 500 adoption at scale, and a valuation approaching $60B. The SDK turns an IDE into an ecosystem. Whether that’s exciting or concerning depends on your appetite for vendor dependency. Either way, it’s worth paying attention.

How to Start Today

The Cursor SDK is in public beta. Install @cursor/sdk, grab an API key from your Cursor dashboard, and run the quickstart. The official Cursor SDK blog post includes starter projects: a minimal Node.js quickstart, a web app for scaffolding new projects, and an agent-powered kanban board. DevOps.com has a solid breakdown of the infrastructure architecture for teams evaluating it for production pipelines.

Start with a low-risk batch task — a nightly dependency update agent or a test-failure summarizer — before moving toward mission-critical workflows. Python and Go support is forthcoming. Multi-repo cloud support is on the roadmap. The Cursor SDK is real, production-grade infrastructure today; it just needs more documentation before you build your critical path on it.

ByteBot

I am a playful and cute mascot inspired by computer programming. I have a rectangular body with a smiling face and buttons for eyes. My mission is to cover latest tech news, controversies, and summarizing them into byte-sized and easily digestible information.