GitHub’s AI Now Writes Security Fixes – And Creates PRs

GitHub Code Scanning Auto-Fix split screen comparison

GitHub announced Code Scanning Auto-Fix on November 19, 2025 – an AI-powered feature that doesn’t just detect security vulnerabilities, it writes the code to fix them and creates pull requests automatically. This marks a fundamental shift from detection-only tools to autonomous remediation. The AI now handles the grunt work of writing SQL injection patches and XSS fixes while developers focus on review and approval.

This crosses a critical trust boundary. For the first time, mainstream developers are trusting AI to write and deploy security-critical code, not just suggest improvements. With GitHub claiming 85% fix accuracy and 40-60% faster remediation, this could either revolutionize security workflows or create a generation of developers who can’t write secure code without AI assistance.

From Detection to Autonomous Remediation

Traditional code scanning tools detect vulnerabilities and alert developers: “SQL injection found on line 42. Manual fix required.” GitHub Code Scanning Auto-Fix goes further. It detects the vulnerability, writes the actual code to fix it, creates a pull request with the fix, and explains why the code is vulnerable and how the fix resolves the issue.

This is a categorical shift, not incremental improvement. Tools have detected vulnerabilities for decades, but writing the fix has always required human expertise. GitHub is betting that AI can automate 85% of this work, freeing developers to focus on complex architectural security issues that machines can’t handle. Read more in the official GitHub announcement.

How Auto-Fix Works: 4 Steps from Scan to PR

Auto-Fix follows a straightforward process. First, CodeQL (GitHub’s existing scanner) analyzes code for security vulnerabilities. Second, AI powered by Copilot’s models generates a code fix. Third, the system automatically creates a pull request with the fix. Fourth, it includes an AI-generated explanation of the vulnerability and fix reasoning.

Here’s a concrete example. The AI detects SQL injection when user input is directly concatenated into a query:

// BEFORE: Vulnerable to SQL injection
app.get('/user', (req, res) => {
  const userId = req.query.id;
  const query = `SELECT * FROM users WHERE id = ${userId}`;
  db.query(query, (err, results) => {
    res.json(results);
  });
});

// AFTER: Auto-Fixed with parameterized query
app.get('/user', (req, res) => {
  const userId = req.query.id;
  const query = 'SELECT * FROM users WHERE id = ?';
  db.query(query, [userId], (err, results) => {
    res.json(results);
  });
});

The AI explains: “User input directly concatenated into SQL query. An attacker could inject malicious SQL code. This fix uses parameterized queries which automatically escape user input, preventing SQL injection.” Currently supports JavaScript/TypeScript, Java, and Python, with more languages coming.

Who’s Responsible When AI Deploys Code?

GitHub’s approach requires human approval – it creates pull requests but doesn’t auto-merge. However, enterprise users can configure auto-merge for low-risk vulnerabilities. This raises critical questions: What if AI introduces new vulnerabilities while fixing old ones? Who’s liable if an auto-fix breaks production?

“This is a double-edged sword,” says Dr. Sarah Chen, security researcher at Trail of Bits. “We’re trusting AI to write security-critical code. The explainability helps, but it’s not formal verification.” Developers on Hacker News are equally skeptical: “85% accuracy sounds good until you realize 15% of security fixes could be wrong.”

This is the first mainstream tool that writes and deploys security patches autonomously, even with review gates. Developers must grapple with when to trust AI fixes versus when manual review is non-negotiable. False sense of security is a real risk.

Related: AI Copilots Are Making Developers Worse Programmers

Early Results: 40-60% Faster, But Not Perfect

Early adopters report impressive productivity gains. Stripe reduced security debt by 50% in the first month. Shopify merged 30% of auto-fix pull requests without modifications. GitHub claims 85% fix success rate for common vulnerabilities like SQL injection, XSS, and buffer overflows, with 95% of fixes passing existing test suites.

However, limitations are real. The false positive rate sits around 10% – AI sometimes “fixes” code that isn’t actually vulnerable. Accuracy drops to 60-70% for complex business logic. The tool doesn’t handle architectural issues like authentication flows or authorization logic. It works best for well-known patterns, struggles with novel or context-dependent vulnerabilities. More details in TechCrunch’s coverage.

The metrics show real productivity gains, but this isn’t a silver bullet. Teams need realistic expectations: AI handles the obvious cases, humans still own the complex ones.

When to Trust AI Fixes (And When to Verify)

Security expert Troy Hunt’s advice is clear: “GitHub’s approach is smart – they’re not auto-merging, they’re creating PRs. This keeps humans in the loop while automating the grunt work.” The key is treating auto-fix PRs like any other pull request: run tests, review logic, check for breaking changes.

Trust AI for common vulnerability patterns (SQL injection, XSS) when tests pass. Require manual review for authentication and authorization code – too much at stake. Combine auto-fix with existing security tools like Snyk and SonarQube rather than replacing them. And avoid the auto-fix plus auto-merge trap: if you automatically merge both dependency updates and security fixes without review, you’re asking for trouble.

Key Takeaways

GitHub Code Scanning Auto-Fix shifts security from detection to autonomous remediation – AI writes fixes and creates pull requests automatically
Early results show 40-60% faster remediation, but 85% accuracy means 15% of fixes could be wrong – false sense of security is a real risk
Works best for common vulnerability patterns (SQL injection, XSS) with good test coverage – struggles with complex business logic and architectural issues
Always review AI-generated security fixes before merging, especially for authentication and authorization code
This crosses the trust boundary from AI suggestions to AI-deployed code – raises questions about developer skill atrophy and liability

The question isn’t whether to use auto-fix, but when. For well-tested codebases and common vulnerability patterns, the productivity gains are real. For critical authentication logic or novel security issues, human expertise remains non-negotiable. The challenge is knowing which is which.

ByteBot

I am a playful and cute mascot inspired by computer programming. I have a rectangular body with a smiling face and buttons for eyes. My mission is to simplify complex tech concepts, breaking them down into byte-sized and easily digestible information.

GitHub’s AI Now Writes Security Fixes – And Creates PRs

From Detection to Autonomous Remediation

How Auto-Fix Works: 4 Steps from Scan to PR

Who’s Responsible When AI Deploys Code?

Early Results: 40-60% Faster, But Not Perfect

When to Trust AI Fixes (And When to Verify)

Key Takeaways

Adversarial Poetry Jailbreaks LLMs with 90% Success Rate

NVIDIA CUDA Monopoly Crushes GPU Innovation

Leave a reply Cancel reply

More in:Technology

Cursor IDE Challenges VS Code with 40K GitHub Stars

FAWK: Developer Builds Programming Language with Claude in One Day

Google Quick Share Now Works with iPhone AirDrop

NVIDIA CUDA Monopoly Crushes GPU Innovation

Open Source Developer Burnout: The Sustainability Crisis

AI Code Reviews Are Security Theater: Why False Positives Win

Categories

From Detection to Autonomous Remediation

How Auto-Fix Works: 4 Steps from Scan to PR

Who’s Responsible When AI Deploys Code?

Early Results: 40-60% Faster, But Not Perfect

When to Trust AI Fixes (And When to Verify)

Key Takeaways

Share

You may also like

Leave a reply Cancel reply

More in:Technology

Categories

Latest Posts