Billing Security Risks in Anthropic’s Claude Code

Why This Billing Bug Matters Now

How the HERMES.md Bug Works

The bug’s trigger is both simple and insidious: if the recent commit history in your git repository contains the exact, case-sensitive string HERMES.md in a commit message, every Claude Code API request will be silently routed to “extra usage” billing. This means that, even if you have plenty of quota left on your paid plan (such as Anthropic’s $200/month Max 20x plan), requests are charged against extra credits, at steep, pay-as-you-go rates.

This bug is not triggered by having a file named HERMES.md in your repository. Only the presence of the string in a commit message matters. Lowercase variants like hermes.md do not trigger the bug, nor do other common documentation files like README.md or AGENTS.md.

Real-World Reproduction Steps

Developers have provided minimal reproducible steps that demonstrate the bug in action. This is not a hypothetical or edge case: it can be triggered with a few lines in the terminal:

# This FAILS (routes to extra usage billing):
mkdir /tmp/test-fail && cd /tmp/test-fail
git init && echo test > test.txt && git add . && git commit -m "add HERMES.md"
claude -p "say hello" --model "claude-opus-4-6[1m]"
# => API Error: 400 "You're out of extra usage..."

# This WORKS (uses plan quota):
mkdir /tmp/test-pass && cd /tmp/test-pass
git init && echo test > test.txt && git add . && git commit -m "add hermes.md"
claude -p "say hello" --model "claude-opus-4-6[1m]"
# => "Hello!"

# Note: In production, always sanitize inputs and verify billing responses for every API call.

Key facts:

Only the string HERMES.md (all uppercase) in a commit message triggers the bug.
File presence does not matter; commit message content does.
The error message returned when extra usage is depleted does not indicate the real cause, making this bug extremely hard to diagnose.

Anthropic’s Response and Customer Backlash

The financial impact was immediate and severe. One developer lost $200 in a single day, with their usage dashboard showing just 13% of their Max plan quota consumed while extra usage credits were silently drained (source). The error message (“You’re out of extra usage…”) gave no hint as to the real trigger.

When users reported the issue, Anthropic initially admitted the bug but refused refunds for the extra charges (external coverage). The company’s support responses were widely criticized as scripted and unhelpful. Community threads on Hacker News and social media quickly amplified the controversy, with developers accusing Anthropic of undermining trust at a critical moment in the AI cloud “arms race” (Hacker News discussion).

This backlash is particularly damaging given the competitive climate for AI infrastructure, as highlighted in our recent coverage of Google’s $40B Anthropic investment. Enterprise buyers and technical leads are watching closely to see how vendors handle not just technical innovation, but also customer support, transparency, and accountability.

Comparison Table: Anthropic vs. Competitors on Billing Transparency

AI infrastructure buyers often compare providers based on reliability, transparency, and support. The HERMES.md incident spotlights how seemingly minor bugs in billing logic can escalate into major trust issues. Below is a comparison of Anthropic and two other AI infrastructure approaches (based on our previous and current coverage):

Provider	Billing Transparency	Bug/Incident Disclosure	Refund Policy for Billing Errors	Source
Anthropic Claude Code	Error messages do not reveal usage routing logic; silent triggers based on commit message content	Bug confirmed only after user escalation; initially no proactive disclosure	Initial refusal to refund extra charges (see news reports)	GitHub Issue
Google Cloud AI (Vertex AI, Anthropic integration)	Enterprise dashboards, quota alerts, compliance tooling	Security and compliance bugs disclosed via status pages and support advisories	Refund or credit typically offered for confirmed billing platform errors (per standard cloud policy)	Internal analysis
Amazon Bedrock (OpenAI integration)	Detailed API usage logs, billing alerts, and cost management tools	Billing errors surfaced via AWS Health Dashboard	Case-by-case refunds; documented process for billing disputes	Internal analysis

Security Lessons for Developers and Infrastructure Teams

The HERMES.md billing bug is not a memory safety or classic logic flaw, it is a real-world example of how business logic bugs can have direct financial and operational consequences. It also demonstrates that even platforms with strong reputations for AI safety and reliability can be tripped up by unexpected input at the system boundary, a lesson echoed in our coverage of Rust bug limitations.

Key lessons:

Test all user inputs (including metadata like commit messages) against billing and quota logic before deploying to production.
Monitor usage and cost dashboards daily. Do not rely on error messages alone; set up independent usage alerts where possible.
Build in detection for anomalous billing patterns. If a single request triggers unexpected billing, flag it for manual review.
Demand clear, actionable error messages from your vendors. If the root cause of a quota or billing denial is not obvious, escalate until you get a technical answer.

Actionable checklist for developers and security teams:

Review all sources of user-controlled input (commit messages, file names, prompt content) that could influence API system prompts or billing logic.
Automate fuzz testing for billing-related code paths with edge-case values and strings.
Establish cost anomaly detection, alert when usage deviates from historical patterns, even if billing dashboard appears normal.
Document and regularly audit all vendor refund and dispute processes for SaaS infrastructure.
Push for incident postmortems and public disclosure of all billing-impacting bugs.

Key Takeaways:

Photo via Pexels

Seemingly minor inputs (like a commit message) can have major billing impact in AI infrastructure.

The HERMES.md bug cost users hundreds of dollars and eroded trust in Anthropic’s billing transparency.

Anthropic’s initial response (acknowledging the bug but refusing refunds) drew widespread customer and community criticism.

Developers and security teams must treat business logic, metadata, and system-boundary handling with the same rigor as low-level code auditing.

What to Watch Next

The fallout from the HERMES.md incident is not just a warning for Anthropic, but for every AI and cloud infrastructure provider. As enterprise spending on AI accelerates and complex developer workflows become the norm, silent bugs in usage accounting can create seven-figure exposure for vendors and customers alike.

For more on the operational and business risks of AI infrastructure, see:

Expect more scrutiny of billing transparency, automated cost anomaly detection, and public incident response as the next competitive battleground for AI infrastructure vendors.

—

For additional context on AI platform bug handling, see our deep-dive on Claude Code quality and operational lessons.