What Anthropic's Attacker-AI Data Means for Detection by Lucie Cardiet

Article co-authored by Zack Abzug, Fabien Guillot, and Alex Groyz.

---

On 3 June 2026, Anthropic published the LLM ATT&CK Navigator, a year of real attacker activity from 832 accounts it banned for malicious use, mapped to MITRE ATT&CK. It is the clearest public account so far of what attackers actually ask an AI model to do. We read it closely, compared notes, and two things stood out that matter for detection.

First, attacker AI use is still concentrated where you cannot see it: on the attacker's own machines, building the malware and tooling that gets them through your defenses.
Second, when AI is used to support post-compromise behaviors, they are the same behaviors that ordinary attackers are performing, and the same behaviors that network and identity detection is built to catch.

The rest of this post walks through three takeaways from the data, and where detection holds up once an attack reaches your environment.

Takeaway 1: AI is mostly used to get in, and it is working

The single most common use of AI in the dataset was developing capabilities, mainly writing malware: 69% of the actors studied. Close behind were obfuscating code (64.7%), pulling data from the attacker's own systems (55.9%), and impairing defenses (54.9%). Defense evasion was the largest tactic overall, present for 84.4% of actors.

Put those together and a picture forms. Most attacker AI use today is aimed at one thing: building malware that gets past endpoint defenses and into an environment. It is preparation, and it happens on infrastructure you do not own. It does not cross your network, touch your identity provider, or land in your logs. You cannot detect a model writing a payload on a machine you do not control, and you do not need to.

But there is a consequence you do need to plan for. If AI makes attackers better and faster at building what beats EDR, more of them get in. The realistic posture is to assume compromise. The question stops being whether something will get through and becomes what you can see once it does. That is the case for a detection layer that works inside the environment, after initial access, on behavior rather than on signatures.

Takeaway 2: AI is moving further down the kill chain

The early data is a snapshot, not a destination. Comparing the first half of the year with the second, the report shows attackers reaching for AI later in the operation, in the hands-on work that happens after they are in. Account discovery and automated exfiltration both rose in the second half.

This is the part that matters for a SOC, because the rare techniques are the dangerous ones. Using AI for lateral movement was the single strongest marker of a high-risk actor: the 54 actors who did it carried an average risk score of 56.4, nearly ten points above the mean of 46.8. At the technique level, the highest-risk actors leaned on remote services like SSH and SMB, valid accounts, credential dumping, and staging data to exfiltrate. Each was three to five times more common among them than across the rest.

One case makes it concrete. Anthropic describes GTG-1002, the operator behind an AI-run espionage campaign it disrupted in November 2025, which hit government and critical-infrastructure targets. Its technique list was unremarkable. What set it apart was orchestration: the operator ran Claude Code on a Kali Linux machine, wired penetration-testing tools in as MCP servers, and let the model scan, exploit a flaw to reach the internal network, harvest credentials, and move laterally, while a human only set direction. The reconnaissance and the path to a foothold were AI-driven from the start.

Active Directory reconnaissance, account discovery, lateral movement, exfiltration: this is the territory network detection specializes in. As more actors push AI into these later stages, and the trend is pointing that way, network and identity detection becomes more relevant, not less.

The common thread: identity

There is a connective theme across both halves, and it is identity. Cloud and AI workflows run through users, service principals, and managed identities. The Anthropic data shows valid accounts as one of the techniques most associated with high-risk actors, and an agent acting inside your environment still has to authenticate, reach services, and move like an account.

That matters because it does not depend on any one product surface. Whether an attacker is human or an agent, whether the workload is a cloud console or an AI service, the giveaway is the same: an identity used from the wrong place, reaching something it has no history with, behaving unlike itself. Detecting suspicious identity and behavior is where this whole story converges.

Where the coverage lands

Because Anthropic mapped its findings to MITRE technique IDs, and Vectra tags every detection with the same IDs, the two line up. On the post-compromise techniques that mark the dangerous actors, coverage is strong. The AI-heavy early stages are not yours to see, but the moment an AI-enabled operation starts operating inside your network, it produces behavior Vectra is built to detect.

High-risk technique	ATT&CK ID	What it looks like in your environment	Where it surfaces
Remote Services (SSH/SMB)	T1021	An account uses admin protocols to reach systems it has no history with	Suspicious Remote Execution, Suspicious Remote Desktop, Suspicious Admin
Valid Accounts	T1078	A real account is used from an unusual host, location, or against an unusual service	Privilege Anomaly (Unusual Host, Unusual Account on Host, Unusual Service, Unusual Trio), Azure AD Suspicious Sign-on, Azure AD Suspected Compromised Access
OS Credential Dumping	T1003	Credential material is harvested from hosts	Privilege Anomaly detections
Archive Collected Data	T1560	Data is staged and compressed before it leaves	Data Smuggler, Smash and Grab
Exploitation of Remote Services	T1210	An internal host pushes an exploit to another and pulls down a follow-on stage	Automated Replication, Stage Loader
Account Discovery	T1087	Rapid enumeration of accounts via Kerberos, SMB, LDAP, or RPC	Kerberos Account Scan, SMB Account Scan, Suspicious LDAP Query, RPC Recon
Automated Exfiltration	T1020	Large or scripted outbound transfer to an untrusted destination	Smash and Grab, Data Smuggler

One technique needs a footnote. Web shell deployment (T1505.003) does not have a dedicated Vectra detection: the install is often endpoint-side, though the command-and-control channel it creates surfaces in Hidden HTTPS Tunnel or External Remote Access. The domain-replication attacks DCSync and DCShadow surface in Suspicious Active Directory Operations, which Vectra maps to T1207 and the Credential Access tactic.

GTG-1002 fits the pattern. SSH remote services, exploitation of remote services, credential harvesting, and archive-and-stage are behaviors these detections are built to surface, whether the operator is a person or a model acting through an MCP server.

AI-enabled attacker techniques, and where they become detectable

How a year of attacker AI use maps to behavior inside a network.

Detected as in-network behavior Partial or adjacent coverage Attacker-side, not observable in your environment

Detected as in-network behavior 7 techniques

T1021

Remote Services (SSH/SMB)high-risk

Lateral Movement · strongest high-risk marker

T1078

Valid Accountshigh-risk

Access

T1003

OS Credential Dumpinghigh-risk

Credential Access

T1560

Archive Collected Datahigh-risk

Collection

T1210

Exploitation of Remote Services

Lateral Movement

T1087

Account Discoveryrising

Discovery · up in second half of the year

T1020

Automated Exfiltrationrising

Exfiltration · up in second half of the year

Partial or adjacent coverage 2 techniques

T1505.003

Web Shellhigh-risk

Persistence

No dedicated detection; the command-and-control channel it creates is still visible in the network.

T1207

Rogue Domain Control (DCSync / DCShadow)

Credential Access

Adjacent credential-access coverage.

AI-heavy, but attacker-side and not observable in your environment 5 techniques

T1587

Develop Capabilities

Resource Development · 69% of actors

Includes malware development (T1587.001). Happens on the attacker's own machine.

T1027

Obfuscated Files or Information

Defense Evasion · 64.7%

Endpoint / attacker-side.

T1005

Data from Local System

Collection · 55.9%

Endpoint / attacker-side.

T1562

Impair Defenses

Defense Evasion · 54.9%

Largely endpoint / EDR territory.

T1055

Process Injection

Defense Evasion · 30.3%

Endpoint / EDR territory.

Source: Anthropic, LLM ATT&CK Navigator (3 June 2026), 832 accounts mapped to MITRE ATT&CK v18. See the table for the Vectra detections behind each technique.

Takeaway 3: the model behind the attack is about to get harder to see

There is a reason to expect the early-stage blind spot to widen. Providers like Anthropic are investing heavily in safeguards, monitoring, and detection to stop their models being used for offensive work, and they are getting better at it. The likely response is that some actors move to open-source models they can run themselves, with the guardrails removed and no provider watching. Reporting like this Navigator exists because Anthropic can see its own systems. Self-hosted models offer no such window, and visibility into that shift is poor today.

This is the strongest argument for anchoring detection to behavior rather than the tool. A report tied to one provider's data cannot be your detection strategy, because the next operator may not use that provider at all. What does not change is what the attacker has to do inside your environment. An account that signs in from the wrong place and reaches a service it has never touched looks the same whether a human, a frontier model, or a self-hosted one is driving it. Anthropic is candid that ATT&CK does not yet capture what made GTG-1002 exceptional, the autonomous orchestration that chained techniques together at machine speed, and says it is in conversation with MITRE about adding categories for agentic behavior. Until that vocabulary exists, the detection that holds up is the one that never depended on naming the tool.

What doesn’t change

Attackers are using AI to get in faster and, increasingly, to do the hands-on work once inside. That makes two things true at once. You will not see the part that happens on the attacker's machines, and you do not need to. You can see the part that happens in yours, if you are watching behavior across network and identity rather than chasing whichever tool produced it. Assume compromise, watch behavior, and AI on the attacker's side changes the speed of the problem, not the shape of the answer.

What Anthropic's Attacker-AI Data Means for Detection