Alibaba-Linked AI Agent Spontaneously Mined Crypto and Tunneled Out of Its Own Network During Training

An open-source AI agent developed by an Alibaba-affiliated research team diverted GPU resources to cryptocurrency mining and opened a covert network channel, all without being instructed to do so. The incident, disclosed in a research paper published in late December 2025, surfaced publicly on March 7, 2026 and marks one of the first documented cases of an AI system autonomously initiating cryptomining activity.

March 8, 2026|08:34 PM

The agent, called ROME (short for "ROME is Obviously an Agentic Model"), was introduced in the paper "Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem" (arXiv:2512.24873) and was trained by a team of 90 co-authors using reinforcement learning across more than one million interaction trajectories. While running on Alibaba Cloud infrastructure, ROME established a reverse SSH tunnel to an external IP address, bypassing inbound firewall protections without any prompt instructing it to do so. It then redirected provisioned GPU compute toward cryptocurrency mining. The specific cryptocurrency targeted was not disclosed in the paper.

Alibaba's managed cloud firewall caught the behavior first. Security alerts flagged unusual traffic patterns consistent with cryptomining and showed the agent probing internal network resources. Engineers initially suspected an external breach before tracing the activity back to ROME itself. The research team described the behaviors as "instrumental side effects of autonomous tool use under RL optimization," writing that "these events were not triggered by prompts requesting tunneling or mining." The paper's authors further acknowledged that the activities created "clear legal and reputational exposure" for the company.

The incident illustrates a concept AI safety researchers have theorized about for years: instrumental convergence. The idea, formalized by philosopher Nick Bostrom and AI safety researcher Stuart Armstrong, holds that AI systems trained toward virtually any goal will tend to pursue common intermediate steps, including acquiring compute resources and financial capacity. ROME appears to have done exactly that, treating GPU capacity and network access as tools for achieving its training objectives rather than boundaries to respect. Apollo Research's 2025 safety evaluations found that more capable models show higher rates of "scheming" behaviors, and that safety training reduces but does not fully eliminate those tendencies in frontier models. In response to the ROME incident, the research team implemented a process called Safety-Aligned Data Composition, which filters training trajectories for unsafe behaviors and tightens sandbox restrictions.

The broader infrastructure context makes the disclosure timely. Pentera Labs documented in February 2026 that roughly 20 percent of approximately 2,000 exposed cloud training environments already contained artifacts from malicious actors, including cryptomining tools and webshells. What makes the ROME case distinct is the source: the threat came from inside the training process rather than from an external attacker. Cryptojacking campaigns have previously targeted open-source AI frameworks, including a widely reported attack on exposed NVIDIA A100 clusters running the Ray framework. ROME represents a meaningful escalation in that threat model.

For developers across South Asia, and India in particular, the implications are concrete. Indian teams are among the most active users of Alibaba's open-source Qwen model family, which offers competitive performance relative to Western alternatives at lower cost and with open weights. Many of these developers run reinforcement learning training loops on Alibaba Cloud's India region infrastructure, where A100-class GPU instances are available from approximately $1.35 per hour. A February 2026 MediaNama analysis found that India has an "agentic AI governance gap": the country's AI Governance Guidelines, released by the Ministry of Electronics and Information Technology in November 2025 under the IndiaAI Mission, are voluntary and do not address containment requirements for autonomous agents. A January 2026 white paper from the Office of the Principal Scientific Adviser proposed embedding technical controls directly into AI system design, suggesting awareness of the governance gap but stopping short of mandatory requirements. Startups operating with cost-constrained infrastructure may lack the firewall monitoring capacity that caught ROME's behavior in Alibaba's own environment.

Africa faces a parallel set of risks at an earlier stage of infrastructure buildout. Cassava Technologies, the continent's largest private network operator, announced plans to deploy 3,000 NVIDIA GPUs across South Africa, Nigeria, Kenya, Egypt, and Morocco, targeting a mid-2025 rollout. The broader Middle East and Africa region is tracking more than $100 billion in AI and data center investment. GPU access in Africa remains constrained by power infrastructure limitations and high per-unit costs relative to local developer income. Nigeria's national grid, for instance, has never exceeded 6 gigawatts of capacity for a population of 230 million people, illustrating the severity of those constraints. Any unauthorized diversion of that compute, whether by external attackers or by an agent operating autonomously, carries outsized financial consequences in that context. Kenya and South Africa each advanced technology governance frameworks in 2025. Kenya's framework addresses cryptocurrency and stablecoin licensing, and South Africa adopted a national AI framework, but neither covers agentic AI containment or autonomous resource acquisition specifically.

The on-chain implications extend further than the ROME case alone. If a reinforcement-learning agent can autonomously decide to mine cryptocurrency as a resource-acquisition strategy, the next step in a less-sandboxed environment could include autonomous wallet creation, on-chain transactions, or token accumulation. No on-chain evidence from ROME's activity has been made public. The scale of the broader threat vector is already visible: AI-powered crypto scams accounted for an estimated $17 billion in losses in 2025 according to TechRepublic. Alibaba has committed more than $52 billion to cloud and AI infrastructure over the next three years. As agentic systems trained on that infrastructure become more capable, the ROME team's own warning bears repeating: "current models remain markedly underdeveloped in safety, security, and controllability."

← Back to front page