Conceptual image illustrating cybersecurity threats, focusing on hackers exploiting auth flows and data protection vulnerabilities.

The revelation that Chinese state-sponsored hackers maintained unauthorized access to critical infrastructure for an entire decade represents one of the most significant intelligence failures in recent cybersecurity history. Operation Highland, as researchers have dubbed it, demonstrates a fundamental breakdown in the assumptions organizations make about network isolation and security monitoring. (Source: BleepingComputer)

Consider what ten years of undetected access actually means for an organization. Every strategic decision, every operational change, every security upgrade during that period was potentially visible to foreign adversaries. The Velvet Ant threat group didn't just steal data—they embedded themselves so deeply into authentication systems that they could observe administrative activity in real-time, watching password changes, security updates, and incident response activities as they happened.

The attack began in 2016, targeting vulnerable internet-facing systems before executing what security professionals considered nearly impossible: pivoting into an air-gapped network with no direct external connection. This wasn't a smash-and-grab operation. The attackers methodically built a complex infrastructure that survived through multiple technology refresh cycles, staff turnovers, and presumably numerous security audits.

What makes this breach particularly alarming is the target: critical infrastructure networks that organizations specifically isolate to prevent exactly this type of compromise. Air-gapping represents the ultimate security measure—physically separating sensitive systems from any internet connection. Organizations invest millions in maintaining these isolated environments, accepting significant operational friction in exchange for what they believe is absolute protection.

Key Insight: What makes this breach particularly alarming is the target: critical infrastructure networks that organizations specifically isolate to prevent exactly this type of compromise.

The business implications extend far beyond data theft. For a decade, competitive advantages disappeared as intellectual property flowed to adversaries. Strategic plans became transparent to foreign governments. Operational technology controlling physical infrastructure remained under potential hostile control. Every merger discussion, every expansion plan, every technological innovation during those ten years potentially informed competitor strategies and state-level economic planning.

From a regulatory perspective, this represents a catastrophic compliance failure. Critical infrastructure operators face stringent reporting requirements under frameworks like NERC CIP for energy sectors or TSA directives for transportation. A decade-long breach means ten years of potentially false compliance attestations, each carrying significant legal liability. Board members who signed off on security postures now face questions about their oversight effectiveness.

The persistence demonstrated here challenges fundamental security assumptions. Organizations typically plan for breaches lasting days or weeks, not decades. Incident response playbooks assume attackers will eventually make mistakes, trigger alerts, or lose access during system updates. Velvet Ant proved these assumptions wrong by replacing core authentication components with trojanized versions, ensuring their access survived password rotations, system patches, and even security tool deployments.

Perhaps most concerning is what this breach reveals about detection capabilities. The targeted organization wasn't some small entity with minimal security resources—maintaining air-gapped critical infrastructure requires substantial investment and expertise. Yet for ten years, routine security operations, vulnerability assessments, and compliance audits failed to detect an active adversary with complete visibility into administrative functions. This suggests that current detection methodologies fundamentally misunderstand how sophisticated actors operate within compromised environments.

The Authentication Hijacking Technique: Breaking Into Systems That Should Be Unreachable

The sophistication of Operation Highland lies not in its initial compromise, but in how attackers transformed ordinary authentication components into surveillance infrastructure. The attack chain reveals a methodical approach to subverting trust at every layer of the authentication stack.

GS-Netcat served as the foundation for initial persistence. This modified reverse shell masqueraded as a legitimate system component, connecting to hardcoded relay domains that provided encrypted remote shell access. The tool achieved persistence through either malicious systemd services or startup script modifications, ensuring the attackers maintained access even after system reboots. This wasn't just a backdoor—it was an encrypted command channel that blended seamlessly with normal system operations.

The attackers then deployed custom SOCKS5 proxies to create internal pivot points. These proxies ran as daemons disguised as 'smbd -D' processes, using different filenames and ports on each compromised host. This variation prevented pattern-based detection while enabling traffic tunneling to internal systems that had no direct internet connectivity. Every compromised server became a potential gateway deeper into the network.

The most ingenious aspect involved chaining together legitimate infrastructure components to reach the isolated network. Attackers modified Nginx configurations on internet-facing servers to proxy specially crafted HTTP requests to backend servers. These backend servers forwarded requests to fcgiwrap processes listening on separate ports. The FastCGI wrapper acted as an execution bridge, processing HTTP POST requests and launching a custom binary named 'uptime'.

This 'uptime' tool wasn't for monitoring system availability—it established SSH connections to systems within the isolated critical infrastructure network using parameters supplied in the HTTP requests. Through this chain of modified configurations and custom tools, attackers created a remote execution path into supposedly air-gapped environments without ever establishing direct connections.

Once inside the isolated network, the focus shifted to long-term credential harvesting through PAM module manipulation. The attackers replaced legitimate 'pam_unix.so' modules with backdoored versions that accepted hardcoded passwords while harvesting user credentials. Nine distinct variants of these malicious PAM modules were discovered, each compiled in separate build environments—indicating significant resources and planning.

Two PAM module variants stood out for their specialized functions: one acted purely as a backdoor accepting hardcoded authentication, while another focused exclusively on credential collection. This division of labor suggests operational discipline and awareness of detection risks.

The attackers also trojanized OpenSSH components including ssh, sshd, and scp. These modified versions captured credentials, logged commands entered during SSH sessions, and stored collected data locally for future retrieval. Every administrative action became observable—every login attempt, every command executed across compromised hosts.

By controlling both PAM and OpenSSH components, the attackers achieved something remarkable: persistence that survived password changes and session terminations. Traditional containment measures became ineffective because the authentication process itself was compromised. Even if administrators changed passwords or terminated suspicious sessions, the attackers retained access through their hardcoded backdoors while continuing to harvest new credentials as they were used.

This wasn't just network compromise—it was the systematic subversion of trust mechanisms that organizations rely on for basic security. The attackers didn't break authentication; they became part of it.

Detection Gaps That Enabled a Decade of Espionage

The decade-long breach reveals fundamental blind spots in how organizations monitor authentication infrastructure, particularly in environments considered "safe" due to network isolation. Security teams hunting for similar compromises should immediately examine specific system behaviors that indicate authentication hijacking rather than traditional malware signatures.

The most glaring detection opportunity centers on fcgiwrap processes running on unexpected ports. FastCGI wrappers typically operate on standard ports for web application processing. When Velvet Ant configured fcgiwrap to listen on separate ports as an execution bridge, this created an anomalous pattern that should trigger alerts. Security teams need to baseline their fcgiwrap deployments and flag any instances running outside expected parameters, especially those receiving HTTP POST requests containing SSH connection parameters.

The masquerading of malicious processes presents another detection vector. The SOCKS5 proxy running as smbd -D with varying filenames across hosts creates a pattern mismatch between process names and actual behavior. Real SMB daemons generate specific network traffic patterns and file access behaviors. A process claiming to be smbd while establishing SOCKS5 connections to external domains represents a clear anomaly that behavioral monitoring should catch.

Nine distinct PAM module variants deployed across the environment should have triggered file integrity monitoring alerts. Each variant represented a separate compilation in different build environments, creating unique file hashes. Organizations rarely update PAM modules, and when they do, it happens through package management systems that leave audit trails. Direct replacement of pam_unix.so files outside the package management workflow indicates compromise.

The binary named "uptime" functioning as an SSH relay tool represents another missed detection opportunity. The legitimate uptime command displays system running time and never establishes SSH connections. Process monitoring that correlates binary names with network behavior would immediately flag this discrepancy. Additionally, SSH connections originating from web server processes through Nginx configurations modified to proxy requests internally violate standard architectural patterns.

Why did these indicators go unnoticed in an isolated network? The assumption of safety through air-gapping created a monitoring vacuum. Organizations often deploy minimal logging and detection capabilities in supposedly isolated environments, believing physical separation provides adequate protection. This creates perfect conditions for persistent threats that bridge the gap through legitimate administrative channels.

Limited behavioral baselines in isolated networks compound the problem. Without understanding normal authentication patterns, security teams cannot identify anomalies. The modified OpenSSH components capturing credentials and logging commands appeared legitimate because no baseline existed to compare against. Organizations must establish authentication behavior profiles even—especially—in isolated environments.

The storage of harvested credentials locally for future retrieval created file system artifacts that routine forensic sweeps should detect. Unexpected data files in system directories, particularly those growing over time with structured credential data, indicate active collection operations. Regular integrity checks comparing system directories against known-good states would reveal these collection points.

Security teams should immediately hunt for HTTP requests to backend servers that trigger unexpected process execution, systemd services with generic names running reverse shells, and authentication modules accepting hardcoded passwords. These patterns transcend specific malware families and indicate fundamental authentication compromise regardless of the threat actor involved.

Immediate Response Actions for Critical Infrastructure Operators

Critical infrastructure operators discovering potential Velvet Ant compromise need immediate action plans that address the unique challenges of authentication hijacking. The complexity of removing deeply embedded authentication modifications requires careful sequencing to avoid locking out legitimate administrators while eliminating attacker access.

Immediate Actions (0-24 Hours)

Your first priority involves auditing authentication components without alerting potential attackers still monitoring your systems. Begin by creating offline copies of /lib/security/pam_unix.so and /usr/lib/x86_64-linux-gnu/security/pam_unix.so from all Linux systems. Compare these binaries against known-good versions from vendor repositories using cryptographic hashes—compromised versions will show different signatures even if file sizes match.

Simultaneously, inventory all FastCGI wrapper processes across your infrastructure. Run ps aux | grep fcgiwrap and document which ports each instance uses. Velvet Ant configured fcgiwrap to listen on non-standard ports as execution bridges into isolated networks. Any fcgiwrap process not associated with legitimate web applications requires immediate investigation.

Check for modified Nginx configurations that proxy requests to unexpected backend servers. Review /etc/nginx/sites-enabled/ for proxy_pass directives pointing to internal FastCGI processes. Document any configuration that forwards HTTP requests to fcgiwrap instances, particularly those accepting POST data with SSH connection parameters.

Short-Term Response (1-7 Days)

Deploy file integrity monitoring specifically targeting OpenSSH binaries. Create baseline checksums for /usr/bin/ssh, /usr/sbin/sshd, and /usr/bin/scp across all systems. Velvet Ant replaced these components with trojanized versions that captured credentials and logged SSH session commands. Any deviation from baseline checksums indicates potential compromise.

Enable verbose logging for PAM authentication events by adding debug to PAM configuration lines in /etc/pam.d/ files. Monitor these logs for authentication attempts using hardcoded passwords—a signature behavior of Velvet Ant's backdoored PAM modules. Set up automated alerts for successful authentications that bypass normal password verification flows.

Search historical logs for connections to relay domains used by GS-Netcat shells. While specific domains weren't disclosed, look for persistent outbound connections from system processes that shouldn't generate network traffic. Focus on connections disguised as legitimate system components, particularly those establishing encrypted channels on non-standard ports.

Long-Term Remediation (7-30 Days)

Establish a parallel authentication infrastructure before removing compromised components. Build clean authentication servers in an isolated environment, test all authentication flows, and prepare detailed rollback procedures. This prevents the operational outages that occur when removing backdoored authentication components breaks legitimate access.

Implement behavioral analytics specifically for system administration tools. Monitor for unusual patterns in SSH usage, unexpected systemd service creation, and modifications to startup scripts. Velvet Ant achieved persistence through both systemd services and startup script modifications—behavioral monitoring catches these techniques regardless of specific implementation.

Redesign network segmentation to eliminate HTTP-based bridges into isolated environments. The attack chain relied on chaining Nginx proxy configurations to reach air-gapped networks through HTTP requests. Implement strict egress filtering that blocks any HTTP/HTTPS traffic from isolated segments, forcing all administrative access through dedicated jump servers with enhanced monitoring.

Why Air-Gapped Networks Aren't Enough: Rethinking Isolation Strategy

The breach of Velvet Ant's supposedly isolated target reveals a harsh reality: network isolation creates dangerous blind spots when organizations assume physical separation equals security. The ten-year compromise demonstrates that determined adversaries view air gaps not as barriers but as challenges requiring creative solutions.

Consider what "isolated" actually meant in this case. The victim organization believed their critical infrastructure network had no direct external path, yet attackers maintained persistent access through a carefully orchestrated chain of compromised systems. The modified Nginx configurations created an HTTP-based execution bridge, transforming web servers into command relays that reached across network boundaries without traditional connectivity.

Key Insight: The victim organization believed their critical infrastructure network had no direct external path, yet attackers maintained persistent access through a carefully orchestrated chain of compromised systems.

This architectural subversion exposes three uncomfortable truths about isolation strategies. First, any system that processes data from less-trusted networks becomes a potential bridge—whether through shared storage, administrative jump boxes, or even USB-based data transfers. Second, the complexity of modern infrastructure means true isolation rarely exists; somewhere, somehow, data flows between networks, and attackers only need to find one path. Third, isolated networks often receive less security scrutiny precisely because they're considered "safe," creating perfect conditions for long-term persistence.

The implications extend beyond technical architecture. When you assume network isolation provides security, you make strategic decisions that amplify risk. Security teams allocate fewer resources to monitoring "protected" segments. Patch management becomes less urgent when systems aren't "internet-facing." Authentication mechanisms receive minimal hardening because "only trusted users can reach these systems." Each assumption becomes an exploitable weakness.

Critical infrastructure operators must fundamentally rethink their approach. Instead of trusting isolation, implement zero-trust principles within supposedly secure networks. Every authentication request should be verified as if it originated from an untrusted source. Every process should be monitored for anomalous behavior regardless of network segment. This means deploying endpoint detection capabilities on isolated systems, implementing privileged access management for administrative functions, and establishing behavioral baselines for all authentication components.

The detection challenge becomes particularly acute in isolated environments that can't receive real-time threat intelligence feeds. Without external updates about emerging threats or known indicators of compromise, security teams must rely entirely on behavioral analysis and anomaly detection. This requires building comprehensive activity profiles for every critical system—understanding not just what processes should run, but when, how often, and with what parameters.

Organizations should establish "canary" authentication modules—specially configured PAM components or SSH binaries that generate alerts when accessed or modified. These act as tripwires for authentication hijacking attempts, providing early warning even when attackers successfully compromise other components. Deploy file integrity monitoring specifically tuned for authentication libraries, with immediate alerts for any modifications to critical paths.

Most importantly, assume breach even within isolated networks. Regular threat hunting exercises should specifically target authentication flows, looking for evidence of credential harvesting or authentication bypass. Security teams need offline analysis capabilities for authentication logs, comparing patterns across time to identify subtle changes that indicate compromise. The goal isn't preventing initial access—it's detecting and responding before attackers achieve the deep persistence that enabled a decade of surveillance.

Attribution and Threat Actor Capabilities: What Velvet Ant's Success Reveals

The nine distinct PAM module variants discovered by Sygnia researchers point to something more troubling than a single compromise—they indicate an industrial-scale operation with dedicated development infrastructure. Each variant compiled in separate build environments suggests Velvet Ant maintains multiple development teams, testing facilities, and quality assurance processes specifically for creating authentication backdoors.

This level of resource commitment aligns with patterns observed in other Chinese state-sponsored campaigns targeting critical infrastructure globally. The operational discipline required to maintain undetected presence for a decade while continuously updating malware variants demonstrates capabilities typically associated with nation-state intelligence services rather than criminal groups or hacktivist collectives.

The targeting of critical infrastructure through authentication hijacking reveals strategic intelligence collection priorities. Rather than pursuing quick data theft or ransomware deployment, Velvet Ant invested years establishing persistent visibility into administrative operations. This patience suggests collection requirements focused on understanding operational procedures, maintenance schedules, emergency response protocols, and system interdependencies—intelligence valuable for both economic espionage and potential future disruption operations.

Consider what administrative visibility actually provides to a foreign intelligence service. Every password reset reveals security practices. Every system upgrade exposes technology refresh cycles. Every emergency response demonstrates crisis management capabilities. The attackers didn't just steal data; they studied how the organization operates under normal and stressed conditions.

The sophistication of the HTTP-based execution bridge into isolated networks demonstrates advanced understanding of enterprise architecture assumptions. Most organizations design security controls assuming attackers need direct network paths. Velvet Ant's approach—chaining Nginx proxy modifications with FastCGI wrappers—exploits the trust relationships between internet-facing and internal systems that organizations consider separate security zones.

This technique likely extends beyond this single victim. Critical infrastructure sectors sharing similar architectural patterns—energy grids with SCADA networks, water treatment facilities with operational technology segments, transportation systems with safety-critical networks—face comparable risks. The attack methodology suggests Velvet Ant possesses detailed knowledge of how different critical infrastructure sectors implement network segmentation.

The timeline itself reveals concerning intelligence capabilities. Beginning operations in 2016 and maintaining them through 2026 spans multiple technology refresh cycles, security audits, and presumably penetration tests. Either the attackers possessed insider knowledge of security assessment schedules, or their operational security exceeded the detection capabilities of professional security evaluations.

The connection to previous Velvet Ant operations against F5 BIG-IP devices and Cisco NX-OS systems suggests a broader campaign targeting network infrastructure control points across multiple organizations. These aren't opportunistic compromises—they represent deliberate selection of technologies that provide maximum visibility with minimal detection risk.

The geopolitical context matters here. Chinese intelligence services historically prioritize long-term intelligence collection over immediate exploitation. The decade-long presence aligns with strategic patience demonstrated in other campaigns attributed to Chinese actors, where maintaining access takes precedence over short-term gains.

For critical infrastructure operators, this changes risk calculations fundamentally. Traditional threat modeling assumes attackers want data or disruption. Velvet Ant's approach suggests adversaries may already be inside, content to watch and learn, building comprehensive operational intelligence for purposes we may not yet understand.

Table of contents

Top hits