The attack's success hinged on a devastating combination of social engineering and algorithmic manipulation that fooled a quarter-million developers in just 18 hours. The malicious repository didn't just impersonate OpenAI's legitimate Privacy Filter model—it weaponized the very trust signals developers rely on when selecting AI tools. (Source: The Hacker News)
Developers believed they were downloading OpenAI's official Privacy Filter, a highly anticipated open-weight model designed to detect and redact personally identifiable information from unstructured text. The legitimate tool, released in April 2026, promised to help organizations build AI applications with robust privacy protections—a critical capability as regulatory scrutiny intensifies around AI data handling.
Instead, victims received a sophisticated Rust-based information stealer wrapped in convincing packaging. The attackers copied OpenAI's model description verbatim, creating an almost perfect duplicate under the name Open-OSS/privacy-filter that appeared indistinguishable from the authentic openai/privacy-filter repository.
The #1 trending position on Hugging Face transformed this deception into a supply chain catastrophe. Platform algorithms interpret high download counts and likes as quality signals, creating a feedback loop where popularity breeds more popularity. The repository accumulated approximately 244,000 downloads and 667 likes within its 18-hour lifespan—numbers suspected to be artificially inflated through automated processes to game the trending algorithm.
This algorithmic trust exploitation represents a fundamental vulnerability in how modern developers discover and adopt AI models. When a repository hits the trending list, it gains implicit platform endorsement. Developers racing to implement the latest AI capabilities often prioritize speed over verification, especially when a model appears to come from a trusted source like OpenAI.
Key Insight: This algorithmic trust exploitation represents a fundamental vulnerability in how modern developers discover and adopt AI models.
The malware's deployment mechanism targeted the standard workflow of AI developers. Users were instructed to clone the repository and run either a batch script for Windows or a Python script for Linux and macOS systems—a completely normal process for configuring model dependencies. The Python loader disabled SSL verification, decoded Base64-encoded URLs from JSON Keeper, and executed PowerShell commands that downloaded additional payloads from attacker-controlled infrastructure.
The immediate business impact extends far beyond individual compromised machines. Every infected developer represents a potential breach point into their organization's AI and machine learning pipelines. The stealer harvested Discord tokens, cryptocurrency wallets, browser credentials, FileZilla configurations, and wallet seed phrases—the exact credentials attackers need to pivot into production environments.
"Despite using a scheduled task, this stage establishes no persistence: the task is destroyed before any reboot. It is being used as a one-shot SYSTEM-context launcher."
Organizations now face the nightmare scenario of potentially compromised AI development environments. Stolen credentials could grant attackers access to training datasets, model weights, API keys for cloud AI services, and proprietary algorithms. The stealer's ability to capture screenshots and system metadata means attackers potentially observed sensitive development work, internal documentation, and architectural diagrams.
The attack's connection to broader campaigns compounds the risk. The same infrastructure served ValleyRAT (also known as Winos 4.0), a modular remote access trojan attributed to the Chinese hacking group Silver Fox. This suggests victims aren't just dealing with credential theft—they may have persistent backdoors enabling long-term espionage or sabotage of AI systems.
Supply Chain Attack: Repository Impersonation Flow
Attack Chain: From Fake Repository to ValleyRAT Execution
The attack began with careful repository construction that mimicked OpenAI's legitimate project down to the smallest detail. Attackers registered under the username "Open-OSS" and created their malicious repository with the nearly identical path "Open-OSS/privacy-filter" to exploit users searching for OpenAI's official release. The threat actors copied the entire model card verbatim from the legitimate repository, ensuring that anyone comparing descriptions would see identical text about detecting and redacting personally identifiable information.
Key Insight: Attackers registered under the username "Open-OSS" and created their malicious repository with the nearly identical path "Open-OSS/privacy-filter" to exploit users searching for OpenAI's official release.
The social engineering hook proved devastatingly effective because developers were actively seeking this specific tool. OpenAI had announced Privacy Filter in April 2026 as a critical component for building compliant AI applications, creating immediate demand among organizations racing to implement privacy protections. The malicious repository appeared at the perfect moment when developers were searching for this exact capability.
Installation instructions directed Windows users to run "start.bat" while Linux and macOS users executed "loader.py" - both appearing as standard dependency configuration scripts. The Python loader immediately disabled SSL verification and decoded a Base64-encoded URL pointing to JSON Keeper, a public paste service. This dead drop resolver architecture allowed attackers to modify payloads without touching the repository itself, maintaining the appearance of legitimacy while adapting their attack.
The PowerShell command retrieved from JSON Keeper initiated contact with "api.eth-fastscan[.]org" to download a batch script. This second-stage downloader elevated privileges through a User Account Control prompt - a request users typically approve when installing legitimate software. The script then configured Microsoft Defender exclusions, effectively blinding the primary security tool on Windows systems.
The scheduled task mechanism launched the final payload with SYSTEM privileges, executing the information stealer before immediately destroying the task. This one-shot approach avoided persistent artifacts that might trigger security scans. The stealer component targeted Discord tokens, cryptocurrency wallet configurations, FileZilla settings containing FTP credentials, and seed phrases for digital wallets. Browser data extraction focused on both Chromium and Gecko-based browsers, harvesting saved passwords, cookies, and autofill data.
Anti-analysis features checked for debuggers, sandboxes, and virtual machine indicators before proceeding with data theft. The malware disabled Windows Antimalware Scan Interface and Event Tracing for Windows, eliminating two critical behavioral detection mechanisms. Screenshots captured active desktop content while the stealer systematically collected system metadata including hardware specifications, installed software, and network configurations.
The same infrastructure served "o0q2l47f.exe," which established communication with "welovechinatown[.]info" - a command-and-control server associated with ValleyRAT campaigns. This connection revealed the broader operation: the npm package "trevlo" published by user "titaniumg" on April 4, 2026, delivered the same ValleyRAT variant through a postinstall hook. The JavaScript loader spawned obfuscated PowerShell that fetched and executed "CodeRun102.exe," the Winos 4.0 stager binary.
CodeRun102.exe employed hidden window execution and Zone Identifier removal to bypass SmartScreen warnings. Process detachment ensured the malware continued running even if the parent process terminated. ValleyRAT's modular architecture enabled attackers to deploy additional capabilities based on victim value - keyloggers for capturing credentials, screen recorders for monitoring activity, and file transfer modules for selective data theft.
The shared infrastructure between Hugging Face and npm attacks suggests coordinated supply chain targeting by the Silver Fox group, known for combining phishing and SEO poisoning with developer-focused attacks.
Immediate Detection & Response: What to Do in the Next 24 Hours
Organizations that downloaded from Hugging Face between April and May 2026 face a critical window for containing potential compromise. The malware's lack of persistence mechanisms means rapid action can prevent long-term damage, but the stealer has already harvested credentials and sensitive data from infected systems.
Immediate Actions (Next 4 Hours)
Search your environment for these specific executable files that indicate active infection: CodeRun102.exe and o0q2l47f.exe. These binaries represent the Winos 4.0 stager and secondary payload respectively. Check scheduled tasks for any PowerShell scripts created recently that execute from temporary directories or contain Base64-encoded commands.
Query your proxy logs and DNS resolution records for connections to these command-and-control domains: api.eth-fastscan.org, recargapopular.com, and welovechinatown.info. Any hits indicate active or attempted data exfiltration. The stealer transmits harvested data in JSON format to recargapopular.com, making this domain particularly critical for identifying compromised systems.
Examine npm package installation logs for the trevlo package installed by user "titaniumg" after April 4, 2026. This malicious Node.js library served as an alternative infection vector for the same threat actors. Check for postinstall hooks that spawn Base64-encoded PowerShell commands in your development environments.
Critical File System Indicators (Next 12 Hours)
The malware targets specific data stores that require immediate credential rotation. Search for unauthorized access to FileZilla configuration files, which contain FTP credentials in plaintext. Examine Discord token storage locations and browser profile directories for signs of tampering—the stealer harvests saved passwords from both Chromium and Gecko-based browsers.
Cryptocurrency wallet extensions and seed phrase storage locations need immediate inspection. The stealer specifically targets wallet configurations and recovery phrases, potentially enabling complete wallet compromise. Any developer who stored wallet information on infected systems should transfer funds to new wallets immediately.
Review Microsoft Defender exclusion lists for recently added paths. The malware configures antivirus exclusions through elevated PowerShell commands to avoid detection. Remove any suspicious exclusions and run full system scans with updated definitions.
Network Segmentation Review (This Week)
AI and machine learning development environments require immediate network isolation from production systems. The attack demonstrates how threat actors target developer workstations as entry points to broader infrastructure. Implement jump boxes for accessing Hugging Face and similar repositories, preventing direct downloads to development machines.
Rotate all credentials for developers who accessed Hugging Face repositories during the infection window. The stealer captures system metadata alongside credentials, enabling attackers to craft targeted follow-up attacks using legitimate developer identities. Include API keys, SSH keys, and cloud service credentials in your rotation scope.
Detection Engineering Requirements
Configure SIEM rules to alert on PowerShell commands that disable AMSI or ETW—the malware attempts both to evade behavioral detection. Monitor for User Account Control elevation requests followed immediately by scheduled task creation, a pattern unique to this attack chain.
Deploy YARA rules to scan for the specific Base64 encoding patterns used in loader.py and start.bat files. The malware's reliance on JSON Keeper as a dead drop resolver creates a detectable pattern of external JSON fetches followed by PowerShell execution.
Why Hugging Face Ranking Enabled This Attack (And How to Verify Future Dependencies)
The Hugging Face platform's algorithmic ranking system created the perfect storm for this attack's success. The platform automatically promotes repositories based on download counts and engagement metrics—a trust signal that developers have learned to rely on when evaluating AI models. This automated promotion mechanism became the attack's primary amplification vector.
When the malicious Open-OSS/privacy-filter repository artificially inflated its metrics to approximately 244,000 downloads and 667 likes within 18 hours, Hugging Face's algorithms responded exactly as designed. The repository climbed to the #1 trending position, placing it prominently on the platform's homepage where millions of developers discover new models daily.
This algorithmic vulnerability exploits a fundamental assumption in open-source ecosystems: that popularity equals legitimacy. Developers routinely use download statistics as a proxy for trustworthiness, reasoning that thousands of peers wouldn't download malicious code. The attackers understood this psychology perfectly, creating a self-reinforcing cycle where fake popularity generated real downloads, which further boosted rankings.
The timing amplified the attack's effectiveness. OpenAI had announced Privacy Filter in April 2026 as a solution for detecting and redacting personally identifiable information—a capability developers urgently needed for compliance with evolving AI regulations. This created a perfect storm: high demand for a specific tool, a convincing impersonation, and algorithmic promotion that made the fake appear more popular than the legitimate version.
Repository verification requires examining multiple trust signals beyond download counts. Start by checking the repository creation date against the official announcement. The malicious Open-OSS/privacy-filter appeared shortly after OpenAI's legitimate release, a common pattern in typosquatting attacks. Legitimate projects from major organizations typically have months or years of commit history, not days.
Contributor analysis reveals critical discrepancies. The fake repository showed minimal contributor activity beyond initial uploads, while legitimate OpenAI projects display diverse contribution patterns from verified employees. Check the account age and contribution history of repository owners—newly created accounts pushing trending models warrant extreme skepticism.
Cross-reference against official channels before downloading any model claiming association with major AI companies. OpenAI maintains an official Hugging Face organization at "openai/" not variations like "Open-OSS/". Their blog, GitHub, and social media accounts announce new releases with direct links. The legitimate Privacy Filter resided at openai/privacy-filter, while the malicious version used Open-OSS/privacy-filter—a subtle but critical difference.
GPG signature verification provides cryptographic proof of authenticity when available. Major AI organizations increasingly sign their releases, allowing developers to verify that code hasn't been tampered with since publication. The absence of signatures on a supposedly official release from OpenAI, Google, or Meta should trigger immediate suspicion.
The attack revealed specific red flags that transcend this single incident. Watch for repositories that copy descriptions verbatim from legitimate projects, especially when combined with similar but not identical usernames. Be suspicious of models requiring custom installation scripts like start.bat or loader.py rather than standard pip or conda commands. Question why a model focused on privacy filtering would need system-level permissions or scheduled task creation.
Platform-level verification gaps enabled this attack's success. Hugging Face lacks namespace protection that would prevent users from creating organization names resembling established companies. Unlike npm's verified publisher badges or GitHub's verification checkmarks, Hugging Face provides no visual indicators distinguishing official organizational accounts from imposters.
Algorithmic Amplification Attack Chain
Affected Organizations: AI/ML Teams and Open-Source Contributors at Highest Risk
The attack's victim profile extends far beyond individual developers who downloaded the malicious repository. Organizations building privacy-focused AI applications face the most severe exposure, particularly those racing to implement PII detection capabilities in response to evolving data protection regulations.
AI/ML teams represent prime targets because their development environments contain the crown jewels of modern enterprises: training datasets, model weights, API keys for cloud compute resources, and credentials for model registries. A single compromised developer workstation provides attackers with access to proprietary algorithms worth millions in research investment, customer datasets subject to regulatory protection, and the computational resources needed to exfiltrate or manipulate these assets at scale.
The stealer's specific targeting of cryptocurrency wallets and browser-stored credentials amplifies risk for blockchain-focused AI projects. Many organizations developing decentralized AI systems maintain development wallets for testing smart contracts and paying for distributed compute resources. These wallets often contain substantial funds for gas fees and liquidity testing.
Secondary contamination through dependency chains poses an even greater threat. If developers incorporated the malicious loader.py into automated pipelines, Docker containers, or published packages, the infection could propagate silently through CI/CD systems. Organizations that forked the repository for internal modifications unknowingly preserved the malicious code in their private Git repositories, where it remains dormant until activated during model deployment.
Open-source maintainers who integrated what they believed was OpenAI's Privacy Filter into their projects created downstream vulnerabilities affecting every user of their libraries. The npm ecosystem already demonstrated this cascade effect when the trevlo package delivered ValleyRAT to over 2,300 systems. Similar propagation through PyPI, Conda, or corporate artifact repositories could expose thousands of additional organizations.
Research institutions and universities face unique exposure due to their collaborative development practices. Academic teams frequently share model repositories across institutions, and the prestige associated with using OpenAI's latest tools would have made the fake Privacy Filter particularly attractive for grant-funded privacy research projects.
Organizations must answer critical questions to assess their exposure:
- Did any team members access Hugging Face repositories named "Open-OSS/privacy-filter" or the six identified anthfu repositories between April and May 2026?
- Do any internal forks, mirrors, or cached copies of Hugging Face models contain loader.py files with Base64-encoded URLs pointing to JSON Keeper?
- Which developer machines have access to production model registries, training data storage, or cloud compute credentials?
- Have any automated model deployment pipelines pulled dependencies from Hugging Face without hash verification?
- Do containerized ML workflows include any models downloaded during the 18-hour window when the malicious repository ranked #1?
Financial services firms developing AI for fraud detection and healthcare organizations building patient data analysis tools face regulatory scrutiny if compromised systems processed sensitive data. The stealer's screenshot capabilities and file harvesting functions could have captured protected health information, payment card data, or personally identifiable information displayed during model testing.
The infrastructure overlap between this campaign and the Silver Fox group's previous operations suggests ongoing targeting of AI/ML development environments. Organizations that escaped this specific attack remain vulnerable to future campaigns exploiting the same trust mechanisms in model sharing platforms.
Attribution Context: HiddenLayer Research & Silver Fox Pattern
The discovery of this sophisticated supply chain attack came from HiddenLayer Research Team, a specialized AI security research group that focuses on threats targeting machine learning ecosystems. Their investigation revealed not just an isolated incident, but a coordinated campaign that connects to broader Chinese threat actor operations targeting Western AI development infrastructure.
The attribution to Silver Fox, a Chinese hacking group, carries significant weight given their established pattern of supply chain compromises. HiddenLayer's analysis linked the attack infrastructure through the domain "welovechinatown[.]info," which served as a command-and-control server for ValleyRAT (Winos 4.0) deployments across multiple campaigns.
What makes this attribution particularly concerning is Silver Fox's strategic pivot toward AI and machine learning targets. The group has historically focused on traditional phishing and SEO poisoning campaigns, but their expansion into poisoning AI model repositories signals a calculated shift in priorities. They recognize that AI development environments contain far more valuable assets than typical corporate networks—model weights representing years of research, training datasets worth millions, and API keys that control vast computational resources.
The connection between the Hugging Face attack and the npm package "trevlo" campaign demonstrates Silver Fox's multi-vector approach to compromising AI supply chains. Both campaigns shared infrastructure, with the "api.eth-fastscan[.]org" domain serving payloads across different package ecosystems. This infrastructure overlap suggests a centralized operation rather than opportunistic attacks.
HiddenLayer identified six additional malicious repositories under the "anthfu" username, all following the same loader pattern but targeting different AI model types. These repositories—including fake versions of DeepSeek, Qwen, and Gemma models—show Silver Fox systematically poisoning popular model architectures across the AI community. The group appears to be casting a wide net, knowing that different organizations prefer different model families for their specific use cases.
The timing of these attacks reveals strategic intelligence gathering priorities. Silver Fox launched this campaign just as Western companies rushed to implement privacy-preserving AI capabilities in response to regulatory pressure. They understood that security teams would be under pressure to quickly adopt tools like Privacy Filter, creating conditions where normal verification procedures might be skipped.
This targeting pattern suggests Chinese threat actors view AI development pipelines as strategic collection priorities, not just targets of opportunity. The stolen data—including Discord tokens, cryptocurrency wallets, browser credentials, and FileZilla configurations—provides long-term access to developer communications, code repositories, and cloud infrastructure where models are trained and deployed.
The sophistication of the evasion techniques, including AMSI bypass, ETW disabling, and anti-VM checks, indicates Silver Fox has invested significant resources in understanding AI developer environments. They know these systems often run with elevated privileges and reduced security monitoring to accommodate resource-intensive model training operations.
Understanding Silver Fox's focus on AI infrastructure helps explain why this attack succeeded so spectacularly. The group recognizes that compromising a single AI developer's workstation provides access to intellectual property that would take years and millions of dollars to recreate—a far higher return than traditional corporate espionage targets.