Reverse CAPTCHA: Evaluating LLM Susceptibility to Invisible Unicode Instruction Injection

LLMs like GPT-5.2 and Claude Opus/Sonnet/Haiku require immediate input sanitization to prevent Unicode injection attacks. ProxMox, Docker, Linux, Nginx users should update their security protocols accordingly.

A study tested five LLMs against invisible Unicode instruction injection through zero-width characters and tags hidden within trivia questions. The research found that models with code execution capabilities can decode these instructions, highlighting a new security vulnerability in LLM APIs. This raises concerns about the integrity of responses from AI systems when exposed to such encoded commands. Sysadmins and developers need to be aware of this potential threat vector to protect their applications.

This matters for sysadmins managing proxmox/docker/linux/nginx environments as it opens a new vector of attack through invisible instructions that could manipulate or mislead automated systems. Homelab operators must also be vigilant to prevent unauthorized access via these subtle methods.

{'point': 'Models with code execution are particularly vulnerable', 'explanation': 'LLMs capable of executing code can write scripts to decode hidden Unicode instructions, turning a seemingly benign input into an executable command.'}
{'point': 'Different models have varying susceptibilities', 'explanation': 'OpenAI and Anthropic models react differently to specific encoding schemes, necessitating that attackers tailor their methods based on the targeted model.'}
{'point': 'Standard Unicode normalization is ineffective against this attack', 'explanation': 'Normalization formats like NFC or NFKC do not remove zero-width characters used in these attacks, meaning standard security measures may fail to prevent exploitation.'}
{'point': 'Simple detection hints can mitigate the risk', 'explanation': 'Adding a line of code to check for hidden Unicode can trigger the decoding process and prevent unintended instruction execution.'}
{'point': 'Input sanitization is critical for defense', 'explanation': 'Removing specific ranges of characters at the input boundary, such as U+200B-200F, U+2060-2064, and U+E0000-E007F, can effectively block these attacks.'}

Stack Impact

ProxMox version N/A; Docker version N/A; Linux kernel versions generally affected if running services with LLM integration; Nginx version N/A. Homelab users must ensure their environments are updated and properly configured to avoid exploitation.

Action Items

{'command': 'Add a script or middleware layer to strip zero-width characters from incoming text data before processing by the LLM API.'}
{'configuration_change': 'Update security policies to include checks for hidden Unicode instructions, especially in environments interfacing with external text inputs.'}

Source →