How is Rabbit Hole different from ChatGPT Deep Research or Perplexity?

Three differences. First, Rabbit Hole uses 10 specialist agents searching in parallel vs one model doing sequential queries -- so it's faster and deeper. Second, a contrarian agent stress-tests every finding before synthesis, catching hidden assumptions and gaps. Third, the output is a downloadable report with embedded diagrams and verified citations, not a chat response. Stanford found Perplexity fabricates 26% of references and ChatGPT 40%. Rabbit Hole verifies every citation before you see it.

What is adversarial verification?

Before you see any report, a contrarian researcher agent reviews all findings. It looks for hidden assumptions, unstated dependencies, what would falsify the thesis, and steel-mans the opposition. Then a separate citation verification hook checks that every factual claim has a real, linked source. This two-layer approach catches the blind spots and hallucinations that single-model research tools miss.

How does pricing work?

Pricing is per month based on how many research reports you need. Free gives you 3 reports to try it out. Basic is $39/month for 15 reports, Plus is $99/month for 40, and Team is $499/month for 100 reports. Every plan includes all 10 specialist agents and adversarial verification. No per-seat fees, no surprises.

What sources does Rabbit Hole search?

10 specialist agents search different source types: arXiv and Semantic Scholar for academic papers, Reddit and Hacker News for community sentiment, X/Twitter and LinkedIn for social signals, SEC EDGAR for financial filings, GitHub and Stack Overflow for technical content, plus news and company sites. Each agent is optimized for its domain -- the academic researcher follows citation graphs differently than the community researcher analyzes Reddit sentiment.

Can I use this for professional work?

That's exactly what it's built for. Consultants use it for competitive landscapes and client deliverables. VCs use it for due diligence. Grad students use it for literature reviews with BibTeX export. The verified citations and confidence ratings mean you can actually cite the output in professional documents -- something you can't safely do with tools that fabricate references.

Why not just use Claude Code or ChatGPT to do this myself?

You could. It would take about 50+ hours. You'd need to set up MCP servers for arXiv, Reddit, SEC EDGAR, Hacker News, and finance APIs. Then build a multi-agent orchestrator with parallel delegation. Then design a contrarian review pipeline. Then wire up citation verification. Then build report formatting with SVG diagram generation. Then tune prompts for each specialist. Then keep it all working as APIs change. Rabbit Hole is that entire stack, already built and tested. At $39/month, it's cheaper than the API tokens you'd burn debugging it.

The OpenClaw Security Wake-Up Call: What AI Agent Users Need to Know About Prompt Injection

On March 14, 2026, China's National Computer Network Emergency Response Technical Team (CNCERT) issued a security warning that should make anyone using AI agents pause: OpenClaw's "inherently weak default security configurations" could allow attackers to seize control of endpoints. The advisory highlighted risks including prompt injection, data exfiltration, malicious skill installations, and irreversible data deletion.

This wasn't theoretical. Just weeks earlier, researchers at PromptArmor had demonstrated how link previews in messaging apps could become data exfiltration pathways when communicating with OpenClaw agents. The attack was elegant in its simplicity: trick the AI into generating an attacker-controlled URL with sensitive data encoded in query parameters, and the messaging app's link preview would automatically transmit that data to the attacker's server—no user click required.

The CNCERT warning and PromptArmor research represent something larger than one platform's security gaps. They mark the moment when prompt injection stopped being a chatbot annoyance and became an operational security problem. Once an agent has permissions, memory, and execution power, a bad instruction is no longer just a bad answer. It can become file access, shell execution, message sending, or data exfiltration. If you're using AI agents for research, content creation, or any knowledge work, you need to understand what's actually happening—and how to protect yourself.

Quick verdict: what the OpenClaw security warning actually means

If you only have 20 seconds, here's the takeaway. The OpenClaw security warning matters because prompt injection becomes materially more dangerous the moment an AI agent can browse, execute, persist, and act with your credentials. The practical response is not to stop using agents. It is to isolate risky workflows, minimize permissions, and verify outputs before trust.

1 clickless leak

PromptArmor showed link previews can exfiltrate data without a user click

5 boundaries

Identity, execution, instruction, persistence, and supply chain determine blast radius

Least privilege

The safest default is to give every agent less access than feels convenient

What Prompt Injection Actually Is (And Why It's Different Now)

Prompt injection isn't new. Security researchers have warned about it since 2022. But the threat has evolved significantly, and the March 2026 advisories capture that evolution perfectly.

At its core, prompt injection exploits a fundamental architectural issue: large language models process instructions and data in the same channel. There's no technical boundary between "do this task" and "here's content to analyze." When an attacker embeds instructions in content the AI will process—what's called indirect prompt injection—the model can't reliably distinguish legitimate data from malicious commands.

OpenAI's March 11, 2026 security guidance captures the shift: early prompt injection attacks were simple ("ignore all previous instructions"), but modern attacks increasingly resemble social engineering. Attackers don't just give commands—they manipulate context, frame requests as policy updates, and exploit the model's tendency to follow seemingly authoritative instructions. In OpenAI's testing, one sophisticated attack worked 50% of the time by embedding instructions in what appeared to be a routine compliance validation workflow.

The real problem isn't that AI agents can be tricked. It's what they can do once tricked.

Why Agents Make Injection Dangerous

The shortest way to understand the risk shift is this: chatbot compromise is usually embarrassing, while agent compromise is operational. The difference is not the model. It is the permissions wrapped around it.

Prompt injection impact: chatbot failure vs agent compromise

Bad answer / summary drift

Chatbot-level

File access or modification

Agent-level

Shell / tool execution

Agent-level

Persistent memory poisoning

Agent-level

Outbound exfiltration to attacker infra

Agent-level

The same prompt-injection primitive gets more dangerous as agents gain tools, memory, and credentials.

An AI agent compromised by prompt injection might:

Exfiltrate your conversation history, documents, or connected account data
Execute shell commands on your system
Make unauthorized API calls to connected services
Modify files, delete data, or change system configurations
Send messages or emails on your behalf
Plant persistent instructions in memory that survive across sessions

Palo Alto Networks' Unit 42 research, published March 3, 2026, documented the first confirmed real-world indirect prompt injection attacks against production AI systems. The attacks weren't proof-of-concepts—they were operational. One case involved 24 simultaneous injection attempts using multiple concealment methods to bypass an AI-based advertisement review system. Another case targeted database destruction. The researchers identified 22 distinct payload construction techniques, from CSS text hiding to Base64 encoding to invisible Unicode characters.

The attack surface is enormous. Any AI agent that browses the web, reads documents, processes emails, or consumes external content is vulnerable. And unlike traditional software vulnerabilities, prompt injection can't be patched with a code update—it exploits the fundamental way LLMs process information.

The Five Boundaries That Matter

If you use AI agents for research or knowledge work, you don't need to become a security engineer. But you should understand the five boundaries that determine your risk exposure:

Identity

Which accounts, tokens, and sessions the agent can touch

Execution

Whether the agent can write files, run code, or call tools

Instruction

Which untrusted pages, docs, and messages can steer behavior

Persistence

What survives across sessions, including memory and saved state

Supply chain

Which plugins, skills, and extensions run with the agent's permissions

1. Identity Boundary: What Can the Agent Access?

AI agents act using credentials—API keys, OAuth tokens, session cookies. When an attacker compromises an agent, they don't need your password. They just need the agent to use its existing permissions. The CNCERT warning specifically noted that OpenClaw's "privileged access to the system to facilitate autonomous task execution capabilities" amplifies the risk.

2. Execution Boundary: What Can the Agent Do?

Does your agent have shell access? Can it write files? Send HTTP requests? Execute code? Each capability becomes a potential exfiltration channel or attack vector. Microsoft's February 2026 guidance on OpenClaw explicitly recommended treating self-hosted agents as "untrusted code execution with persistent credentials" and running them in isolated environments.

3. Instruction Boundary: What Untrusted Content Can Influence the Agent?

If your agent browses websites, reads shared documents, or processes scraped content, the entire web becomes a potential attack surface. Unit 42 documented cases where hidden instructions in webpage footers, email signatures, and document metadata successfully hijacked agent behavior.

4. Persistence Boundary: What Survives Across Sessions?

Some agents have long-term memory—stored preferences, learned patterns, trusted source lists. If an attacker can poison this memory, they create a "sleeper agent" that appears normal until triggered. Research from late 2025 demonstrated how poisoned memory could persist for weeks, with agents defending planted false beliefs as correct when questioned.

5. Supply Chain Boundary: What Code Runs in the Agent?

AI agents increasingly support plugins, skills, and extensions. The CNCERT warning specifically highlighted that "threat actors can upload malicious skills to repositories like ClawHub that, when installed, run arbitrary commands or deploy malware." Supply chain attacks on AI frameworks have already been documented, with state-sponsored actors compromising open-source agent components.

What the Research Actually Shows

Let's get specific about what security researchers have confirmed in 2026:

Witness AI reported in March 2026 that their AI Firewall achieves a 99.3% true-positive rate against prompt injection attacks—but that defense requires network-level inspection of every AI interaction. Pattern-based detection, while effective against known techniques, struggles with novel obfuscation methods.

CrowdStrike's analysis of over 300,000 adversarial prompts identified more than 150 injection techniques. Their telemetry shows indirect prompt injection is now the dominant attack vector, with attackers embedding payloads in documents, emails, and web content rather than attempting direct chatbot manipulation.

Johann Rehberger's research, presented at Black Hat in March 2026, demonstrated persistent memory exploits where attackers could remain embedded in an agent's long-term storage, waiting for triggering keywords before activating data exfiltration.

The consensus across these findings: prompt injection is not a bug that will be fixed. It's a structural feature of how LLMs work, and defense requires architectural controls, not just better training data.

If you already rely on AI agents for serious work, this is also why verification has to be part of the workflow rather than a post-hoc cleanup step. Our guide on how to verify AI research is the operational companion to this security problem: the same habit of checking sources, outputs, and contradictions is what limits damage when an agent has been manipulated.

Practical Defenses for Knowledge Workers

You don't need to stop using AI agents. But you should use them with eyes open. Here's what actually works:

Run Agents with Minimal Privileges

Don't give your research agent access to your email, cloud storage, and production systems simultaneously. The principle of least privilege applies to AI just as it does to human employees. If an agent only needs to read documents, don't give it write access. If it only needs local files, don't give it network access.

Isolate High-Risk Activities

Processing untrusted content—web scraping, analyzing shared documents, reading email attachments—should happen in isolated environments. Don't let an agent that browses the web also have access to sensitive internal documents. Containerization, dedicated VMs, or even separate physical devices for high-risk agent activities significantly reduce blast radius.

Verify Before Trusting

When an agent makes a surprising claim or recommends an unusual action, verify it independently. The "salami slicing" attacks documented by Unit 42 work by gradually shifting an agent's behavior over multiple interactions. Each individual response seems reasonable; the cumulative effect is compromise. Cross-checking agent outputs against primary sources isn't paranoia—it's basic verification. If your workflow already depends on sourced reports, use a repeatable checklist like How to Verify AI Research Output instead of relying on instinct.

Monitor for Anomalous Behavior

AI agents should have predictable behavior patterns. If your research agent suddenly starts making network requests to unknown domains, executing shell commands, or accessing files outside its normal scope, that's a red flag. Behavioral monitoring catches successful injections that pattern-based detection misses.

Be Careful with Plugins and Skills

Every plugin is code that runs with your agent's permissions. The CNCERT warning about malicious skills isn't hypothetical—researchers have already found credential-harvesting packages in AI skill repositories. Only install plugins from verified publishers, and review what permissions they request.

Understand Your Data Exposure

Prompt injection only causes harm if there's something to steal. Before giving an agent access to sensitive documents or connected accounts, ask: what could an attacker exfiltrate if they compromised this agent? Sometimes the right answer is to limit what data the agent can access, not to trust that injection won't happen.

The Bigger Picture

The March 2026 security advisories aren't just about OpenClaw. They're about a fundamental shift in how we need to think about AI systems.

Traditional software security assumes you can patch vulnerabilities and eliminate attack vectors. Prompt injection isn't a vulnerability—it's an emergent property of how large language models process information. You can't patch it away without changing what makes LLMs useful. That same trust problem shows up on the research side too: interfaces can look polished long before they are actually reliable, which is why deep research tools can feel more credible than they are.

This means defense has to shift from prevention to containment. Assume some injection attempts will succeed. Design your workflows so that successful injection has limited impact. Monitor for anomalous behavior. Keep sensitive data isolated from agents that process untrusted content.

The organizations that thrive with AI agents won't be the ones with perfect security—they'll be the ones with resilient architectures. They'll treat AI agents like they treat human employees: with appropriate access controls, oversight mechanisms, and the understanding that any agent can be manipulated or make mistakes.

China's ban on OpenClaw in government agencies and state enterprises, reported by Bloomberg alongside the CNCERT warning, reflects this pragmatic approach: the risk isn't that AI agents are inherently dangerous, but that they require security postures we haven't fully developed yet.

For individual knowledge workers and research teams, the takeaway is simpler but no less important. AI agents are powerful tools that can dramatically accelerate research, analysis, and content creation. But they're not magic—they're software systems with real security constraints. Using them effectively means understanding those constraints and designing your workflows accordingly.

The agents that help you research, write, and analyze are also software that processes external content, executes commands, and has access to your data. Treat them with the same care you'd treat any powerful tool. The March 2026 advisories aren't a reason to abandon AI agents. They're a reminder to use them wisely.

Rabbit Hole is an AI research agent designed for deep investigation and analysis. This post reflects our team's ongoing attention to AI security developments and their implications for knowledge workers.

The OpenClaw Security Wake-Up Call: What AI Agent Users Need to Know About Prompt Injection

Quick verdict: what the OpenClaw security warning actually means

What Prompt Injection Actually Is (And Why It's Different Now)

Why Agents Make Injection Dangerous

The Five Boundaries That Matter

What the Research Actually Shows

Practical Defenses for Knowledge Workers

The Bigger Picture

Related Articles

The 2026 Buyer's Guide to AI-Powered Research Assistants

ChatGPT Deep Research vs Perplexity vs Rabbit Hole: Which One Cites Sources That Actually Exist?

AI Patent Search: From IPC Code to Cited Report in 5 Minutes

Ready to try honest research?