Puppeteer
Google's headless Chrome automation library commonly used for scraping.
What is Puppeteer?
Puppeteer is a Node.js library developed by Google that provides a high-level API to control Chrome or Chromium. It's widely used for web scraping, automated testing, PDF generation, and screenshot capture. Puppeteer is one of the most popular browser automation tools.
Puppeteer-controlled browsers typically identify themselves with "HeadlessChrome" in the user-agent string, though sophisticated users can override this. Detection also relies on the navigator.webdriver flag and other browser environment signals.
While Puppeteer has many legitimate uses (testing, monitoring, PDF generation), it's also the tool of choice for large-scale web scraping operations. Switch detects Puppeteer through multiple signals beyond just the user-agent string.
User-Agent Strings
These are the known user-agent patterns used by Puppeteer. Use them to identify this crawler in your server logs or configure robots.txt rules.
robots.txt example:
User-agent: HeadlessChrome Disallow: /private/ Allow: /
Detection on the Switch Network
164
Sessions detected
185
Total events
95%
Avg confidence
First detected: 2/10/2026 · Last seen: 3/7/2026
How to Manage Puppeteer
High detection confidence through HeadlessChrome UA and webdriver flags.
Use Switch Content Gate for aggressive Puppeteer-based scraping.
Some Puppeteer traffic is legitimate (monitoring, testing) — consider allowing known IPs.
Deploy behavioral analysis to catch stealth-mode Puppeteer.
Start managing Puppeteer today
Switch detects, tracks, and lets you build custom journeys for Puppeteer and 35+ other AI agents and crawlers. Set up in five minutes.
Get Started FreeRelated Agents
AdsBot-Google
MonitoringGoogle's ad quality crawler checking landing pages for Google Ads compliance.
Gemini
AI AssistantsGoogle's AI assistant crawlers including Safety and Cloud Vertex bots.
Gemini-Deep-Research
AI AssistantsGoogle Gemini's Deep Research agent that performs comprehensive multi-page research.
Google-Extended
Commercial CrawlersGoogle's AI training token controlling use of Googlebot-crawled content for AI.
Googlebot
Search EnginesGoogle's primary web crawler powering the world's largest search engine.
Claude Computer Use
Browser AgentsAnthropic
Anthropic's Claude agent that controls a real browser to complete tasks.