Stealth Scraping Techniques
Bypass anti-bot detection systems with advanced browser fingerprinting, IP rotation, user-agent spoofing, and CAPTCHA handling strategies.
1. Using stealth_mode Tool
The stealth_mode tool automatically applies anti-detection techniques including fingerprint randomization, WebRTC spoofing, and canvas noise.
Basic (3 credits)
User-agent rotation, basic header spoofing
Use for: Low-protection sites, simple scrapers
Medium (3 credits)
Basic + fingerprint randomization, WebRTC leak protection
Use for: Most commercial sites, moderate protection
Advanced (3 credits)
Medium + canvas noise, WebGL spoofing, timezone randomization
Use for: High-protection sites, Cloudflare, Akamai
2. Browser Fingerprinting
Anti-bot systems use browser fingerprinting to detect automated browsers. Randomize fingerprints to avoid detection.
User-Agent
Browser version, OS, device type
Canvas Fingerprint
Unique rendering signature
WebGL
Graphics card vendor/renderer
WebRTC
Local IP address leaks
Screen Resolution
Display dimensions
Timezone & Locale
Geographic location indicators
Fonts
Installed font list
Plugins
Browser extensions detected
stealth_mode tool automatically randomizes all these fingerprint signals, making each request appear to come from a unique real browser.3. IP Rotation & Proxies
Use rotating proxies to distribute requests across different IP addresses.
Datacenter Proxies
✅ Fast (50-150ms latency)
✅ Cheap ($1-5/GB)
❌ Easily detected
❌ Higher ban rate
Best for: Low-protection sites, high-volume scraping
Residential Proxies (Recommended)
✅ Real user IPs (hard to detect)
✅ Low ban rate
⚠️ Slower (200-500ms latency)
⚠️ Expensive ($5-15/GB)
Best for: High-protection sites, e-commerce, social media
Mobile Proxies
✅ Highest success rate (4G/5G IPs)
✅ Nearly undetectable
❌ Very expensive ($50-100/GB)
❌ Slowest (300-1000ms latency)
Best for: Maximum stealth, premium targets
4. CAPTCHA Handling
Strategies for dealing with CAPTCHA challenges.
Avoid Triggering CAPTCHAs
Use stealth mode, rotate IPs, respect rate limits, add random delays (2-5 seconds between requests)
✅ Best strategy - prevention is easier than solving
Use CAPTCHA Solving Services
Integrate with 2Captcha, Anti-Captcha, or DeathByCaptcha ($1-3 per 1,000 CAPTCHAs)
⚠️ Adds cost and latency (10-30 seconds)
Find Alternative Data Sources
Look for APIs, RSS feeds, sitemaps, or partner sites without CAPTCHA
✅ Most reliable long-term solution
Manual Intervention
Queue CAPTCHA challenges for human operators to solve
❌ Only viable for low-volume scraping
Best Practices Summary
Always start with stealth_mode level "medium"
Use residential proxies for high-protection sites
Rotate proxies every 10-20 requests
Add random delays between requests (2-5 seconds)
Match geolocation with proxy location (use localization tool)
Respect robots.txt and rate limits
Monitor ban rates and adjust strategy accordingly