CrawlForge
Guides
Stealth Techniques
Advanced Guide18 min read

Stealth Scraping Techniques

Bypass anti-bot detection systems with advanced browser fingerprinting, IP rotation, user-agent spoofing, and CAPTCHA handling strategies.

In This Guide
Using stealth_mode ToolBrowser FingerprintingIP Rotation & ProxiesCAPTCHA Handling
Legal Notice: Always respect robots.txt and website terms of service. Use stealth techniques responsibly and only for legitimate purposes. Violating terms of service or scraping protected content may have legal consequences.

1. Using stealth_mode Tool

The stealth_mode tool automatically applies anti-detection techniques including fingerprint randomization, WebRTC spoofing, and canvas noise.

Stealth Levels Explained

Basic (3 credits)

User-agent rotation, basic header spoofing

Use for: Low-protection sites, simple scrapers

Medium (3 credits)

Basic + fingerprint randomization, WebRTC leak protection

Use for: Most commercial sites, moderate protection

Advanced (3 credits)

Medium + canvas noise, WebGL spoofing, timezone randomization

Use for: High-protection sites, Cloudflare, Akamai

Basic Stealth Scraping
3 credits
Bash
Advanced: Stealth + Proxy + Custom Headers
Typescript

2. Browser Fingerprinting

Anti-bot systems use browser fingerprinting to detect automated browsers. Randomize fingerprints to avoid detection.

What Anti-Bot Systems Track

User-Agent

Browser version, OS, device type

Canvas Fingerprint

Unique rendering signature

WebGL

Graphics card vendor/renderer

WebRTC

Local IP address leaks

Screen Resolution

Display dimensions

Timezone & Locale

Geographic location indicators

Fonts

Installed font list

Plugins

Browser extensions detected

How CrawlForge Helps: The stealth_mode tool automatically randomizes all these fingerprint signals, making each request appear to come from a unique real browser.

3. IP Rotation & Proxies

Use rotating proxies to distribute requests across different IP addresses.

Proxy Types Comparison

Datacenter Proxies

✅ Fast (50-150ms latency)

✅ Cheap ($1-5/GB)

❌ Easily detected

❌ Higher ban rate

Best for: Low-protection sites, high-volume scraping

Residential Proxies (Recommended)

✅ Real user IPs (hard to detect)

✅ Low ban rate

⚠️ Slower (200-500ms latency)

⚠️ Expensive ($5-15/GB)

Best for: High-protection sites, e-commerce, social media

Mobile Proxies

✅ Highest success rate (4G/5G IPs)

✅ Nearly undetectable

❌ Very expensive ($50-100/GB)

❌ Slowest (300-1000ms latency)

Best for: Maximum stealth, premium targets

IP Rotation Strategy
Typescript

4. CAPTCHA Handling

Strategies for dealing with CAPTCHA challenges.

CAPTCHA Bypass Strategies
1

Avoid Triggering CAPTCHAs

Use stealth mode, rotate IPs, respect rate limits, add random delays (2-5 seconds between requests)

✅ Best strategy - prevention is easier than solving

2

Use CAPTCHA Solving Services

Integrate with 2Captcha, Anti-Captcha, or DeathByCaptcha ($1-3 per 1,000 CAPTCHAs)

⚠️ Adds cost and latency (10-30 seconds)

3

Find Alternative Data Sources

Look for APIs, RSS feeds, sitemaps, or partner sites without CAPTCHA

✅ Most reliable long-term solution

4

Manual Intervention

Queue CAPTCHA challenges for human operators to solve

❌ Only viable for low-volume scraping

Recommendation: If you're consistently hitting CAPTCHAs, you're scraping too aggressively. Slow down, rotate IPs more frequently, and use higher stealth levels before resorting to CAPTCHA solving services.

Best Practices Summary

Stealth Scraping Checklist

Always start with stealth_mode level "medium"

Use residential proxies for high-protection sites

Rotate proxies every 10-20 requests

Add random delays between requests (2-5 seconds)

Match geolocation with proxy location (use localization tool)

Respect robots.txt and rate limits

Monitor ban rates and adjust strategy accordingly

Next Steps
Continue learning with more guides
Credit Optimization →
Minimize scraping costs
stealth_mode Tool →
Full API reference