Content Migration

Extract and restructure content from legacy sites for migration to modern platforms.

Quick Answer

Use CrawlForge crawl_deep (4 credits) to traverse an entire legacy site by following internal links, then extract_text (1 credit per page) to pull clean, readable content stripped of navigation and ads. A single automated run migrates hundreds of pages while preserving structure and metadata, and you can re-run it any time.

The Problem

Migrating content from legacy CMS platforms means extracting thousands of pages while preserving structure, metadata, and formatting. Manual copy-paste is error-prone and painfully slow.

The Solution

CrawlForge crawl_deep traverses entire sites following internal links, while extract_text pulls clean content from each page. Migrate hundreds of pages in a single automated run.

Code Example

// Crawl legacy site and extract all content for migration
const crawl = await mcp.crawl_deep({
  url: "https://legacy-site.com/blog",
  max_depth: 3,
  follow_links: true,
  include_patterns: ["/blog/*"],
});

// Extract clean text from each discovered page
const pages = await Promise.all(
  crawl.urls.map(url =>
    mcp.extract_text({ url, preserve_structure: true })
  )
);

console.log(`Migrated ${pages.length} pages`);

Tools Used

crawl_deep4 credits

extract_text1 credit

Estimated cost: ~5 credits per page

Frequently Asked Questions

How do I migrate content off a legacy CMS?

Use CrawlForge crawl_deep to traverse the whole site by following internal links, and extract_text to pull clean, readable content from each page. A single automated run can migrate hundreds of pages while preserving structure and metadata.

Does crawl_deep preserve page structure and metadata?

crawl_deep returns each page's content, and you can pair it with extract_metadata to keep titles, descriptions, and canonicals. extract_text and extract_content strip boilerplate so you migrate the real content, not navigation and ads.

How many pages can I migrate in one run?

crawl_deep follows internal links across an entire site or section with depth and page limits you set, so hundreds to thousands of pages in one job. Cost is 4 credits per crawl call plus extraction per page.

What credits does a content migration use?

crawl_deep is 4 credits and extract_text is 1 credit per page. Migrating a few hundred pages typically costs a few hundred credits — a fraction of the manual copy-paste time, and repeatable if you need to re-run it.

Ready to Get Started?

Every new account gets 1,000 free credits. No credit card required.

Start Free with 1,000 Credits

Related Use Cases

SEO Analysis & Auditing

Audit your site and competitors for metadata, broken links, content gaps, and ranking opportunities.

extract_metadata (1 cr)map_site (2 cr)analyze_content (3 cr)

AI Training Data Collection

Collect and structure large-scale web datasets for fine-tuning and training AI models.

batch_scrape (5 cr)extract_content (2 cr)