CrawlForge
HomeUse CasesIntegrationsPricingDocumentationBlog
CrawlForge v4.2.2: New CLI + 3 Tools for Local AI Scraping
Product Updates
Back to Blog
Product Updates

CrawlForge v4.2.2: New CLI + 3 Tools for Local AI Scraping

C
CrawlForge Team
Engineering Team
May 18, 2026
6 min read

On this page

Quick Answer

CrawlForge v4.2.2 ships a standalone CLI package and three new tools: extract_with_llm for local AI extraction via Ollama, list_ollama_models for free model discovery, and scrape_template for one-line scraping of 10 popular sites. The CLI runs anywhere npm runs and needs no MCP client. Install with `npm install -g @crawlforge/cli`.

Today we are shipping CrawlForge v4.2.2, our biggest release since launch. It brings three new tools, a standalone command-line interface, and a quiet shift in how we think about web scraping for AI: most of it should run locally, on your own machine, without API keys.

This post is the umbrella for everything in 4.2.2. Three deep-dive guides follow in the next nine days.

Table of Contents

  • What Shipped
  • The New CrawlForge CLI
  • Extract With LLM: Local AI Extraction
  • Scrape Template: Ten Sites, One Call
  • list_ollama_models: Free Model Discovery
  • Old Workflow vs v4.2.2 Workflow
  • Credit Costs
  • How to Upgrade
  • What Is Next

What Shipped

v4.2.2 adds four things:

  1. @crawlforge/cli -- a standalone command-line tool exposing all 23 CrawlForge tools to your shell. No MCP client required.
  2. extract_with_llm -- LLM-powered structured extraction that defaults to local Ollama. No external API key needed.
  3. scrape_template -- pre-built scrapers for Amazon, LinkedIn, GitHub, YouTube, Reddit, Hacker News, Stack Overflow, npm, Product Hunt, and Twitter/X.
  4. list_ollama_models -- a free discovery tool that lists models on your local Ollama instance.

Tool count goes from 20 to 23. The CLI is brand new -- it is not a tool, it is a delivery channel.

The New CrawlForge CLI

The CLI is the shortest path from intent to scraped data. You install it once, set an environment variable, and every CrawlForge tool becomes a command:

Bash

Why does this matter? Because MCP is great for AI agents, but a lot of scraping work is not an AI agent task. It is a cron job. A CI step. A one-off pull from your terminal. For that, you want JSON on stdout that pipes into jq, not a JSON-RPC handshake.

Read the complete CrawlForge CLI guide for the full command reference and real-world workflows.

Extract With LLM: Local AI Extraction

extract_with_llm is structured extraction powered by a language model. You hand it a URL and a schema, it gives you back JSON. The new part is that it defaults to local Ollama rather than calling OpenAI or Anthropic.

Typescript

Three things follow from the local-first default:

  • No third-party API costs. The LLM is free. You only pay 3 CrawlForge credits per extraction.
  • No data leaving your machine. Scraped content stays on localhost.
  • No new API key to manage. If Ollama is installed, you are done.

OpenAI and Anthropic are still supported via provider: "openai" or provider: "anthropic" for cases where you want a frontier model. Detailed guide: extract data with local LLMs.

Scrape Template: Ten Sites, One Call

scrape_template is for the long tail of scraping requests that all look the same: "get me product data from Amazon", "get me a GitHub repo's metadata", "get me the top posts on Hacker News today". You should not need to write CSS selectors for these. We did it once, we maintain it, you call it.

Bash

Ten templates ship in this release:

TemplateWhat it returnsCredits
amazonProduct title, price, rating, reviews, images1
linkedinProfile name, headline, experience, skills1
githubRepo metadata, stars, languages, README1
youtubeVideo title, views, channel, transcript1
redditPost title, score, comments, top replies1
hackernewsStory title, points, URL, comments1
stackoverflowQuestion, answers, accepted, vote counts1
npmPackage metadata, weekly downloads, versions1
producthuntProduct name, tagline, upvotes, makers1
tweetTweet text, author, engagement, replies1

Full walkthrough with code: scrape Amazon, LinkedIn, and GitHub with one tool.

list_ollama_models: Free Model Discovery

Most useful as a sanity-check before running extract_with_llm. Lists every model on your local Ollama instance with name, size, and modified date.

Bash

Costs zero credits. It does no scraping, no LLM call -- it just queries Ollama's local API on 127.0.0.1:11434 and returns the result. If you have ever wondered which model you actually have installed, this is the answer.

Old Workflow vs v4.2.2 Workflow

TaskPre-4.2.2v4.2.2
Scrape from your terminalcurl + custom parser, or boot a Node REPLcrawlforge scrape <url>
Extract structured data with an LLMextract_structured (CSS selectors) or roll your own with Puppeteer + OpenAIextract_with_llm (Ollama default)
Scrape Amazon, LinkedIn, GitHubscrape_structured with hand-maintained selectorsscrape_template (we maintain selectors)
Run scraping in CI/croncurl with API key in headerscrawlforge <cmd> with env var

Credit Costs

The three new tools follow our existing credit-cost model. No surprises:

ToolCreditsWhy
list_ollama_models0Free discovery helper
scrape_template1Single page, pre-built schema
extract_with_llm3LLM inference (provider-agnostic)

The CLI itself is free. It uses your existing API key and bills against your normal credit balance.

How to Upgrade

Existing users do not need to do anything. The new tools are live on all plans -- Free, Hobby, Professional, and Business -- and show up automatically in your MCP client.

If you want the CLI:

Bash

If you want to try Ollama-powered extraction:

Bash

What Is Next

We are working on three things for 4.3:

  • More templates -- Etsy, eBay, TikTok, Instagram, Google Maps. Send us requests on Discord.
  • Webhook delivery for batch_scrape -- get results pushed to your endpoint when long-running jobs complete.
  • CLI watch mode -- crawlforge track --watch for live diffs on monitored pages.

Ready to try the new tools? Start free with 1,000 credits -- no credit card required. Or jump straight into the deep dives: the CLI guide, local LLM extraction, and the ten scrape templates.

Tags

releasev4.2.2CLIOllamaannouncementMCP

About the Author

C

CrawlForge Team

Engineering Team

Building the most comprehensive web scraping MCP server. We create tools that help developers extract, analyze, and transform web data for AI applications.

On this page

Frequently Asked Questions

What is new in CrawlForge v4.2.2?+

v4.2.2 adds three tools (extract_with_llm, list_ollama_models, scrape_template) and a standalone CLI package (@crawlforge/cli). Tool count goes from 20 to 23. The CLI is brand new and runs without an MCP client, so you can use CrawlForge from any terminal, cron job, or CI pipeline.

Do I need the CrawlForge CLI if I already use MCP?+

No. MCP is still the best fit for AI agents (Claude, Cursor, Windsurf, etc.). The CLI exists for the other half of scraping work -- cron jobs, CI steps, one-off pulls from your terminal -- where an MCP handshake is overkill and you just want JSON on stdout that pipes into jq.

Does extract_with_llm require an OpenAI or Anthropic API key?+

No. It defaults to local Ollama, which runs entirely on your machine with no external API key required. OpenAI and Anthropic are supported as optional providers if you want frontier models, but the default path is fully local.

Which sites does scrape_template support?+

Ten sites in v4.2.2: Amazon, LinkedIn, GitHub, YouTube, Reddit, Hacker News, Stack Overflow, npm, Product Hunt, and Twitter/X. Each costs 1 credit per scrape and returns site-specific structured JSON without needing CSS selectors.

How do I upgrade to v4.2.2?+

Existing users do not need to do anything -- the new tools are live on all plans and show up automatically in your MCP client. To install the new CLI, run "npm install -g @crawlforge/cli" and set the CRAWLFORGE_API_KEY environment variable.

Related Articles

CrawlForge MCP Is Now Live: Free Web Scraping for AI Agents
Product Updates

CrawlForge MCP Is Now Live: Free Web Scraping for AI Agents

CrawlForge MCP launches today with 20 web scraping tools, MCP integration for Claude and Cursor, and a free tier with 1,000 credits. Build agents faster.

C
CrawlForge Team
|
Mar 31
|
6m
18 Web Scraping Tools in One MCP Server: The Complete CrawlForge Guide
Product Updates

18 Web Scraping Tools in One MCP Server: The Complete CrawlForge Guide

Discover all 20 web scraping tools in CrawlForge MCP - from basic URL fetching to AI-powered research. A complete reference for AI agent developers.

C
CrawlForge Team
|
Jan 7
|
10m
Welcome to CrawlForge: Enterprise Web Scraping for AI
Product Updates

Welcome to CrawlForge: Enterprise Web Scraping for AI

Introducing CrawlForge MCP - a suite of 20 specialized web scraping API tools built for modern AI apps that makes web data extraction simple and scalable.

C
CrawlForge Team
|
Dec 20
|
3m

Footer

CrawlForge

Enterprise web scraping for AI Agents. 23 specialized MCP tools designed for modern developers building intelligent systems.

Product

  • Features
  • Pricing
  • Use Cases
  • Integrations
  • Changelog

Resources

  • Getting Started
  • API Reference
  • Templates
  • Guides
  • Blog
  • FAQ

Developers

  • MCP Protocol
  • Claude Desktop
  • Cursor IDE
  • LangChain
  • LlamaIndex

Company

  • About
  • Contact
  • Privacy
  • Terms

Stay updated

Get the latest updates on new tools and features.

Built with Next.js and MCP protocol

© 2025-2026 CrawlForge. All rights reserved.