JSON-LD
DataDefinition
JSON-LD (JSON for Linking Data) is a method of encoding structured data using JSON format. It is the preferred format for embedding schema.org markup in web pages for search engine understanding.
How It Relates to CrawlForge
JSON-LD is embedded in web pages as a script tag and contains structured metadata about the page content -- product details, article information, organization data, and more. It is a rich source of clean, structured data for scrapers because it is already machine-readable.
CrawlForge extract_metadata detects and extracts JSON-LD from pages, giving you access to structured data that the site owner has already organized. This is often more reliable than parsing visual elements because JSON-LD is designed for machine consumption.
Related CrawlForge Tools
Related Terms
Structured Data
Structured data is information organized in a predefined format that makes it easy for machines to parse and understand. On the web, it typically refers to schema.org markup embedded in HTML pages.
Schema Markup
Schema markup is a vocabulary of tags (from schema.org) that you add to HTML to improve how search engines read and represent your page. It defines types like Product, Article, Organization, and their properties.
JSON
JSON (JavaScript Object Notation) is a lightweight data interchange format that is easy for humans to read and machines to parse. It is the standard format for API responses and structured data exchange.
HTML Parsing
HTML parsing is the process of analyzing HTML markup to extract its structure and content. Parsers convert raw HTML strings into navigable tree structures that programs can query and manipulate.
Start Scraping with 1,000 Free Credits
Get started with CrawlForge today. No credit card required.
Start scraping with 1,000 free credits