ZelaxyDocs
Tool Integrations
Tool

Firecrawl

Crawl, scrape, and extract structured data from websites

Firecrawl

Firecrawl crawls websites and extracts clean, structured content. It handles JavaScript rendering, pagination, and complex site structures — returning clean markdown or structured data.

Overview

PropertyValue
Typefirecrawl
CategoryTool — Web Scraping
AuthAPI Key

Operations

OperationDescription
ScrapeExtract content from a single page
CrawlCrawl multiple pages from a starting URL
MapGet a sitemap of all discoverable URLs
ExtractExtract structured data using a schema

Configuration

SettingTypeDescription
API KeyPasswordFirecrawl API key
URLShort inputTarget URL to scrape or crawl
FormatsCheckbox listmarkdown, html, links, screenshot
Max PagesSliderMaximum pages to crawl (1–100)
Only Main ContentToggleExclude headers, footers, sidebars

Outputs

FieldTypeDescription
contentstringExtracted content (markdown)
pagesjsonArray of crawled page objects
metadatajsonPage metadata (title, description)

Example: Competitive Intelligence

Workflow:

[Starter: Competitor URL] → [Firecrawl: Crawl] → [Agent: Analyze] → [Notion: Save]

Crawl a competitor's website (landing pages, pricing, features), then have an Agent analyze the content and save a competitive analysis report to Notion.

Firecrawl Config:

  • URL: {{starter.input}}
  • Max Pages: 20
  • Only Main Content:
  • Formats: markdown

Tips

  • Scrape vs Crawl — Scrape for one page, Crawl for multiple pages from a site
  • Markdown format is best for AI processing — clean text without HTML noise
  • Only Main Content removes navigation, ads, and boilerplate
  • Extract mode — define a JSON schema to get structured data from pages