Package Information
Released: 7/7/2025
Downloads: 0 weekly / 0 monthly
Latest Version: 0.1.6
Author: niiotyo
Documentation
n8n WebCrawler API Node
WebcrawlerAPI is a website to markdown API for LLM and RAG. You can get content of the website or a single page with a
simple API call. Check more at WebCrawlerAPI. Our docs are available at WebCrawlerAPI Docs.
This package contains n8n community nodes for integrating with WebCrawler API.
Nodes
WebCrawlerAPI Node
The WebCrawlerAPI node allows you to scrape web content using the WebCrawlerAPI service.
Features
- Scrape content from any URL using the
/v2/scrape
endpoint - Choose output format: Markdown, Cleaned, or HTML
- Remove unwanted elements using CSS selectors
- Use AI-powered prompts to extract specific information from the page
Credentials
To use this node, you need a WebCrawlerAPI credential:
- API Key: Obtain from WebCrawlerAPI Dashboard
Parameters
- URL to scrape (required): The URL of the page to scrape
- Output format (optional): Markdown (default), Cleaned, or HTML
- CSS selectors to remove (optional): Comma-separated list of selectors (e.g.,
.ads, .popup, #sidebar
) - Prompt (optional): AI prompt to extract specific information
Response Fields
success
: Boolean indicating if the request was successfulstatus
: Status of the scraping operationmarkdown
: Content formatted as Markdowncleaned_content
: Cleaned text contentraw_content
: Raw HTML contentpage_title
: Title of the scraped page
Example Usage
- Add the WebCrawlerAPI node to your workflow
- Configure your WebCrawlerAPI credentials. Get your API key here.
- Set the URL you want to scrape
- Optionally, choose output format, add CSS selectors, or a prompt
- Execute the workflow
Installation
npm install n8n-nodes-webcrawlerapi
Development
# Install dependencies
npm install
# Build the package
npm run build
# Run in development mode
npm run dev
License
MIT