webcrawlerapi

Package Information

Released: 7/7/2025

Downloads: 0 weekly / 0 monthly

Latest Version: 0.1.6

Author: niiotyo

Available Nodes

WebCrawlerAPI

Scrape a webpage content in markdown format ready for LLM and RAG

Documentation

Banner image

n8n WebCrawler API Node

WebcrawlerAPI is a website to markdown API for LLM and RAG. You can get content of the website or a single page with a
simple API call. Check more at WebCrawlerAPI. Our docs are available at WebCrawlerAPI Docs.

This package contains n8n community nodes for integrating with WebCrawler API.

Nodes

WebCrawlerAPI Node

The WebCrawlerAPI node allows you to scrape web content using the WebCrawlerAPI service.

Features

Scrape content from any URL using the /v2/scrape endpoint
Choose output format: Markdown, Cleaned, or HTML
Remove unwanted elements using CSS selectors
Use AI-powered prompts to extract specific information from the page

Credentials

To use this node, you need a WebCrawlerAPI credential:

API Key: Obtain from WebCrawlerAPI Dashboard

Parameters

URL to scrape (required): The URL of the page to scrape
Output format (optional): Markdown (default), Cleaned, or HTML
CSS selectors to remove (optional): Comma-separated list of selectors (e.g., .ads, .popup, #sidebar)
Prompt (optional): AI prompt to extract specific information

Response Fields

success: Boolean indicating if the request was successful
status: Status of the scraping operation
markdown: Content formatted as Markdown
cleaned_content: Cleaned text content
raw_content: Raw HTML content
page_title: Title of the scraped page

Example Usage

Add the WebCrawlerAPI node to your workflow
Configure your WebCrawlerAPI credentials. Get your API key here.
Set the URL you want to scrape
Optionally, choose output format, add CSS selectors, or a prompt
Execute the workflow

Installation

npm install n8n-nodes-webcrawlerapi

Development

# Install dependencies
npm install

# Build the package
npm run build

# Run in development mode
npm run dev

License

MIT