Crawl website
Queue an asynchronous website crawl. Poll the returned operation URL for status and results.
POST
Crawl website
POST /api/v1/platform/scrapers/web/crawl
Crawl a website and return discovered page content.Crawl a website from a starting URL with an optional page limit.
Example
Authorizations
The public API key from the Credentials page.
The API secret shown when the credential is created.
Headers
Optional. Reusing the same key for the same organization returns the existing queued crawl operation.
Body
application/json
Domain or URL to crawl
Example:
"https://example.com"
Required range:
1 <= x <= 50Maximum number of pages to crawl. Overrides limit when set.
Required range:
1 <= x <= 50Maximum link depth from the starting URL.
Required range:
0 <= x <= 5Restrict discovered links to the starting hostname.
Instructions for content extraction