/scrape: non-JS scraping
POST/scrape
Use high-performance web scraping endpoint with Chrome browser TLS fingerprint, but without JavaScript execution and real browser overhead.
Request
- application/json
Body
required
URL to scrape
Custom headers to send with the request. By default, regular Chrome browser headers are sent to the target URL.
Amount of attempts.
Default value: us
Geo location.
Custom proxy URL (overrides geo field). Available starting from ULTRA plan.
Default value: 1
Follow redirects.
Default value: 10
Timeout per attempt, in seconds. Each retry will take [timeout] number of seconds.
Text which will trigger a retry from another proxy address.
Default value: 403,502
HTTP response statuses which will trigger a retry from another proxy address.
Custom JS function to extract JSON values from scraped HTML. Write&test your own extractor on https://scrapeninja.net/cheerio-sandbox/
Responses
- 200
OK
- application/json
- Schema
- Example (from schema)
Schema
info object
HTML body of the rendered page.
{
"info": {
"statusCode": 200,
"finalUrl": "https://example.com/url",
"headers": [
"content-type: xxx",
"header2: val2"
]
},
"body": "<html><body><h1>Hello World!</h1></body></html>"
}