/scrape: non-JS scraping
POST/scrape
Use high-performance web scraping endpoint with Chrome browser TLS fingerprint, but without JavaScript execution and real browser overhead.
Request
- application/json
Body
required
URL to scrape
Custom headers to send with the request. By default, regular Chrome browser headers are sent to the target URL.
Amount of attempts.
Default value: us
Geo location for basic proxy pools (you can purchase premium ScrapeNinja proxies for wider country selection and higher proxy quality). Read more about ScrapeNinja proxy setup
Premium or your own proxy URL (overrides geo
field). Read more about ScrapeNinja proxy setup
Default value: 1
Follow redirects.
Default value: 10
Timeout per attempt, in seconds. Each retry will take [timeout] number of seconds.
Text which will trigger a retry from another proxy address.
Default value: 403,502
HTTP response statuses which will trigger a retry from another proxy address.
Custom JS function to extract JSON values from scraped HTML. Write&test your own extractor on https://scrapeninja.net/cheerio-sandbox/
Responses
- 200
OK
- application/json
- Schema
- Example (from schema)
Schema
info object
HTML body of the rendered page.
{
"info": {
"statusCode": 200,
"finalUrl": "https://example.com/url",
"headers": [
"content-type: xxx",
"header2: val2"
]
},
"body": "<html><body><h1>Hello World!</h1></body></html>"
}