Output Formats
Both scrape and crawl use output.formats to control returned formats.
{
"output": {
"formats": [
"markdown",
"links",
"summary"
]
}
}Supported formats
output.formats supports:
htmlraw_htmlmarkdownlinkssummaryscreenshotjson
Default:
["markdown"]
Response fields
Results are under data (sync scrape) or the result endpoints (scrape/crawl result).
html: cleaned body HTML (head/scripts removed)raw_html: raw HTMLmarkdown: Markdown conversionlinks: extracted linksmetadata: page metadata (title,status_code,content_type,proxy_location, etc; may expand)screenshot: screenshot URL (whenformatsincludesscreenshot)summary: AI summaryjson: structured extraction result
Screenshot
When formats includes screenshot, you can set output.screenshot:
viewport(default)full_page
JSON (structured extraction)
When formats includes json, you can specify output.json:
{
"output": {
"formats": [
"json"
]
},
"json": {
"prompt": "Extract product name and price from the page.",
"json_schema": {
"type": "object",
"properties": {
"product_name": {
"type": "string"
},
"price": {
"type": "string"
}
},
"required": [
"product_name",
"price"
]
}
}
}