Skip to main content
POST
https://api.evergreens.ai
/
v1
/
networks
/
browser
/
content
{
  "urls": [
    "https://www.evergreen.media",
    "https://www.tirol.gv.at/"
  ],
  "format": ["html", "markdown"],
  "excludeHeader": true,
  "excludeFooter": true
}
{
  "isCached": true,
  "cacheExpiresIn": 7200,
  "results": {
    "https://evergreen.media": {
      "markdown": : "...",
      "html": "<!DOCTYPE html>..."
    },
    "https://www.tirol.gv.at/": {
      "markdown": "...",
      "html": "<!DOCTYPE html>..."
    }
  }
}
Extract the full content of web pages in various formats using Browser network component. Entity Reference:
ContentEntity

Request

urls
string[]
required
An array of URLs to extract elements from.
  • Minimum of 1 URL per request.
  • Maximum of 10 URLs per request.
format
('html' | 'markdown')[]
An array specifying the desired content formats to be returned for each URL.
  • Supported formats are html and markdown.
  • You can request multiple formats in a single request.
the default format is markdown.
excludeHeader
boolean
default:false
An optional flag to exclude headers from the extracted content.
  • Defaults to false.
This rule only meet tag <header> in HTML.
An optional flag to exclude footers from the extracted content.
  • Defaults to false.
This rule only meet tag <footer> in HTML.
cache
object
An optional object to configure caching behavior for the request. By default, caching is enabled with a TTL of 120 seconds. You can customize these settings by providing the cache object.

Response

The response contains the extracted content for each requested URL in the specified formats.
isCached
boolean
default:true
Indicates whether the response was served from cache.
cacheExpiresIn
number
default:7200
The time in seconds until the cached response expires. Default is 7200 seconds (2 hours).
results
Record<URL,{html?: string; markdown?: string}>
required
A mapping of each URL to its extracted content. Each URL maps to another object where keys are content formats and values are the corresponding extracted content.
{
  "urls": [
    "https://www.evergreen.media",
    "https://www.tirol.gv.at/"
  ],
  "format": ["html", "markdown"],
  "excludeHeader": true,
  "excludeFooter": true
}
{
  "isCached": true,
  "cacheExpiresIn": 7200,
  "results": {
    "https://evergreen.media": {
      "markdown": : "...",
      "html": "<!DOCTYPE html>..."
    },
    "https://www.tirol.gv.at/": {
      "markdown": "...",
      "html": "<!DOCTYPE html>..."
    }
  }
}