Every URL on
any domain.
Discover the full shape of a website in one call. Recursive index handling, gzip, and malformed XML are handled silently.
Discover all URLs from a website's sitemap.
Start free, no card required, most teams ship a first integration in under ten minutes.
Not another JSON page.
A production workflow in one call
One call in. The full URL graph out.
Before Orsa
- A browser worker, proxy account, parser, retry queue, and schema contract for one endpoint.
- Product teams wait while platform teams debug website-specific failures.
- Every new data field becomes another brittle scraper branch.
After Orsa
- One call in. The full URL graph out. With retries, rendering, and validation handled behind the API.
- The response shape is typed, documented, and ready for product code.
- Teams combine it with adjacent Orsa endpoints without adding vendors.
Teach the workflow,
then show the endpoint
Point Orsa at a url, let the platform handle the web work, and receive a response your product can trust.
Send a url
nytimes.com
Render, retry, enrich
Orsa handles browser execution, proxy escalation, parsing, validation, caching, and typed response shaping behind the API.
One call in. The full URL graph out.
Use the result in crawl planning without owning the extraction stack.
- REST path
- /api/v1/web/scrape/sitemapSame endpoint used by the SDK examples below.
- Input shape
- URLnytimes.com
- SDKs
- TypeScript, Python, cURLStart with TypeScript, Python, or direct cURL.
- Best first use
- Crawl planningSeed crawls with the real surface area of a site, not just the homepage.
Performance your product
can actually plan around
Every endpoint page should answer the practical buying question: will this hold up once it leaves the demo?
More than the response.
The operating layer behind it
Stripe pages teach the system around the API: inputs, retries, observability, adjacent products, and the code path. This section does the same for Orsa endpoints.
- Typed response contracts for product code and AI tools.
- Browser, proxy, cache, and validation logic handled by Orsa.
- Direct fit for crawl planning and seo ops.
Endpoint
/api/v1/web/scrape/sitemap
Example input
nytimes.com
Promise
One call in. The full URL graph out.
Pairs with
Crawl Website, Scrape Markdown, Scrape HTML
What teams ship with
every url on any domain.
Each product page now speaks to the real workflow behind the endpoint, with concrete jobs instead of a generic feature list.
Crawl planning
Seed crawls with the real surface area of a site, not just the homepage.
SEO ops
Diff sitemaps over time to catch indexing regressions early.
Docs mirrors
Pull every docs path before you snapshot content to Markdown.
Keep the code small.
Let Orsa do the messy part
Use the endpoint directly, then combine it with adjacent Orsa APIs as the workflow grows.
{
"domain": "nytimes.com",
"urls": [
"https://www.nytimes.com/",
"https://www.nytimes.com/section/world"
],
"sources": ["https://www.nytimes.com/sitemap.xml"]
}Build the full workflow,
not another point solution
The best product integrations usually combine two or three Orsa endpoints behind one customer experience.
The questions teams ask
before shipping
Short answers for the practical details: rendering, limits, freshness, and how this fits into production.
Put this endpoint
in your product today
Try the live endpoint, then wire the same response into your app with one API key.
One API key for every Orsa endpoint · No card required to start.