Web Toolkit
On demand validation and alternative formats of web page content.
Our Web Toolkit offers access to some of the algorithms and practices that we use internally while processing the web. Learn more from the links below, or keep reading about the free and paid plans.
- Structured Data for extracting semantic data, such as schema.org, that publishers explicitly include in content. In addition to extraction, we apply common data normalization and validation practices for easier consumption by downstream clients.
- Text Content for extracting text-based formats, such as Markdown, of web pages. Our hybrid processors can analyze semantic markup, stylesheets, and common accessibility practices to build a meaningful profile of the underlying content.
Pricing
The following, self-service plans are available from your Settings page to quickly get started on your own. For more advanced requirements, contact us to discuss additional options.
Free | 100K Plan | 1M Plan | |
---|---|---|---|
Price | — | 25 USD /month | 200 USD /month |
Price (annual) | — | 270 USD /year | 2,160 USD /year |
Included Usage | 250 credits /day | 100,000 credits /month | 1,000,000 credits /month |
Structured Data | |||
↳ Upload Size (/credit) | 2 MiB | 2 MiB | 2 MiB |
↳ Max Upload Size | 4 MiB | 8 MiB | 8 MiB |
Text Content | |||
↳ Upload Size (/credit) | 2 MiB | 2 MiB | 2 MiB |
↳ Max Upload Size | 4 MiB | 8 MiB | 8 MiB |
Tools | |||
↳ Web Application | Yes | Yes | Yes |
↳ Developer API | — | Yes | Yes |
Anonymous, IP-based access is limited to 50 credits per day, but otherwise follows the Free plan.