Web Toolkit

On demand validation and alternative formats of web page content.

Our Web Toolkit offers access to some of the algorithms and practices that we use internally while processing the web. Learn more from the links below, or keep reading about the free and paid plans.

  • Structured Data for extracting semantic data, such as schema.org, that publishers explicitly include in content. In addition to extraction, we apply common data normalization and validation practices for easier consumption by downstream clients.
  • Text Content for extracting text-based formats, such as Markdown, of web pages. Our hybrid processors can analyze semantic markup, stylesheets, and common accessibility practices to build a meaningful profile of the underlying content.

Pricing

The following, self-service plans are available from your Settings page to quickly get started on your own. For more advanced requirements, contact us to discuss additional options.

Free 100K Plan 1M Plan
Price25 USD /month 200 USD /month
Price (annual)270 USD /year 2,160 USD /year
Included Usage250 credits /day 100,000 credits /month 1,000,000 credits /month
Structured Data
 ↳ Upload Size (/credit)2 MiB 2 MiB 2 MiB
 ↳ Max Upload Size4 MiB 8 MiB 8 MiB
Text Content
 ↳ Upload Size (/credit)2 MiB 2 MiB 2 MiB
 ↳ Max Upload Size4 MiB 8 MiB 8 MiB
Tools
 ↳ Web ApplicationYes Yes Yes
 ↳ Developer APIYes Yes

Anonymous, IP-based access is limited to 50 credits per day, but otherwise follows the Free plan.