Skip to main content
The Schemas page lets you create and manage schemas that define what data to extract. Once created, these schemas can be reused across multiple workflows. For example, if you need to extract store locations from 5 different websites, you can create one schema and use it for all of them.

Create a new schema

  • Go to My Schemas and click ‘Create Schema’
  • Choose how to start:
    • Create your own - Define your own data structure with custom fields
    • Copy from an existing workflow - Use the schema from one of your workflows as a starting point
    • Copy from an existing schema - Duplicate and modify one of your existing schemas
  • Add, remove, or modify fields to match your data extraction needs
  • Save your schema to use it in future workflows

Using schemas in workflows

When you create a new workflow, you can select one of your saved schemas to ensure consistent data extraction across different sources. This saves time and ensures your data always follows the same structure, making it easier to work with your extracted information.

Data Types

When defining schemas, you specify the data type for each field to ensure accurate extraction and validation. Kadoa supports the following data types:
Data TypeDescriptionExample Use Cases
STRINGString/text contentProduct names, descriptions, article headlines
NUMBERNumeric values (integers, decimals)Quantities, ratings, scores, counts
BOOLEANTrue/false valuesAvailability status, feature flags, yes/no indicators
DATEDate valuesPublication dates, deadlines, event dates
DATETIMEDate and time valuesTimestamps, scheduled times, last updated
MONEYCurrency and monetary valuesPrices, costs, revenue, discounts
IMAGEImage URLs and referencesProduct photos, thumbnails, profile pictures
LINKURLs and hyperlinksProduct pages, external links, social media
OBJECTNested/complex JSON structuresStructured metadata, complex configurations
ARRAYLists/arrays of valuesTags, categories, multiple images, feature lists
Choose the appropriate data type to ensure your data is extracted and validated correctly.

Special Field Types

Beyond regular data fields, Kadoa supports special field types for advanced use cases:

Classification Fields

Automatically categorize content into predefined labels. Useful for:
  • Sentiment analysis (Positive/Negative/Neutral)
  • Content categorization (Technology/Business/Sports)
  • Priority classification (High/Medium/Low)
Learn more about classification in the SDK →

Metadata Fields (Raw Content)

Extract raw page content in different formats:
  • HTML - Raw HTML source code
  • MARKDOWN - Markdown formatted content
  • PAGE_URL - Page URL
Learn more about metadata fields in the SDK → Need help creating a custom schema? Contact our support team for assistance.
I