We currently support Node.js/TypeScript. Python SDK coming soon.
Prerequisites
To get the most out of this guide, you’ll need to:- Create a Kadoa account
- Get your API key
1. Install
2. Extract Data
Choose Your Extraction Method
Auto-Detection
Let AI find and extract data automatically (Recommended)
Custom Schema
Define exactly what fields you want
Raw Content
Get HTML, Markdown, or plain text
Classification
Categorize content automatically
Extraction Methods
Auto-Detection
The fastest way to extract data. AI automatically identifies structured content:Custom Schema (Builder API)
Define exactly what you want to extract with type-safe field definitions:Raw Content Extraction
Extract raw HTML, Markdown, or URLs without structure:html
- Raw HTMLmarkdown
- Markdown formatted contenturl
- Page URLs
Classification
Automatically categorize content into predefined classes:Hybrid Extraction
Combine structured fields with raw content:Real-time Notifications
Get instant WebSocket notifications when data changes:EXTRACTION_STARTED
- Extraction beginsEXTRACTION_COMPLETED
- Extraction finishedDATA_CHANGED
- New data detectedVALIDATION_COMPLETED
- Validation finishedERROR
- Error occurred
Workflow Scheduling
Create reusable workflows that run on a schedule:Pagination Handling
Automatically navigate through multiple pages:Reuse Existing Schemas
Reference previously created schemas:Getting Help
If you’re stuck:- Check the examples in this documentation
- Browse the GitHub examples
- Search GitHub Issues
- Contact support at support@kadoa.com