Key Features
- AI-Powered Retrieval: Vector-based semantic search
- Milvus Processing: Production-ready vector database engine
- Collection Integration: Build from organized file collections
- Status Monitoring: Track processing and health
- Agent Integration: Direct use in agent configurations
Authentication
All endpoints require authentication using your API key:- API Key:
x-api-key: <key>
Available Endpoints
List Knowledge Bases
Retrieve all knowledge bases
Create Knowledge Base
Create a new knowledge base
Get Knowledge Base
Retrieve knowledge base details
Update Knowledge Base
Modify knowledge base settings
Delete Knowledge Base
Remove a knowledge base
Check Status
Monitor processing status
Available Collections
List collections ready for use
Processing Engine
Milvus Vector Database- High-performance vector similarity search
- Scalable for large document collections
- Advanced HNSW indexing algorithms
- Real-time updates and queries
- Production-ready for enterprise deployments
Milvus is currently the only supported processing engine. Additional engines may be added in future releases.
Knowledge Base Lifecycle
1. Creation
Define knowledge base with collections and processing engine:2. Processing
Knowledge base processes collection content:- Text extraction: Extract and chunk content
- Embedding generation: Create vector representations
- Index building: Optimize for search performance
- Validation: Ensure data quality
3. Completion
Ready knowledge bases can be used by agents for retrieval.Knowledge Base Status
pending
pending
Knowledge base created but processing not started
running
running
Collections are being processed and indexed
completed
completed
All collections processed and ready for queries
failed
failed
Processing failed - check error details and logs
Data Sources and Collections
Knowledge bases track their data sources:Successfully processed data sources with metadata
Data sources that failed processing with error details
Percentage of successfully processed data sources
Number of collections included in the knowledge base
Agent Integration
Use knowledge bases in agent configurations:Retrieval Configuration
- Search parameters: Control relevance and results
- Context integration: How retrieved content is used
- Fallback behavior: When no relevant content is found
Performance Considerations
Indexing Time
- Collection size: Larger collections take longer to process
- Content complexity: Rich documents require more processing
- Engine choice: Different engines have varying performance characteristics
Query Performance
- Index optimization: Proper indexing improves search speed
- Result filtering: Limit results for faster responses
- Caching: Frequently accessed content is cached
Scalability
- Concurrent queries: Multiple agents can query simultaneously
- Update frequency: How often content changes
- Storage requirements: Vector storage grows with content
Monitoring and Troubleshooting
Health Checks
- Processing progress: Track completion percentage
- Error rates: Monitor failed data sources
- Query performance: Response times and accuracy
Common Issues
- Collection dependencies: Ensure collections are completed
- Processing failures: Check data source formats and content
- Memory limits: Large knowledge bases may hit resource limits
Use the status endpoint to monitor knowledge base health and processing progress.