Available Knowledge Base Types
Milvus
Vector Database β
Production-ready vector database with semantic search capabilities.
Additional vector database integrations will be supported in future releases.
Milvus Knowledge Base
Milvus is a vector database that enables semantic search over your documents using dense vector embeddings. Knowledge bases support customizable chunking strategies, embedding models, and structured data processing.Basic Configuration
Creating with API
Knowledge bases are created via the API with full control over processing configuration. See Create Knowledge Base API for complete documentation.Required Fields
Unique identifier for this knowledge base instance
Must be set to
"milvus"Knowledge base ID for isolation and multi-tenancy
Retrieval Configuration
Theretrieval_config object controls how documents are retrieved:
Number of most relevant results to returnRange: 1-1000
HNSW search parameter controlling accuracy vs speed trade-offRange: 1-512
Higher values = more accurate but slower
Higher values = more accurate but slower
Distance metric for similarity searchOptions:
COSINE- Cosine similarity (recommended)L2- Euclidean distanceIP- Inner product
Number of results to skip (for pagination)
Minimum similarity score thresholdRange: 0.0-1.0
Only return results above this score
Only return results above this score
Search strategy for retrievalOptions:
dense- Vector-based semantic search only (default, fastest)hybrid- Combines dense vector + BM25 keyword searchbm25- BM25 keyword-based search onlyhybrid_rrf- Hybrid with Reciprocal Rank Fusion for improved ranking
Usage Examples
Basic Knowledge Base Agent
Hybrid Search with RRF Fusion
Query Expansion
Multiple Knowledge Bases
Chunking Strategies
Knowledge bases support three chunking strategies configured during creation:Recursive (Default)
Splits text using multiple separators in order (paragraphs β sentences β words). Best for general documents. Parameters:strategy:"recursive"chunk_size: Size in words/characters (default: 500)chunk_overlap: Overlap between chunks (default: 50)split_by:"word"|"char"(only these two options)recursive_separators: Array of separators to try in order
Hierarchical
Creates multi-level chunks preserving document structure. Ideal for academic papers and structured documents. Parameters:strategy:"hierarchical"hierarchical_block_sizes: Descending array of block sizes (e.g.,[700, 350, 150])chunk_overlap: Overlap between chunkssplit_by:"word"|"sentence"
Fixed
Simple fixed-size chunks. Fastest processing for straightforward documents. Parameters:strategy:"fixed"chunk_size: Fixed chunk sizechunk_overlap: Overlap between chunkssplit_by:"word"|"sentence"
Structured Data Processing
CSV and Excel files (.csv, .xlsx, .xls) support specialized processing:
Configuration
Structured Config Parameters
Number of rows to combine into one searchable chunk (1-20)Lower values = More precise retrieval, slower ingestion
Higher values = Faster ingestion, broader context
Higher values = Faster ingestion, broader context
Output format for table dataOptions:
csv- Comma-separated valuesmarkdown- Markdown table format
Column name containing the main text content (CSV only)Required for
row mode processingProcessing mode for CSV filesOptions:
row- One document per row (precise retrieval)file- One document per file (holistic context)
Embedding Providers
Choose your embedding model during knowledge base creation:Azure OpenAI
text-embedding-ada-002(1536 dimensions)
Mistral AI
mistral-embed(1024 dimensions)
Telekom OTC
text-embedding-bge-m3(1024 dimensions) - BGE multilingual modeljina-embeddings-v2-base-de(768 dimensions) - German-optimizedjina-embeddings-v2-base-code(768 dimensions) - Code-optimizedtsi-embedding-colqwen2-2b-v1(1024 dimensions) - TSI ColQwen2
Best Practices
Chunking
Retrieval
Next Steps
Agent Capabilities
Explore all agent capabilities including tools and streaming
Agent Types
Learn about different agent types