Embedding Service Architecture
CPU-deployable RAG pipeline with multilingual embeddings, hybrid search, and AI-powered metadata extraction
System Architecture
Core Capabilities
Text Embeddings
Generate 384-dimensional semantic vectors for 100+ languages using E5-small model
Hybrid Search
Combine semantic (dense) and keyword (sparse) search with Reciprocal Rank Fusion
Cross-Encoder Reranking
Improve search relevance by reranking top results with cross-encoder model
PDF Processing
Automatic PDF parsing, chunking, and embedding using Docling document understanding
LLM Extraction
Extract structured metadata from documents using AI-powered LLM analysis (Romanian + English)
Vector Search
Fast semantic search with metadata filtering, score thresholds, and pagination support
Specs Management
View, search, and filter extracted specifications across all documents in a collection
Example Workflows
Real test data from
Using mock data - click refresh to run real benchmarks
Simple Semantic Search
Hybrid Search + Rerank
PDF Ingestion with AI Extraction
Cross-Language Retrieval
Quick API Examples
Generate Embeddings
{
"input": "Hello world",
"input_type": "document"
}
Hybrid Search
{
"query": "password reset",
"search_mode": "hybrid",
"rerank": true,
"top_k": 5
}
Upload PDFs
FormData: files: manual.pdf chunk_size: 400 extract_specs: true
Extract Metadata
{
"text": "Router TP-Link...",
"language_hint": "ro"
}
Technology Stack
Configuration
Configure your API key and service endpoint
Default: test-api-key
The embedding service endpoint
Health Dashboard
Monitor service and dependency status
Service Status
Detailed Health
Embedder
Qdrant
Generate Embeddings
Convert text into 384-dimensional vector embeddings
Response
Collections
Manage your vector collections
New Collection
No collections
Get started by creating a new collection.
| Name | Vectors | Dimensions | Distance | Actions |
|---|---|---|---|---|
Similarity Search
Find similar vectors using text queries
Search Results
Reranked ( candidates)No results found
Extracted Specifications
View and search all extracted specifications from uploaded PDFs
Loading specifications...
Found documents with extracted specs
Vector ID: Β·
Technical Specifications:
No specifications found
Select a collection to view specs No documents with extracted specs in this collection
Metadata Extraction
Extract structured metadata from text using LLM (OpenRouter API)
Text will be truncated to 14,000 characters
Optional: Define custom fields to extract
Extracted Metadata
Product
Manufacturer
Model Number
Specifications
Categories
PDF Upload
Upload PDFs to automatically chunk, embed, and store
Configuration
Extract product specs using AI (requires API key)
Optional: Define custom extraction fields
Processing PDFs...
Converting file(s), chunking with Docling, generating embeddings, and storing vectors
Upload Results
Total PDFs
Successful
Chunks Created
chunks β’ % confidence
Vector IDs
Extracted Metadata
Product
Manufacturer
Model
Specifications
Categories