Automatic Content Intelligence
When you add content, Supermemory:- Detects the content type — PDF, code, markdown, images, video, etc.
- Extracts content optimally — Uses type-specific extraction (OCR for images, transcription for audio)
- Chunks intelligently — Applies the right chunking strategy for the content type
- Generates embeddings — Creates vector representations for semantic search
- Builds relationships — Connects new knowledge to existing memories
Smart Chunking by Content Type
Different content types need different chunking strategies. Supermemory applies the optimal approach automatically:Documents (PDF, DOCX)
PDFs and documents are chunked by semantic sections — headers, paragraphs, and logical boundaries. This preserves context better than arbitrary character splits.Code
Code is chunked using code-chunk, our open-source library that understands AST (Abstract Syntax Tree) boundaries:- Functions and methods stay intact
- Classes are chunked by method
- Import statements grouped separately
- Comments attached to their code blocks
Web Pages
URLs are fetched, cleaned of navigation/ads, and chunked by article structure — headings, paragraphs, lists.Markdown
Chunked by heading hierarchy, preserving the document structure. See Content Types for the full list of supported formats.Hybrid Memory + RAG
Supermemory combines the best of both approaches in every search:Traditional RAG
- Finds similar document chunks
- Great for knowledge retrieval
- Stateless — same results for everyone
Memory System
- Extracts and tracks user facts
- Understands temporal context
- Personalizes results per user
searchMode: "hybrid" (the default), you get both:
Search Optimization
Two flags give you fine-grained control over result quality:Reranking
Re-scores results using a cross-encoder model for better relevance:Query Rewriting
Expands your query to capture more relevant results:Why It’s “Super”
| Traditional RAG | SUPER RAG |
|---|---|
| Manual chunking config | Automatic per content type |
| One-size-fits-all splits | AST-aware code chunking |
| Just document retrieval | Hybrid memory + documents |
| Static embeddings | Relationship-aware graph |
| Generic search | Rerank + query rewriting |