Recursive split at 512 tokens with 15% overlap, respecting heading boundaries.
Multi-vector pattern: summary embedding for retrieval, raw table preserved for generation.
Layout detection filters decorative elements; informational visuals are VLM-captioned.
DePlot converts charts to markdown tables, then trends are summarised and indexed.
Tree-sitter AST chunking by function and class boundaries; generated/vendor code excluded.
VLM extraction to LaTeX plus natural-language descriptions for exact and conceptual lookups.
"Hierarchical parent-child chunking is where you go next, not where you start. Index 512-token children for precise retrieval. Return 2,048-token parents to the LLM for context."
— P3Fusion Engineering, InsightBot Chunking Architecture
"Reranking is the upgrade we recommend to every client after their first deployment stabilises. It changes selection quality without changing indexing infrastructure."
— P3Fusion Engineering, InsightBot Production Optimisation
AWS Generative AI Competency Partner. P3Fusion builds enterprise RAG systems — InsightBot for unstructured documents, FusionReport for structured databases, and custom RAG for any corpus. Every deployment uses format-aware ingestion and content-type-specific indexing from day one.
Discuss your enterprise RAG ingestion architecture and scaling plan with our engineering team.





