Skip to main content
← All Tags

RAG

7 articles in this category

AI NewsAI InfrastructureRAG

Alibaba's VimRAG: Optimizing Multimodal RAG with Memory Graphs and Token Budgeting

Alibaba’s VimRAG framework improves multimodal retrieval performance to 50.1 on Qwen3-VL-8B-Instruct by utilizing a dynamic directed acyclic memory graph.

Read more
AI NewsArtificial IntelligenceRAG

RAG vs. Context Stuffing: Benchmarking Efficiency and Reliability in Large Context Windows

Benchmarking shows RAG reduces input tokens by 2.7x and cuts latency by nearly half compared to brute-force context stuffing for the same query.

Read more
AI NewsRAGAgentic AI

VectifyAI Launches Mafin 2.5 and PageIndex: Achieving 98.7% Financial RAG Accuracy

VectifyAI has launched Mafin 2.5 and the open-source PageIndex framework, achieving a record-breaking 98.7% accuracy on FinanceBench. By replacing traditional vector similarity with hierarchical tree indexing, PageIndex solves the problem of structural context loss in complex financial documents like SEC filings and balance sheets.

Read more
AI NewsRAGStartups

Building a RAG-Based AI Platform

Sardor Albakiev launched Swipies AI, a RAG platform aiming to solve the problem of inefficient data utilization for individuals and businesses.

Read more
AI NewsRAGSoftware Engineering

Code-Aware RAG Tool for Developers Seeks Feedback

A new RAG tool uses AST-based code chunking and dependency graph expansion for more relevant code retrieval.

Read more
AI NewsRAGArchitecture

Bidirectional Data Flow Architecture for AI Agents with MongoDB Atlas

MongoDB Atlas enables bidirectional data flow for AI agents, achieving 90% intent accuracy in rental searches.

Read more
AI NewsRAGAI Optimization

How to Reduce Cost and Latency of Your RAG Application Using Semantic LLM Caching

Semantic LLM caching cuts RAG API costs by reusing responses for similar queries, saving up to 80% on repeated requests.

Read more