RAG

7 articles in this category

AI NewsAI InfrastructureRAG

Alibaba's VimRAG: Optimizing Multimodal RAG with Memory Graphs and Token Budgeting

Alibaba’s VimRAG framework improves multimodal retrieval performance to 50.1 on Qwen3-VL-8B-Instruct by utilizing a dynamic directed acyclic memory graph.

Apr 10, 2026

AI NewsArtificial IntelligenceRAG

RAG vs. Context Stuffing: Benchmarking Efficiency and Reliability in Large Context Windows

Benchmarking shows RAG reduces input tokens by 2.7x and cuts latency by nearly half compared to brute-force context stuffing for the same query.

Feb 24, 2026

AI NewsRAGAgentic AI

VectifyAI Launches Mafin 2.5 and PageIndex: Achieving 98.7% Financial RAG Accuracy

VectifyAI has launched Mafin 2.5 and the open-source PageIndex framework, achieving a record-breaking 98.7% accuracy on FinanceBench. By replacing traditional vector similarity with hierarchical tree indexing, PageIndex solves the problem of structural context loss in complex financial documents like SEC filings and balance sheets.

Feb 22, 2026

AI NewsRAGStartups

Building a RAG-Based AI Platform

Sardor Albakiev launched Swipies AI, a RAG platform aiming to solve the problem of inefficient data utilization for individuals and businesses.

Jan 8, 2026

AI NewsRAGSoftware Engineering

Code-Aware RAG Tool for Developers Seeks Feedback

A new RAG tool uses AST-based code chunking and dependency graph expansion for more relevant code retrieval.

Dec 13, 2025

AI NewsRAGArchitecture

Bidirectional Data Flow Architecture for AI Agents with MongoDB Atlas

MongoDB Atlas enables bidirectional data flow for AI agents, achieving 90% intent accuracy in rental searches.

Dec 2, 2025

AI NewsRAGAI Optimization

How to Reduce Cost and Latency of Your RAG Application Using Semantic LLM Caching

Semantic LLM caching cuts RAG API costs by reusing responses for similar queries, saving up to 80% on repeated requests.

Nov 11, 2025