What did the ablation study reveal about Agentic RAG complexity on local 7B models?

The study using Qwen2.5-7B-Instruct on perturbed HotpotQA showed the full pipeline achieved 53.2% EM and 61.6% F1, significantly beating single-pass baselines, but deeper loops offered no substantive benefit beyond two iterations.

Why does fixed hybrid retrieval outperform adaptive routing?

Experiments found fixed hybrid retrieval via reciprocal rank fusion outperformed rule-based routing by 1.8 EM and 1.9 F1 points; adaptive routing suffered false triggers from named entities, degrading performance.

What should developers prioritize for efficient local RAG systems?

Under fixed compute budgets, simplified fixed designs prove more competitive than complex adaptive variants. Core gains stem from moderate retrieval loops, so developers should prioritize retrieval robustness over intricate control logic.

拆解智能體RAG：基於本地7B模型的多跳問答元件消融研究

本文針對資源受限環境下智能體檢索增強生成（Agentic RAG）系統的複雜性提出質疑，透過嚴格的消融實驗揭示了各元件的真實貢獻。研究基於Qwen2.5-7B-Instruct本地模型，在HotpotQA干擾開發集上進行了全面評估。實驗表明，完整智能體管線在精確匹配（EM）和F1分數上均顯著優於單遍檢索基線。關鍵發現包括：固定混合檢索透過倒數秩融合策略優於基於規則的自適應路由，後者易因命名實體誤觸發；兩遍檢索迭代即可捕捉五遍迭代95%的收益，深層循環並無實質幫助；查詢分解與交叉編碼器重排序雖具統計顯著性，但增益相對較小。研究證明，在固定本地模型預算下，簡化且固定的設計往往比複雜的自適應版本更具競爭力，核心收益源於適度的檢索循環而非過度複雜的控制邏輯。

Sources

arXiv