深入RAG架构:分块策略、混合检索与重排序的工程实现
Get 20 candidates(cheap,fast) candidates=self.hybrid.search(query,k=20) Rerank with cross-encoder(expensive,accurate) pairs=[(query,doc)for doc in candidates] scores=self.reranker.predict(pairs) Return top k ...