The original post: /r/localllama by /u/Sarcinismo on 2025-02-10 10:33:39.
Hi All,
Curious to hear if you worked on RAG use cases with 20+ million documents and how you handled such scale from latency, embedding and indexing perspectives.
You must log in or register to comment.