Discussion about this post

User's avatar
Neural Foundry's avatar

Spot on about the 10k doc threshold, this is where most enterprise pilots quietly die. The hierarchical retrieval sugestion is right but the graph approach is probly where things go long-term. I worked on a legal discovery tool in 2021 that hit semantic collapse around 8k case files, we ended up bolting on manual metadata tagging which defeats the whole purpose. The 87% precision drop stat is wild but tracks with what I saw.

Expand full comment

No posts

Ready for more?