What we learned indexing 10,000+ past papers
A short look at normalizing board names, session codes, and duplicate detection so search stays fast as the library grows.
Scaling to 10,000+ papers required stronger normalization and duplicate handling.
These indexing updates keep query speed consistent as archive size grows.
