The National Digital Library of India is a 24×7 virtual knowledge repository of millions of books and publications. The contents are sourced from Educational boards, institutions, publishers, online video tutorials etc. and hence are formatted differently. Library Scientists needed to capture the data and metadata in a uniform format to enable proper searching among the millions of available resources. Given the volume of data, the semi-manual effort was not scalable and error-prone.
Alumnus automated the entire Data Acquisition process. This included Rule Engines to facilitate mapping, curation, content stitching and post-processing.