We collect, process, standardize and enhance over one million individual files in different formats, layouts and languages to produce a centralized digital database of registered voters. The majority of the content is derived from the India Electoral Rolls with additions and crosschecks from India Post address data.
Voter registration is compulsory in India, which means that every citizen 18 years of age or older must register to vote. We collect voter data from the Electoral Authority in each of India’s 29 States and 7 Union Territories.
To increase the value of our database, data is aggregated between consecutive years, which allows tracking of more than 1 Billion Indian citizens through address changes, name changes and other updates to voter information.
Data in the India Voter Registration Roll consists of all registered voters 18 years of age or older from all States and Union Territories. It contains Government Issued ID number (EPIC number), name, relation’s name, address, age, gender, polling station name, postal code, and other administrative division fields, as well as geographic coordinates.
The EPIC number, or Electoral Photo Identity Card number, is a unique voter ID issued to each registered voter. EPIC numbers stay the same even when a voter changes location and re-registers in a new Constituency, which means that voter information is trackable over time and location.
During the 2014 election, there were 834,101,497 registered voters, including 437,051,538 males and 397,049,941 females. Because we aggregate the data between data releases, a complete history of each voter is provided, including past addresses and any name changes.
We compile data from Final or Draft Electoral Rolls as published by the corresponding Electoral Authority. Full addresses and postal codes are standardized and enhanced with additional government data from India Post. By combining official voter data with India Post data, we ensure accuracy and timeliness superior to that obtainable from the individual sources.
India raw data comes in an immense variety of layouts and in multiple languages. We parse, cleanse and normalize data fields for easy consumption. For example, we separate voter’s relative name and relationship status into mother’s, father’s or husband’s name fields. We also derive an approximate year of birth from the voter’s age.
In addition to standardizing raw source data, we significantly enhance the voter roll by adding various data elements based on India Post. We add administrative divisions, such as Taluk, Circle, and District name to the datasets. We also add geographic coordinates to each record based on India PIN (Postal Index Number) Code data. The PIN Code is a 6-digit code of Post Office numbering used by India Post.
The data is available in the language prevalent to a particular State or Territory. There are 14 languages in which the data is published, with the majority being in Hindi and English. If the data is available in multiple languages, we make the data available in all of them.
To ensure the most accurate data rendition possible, we apply machine transliteration algorithms custom built using our extensive background in linguistics and the expertise of in-country language specialists who are native speakers. We also convert all non-English source data into Roman characters by applying our custom linguistic rules and language exceptions.
The data is typically updated on an annual schedule as it becomes available from the State and Union Territories Electoral Authorities. We constantly monitor the data release schedule and acquire, process, and make the updated data available soon thereafter. Data updates are available as either a full database replacement or as a delta update containing changes and additions only. You are notified via automatic process that the newest version of the database is available for download.
Our database is used for many purposes. For example, a credit reporting agency uses Apriori data to verify their internal data against our cleansed and extended government issued database of registered voters. This cross-reference helps them provide accurate and up to date consumer credit information to their clients.
Another client is a well-known international data aggregator that uses our India data to provide an API solution to its customers in e-commerce and the finance industry on a per transaction basis for incorporation in due-diligence processes.
Information contained in India Voter Registration Roll is Public Data It is acquired from Government of India Electoral Authorities and India Post. This data is considered personal information under the IT Act and associated Policy Rules and therefore reasonable security practices and procedures must be applied by any company that gains access to the data through Apriori LLC.