I gang

PDF to text conversion for voter rolls in India

The objective is to create a command-line tool that will convert a (structured) PDF file containing publicly available voter rolls in India, into text.

The output will be stored in a CSV sheet. There are further details in the attached instruction file.

Some points to bear in mind:

- The text is in the Devanagari character set (i.e. in the Hindi language)

- The voter rolls are arranged in a grid (3 columns and n rows) - see attached PDF

- There are known issues with fidelity of information during a simple copy-paste from PDF to text

- The tool is expected to be run on a Linux system and take two command-line parameters: the path + file name of the source PDF and the path + file name of the output CSV file

Færdigheder: Java, Linux, PDF, Perl, Python

Se mere: source linux information, india java, java objective, voter, source india, linux pdf, copy pdf, path pdf, java create pdf, command line java linux, perl pdf csv file, perl convert pdf, linux convert pdf text, pdf java create, java details hindi, convert pdf line, csv text file, linux convert java, simple java file output, csv pdf, hindi devanagari, pdf convert line, hindi text pdf, pdf text grid, hindi pdf text

Om arbejdsgiveren:
( 8 bedømmelser ) Mumbai, India

Projekt-ID: #4084852

Tildelt til:


Hi, i have significant experience working with automated pdf extraction - will be able to deliver quality code in 2 milestones, the first one will demonstrate the data extraction with accuracy, the second will be to Mere

$300 USD in 7 dage
(4 bedømmelser)

3 freelancere byder i gennemsnit $283 for dette job


Beat work guaranteed. Please check PM.

$250 USD in 4 dage
(0 bedømmelser)

Hi i have extensive experience in python .I can do this within a week.

$300 USD in 7 dage
(0 bedømmelser)