I gang

PDF to text conversion for voter rolls in India

The objective is to create a command-line tool that will convert a (structured) PDF file containing publicly available voter rolls in India, into text.

The output will be stored in a CSV sheet. There are further details in the attached instruction file.

Some points to bear in mind:

- The text is in the Devanagari character set (i.e. in the Hindi language)

- The voter rolls are arranged in a grid (3 columns and n rows) - see attached PDF

- There are known issues with fidelity of information during a simple copy-paste from PDF to text

- The tool is expected to be run on a Linux system and take two command-line parameters: the path + file name of the source PDF and the path + file name of the output CSV file

Færdigheder: Java, Linux, PDF, Perl, Python

Se mere: the source for linux information, objective c pdf, java in india, india java, java to objective c, voter, source india, pdf to text, linux pdf, copy from pdf into, path pdf, line pdf, java create pdf, copy file system system java, command line java linux, perl pdf csv file, perl convert pdf, linux convert pdf text, pdf java create, java details hindi, convert pdf line, csv text file, linux convert java, simple java file output, csv pdf

Om arbejdsgiveren:
( 8 bedømmelser ) Mumbai, India

Projekt-ID: #4084852

Tildelt til:

mkoteshwar

Hi, i have significant experience working with automated pdf extraction - will be able to deliver quality code in 2 milestones, the first one will demonstrate the data extraction with accuracy, the second will be to Mere

$300 USD in 7 dage
(4 bedømmelser)
2.8

3 freelancere byder i gennemsnit $283 for dette job

nitinrajpal86

Beat work guaranteed. Please check PM.

$250 USD in 4 dage
(0 bedømmelser)
0.0
rohitdwivedi90

Hi i have extensive experience in python .I can do this within a week.

$300 USD in 7 dage
(0 bedømmelser)
0.0