I need to create a python script to extract text from pdf file. These pdf have dates and some comments, which are underlined. I need to extract date and count underlined comments from pdf in a file.
1.) I need to extract file date i.e January 30, 2014 in the attached file.
2.) I need to count the underlined headers such as attached file has 2 headers.
3.) I need to count the total number of comments. attached file has 3 comments. so output will be 3. There might be other numbers present in the file text such as 1., 3. 55., so use logic to count numbers from the bullet not any other number in the similar format.
23 freelancers are bidding on average €38 for this job
Dear, Extract data from PDF file is one my work and I sure I can do this job. I use Python/PHP/JAVA for this task. Let us start this project. Regards, Njaka http://www.freelancer.com/u/a6jack.html