
Lukket
Slået op
Betales ved levering
I have a folder full of supplier bills in PDF format and I need a clean, repeatable Python script that pulls everything of value out of them and drops it neatly into an Excel workbook. Here is what I expect: • The script must capture every text field that appears on each bill (invoice number, dates, vendor, totals and any other descriptors). • It should identify and export any tabular line-item sections so that quantities, descriptions and prices land in true Excel rows and columns—not as a single block of text. • Embedded images or logos also need to be saved out (ideally into a sub-folder) with a reference back to the originating invoice inside the Excel sheet. Python tools such as pdfplumber, PyPDF2, camelot, tabula-py, pandas and openpyxl are all fine; choose the combination you’re most comfortable with as long as the final deliverable is a .py file plus an example .xlsx that mirrors the source PDFs accurately. Acceptance will be based on: 1. Running the script locally on a sample batch of PDFs with no manual tweaks. 2. Seeing all text and table content laid out cleanly in Excel. 3. Having each extracted image saved separately and indexed in the sheet. If additional libraries are required, let me know the pip install commands so I can replicate the environment quickly.
Projekt-ID: 40213885
44 forslag
Projekt på afstand
Aktiv 24 dage siden
Fastsæt dit budget og din tidsramme
Bliv betalt for dit arbejde
Oprids dit forslag
Det er gratis at skrive sig op og byde på jobs
44 freelancere byder i gennemsnit ₹971 INR på dette job

Hello I have several years of experience with Python programming and automated processing of PDF files I can prepare script to gather data from PDF and generated Excel worksheets based on those data. Could you share few PDF files to review? Thanks.
₹1.008 INR på 1 dag
7,1
7,1

Hi, noticed that you are looking for a skilled developer with experience in pdf data extraction. I can get it done as I have worked on similar projects involving extracting data from pdf and building question papers using AI form the data. So I'm sure that with my experience in that I'll be able to complete this project within a short amount of time. So let's talk more in DM.
₹1.500 INR på 2 dage
3,5
3,5

Hi I can build a clean, repeatable Python script that extracts all usable data from supplier bill PDFs and exports it into a well-structured Excel workbook, ready for analysis or downstream processing. What the script will do => Read a folder of PDF bills in one run (no manual tweaks) => Extract all text fields (invoice number, dates, vendor, totals, descriptors, etc.) => Detect and export tabular line items into proper Excel rows and columns => Extract embedded images/logos and save them to a sub-folder => Reference each extracted image back to its source invoice inside Excel Technical Approach => Python using pdfplumber / Camelot / tabula-py (tables), PyPDF2 (text), pandas + openpyxl for Excel output => Robust handling for multi-page invoices and varying layouts => Clear logging so runs are traceable and repeatable Deliverables => Fully commented .py script => Example .xlsx generated from your sample PDFs => Images saved in an organised folder with invoice references => List of required libraries with exact pip install commands Acceptance-ready => Script runs locally on a batch of PDFs without intervention => Text and tables appear cleanly structured in Excel => All images extracted and indexed correctly Happy to review a sample invoice first to lock in extraction rules and ensure accuracy. Pavan Kumar A
₹1.200 INR på 3 dage
3,2
3,2

Dear hiring manager, I have gone through your project and understand your work. I can do this as I have previou experience in the same. Looking forward to hearing back from you. Thank you.
₹800 INR på 1 dag
1,6
1,6

I can build a clean, reusable Python script that processes a folder of supplier bill PDFs and extracts all text fields, tabular line items, and embedded images into a well-structured Excel workbook. The solution will use proven libraries such as pdfplumber, camelot/tabula-py, pandas, and openpyxl to ensure text is captured accurately, tables are converted into true Excel rows and columns, and images/logos are saved into a subfolder with clear references back to each invoice. The script will run locally without manual tweaks, include clear setup instructions and pip install commands, and be easy to rerun or extend as new PDFs are added.
₹1.050 INR på 7 dage
0,6
0,6

Hello, I can write a clean, reusable Python script to extract all data from your supplier PDF bills into Excel. The script will capture all text fields, properly extract line-item tables into rows and columns, and save embedded images/logos into a separate folder with references back to the invoice in Excel. I’m comfortable using pdfplumber, camelot/tabula-py, pandas, and openpyxl, and I’ll provide: - The complete .py script - A sample output .xlsx file - Clear pip install commands for all required libraries The script will run locally without manual changes and mirror the source PDFs accurately. Ready to start as soon as you share sample files. Regards, Vishnu Gurjar
₹600 INR på 1 dag
0,0
0,0

I am a professional Data Entry specialist. I am an expert in various types of data copy-paste work. I have carefully reviewed the job you posted. I am very comfortable with this kind of work and I have good experience in it. I will complete your task manually and within a very short time. I consider myself qualified for this job. If you think I am suitable for this work, please feel free to message/knock me. I am ready to work with you.
₹600 INR på 2 dage
0,0
0,0

I am a dedicated and detail-oriented freelancer with experience in translation and content accuracy. I specialize in English–Hindi and Hindi–English translation, ensuring clear meaning, correct grammar, and natural flow. I always deliver work on time and follow client instructions carefully. Quality and client satisfaction are my top priorities. I am comfortable working on documents, articles, subtitles, and basic technical or general content. I communicate clearly and am open to revisions until the client is fully satisfied. I am eager to build long-term professional relationships through reliable and high-quality work.
₹1.000 INR på 7 dage
0,0
0,0

As an experienced and versatile freelancer, I am the perfect fit for your project! My skills in extracting data from PDF, combined with my expertise in Python libraries such as pdfplumber, PyPDF2, camelot, tabula-py, pandas and openpyxl align perfectly with your needs. Over the years, I have honed my skills in developing robust and repeatable scripts that can tackle large batches of data conversion tasks like yours with no manual tweaks. With me onboard, you can expect a script that flawlessly captures every text field on the bills including specific descriptors like invoice numbers, dates, vendor details and invoice totals. I will also ensure seamless export of any tabular line-item sections into an organized Excel format. Having worked extensively with embedded images and logos in my career, I will not only save them out accurately but also provide references back to their respective invoices within the Excel workbook. I am committed to delivering high-quality solutions within deadlines - without any compromise. Having successfully executed various complex coding tasks over the years - especially those involving advanced data extraction - adopting further libraries or technologies needed for your project will pose no challenge. Let's collaborate and ensure the result is an Python script that beautifully transforms your supplier bills into a neat Excel workbook, along with an example .xlsx file demonstrating its accuracy.
₹1.050 INR på 1 dag
0,0
0,0

Hello, I can accurately retype your printed documents into a clean Microsoft Word file with exact formatting, punctuation, and paragraph structure. I have experience in: • Printed text to Word typing • Verbatim retyping with high accuracy • Proper Word styles (Heading, body text, bullet points) • Proofreading and clean layout I will make sure every word and line matches your source pages and deliver a properly formatted DOCX file. I am available to start immediately. Please share sample pages if possible. Thank you.
₹1.050 INR på 3 dage
0,0
0,0

I can deliver exactly what you’re looking for—a clean, repeatable Python solution that extracts all valuable data from supplier PDF bills and converts it into a well-structured Excel workbook with zero manual intervention. What I’ll handle: Full extraction of all textual fields (invoice numbers, dates, vendor details, totals, tax info, and any other descriptors). Accurate detection and export of tabular line items so quantities, descriptions, rates, and amounts land in proper Excel rows and columns. Extraction of embedded images/logos into a separate folder, with clear references back to the source invoice inside Excel. A final .py script and a sample .xlsx that mirrors the PDFs cleanly and accurately. Clear pip install commands so you can replicate the environment instantly. I’ll use the most reliable combination of tools like pdfplumber, Camelot/Tabula, pandas, and openpyxl, choosing what best fits the structure of your PDFs to ensure consistent results across batches. Why I’m a good fit: I’ve already implemented this same type of PDF data extraction and structuring logic in an Android application, so I’m very familiar with handling varied invoice layouts, edge cases, and automation workflows. Translating this into a Python-based batch solution will be straightforward for me. You’ll be able to run the script locally on any folder of PDFs and get clean, analysis-ready Excel output every time. Happy to get started right away.
₹1.050 INR på 7 dage
0,0
0,0

a gig by clearly explaining their relevant skills, showing proper understanding of the client’s requirements, maintaining honest communication, offering a realistic delivery time, and demonstrating commitment to completing the work with quality and responsibility.
₹1.050 INR på 7 dage
0,0
0,0

Hello, I can build a clean, fully repeatable Python solution that extracts all valuable data from supplier bill PDFs and exports everything into a well-structured Excel workbook, exactly as you described.
₹600 INR på 7 dage
0,0
0,0

I can build a clean, repeatable Python script that automatically extracts all usable data from your supplier invoice PDFs and delivers it in a well-structured Excel workbook, exactly as you specified. What you’ll get Complete text extraction from every invoice (invoice number, dates, vendor details, totals, descriptors—nothing skipped) True tabular extraction of line items (quantities, descriptions, prices in proper rows & columns, not raw text) All embedded images/logos extracted into a dedicated folder Clear indexing in Excel linking each image back to its source invoice One-click execution on a folder of PDFs—no manual cleanup Technical approach Python-based pipeline using pdfplumber + camelot + pandas + openpyxl Designed to handle batch processing reliably Clean .py script + example .xlsx output that mirrors the PDFs Full list of pip install commands provided for fast environment setup
₹650 INR på 3 dage
0,2
0,2

Hello, I am an experienced Python developer specializing in PDF data extraction and Excel automation. I can create a clean, repeatable script that extracts all text fields, tabular line items, and embedded images from your supplier bills and organizes them accurately in Excel. Each image will be saved in a subfolder with a reference to its originating invoice, and all tables will be correctly formatted into rows and columns. I am proficient with pdfplumber, PyPDF2, camelot, tabula-py, pandas, and openpyxl, and I will provide a ready-to-run .py script along with a sample .xlsx file reflecting your PDFs perfectly. I will also include any necessary pip commands for replicating the environment. The script will run locally with zero manual tweaks, ensuring reliable, repeatable results. I am ready to start immediately.
₹600 INR på 7 dage
0,0
0,0

I offer accurate and reliable data entry services, converting PDF files into well-organized Excel spreadsheets. I pay close attention to detail to ensure all data matches the source document. I am committed to meeting deadlines, following client instructions, and double-checking the data to minimize errors and deliver high-quality results.
₹1.050 INR på 7 dage
0,0
0,0

Hello, I can build a clean, repeatable Python script that extracts all relevant data from your supplier PDF bills and exports everything neatly into an Excel workbook—ready to run locally with no manual tweaks. I’ve worked with pdfplumber, camelot/tabula-py, pandas, and openpyxl to handle both text fields and structured tables accurately. What I’ll deliver Full extraction of all text fields (invoice number, dates, vendor, totals, descriptors) Proper parsing of line-item tables into true Excel rows/columns. Embedded images/logos saved to a sub-folder with references indexed back in Excel A single, well-documented .py script + sample .xlsx output Clear pip install commands and README for quick setup Quality & reliability Handles multi-page PDFs and common layout variations Robust error handling and logging Clean, reproducible outputs you can rerun on new batches Organized Excel structure (separate sheets for invoices, line items, image index) If you can share a few sample PDFs, I’ll tailor the parsers to your layout so extraction is accurate from the first run. Thanks Hemangi Chhaya
₹1.050 INR på 7 dage
0,0
0,0

Hi, I’ve carefully reviewed your requirements and understand that you need a reliable Python script to extract all valuable data from supplier PDF bills and export it cleanly into an Excel workbook, without any manual changes. I have experience building PDF data extraction solutions in Python, including capturing invoice details such as invoice number, dates, vendor information, totals, and other text fields. I can also extract line-item tables into proper Excel rows and columns, and save embedded images or logos into a separate folder with clear references to the original invoice inside the Excel file. I am comfortable using tools like pdfplumber, PyPDF2, camelot, tabula-py, pandas, and openpyxl, and will choose the best combination based on your PDF layout. The final delivery will include: A clean, well-structured .py script An example .xlsx file matching the source PDFs All extracted images saved separately and indexed in Excel Clear pip install commands for easy setup I also bring practical experience from my role as a Data Entry Operator at Muslim Hands, where I worked with supplier bills, invoices, and structured data, with a strong focus on accuracy and consistency. This real-world experience helps ensure the output is both technically correct and business-ready. I’d be happy to review a sample PDF and get started. Best regards, Aamish
₹700 INR på 2 dage
0,0
0,0

I am the best fit for this project, delivering accurate, reliable work with clear communication and on-time completion.
₹1.450 INR på 6 dage
0,0
0,0

Dear Hiring Manager, I am writing to express my interest in the Data Entry and AI Integration Specialist position. I bring strong experience in accurate data entry, data validation, and supporting AI-driven workflows to improve efficiency and data quality. I am detail-oriented, comfortable working with large datasets, and experienced in collaborating with technical teams to ensure smooth integration between data systems and AI tools. I am confident my skills and reliability would make a positive contribution to your team. Thank you for considering my application. I would welcome the opportunity to discuss how I can support your organization’s goals. Sincerely, Zain Rasool Butt
₹1.050 INR på 7 dage
0,0
0,0

DELHI, India
Betalingsmetode verificeret
Medlem siden jul. 13, 2018
₹1500-12500 INR
₹600-1500 INR
₹600-1500 INR
₹600-1500 INR
₹600-1500 INR
₹750-1250 INR / time
$25-50 USD / time
₹1500-12500 INR
$30-250 USD
$30-250 CAD
$2-8 USD / time
$250-750 USD
$30-250 USD
₹1500-12500 INR
₹750-1250 INR / time
$750-1500 USD
$750-1500 USD
₹1500-12500 INR
₹750-1250 INR / time
$10-30 USD
$8-15 CAD / time
$8-15 USD / time
$30-250 USD
₹12500-37500 INR
$30-250 USD