Crawl an entire website and convert to PDF or ODF

Create a webform that accepts 2 required values (e-mail and website URL), 2 optional values that must be entered together (username and password), and an 'execute' process button.

For example, the e-mail addr provided is: [Posting contact details is Prohibited by [url removed, login to view] Admin] and the url is: http://stig.test.org.

Username: Password:

All email and URL values would have basic value validation checking. Username and PW fields should accept special characters.

Upon entering both values, user clicks 'execute' button

Also create web API that can accept the above (4) values.

Store email address

Crawl website URL ([url removed, login to view]) with no depth limit within the domain ([url removed, login to view])

Must also be able to enter pw protected areas with supplied username/pw credentials prompted by textbox or within url (http://username:[url removed, login to view])

Convert HTML, images, css, script (php/xml) into PDF or ODF(Open Document Format). In other words, generate a 'snapshot' of what a browser would display into a pdf/odf.

Combine all these pages into a single document.

Name file <server.domain.extension>-<mm-dd-yyyy>-<24hr:min:sec>.pdf

Upload (ftp) document onto supplied web server.

If work order entered through webform:

Generate retrieval URL

Send retrieval URL to stored email address [Posting contact details is Prohibited by [url removed, login to view] Admin] originally provided in step 1 with unique transaction number in subject line and body.

If work order entered through API:

Return document payload back over open http connection.

In case of timeout, fall back to email delivery described above.


We can provide server support but we prefer that you develop and test in your own environment and then provide instructions/support to deploy in our environment. Linux (Centos) OS Platform implementation is preferred.


I will need 1 week to verify the completeness of the deliverable.


Please see attached example file.

Take note of source URL and timestamp at the bottom of each page.

If interested, please include example description of API call framework.

Færdigheder: Apache, Linux, PHP, Software Arkitektur, Web Design

Se mere: crawl website pdf, convert entire website pdf, crawl website convert pdf, convert entire website pdf linux, entire website pdf, linux convert entire website pdf, linux convert website pdf, convert entire website, linux entire website pdf, crawl convert pdf linux, crawl generate pdf, xml to pdf php, xml to pdf in php, xml pdf php, within subject design, website order develop, verify freelancer, verify email address freelancer com, verification freelancer, value website, value or freelancer com, user test freelancer, url for freelancer website, support at freelancer com, step by step html web design

Om arbejdsgiveren:
( 0 bedømmelser ) San Jose, United States

Projekt-ID: #957356

Tildelt til:


I have several years experience in Linux/Unix and web development, mainly with Python, Java, C/C++, and PHP. My preferred framework is Django (Python based), but I learn quickly and would be willing to adapt to whateve Mere

$500 USD in 3 dage
(0 bedømmelser)

4 freelancers are bidding on average $663 for this job


Hello Please see PM. Regards, Chandni

$750 USD in 5 dage
(12 bedømmelser)

Hello, Please check PM.

$700 USD in 5 dage
(8 bedømmelser)

Please look PM.

$700 USD in 3 dage
(0 bedømmelser)