I need a web scraper written for the following url:
[login to view URL]
All information needed is available on the main page. The number of rows will vary.
The output should be a pipe (|) delimited file with the following column mappings:
origin_city --> data located in the "Pickup” column before the comma and two letter state abbreviation
origin_state --> data is the two letter abbreviation located in the "Pickup" column after the comma
ship_date --> will have to click on the blue + under the "Expand All" column to retreive data from "Pickup Date" column, changed to the YYYY-MM-DD format
destination_city --> data located in the "Destination" column before the comma and two letter state abbreviation
destination_state --> data is the two letter abbreviation located in the "Destination" column after the comma
receive_date --> leave blank
trailer_type --> data located in the "Equipment" column
load_size --> add the text "Full"
weight --> leave blank
length --> leave blank
width --> leave blank
height --> leave blank
trip_miles --> leave blank
pay_rate --> leave blank
contact_phone --> add the text "334-585-7075"
contact_name --> leave blank
tarp_required --> leave blank
comment --> data located in the "Loads" column, add the text "Loads=" before data
load_number --> will have to click on the blue + under the "Expand All" column to retreive data from "Load ID" column
commodity --> will have to click on the blue + under the "Expand All" column to retreive data from "Commodity" column
If there are mulitple rows once the blue + is clicked and expanded, each of those additional rows will need to be added using the same "origin_city", "origin_state", "destination_city", "destination_state", and "trailer_type" that was listed.
The first line of the output should contain all of the column headers.
Any field that contain no data should be left blank.
Please do not use words like "null" or "blank" in blank columns.
Below is a sample output of the first 5 columns using sample data:
The deliverable will be a Perl .pl file that must run on
Ubuntu Linux and must use Modern::Perl. The Perl .pl file
should be called '[login to view URL]' and the output file should be
called '[login to view URL]'
It will be scheduled in cron to run unattended every 15 minutes.
Please specific what language/OS/modules you plan to use.
Also, please include the word "raccoon" in your bid so I know that
you read this description.