Web Spider Application To Extract Relevant Email Addresses And Info Need a talented developer to develop a simple application that will do the following: 1. We input "File Name", "Criteria Name", "Source" and "URL to a search engine" such as: [url removed, login to view] PART A) 2. Application will then crawl through all the result pages generated from this URL query. For every page that is crawled, the application will need to extract email addresses based on one of the following criteria: A) any emails with the word "owner" in it B) any email address seen in each link C) any email address with a particular domain "[url removed, login to view]" 3) For each match, the application will save the following to a comma delimited file: # of Match, Criteria Name (from input in step #1), Date, Source (from input in step #1), Web Page URL, Email Address Extracted PART B) Part B is mainly for crawling through results that are blogs. The idea is the same, but the data being extracted is different. In this case, instead of the three criteria options to choose from, we are: For each link, you are best searching for "email me" within the link to find the email address; click on "about" to get the link to the blog author's profile 3) For each match, the application will save the following to a comma delimited file (each match might require an additional crawl to the "about" page for the additional information: # of Match, Criteria Name (from input in step #1), Date, Source (from input in step #1), Web Page URL, Email Address Extracted First Name, (If Available) Last Name, (If Available) City, (If Available) State, (If Available) URL of Blogger Profile, (If Available) Age (If Available)
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Deliverables must be in ready-to-run condition, as follows? (depending on the nature? of the deliverables):
a)? For web sites or? other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.
b) For all others including desktop software or software the buyer intends to distribute: A software? installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).
We are open to the language used to develop the application, although anything in the .NET world would be preferred. Although, Must run under Windows.