For this exciting project you will be scraping a large content website.
Our target price for this project is $15.
We will give you the address of a websites each with at lest a few hundred pages
For all of the normal content/article pages, you will need to:
1) Scrape the content
2) Result should be presented as CSV file
3) Parse content and save the following fields: title, abstract, keywords, body, category (some fields may not be available for this particular content)
4) Remove specific string patterns that we define.
The resulting content must be free of any images and html tags, but must maintain spaces and paragraph indicator.
We are looking to complete this project quickly – By May 2nd.
We will need the freelancer to show us a small number of records for our approval before going and completing the project.
Please use the phrase super-scraper in your response, so we know you have read this description.
We expect to have additional work like this.
15 freelancere byder i gennemsnit $138 på dette job
Hi there, can you please supply the URLs that you want scraped and I will provide you with a sample of my work, free of charge and obligation. Thanks
super-scraper Hello, see this link for similar work by me for collecting WOW games gold coin prices. [login to view URL] Waiting for your reply. Thanks.