First script requirements:
1. Go through every article and extract website links and copy it to some database (txt or something), get names from that article and generate those names with website domain (example : if the extracted website link is [login to view URL], and the names are John Snow, Jane Doe, then the script will generate [login to view URL]@[login to view URL] and [login to view URL]@test.com.
2. Since the website will ban you after 50 articles, script needs to have some IP changing solution (Tor circle changing, Identity changing, socks5 etc...)
3. If the article doesn't have a website, the script needs to copy the title and search it via search engine (Google, DuckGoDuck, Yahoo etc...) with a prefix that i will provide later and copy it to database then again find names in the article and generate those names with finded domain.
4. Since some of the articles have email in image, the script needs to have OCR to harvest emails from those pictures.
5. I will need an easy explanation how can i change the domain of main extraction website(example from .com to .net etc...) and the PREFIX for searching tittles of articles with no websites.
Second script requirements:
The second script will go through every website from the database that first script made and go through every page of those websites crawling and gathering emails.
The project will be divided in 3 milestones.
14 freelancere byder i gennemsnit $187 på dette job
Hi There! I have more than 8-year experience in this field. Would you please share more details about the project? I am really interested to work with you for long. Best Regards, Santosh