The job is to scrape data from the following web page: "[url removed, login to view]".
The job must be done WITHOUT the use of browser emulation (selenium, phantomjs, etc.) or macros. We already have a solution that uses selenium but it is much too slow for our purposes. We prefer that the job be done in python, but this is not a requirement.
We can provide code to solve the captcha, that is not the challenging aspect of this job. If you determine that the job cannot be done without browser emulation, we can make an exception if your solution is able to scrape pages in a timely manner (under 5 seconds per page).
To test the program, you can select the default court (Junta Especial No. 1) and for No. de Expediente you can use the value: 123/2015. It is not necessary to parse the results. Just write the results to a file.
Before bidding, please prove that you actually visited the site and are sure you can do the job. To prove that you visited the site, please mention something on the result page (using the example expediente 123/2015).
33 freelancers are bidding on average $3305 for this job
Hi, I have developed similar scraper/crawler and data/web automation projects. Please let me know if you are interested and I am available to start right away.
How many file numbers will there be per Junta? Do you provide file numbers or can I use numbers like 001/2017? I can provide you with samples and I can scrape faster than 5 seconds per page.
Hello, I can scrape this in python without using selenium. It would much faster then 5s. If you have code for captcha even better, I can have this done in 1 day Josef