Collect information from a website (using script) and deliver results as XLS or CSV file
Budget $30-250 USD
Project Summary:
Create an excel (.xls) or .csv file containing information extracted from the New York Department of State Website that publishes licensing information for real estate licensees across New York State.
Data Source:
The data is all available in a single, publicly accessible website (with no registration required for search). The web interface offers several filter criteria to generate a set of search results. The web interface permits search results to be displayed in no more than 30 rows per page.
In the result set, each row contains the name of one brokerage, which is an embedded hyperlink to a page with detailed information about the brokerage. Each detailed information includes information about the brokerage as well as several principals associated.
Extraction Algorithm:
The result set should include only information search results for PRINCIPAL OFFICE entries within WESTCHESTER county.
For each PRINCIPAL OFFICE, the result set should include information for each RELATED PARTY NAME if the LICENSE TYPE is one of:
*Corporate Broker:Corp. Broker
*Limited Liability Company Broker:LL Comp Broker
*Trade Name Broker:TN Broker
*Individual Broker:Indv. Broker
For each PRINCIPAL OFFICE, the result set should include [RELATED PARTY NAME, LICENSE TYPE and ADDRESS] for each RELATED PARTY NAME if the LICENSE TYPE is one of:
*Corporate Broker:Corp. Broker
*Limited Liability Company Broker:LL Comp Broker
*Trade Name Broker:TN Broker
*Individual Broker:Indv. Broker
For each PRINCIPAL OFFICE, the result set should include [RELATED PARTY NAME, LICENSE TYPE, and ADDRESS] for each RELATED PARTY NAME if the LICENSE TYPE is "Branch Office:Branch Office" and the ADDRESS for the RELATED PARTY NAME is in New York, NY
Result Set Format:
*The Registered Name
*Main Address parsed into
--Full Street Address including floor or suite number where appropriate
--City
--Zip
*Area Code & Telephone Number (if available)
*Related Party Name
*License Type
*Related Party Name Address
NOTE: The result set can contain duplicated information for a Registered Name, if there are multiple Related Party Names associated.
Tildelt til:
36 freelancere byder i gennemsnit $116 på dette job
Hello, i have expertise in web scraping. If interested in my bid please contact me. Best Regards.
I worked on many similar projects, I have big experience in data mining projects. I can finish this task in short time, with the best quality.
Hello, I am an expert in data extraction with over 5 years of experience. Please provide URL. Thanks, Alex
Sir, I can do the project. Refer PMB. Looking for further discussions in this matter. with thanks and regards
Hi, can you provide the paginated URL where the paginated elements are displayed? Thanks.
Hi, Ready to start your work. Eagerly awaiting for your positive reply. Please check your inbox for further details. Thanks, Shaik.
I can do this with imacros for Firefox. I've already done similar projects. see examples in portfolio or on my website [url removed, login to view]