I am a consultant advising companies in a certain industry on cross-border business.
I need a piece of software to (i) collect data from an online directory and (ii) insert the collected data in a CSV file.
The website has a database on recent industry news, sorted by details like (i) company, (ii) address, (iii) project type, (iv) project completion. Note that the website doesn't simply list these news, but lets you do a search. Accordingly, if you search for "project type" since "date", it will show you, for example, the above details for the entire period. On average, several thousand projects are listed for each monthly period.
The task would be for the software to run a general search as described above, to collect recent project details from the website and write such data, in a structured format into a CSV file.
The website then presents the relevant data in a way that is not ideal for extraction. Basically, the information categories listed above are only separated by commas (","). Accordingly, it will be necessary to run some filters of the output information, in order to categorize it correctly. The CSV file would again be sorted by details like (i) company, (ii) address, (iii) project type, (iv) project completion.