We are looking for a professional / company to design us a bespoke software for data cleansing and extraction into template based csv files. The input will be through csv files.
Input file will be a csv and it is essential that all the columns/fields in the file are stored.
The product should be able to provide extract to any number of columns which the user can select through drop down or in any simple manner (keeping in mind that the end users are not technically efficient).
It is essential that all the code is provided at the end of the project and the project will be termed successful only after successful installation and after complete data extraction cycle.
This product is a stand alone product and we are not very fussed about making it a web based system. Please mention the technology that you would be using when bidding for the product.
Please note that the deliverable includes complete documentation with screen shots of each and every step involved (installation and operational).
## Deliverables
It is essential that the final product meets the following
* Ability to import and store data from csv files in the provided format
* Ability to clean data(each column) using rules and store it example strip HTML tags
* Ability to store data with time stamps and file names
* Tracking on imported records by date/time stamp
* Ability to extract data in csv with various formats
* Ability to provide different column headings depending on the format selected for extracts
* Ability to extract data on file names that were used to import the data
* Should be able to store the input and output formats in template files so that the next time and import or export is to be run the software can allow the user to use the existing templates
* Should be able to support upto 5 million records
* User authentication and audit needed
* Job to setup periodic backup and restore function to restore from a given backup file.