This project has two parts and an optional third part we'd like a quote on and overall needs to provide for later extension expansion.
1. Scrape data from PDF files and enter data into MySQL DB. The data consist of the name and features of individual health insurance policies from different insurance companies. Initially, the data will be of policies from one state. The data will consist of text and numbers, including, cost or premium, amount of deductible, age of insured, and other specific features. (A sample of a PDF from one insurance company is attached for reference. The PDF shows features but not rates. Those should be in a separate table because they will need to be updated periodically). Overall there will be data from perhaps 100 policies (more or less) and each policy will have multiple premium costs based on different ages and other parameters (such as smoking/non-smoking). There also will be data for the cost of adding children to policy. Also attached is a rate spreadsheet that will be similar to what will be provided for rate information.
2. Part two involves data processing. A person will enter into an online form certain data elements (called a "census") about a group of individuals. For example, it will have a total number of individuals, the average age of the individuals, smoker versus non-smoker, etc. The number of individuals in the census are from 1 to n. The data processing engine we need will take this information and match it against the database to come up with a rough estimate of the total premium costs for the group. We will want three estimates for each census. For example, a census of 10 people has 5 people average age of 25 and 5 average of 55. We want an estimate of the cost to insure them all in total with one estimate using lower cost high deductible policies, one using mid-cost mid-deductible policies, and one higher cost using low deductible policies. The engine will come up with an average rate for each age group then multiply by 5 and then added together. That is done for each high, medium, low cost policies for three aggregate number estimates. The screen for data entry needs to be clean and uncluttered for use by administrators and others (it will not be publicly available - only to those with login).
3. OPTIONAL - We also would like feedback on the potential cost of additionally building online tables of plan information to enable individuals to sort and choose policies from the available options. They would input certain features, such as choose a high, medium, or low deductible and then be presented with a list of plans. See Vimo, ehealthinsurance for an idea of what we'd like (but we want it to be better).
54 freelancers are bidding on average $5450 for this job
this is a nice project,i would love to work on it if you have a fair budget and if you are looking for a top quality work!,we can talk about the optional part as well!