I have a mySQL .SQL data file which needs some cleaning up, with a script (not manually).
What needs to be done:
1) The 'Details' field is currently a messy aggregation of five information fields.
Color, Size, Fabric, Weight, Producing Time
Example:
a) 'COLOR:pink,green,purple,blue\r\nSIZE:s,m,l\r\nFABRIC:100%cotton\r\nWEIGHT:0.8kg'
b) 'Color:blue,size:S,M,L,Fabric:100%cotton,weight:0.2kg,ProducingTime:7days'
The table needs to be redefined and this field needs to be parsed/split out into the 5 separate fields.
2) There are some spelling errors and typos within the fields. I need the unique values (groupBy) list of each field so I can provide input on what needs to be cleaned up/reconciled to an approved list of Correct values. Then you need to be able to apply that 'correct list' to the data.
----
Here is the current [login to view URL] and there are 13000 records in the table.
This is the [login to view URL]
create table products (
id int not null,
offer_no varchar(32),
brand varchar(32),
category varchar(32),
subcategory varchar(32),
quantity int not null default 0,
quantity_min int not null default 0,
price varchar(32),
style varchar(32),
weight varchar(32),
details tinytext,
image_1 varchar(128),
image_2 varchar(128),
image_3 varchar(128),
image_4 varchar(128),
primary key (id))