I got XML file as attached. I need sentences that are in it recreated in plain text, each sentence in separate line. The program should generate 3 files. 1st all words in thay infinitives form, 2dn all sentences should be in Subject Verb Object (SVO) form, so it needs to be check what is word in a sentence and put it in good order not changing other words, 3rd is combination od 1st and 2nd. Coding MUST be UTF-8. Sentences start with . Word is in ci?gnie tags and infinity in ci?gn?? tags. Parts of sentece are in subst:sg:nom:m2 tags. If there is "subst" and "nom" in those tags it is Subject, if "perf" or "imperf" it is Verb, if " subst" and anything but not "nom" it is an Object.
Aplication can be in linux or windows. Can be script. My preferations are linux with gui and command line usage ability but it is not obligatory.
13 freelancere byder i gennemsnit $106 på dette job
I have some experience with that work. I wrote Python code that read a big XML file and insert it to Knowledge Base of Semantic Web project (at my university)... So, I can do it exactly...