Indexer for Word documents with Python
- Status: Closed
- Præmier: €35
- Modtagne indlæg: 2
- Vinder: ashki98
Konkurrence Instruktioner
I have a MS Word 2010 document and want to automatise the creation of the index with a human component, so the index is tailor-made.
The python code with NLTK I need should do the following steps:
1. Extract only the words which start with a capital letter of a given Microsoft Word document.
2. Tokenize only the words and create a MS Excel datasheet with two rows (word, frequency).
3. [I want to edit the Excel datasheet, to make sure, only the desired words get into the index]
4. Afterwards the words in the Excel datasheet should be the source for creating the index. This should be done with inserting { XE “word” } after the particular word in the original MS Word file.
Perhaps you can create two different snippets of code for automation.
Anbefalede færdigheder
Arbejdsgiverfeedback
“It was a great pleasure to work with ashki98. He copes with every problem in a very successful way. Thanks”
FasaniVerlag, Germany.
Bedste indlæg fra denne konkurrence
-
abdohusseinelab2 Egypt
Offentlig Præciserings Opslagstavle
Sådan kommer du i gang med konkurrencer
-
Opret din konkurrence Hurtigt og nemt
-
Få tonsvis af indlæg Fra hele verden
-
Tildel det bedste indlæg Download filerne - Nemt!