Indexer for Word documents with Python

  • Status: Closed
  • Præmier: €35
  • Modtagne indlæg: 2
  • Vinder: ashki98

Konkurrence Instruktioner

I have a MS Word 2010 document and want to automatise the creation of the index with a human component, so the index is tailor-made.

The python code with NLTK I need should do the following steps:
1. Extract only the words which start with a capital letter of a given Microsoft Word document.
2. Tokenize only the words and create a MS Excel datasheet with two rows (word, frequency).
3. [I want to edit the Excel datasheet, to make sure, only the desired words get into the index]
4. Afterwards the words in the Excel datasheet should be the source for creating the index. This should be done with inserting { XE “word” } after the particular word in the original MS Word file.

Perhaps you can create two different snippets of code for automation.

Anbefalede færdigheder

Arbejdsgiverfeedback

“It was a great pleasure to work with ashki98. He copes with every problem in a very successful way. Thanks”

Profilbillede FasaniVerlag, Germany.

Bedste indlæg fra denne konkurrence

Se flere indlæg

Offentlig Præciserings Opslagstavle

  • ashki98
    ashki98
    • 6 år siden

    Can you please clarify point 4 & 5 so that I can modify my code to your requirements.

    • 6 år siden
  • rightroad
    rightroad
    • 6 år siden

    I'm working on it :) don't hesitate to contact me if you want to provide more details.. Best Regards.

    • 6 år siden

Sådan kommer du i gang med konkurrencer

  • Opret din konkurrence

    Opret din konkurrence Hurtigt og nemt

  • Få tonsvis af indlæg

    Få tonsvis af indlæg Fra hele verden

  • Tildel det bedste indlæg

    Tildel det bedste indlæg Download filerne - Nemt!

Opret en Konkurrence Nu eller slut dig til os i dag!