Here is the scenario. I have many pdf files. I have highlighted text that appears as comments in those files. I have exported just those comments into text files. Those text files appear with an author tag, time-stamp, date, and page number. They are ordered by page number. I need everything removed (author, timestamp, date, blank lines, etc) EXCEPT for the comments and the page numbers (keep these).
The main issue here is the timestamp is not consistent. It follows no specific pattern. So I need someone who can make a script that will accomplish what I described and have that script work on a mac. I can provide demo pdfs with comments to work with.
I would prefer if this was a sed/bash/grep/regex/ or applescript.
6 freelancers are bidding on average $117 for this job
I have been working in building unix shell scripts for over 4+ years. I have worked on several scenarios as mentioned in the requirement. And i am well versed with all the logics used in unix shell scripting.