I would like someone to assist me in developing an application in C/Java (standalone) or Visual Basic (plug-in) - I am open to other language suggestions if they make sense - which can parse Microsoft Word documents in accordance with a predefined set of rules and highlight problem areas.
The parser should be able to do the following:
1. Identify syntactic and semantic problems - I imagine the easiest way to do this would be to simply incorporate the code of the Link Grammar Parser ([url removed, login to view]) or a similar parser in the application.
2. Identify other structural problems in accordance with a set of rules that I would provide in a separate file. I am open to suggestions as to what grammar should be used in formulating these rules (probably some parsing expression grammar). The rules I would like to define are rules that are able to, e.g., require a closing parenthesis after an opening parenthesis (and handle nested parentheses), detect double spaces, etc.
The application should either work as a standalone application on Microsoft Windows, in which case I would also need an installer, or would be a plug-in to Microsoft Word.