Need accurate character count of Word documents using antiword. Currently, MS Word docs are placed in temp directory, parsed by antiword, with resulting character count. Counts are coming in anywhere from near perfect to 5% variance - I need less than 1 percent variance compared to what Word reports. Have code in place to remove any whitespace above two spaces after end-sentence punctuation, and to include tabs and returns.
Code written for php on Linux box. Need soon, as larger project is on hold until counts are to within 1 percent every time! Need someone to review and to correct or make additions to code to bring counts within 1% - consistently.
Will pay for project when complete, tested, and working well every time. Need someone on it soon, and someone who will stay in touch. Please do not expect payment ahead of time. The code is there, it just needs tweaking and someone needs to know antiword inside and out and have the experience to fix existing code. (four lines).
Code will be provided to winning bidder.