Accurate PDF OCR for .doc files