Accurate OCR for PDF to DOC conversion