Extract text from PDF to DOC