Extract text from PDF and save as DOC