OCR technology for converting PDFs