Seamless PDF text extraction