|
OCR conversion to searchable PDF
The OCR process identifies text in each page with near perfect accuracy and create a hidden text layer within the same PDF file in such a way that each text word is exactly positioned behind its appearance in the image. This recognized text can now be used in three ways:
You can easily OCR any image-based PDF file or PDF file created by scanning. Docsvault can convert your documents into editable text (Notepad) and fully searchable OCR’d PDF files, the perfect format to store and share text based information over or beyond your organization.
In a searchable PDF, the original scanned image is retained so any human can read the document. The textual content that is extracted via OCR is put behind the image so search indexers can see it and you can select it as text in any PDF Editor. PDF searchable is very useful for you where you would like to have your documents in PDF format as well as have the ability to search the documents by it’s contents. PDF searchable files provide a reliable and easy way of searching PDF documents.You can retrieved the OCR'd documents either by browsing through Docsvault or by searching for a document using Search Option. The indexing provided by Docsvault indexer provides high performance of retrieval. [Indexing is a system service that helps you to quickly find files on your computer using text searches. When you perform OCR on Portable Document Format (PDF) recognized text is available to the index, making it possible to find relevant PDF files when you search.]
While performing OCR, the program analyzes the image and detects areas that contain text.
Page url: http://www.docsvault.com/online-help/professional/index.html?ocr_to_searchable_pdf_file.html |