At the DLS Technology Roundtable on June 9, 2009, we discussed to what extent Google indexes PDF files.
Google has fully indexed documents directly saved as PDFs since 2001. In October 2008, it expanded its scope by also indexing documents scanned into PDF format. These documents are more difficult to index, because scanning creates a digital picture of a page. Google uses optical character recognition to index these scanned files. For more information, see the announcement at http://googleblog.blogspot.com/2008/10/picture-of-thousand-words.html.
Do you have a technology tip or information you’d like to share with other DLS libraries? Send it to Jane Plass for possible publication as a future Tech Tidbit on our Here & Now blog.
Discussion of products and/or services is for information only and does not imply suitability for your library.
Comments