University of California professor Geoffrey Nunberg published a scholarly look at Google’s book scanning and search project in the Chronicle of Higher Education a few days ago.
In the article, Nunberg points out many of the mistakes in Google’s library, such as:
Listing 1899 as the publication date for books like Raymond Chandler’s Killer in the Rain, Robert Shelton’s biography of Bob Dylan and Stephen King’s Christine, to name a few.
Searching for the word Internet in books published before 1950 produced more than 500 results.
Searching for Medicare in books published before 1950 got almost 1,600 hits.
An entry for the book The Mosaic Navigator: The Essential Guide to the Internet Interface was apparently written in 1939 by Sigmund Freud and Katherine Jones.
Nunberg thinks that Google got into literary water a little over its head when it took on this project:
I have the sense that a lot of the initial problems are due to Google’s slightly clueless fumbling as it tried master a domain that turned out to be a lot more complex than the company first realized. It’s clear that Google designed the system without giving much thought to the need for reliable metadata. In fact, Google’s great achievement as a Web search engine was to demonstrate how easy it could be to locate useful information without attending to metadata or resorting to Yahoo-like schemes of classification. But books aren’t simply vehicles for communicating information, and managing a vast library collection requires different skills, approaches, and data than those that enabled Google to dominate Web searching.
In other words, it’s just possible that classifying books to make them useful and searchable might be a little more complicated than running an engine for common searches, like the lyrics of a song or the dates of a war.
Have you used Google’s book search and found it lacking? Do you have horror stories about trying to find information only to come up with mangled results? Let me know your horror stories (or even your praise) in the comments.
I use Google book search all the time when looking for historical information. Many times I would be going through a book page by page, and the scanner's hand is visible in the scan!
Google Book Search disappoints the scholarly searcher
University of California professor Geoffrey Nunberg published a scholarly look at Google’s book scanning and search project in the Chronicle of Higher Education a few days ago.
In the article, Nunberg points out many of the mistakes in Google’s library, such as:
Nunberg thinks that Google got into literary water a little over its head when it took on this project:
In other words, it’s just possible that classifying books to make them useful and searchable might be a little more complicated than running an engine for common searches, like the lyrics of a song or the dates of a war.
Have you used Google’s book search and found it lacking? Do you have horror stories about trying to find information only to come up with mangled results? Let me know your horror stories (or even your praise) in the comments.