Google Book Search disappoints the scholarly searcher
University of California professor Geoffrey Nunberg published a scholarly look at Google’s book scanning and search project in the Chronicle of Higher Education a few days ago.
In the article, Nunberg points out many of the mistakes in Google’s library, such as:
- Listing 1899 as the publication date for books like Raymond Chandler’s Killer in the Rain, Robert Shelton’s biography of Bob Dylan and Stephen King’s Christine, to name a few.
- Searching for the word Internet in books published before 1950 produced more than 500 results.
- Searching for Medicare in books published before 1950 got almost 1,600 hits.
- An entry for the book The Mosaic Navigator: The Essential Guide to the Internet Interface was apparently written in 1939 by Sigmund Freud and Katherine Jones.
Nunberg thinks that Google got into literary water a little over its head when it took on this project:
I have the sense that a lot of the initial problems are due to Google’s slightly clueless fumbling as it tried master a domain that turned out to be a lot more complex than the company first realized. It’s clear that Google designed the system without giving much thought to the need for reliable metadata. In fact, Google’s great achievement as a Web search engine was to demonstrate how easy it could be to locate useful information without attending to metadata or resorting to Yahoo-like schemes of classification. But books aren’t simply vehicles for communicating information, and managing a vast library collection requires different skills, approaches, and data than those that enabled Google to dominate Web searching.
In other words, it’s just possible that classifying books to make them useful and searchable might be a little more complicated than running an engine for common searches, like the lyrics of a song or the dates of a war.
Have you used Google’s book search and found it lacking? Do you have horror stories about trying to find information only to come up with mangled results? Let me know your horror stories (or even your praise) in the comments.
-
http://www.hypercrit.net Michael Becker
-
http://thedispersalofdarwin.wordpress.com/ Michael D. Barton
-
http://intensedebate.com/people/becker Michael Becker
-
http://www.hypercrit.net Michael Becker
Twitter
- 5 Reasons QR Code Marketing Is Broken (and How to Fix It) http://t.co/PlnRoigE via @zite 4 hours ago
- Good god, I haven't gotten a change to take my fingers off my keyboard all day. 11 hours ago
- Damn it all! Will nothing keep #Firebug from slowing #Firefox to a crawl?! 2012/05/16
My Instapaper Queue




