Google’s Updates Ngram Viewer, Showing How Words Have Evolved Over time

Google announced earlier today that version 2.0 of the popular Google Books Ngram Viewer is now available online. What’s an Ngram Viewer? In a nutshell, Ngram Viewer lets you find and visualize how words and phrases have developed and been used over time using the 30 million print books Google has scanned working with libraries located around the world as its dataset.

The service debuted in December, 2010 at the time this research paper was published in Science.

Ngram Viewer was developed as a research tool for linguists, lexicographers, historians and others but has proven to be popular tool for others. Google says that more than 45 million word comparison graphs have been created in Ngram Viewer’s first 22 months.

In a Google Research Blog Post, Google Engineering Manager and Ngram Viewer co-creator, John Orwant, says that version 2.0 is using a new dataset with material from more books.

Orwant adds that along with more data, the optical character recognition (OCR) that Google uses when scanning books is better, and Google has also made improvements in how it deals with the metadata provided by both publisher and library partners.

The quality of Google’s scanning and metadata has been under scrutiny since the beginning of the project.

We covered some of the initial problems with Ngram Viewer when it launched in “When OCR Goes Bad: Google’s Ngram Viewer & The F-Word.” Note: Adult language used in the article and demo searches. 

As an example, the “medial S” appears to still be causing inaccurate results.

Here’s the current version of a search used in the story where you’ll see some of the same issues raised back in 2010.

Of course no scanning method, metadata source or database are 100% perfect, but that doesn’t mean you shouldn’t take advantage of what Ngram Viewer offers. Our only advice, as is the case with any database or reference resource, is to review and question what you find.

Ngram Version 2.0 also can now automatically automatically identify parts of speech and compare how a word is used. For example, how the word “cheer” is used as a verb and noun over time:

With the new version, you can also now add, subtract, multiply and divide Ngram counts. For instance, you can see how “record player” rose as the popularity of “Victrola” declined:

You can learn more about how Ngram Viewer works on this info page.

With a bit of understanding of what Ngram Viewer can and can’t do, because of its size, it’s a unique resource that can be both educational, informative and even fun for just about anyone who is interested in the history of how language evolves.

Related Topics: Channel: Other | Google: Other


About The Author: is a librarian, author, and an online information analyst based in suburban Washington, DC. He is the co-founder and co-editor of INFOdocket and FullTextReports.com and prior to that was founder/editor of ResourceShelf and DocuTicker for 10 years. He has worked for Blekko, Ask.com, and at Search Engine Watch where he was news editor. In 2001, Price was the co-author (with Chris Sherman) of the best-selling book The Invisible Web.

Connect with the author via: Email


SMX - Search Marketing Expo

SearchCap:

Get all the top search stories emailed daily!  

Like This Story? Please Share!

Other ways to share:

Like Our Site? Follow Us!

Subscribe to Our Feed! Join our LinkedIn Group Check out our Tumblr! See us on Pinterest Get Search Engine Land on your mobile device!
 

Read before commenting! We welcome constructive comments and allow any that meet our common sense criteria. This means being respectful and polite to others. It means providing helpful information that contributes to a story or discussion. It means leaving links only that substantially add further to a discussion. Comments using foul language, being disrespectful to others or otherwise violating what we believe are common sense standards of discussion will be deleted. Comments may also be removed if they are posted from anonymous accounts. You can read more about our comments policy here.

Comments are closed.

Get Our News, Everywhere!

 
  • Advertise With Us
 

Click to watch SMX conference video

Join us at an upcoming SMX event:

North America

EMEA

APAC

Search Engine Land produces SMX, the Search Marketing Expo conference series. SMX events deliver the most comprehensive educational and networking experiences - whether you're just starting in search marketing or you're a seasoned expert.

SMX Site » | SMX Difference » | SMX News »




 

Search Engine Land Periodic Table of SEO Ranking Factors

Get Your Copy
Read The Full SEO Guide