Ngram Viewer
Jump to navigation
Jump to search
Google Books Ngrams Viewer is a tool for analyzing the whole google books corpus.
According to Culturonomic, , retrieved 12:56, 18 December 2010 (CET), “The Google Labs N-gram Viewer is the first tool of its kind, capable of precisely and rapidly quantifying cultural trends based on massive quantities of data. It is a gateway to culturomics! The browser is designed to enable you to examine the frequency of words (banana) or phrases ('United States of America') in books over time. You'll be searching through over 5.2 million books: ~4% of all books ever published!”
See also: Visualization
Links
- Tools and data
- Ngrams Viewer (Google's tool)
- Data Sets CSV files for local dataprocessing.
- Official and semi-official
- Inside google Books (informal Google blog). e.g.
- Inside google Books, (December 16, 2010, crossposted to google blog)
- Other
- Cultureomics, E.g.
- The cultural genome: Google Books reveals traces of fame, censorship and changing languages, Discover Magazine, dec 16, 2010.
- Alternative on-line corpuses
- The Corpus Of Historical American English (Coha) (By Mark Davies, Brigham Young University). Smaller corpus recstricted to USA, but better search capacities.
References
- Michel, Jean-Baptiste:; Yuan Kui Shen, Aviva Presser Aiden, Adrian Veres, Matthew K. Gray, The Google Books Team, Joseph P. Pickett, Dale Hoiberg, Dan Clancy, Peter Norvig, Jon Orwant, Steven Pinker, Martin A. Nowak, and Erez Lieberman Aiden. (2010). [Quantitative Analysis of Culture Using Millions of Digitized Books. Science. ScienceMag online Version (12/16/210) Quantitative Analysis of Culture Using Millions of Digitized Books
- Erez Lieberman, Jean-Baptiste Michel, Joe Jackson, Tina Tang, and Martin Nowak, (2007. Quantifying the Evolutionary Dynamics of Language., Nature 449 .