Title |
Assessing the Usefulness of Google Books’ Word Frequencies for Psycholinguistic Research on Word Processing
|
---|---|
Published in |
Frontiers in Psychology, January 2011
|
DOI | 10.3389/fpsyg.2011.00027 |
Pubmed ID | |
Authors |
Marc Brysbaert, Emmanuel Keuleers, Boris New |
Abstract |
In this Perspective Article we assess the usefulness of Google's new word frequencies for word recognition research (lexical decision and word naming). We find that, despite the massive corpus on which the Google estimates are based (131 billion words from books published in the United States alone), the Google American English frequencies explain 11% less of the variance in the lexical decision times from the English Lexicon Project (Balota et al., 2007) than the SUBTLEX-US word frequencies, based on a corpus of 51 million words from film and television subtitles. Further analyses indicate that word frequencies derived from recent books (published after 2000) are better predictors of word processing times than frequencies based on the full corpus, and that word frequencies based on fiction books predict word processing times better than word frequencies based on the full corpus. The most predictive word frequencies from Google still do not explain more of the variance in word recognition times of undergraduate students and old adults than the subtitle-based word frequencies. |
X Demographics
As of 1 July 2024, you may notice a temporary increase in the numbers of X profiles with Unknown location. Click here to learn more.
Geographical breakdown
Country | Count | As % |
---|---|---|
United Kingdom | 1 | 50% |
Germany | 1 | 50% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Scientists | 2 | 100% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
Nigeria | 9 | 10% |
Germany | 2 | 2% |
United Kingdom | 2 | 2% |
United States | 2 | 2% |
Japan | 1 | 1% |
Belgium | 1 | 1% |
Unknown | 73 | 81% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Student > Ph. D. Student | 19 | 21% |
Researcher | 18 | 20% |
Student > Bachelor | 11 | 12% |
Student > Master | 10 | 11% |
Other | 4 | 4% |
Other | 20 | 22% |
Unknown | 8 | 9% |
Readers by discipline | Count | As % |
---|---|---|
Psychology | 26 | 29% |
Linguistics | 22 | 24% |
Arts and Humanities | 7 | 8% |
Computer Science | 6 | 7% |
Social Sciences | 5 | 6% |
Other | 15 | 17% |
Unknown | 9 | 10% |