{"id":2510,"date":"2011-01-05T03:44:18","date_gmt":"2011-01-05T08:44:18","guid":{"rendered":"http:\/\/blogs.agu.org\/landslideblog\/?p=2510"},"modified":"2011-01-05T03:44:18","modified_gmt":"2011-01-05T08:44:18","slug":"google-ngrams-the-use-of-the-word-landslide","status":"publish","type":"post","link":"https:\/\/blogs.agu.org\/landslideblog\/2011\/01\/05\/google-ngrams-the-use-of-the-word-landslide\/","title":{"rendered":"Google Ngrams &#8211; the use of the word &#8220;landslide&#8221;"},"content":{"rendered":"<p>Google now has about a million books online dating from 1500 to 2008.\u00a0 An interesting tool that they have provided, and which is great fun with which to play, is the <a href=\"http:\/\/ngrams.googlelabs.com\/\">NGram viewer<\/a>,\u00a0which allows the user to search for, and graph,\u00a0the occurrence of specific words through time in these texts.\u00a0Note that the data are presented as the use of the word as a percentage of the total words in the texts.\u00a0<\/p>\n<p>It is quite fun to compare the use of four different words describing natural hazards.\u00a0 This graph shows the occurrence of the words &#8220;earthquake&#8221;, &#8220;flood&#8221; and &#8220;volcano&#8221; from 1500 to 2008 in the &#8220;Google Million&#8221; set of English books:<\/p>\n<p style=\"text-align: center\"><a href=\"https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-1.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-2511\" src=\"https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-1.jpg\" alt=\"\" width=\"554\" height=\"212\" srcset=\"https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-1.jpg 924w, https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-1-300x114.jpg 300w\" sizes=\"auto, (max-width: 554px) 100vw, 554px\" \/><\/a><\/p>\n<p>The dataset starts very noisy and then becomes more stable, with a substantial increase in the use of the terms from about 1750 onwards.\u00a0 So lets focus on the period 1800 to 2008:<\/p>\n<p style=\"text-align: center\"><a href=\"https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-3.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-2512\" src=\"https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-3.png\" alt=\"\" width=\"540\" height=\"198\" srcset=\"https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-3.png 900w, https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-3-300x110.png 300w\" sizes=\"auto, (max-width: 540px) 100vw, 540px\" \/><\/a><\/p>\n<p>Notice how the use of the word\u00a0&#8220;earthquake&#8221;\u00a0has some large spikes, whereas &#8220;volcano&#8221; is smoother.\u00a0 I don&#8217;t know enough about the history of these terms and events to be able to interpret this, but compare the above with the\u00a0same data for the\u00a0\u00a0use of the word landslide:<\/p>\n<p style=\"text-align: center\"><a href=\"https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-4.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-2513\" src=\"https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-4.png\" alt=\"\" width=\"540\" height=\"198\" srcset=\"https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-4.png 900w, https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-4-300x110.png 300w\" sizes=\"auto, (max-width: 540px) 100vw, 540px\" \/><\/a><\/p>\n<p>The pattern is completely different (and note that the axes scale is also rather different).\u00a0 The use of the word landslide started to increase from 1800 onwards, and appears to have risen essentially\u00a0monotonically thereafter.\u00a0 There is a slight dip in the period from 1940 to 1950 (the effect of World War 2?), and the data for the term has become more noisy in recent years.\u00a0 This could of course be a reflection of the use of the term in the context of elections (i.e. &#8220;landslide victory&#8221; and suchlike), but a search for that specific term suggests not:<\/p>\n<p style=\"text-align: center\"><a href=\"https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-5.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-2514\" src=\"https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-5.png\" alt=\"\" width=\"540\" height=\"198\" srcset=\"https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-5.png 900w, https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-5-300x110.png 300w\" sizes=\"auto, (max-width: 540px) 100vw, 540px\" \/><\/a><\/p>\n<p>Finally, compare the term &#8220;landslide&#8221; with the older term &#8220;landslip&#8221;:<\/p>\n<p style=\"text-align: center\"><a href=\"https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-6.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-2515\" src=\"https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-6.png\" alt=\"\" width=\"540\" height=\"198\" srcset=\"https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-6.png 900w, https:\/\/blogs.agu.org\/landslideblog\/files\/2011\/01\/11_01-Ngram-6-300x110.png 300w\" sizes=\"auto, (max-width: 540px) 100vw, 540px\" \/><\/a><\/p>\n<p>It is clear that the latter term was much more common than landslide through to about 1900, and that as the use of the term &#8220;landslide&#8221; became more frequent the use of &#8220;landslip&#8221; reduced markedly.\u00a0<\/p>\n<p>I&#8217;ll come back to Google Ngramsin a subsequent post to look at the use of landslide terminology in more detail.\u00a0 It is a pretty cool tool.<\/p>\n<!-- AddThis Advanced Settings generic via filter on the_content --><!-- AddThis Share Buttons generic via filter on the_content -->","protected":false},"excerpt":{"rendered":"<p>Google now has about a million books online dating from 1500 to 2008.\u00a0 An interesting tool that they have provided, and which is great fun with which to play, is the NGram viewer,\u00a0which allows the user to search for, and graph,\u00a0the occurrence of specific words through time in these texts.\u00a0Note that the data are presented as the use of the word as a percentage of the total words in the &hellip;<!-- AddThis Advanced Settings generic via filter on wp_trim_excerpt --><!-- AddThis Share Buttons generic via filter on wp_trim_excerpt --><\/p>\n","protected":false},"author":22,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[59,515,185,517,516,107],"class_list":["post-2510","post","type-post","status-publish","format-standard","hentry","category-uncategorized","tag-flood","tag-google","tag-landslide","tag-landslip","tag-ngram","tag-volcano"],"_links":{"self":[{"href":"https:\/\/blogs.agu.org\/landslideblog\/wp-json\/wp\/v2\/posts\/2510","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.agu.org\/landslideblog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.agu.org\/landslideblog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.agu.org\/landslideblog\/wp-json\/wp\/v2\/users\/22"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.agu.org\/landslideblog\/wp-json\/wp\/v2\/comments?post=2510"}],"version-history":[{"count":0,"href":"https:\/\/blogs.agu.org\/landslideblog\/wp-json\/wp\/v2\/posts\/2510\/revisions"}],"wp:attachment":[{"href":"https:\/\/blogs.agu.org\/landslideblog\/wp-json\/wp\/v2\/media?parent=2510"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.agu.org\/landslideblog\/wp-json\/wp\/v2\/categories?post=2510"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.agu.org\/landslideblog\/wp-json\/wp\/v2\/tags?post=2510"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}