Googlewhack
A Googlewhack is a type of contest for finding a Google search query consisting of exactly two words without quotation marks, that returns exactly one hit. A Googlewhack must consist of two actual words found in a dictionary. A Googlewhack is considered legitimate if both of the searched-for words appear in the result page. For example, typing an asterisk (*) into the Google Search Bar turns up no results.
Published googlewhacks are short-lived, since when published to a web site, the new number of hits will become at least two, one to the original hit found, and one to the publishing site.[1]
History
The term Googlewhack, coined by Gary Stock, first appeared on the web at UnBlinking on January 8, 2002.[2] Subsequently, Stock created The Whack Stack, at googlewhack.com, to allow the verification and collection of user-submitted Googlewhacks.
Googlewhacks were the basis of British comedian Dave Gorman's comedy tour Dave Gorman's GoogleWhack Adventure and book of the same name.[3] In these Gorman tells the true story of how, while attempting to write a novel for his publisher, he became obsessed with Googlewhacks and travelled across the world finding people who had authored them. Although he never wrote his novel, Dave Gorman's GoogleWhack Adventure went on to be a Sunday Times #1 best seller in the UK.
Participants at Googlewhack.com discovered the sporadic "cleaner girl" bug in Google's search algorithm where "results 1-1 of thousands" were returned for two relatively common words[4] such as Anxiousness Scheduler[5] or Italianate Tablesides.[6]
Googlewhack went offline in November 2009 after Google stopped providing definition links. Gary Stock stated on the game's web page soon afterwards that he was pursuing solutions for Googlewhack to remain viable. However, the game has not come back into play, and there is no word of when or if that will happen.
Score
Some people propose the googlewhack "score", which is the product of the hits of the individual words.[7] Thus a googlewhack score is highest when the individual words produce a large number of hits.
Variations
New Scientist has discussed the idea of a Googlewhackblatt, which is similar to a Googlewhack except that it involves finding a single word that produces only one Google result. Lists of these have become available, but as with Googlewhacks they result in the Googlewhackblatt status of the word being destroyed—unless it is blocked by robots.txt or the word does not produce any Google results before it is added to the list, thus forming the Googlewhackblatt Paradox. Those words that do not produce any Google search results at all are known as Antegooglewhackblatts before they are listed—and subsequently elevated to Googlewhackblatt status if it is not blocked by robots.txt.
Feedback stories are also available on the New Scientist website, thus resulting in the destruction of any existing Googlewhackblatts that are ever printed in the magazine. Antegooglewhackblatts that are posted on the Feedback website become known as Feedbackgooglewhackblatts as their Googlewhackblatt status is created. In addition, New Scientist has more recently discovered another way to obtain a Googlewhackblatt without falling into the Googlewhackblatt Paradox. One can write the Googlewhackblatt on a website, but backwards, and then search on elgooG to view the list properly while still keeping the Googlewhackblatt's status as a Googlewhackblatt.
In contrast to Googlewhacks, many Googlewhackblatts and Antegooglewhackblatts are nonsense words or uncommon misspellings that are not in dictionaries and probably never will be.
A practical use of specially constructed Googlewhackblatts was proposed by Leslie Lamport (although he did not use the term).[8]
Research applications
The probabilities of internet search result values for multi-word queries was studied in 2008 with the help of Googlewhacks.[9][10][11] Based on data from 351 Googlewhacks from the whackstack, the Heaps' law coefficient for the indexed World Wide Web (about 8 billion pages in 2008) was measured to be . This result is in line with previous studies which used under 20,000 pages.[12] The googlewhacks were a key in calibrating the model so that it could be extended automatically to analyse the relatedness of word pairs.
See also
- Googlefight
- Hapax legomenon
- Statistically Improbable Phrases – finds phrases in Amazon books unlikely to appear in any other book indexed
References
- ↑ "Googlewhack official rules". Googlewhack.com. Retrieved 2014-03-28.
- ↑ "Googlewhacking: The Search for The One True Googlewhack". Unblinking.com. Retrieved 2014-03-28.
- ↑ Gorman, Dave (2005). Dave Gorman's Googlewhack! adventure. London: Ebury. ISBN 0091897424.
- ↑ "Googlewhack NACK!". Googlewhack.com. Retrieved 2014-03-28.
- ↑ Essex, Mike (2012-02-13). "Anxiousness Scheduler". Blog.blagman.co.uk. Retrieved 2014-03-28.
- ↑ "italianate tablesides". Googlewhack.com. Retrieved 2014-03-28.
- ↑ googlewhack scoring is discussed numerous places, e.g.,:
- ↑ Archival References to Web Pages, Ninth International World Wide Web Conference: Poster Proceedings (May 2000)
- ↑ Lansey JC, Bukiet B (January 2009). "Internet Search Result Probabilities, Heaps' Law and Word Associativity". Journal of Quantitative Linguistics 16 (1): 40–66. doi:10.1080/09296170802514153.
- ↑ Googlewhacks for Fun and Profit on YouTube Google Tech Talk 2008
- ↑ "Poster Presentation" (PDF). Retrieved 2014-03-28.
- ↑ Ricardo Baeza-Yates and Berthier Ribeiro-Neto, Modern Information Retrieval, ACM Press, 1999.
Further reading
- Bowman, Lisa M (29 January 2002). "Have you Googlewhacked?". CNET News. Retrieved 2012-12-31.
- "'Googlewhacking' a new activity for the searchers". London: Reuters. 6 February 2002.
- "'A New Word On the Internet'". New York: The New Yorker. 1 July 2015.