Hatebase
Hatebase is a joint project of the Sentinel Project for Genocide Prevention and Mobiocracy that is described on its website as an "online repository of structured, multilingual, usage-based hate speech". It uses text analysis of speech and written content (including radio transcripts, transcripts of spoken web content, tweets, and articles) and identification of hate speech patterns within it to predict potential regional violence.[1] The full source code for the API is available as open source code on GitHub[2]
History
The introduction of Hatebase was announced on the Sentinel Project blog on March 25, 2013.[3][4] The initiative is led by Timothy Quinn of Mobiocracy.[5][3]
Description
In an article for Foreign Policy, Joshua Keating described Hatebase as follows: "There are two main features to Hatebase. The first is a Wikipedia-like interface which allows users to identify hate speech terms by region and the group they refer to. This could have some value for researchers, but Hatebase's developers are especially excited by the second main feature, which allows users to identify instances when they've heard these terms used."[6] The example of the Rwandan Genocide was cited in that article and also in an article about Hatebase on Maclean's: in the months leading up to the genocide, radio stations attempted to dehumanize Tutsis to Hutus by repeatedly referring to the Tutsis as cockroaches.[5]
The regional and multilingual focus of the site was deemed particularly useful for identifying words that could be construed as hate in some languages and contexts but that outsiders would not know of, such as the word "sakkiliya" in Sinhalese (the language in Sri Lanka) used to refer to a Tamil person as 'a very unhygienic or uncultured person'[7] or the reference to Tutsis as cockroaches by the Rwandan radio stations, that an outsider may simply consider evidence that the region was suffering from a literal cockroach infestation.[8][6] This relates to the challenge of identifying subtly different uses of the same or similar words, one of which connotes hate and the other doesn't.[6] In the context of language that equates humans with pollution or stains, this is also called the human stain problem.
Another related challenge is to control for the ambient level of casual hate speech in society (such as YouTube comments): in some societies and contexts, hateful language may not be accompanied by or followed by violence, whereas in others, it might. For this reason, the evidence was only considered valuable in conjunction with other evidence about the risk and threat of violence, and the project concentrated its efforts on mapping hate speech in regions with a history of violence.[6]
API
The Application programming interface for Hatebase is available on GitHub, along with all the source code.[2] Information about the API can also be found at Programmable Web[9] and Mashape.[10]
Reception
The launch of Hatebase was covered in Wired Magazine[7] and the story was picked up and discussed on Slashdot.[11] Hatebase was also covered in Metro News, a Canadian publication.[8] It was also covered in the Canadian weekly Maclean's.[5]
Joshua Keating covered Hatebase in an article for Foreign Policy.[6] A week later, the magazine published a response letter by Gwyneth Sutherlin, a doctoral candidate at the University of Bradford, pointing out potential problems and limitations of the approach used by Hatebase.[12]
References
- ↑ "Hatebase". Hatebase. Retrieved May 19, 2014.
- 1 2 "Hatebase API Wrapper". GitHub. Retrieved May 19, 2014.
- 1 2 Quinn, Timothy (March 25, 2013). "Introducing Hatebase: the world’s largest online database of hate speech". Retrieved May 19, 2014.
- ↑ "Introducing Hatebase: the world’s largest online database of hate speech (The Sentinel Project for Genocide Prevention)". International Coalition for the Responsibility to Protect. March 25, 2013. Retrieved May 19, 2014.
- 1 2 3 "Hatebase: An anti-genocide app. An NGO hopes tracking hateful tweets can flag mounting ethnic conflict, and even prevent genocide". Maclean's. May 8, 2013. Retrieved May 19, 2014.
- 1 2 3 4 5 Keating, Joshua (April 1, 2013). "Mapping hate speech to predict ethnic violence". Foreign Policy. Retrieved May 19, 2014.
- 1 2 Shubber, Kadhim (April 5, 2013). "Crowdsourced hate speech database could spot early signs of genocide". Wired Magazine. Retrieved May 19, 2014.
- 1 2 Jessica Smith Cross (April 11, 2013). "‘Hatebase’ aims to prevent genocide by tracking hate speech". Metro News. Retrieved May 19, 2014.
- ↑ "Hatebase API". Programmable Web. Retrieved May 19, 2014.
- ↑ "Hatebase". Mashape. Retrieved May 19, 2014.
- ↑ "Hatebase Tries To Scan For Precursors of Genocide In Language". Slashdot. April 6, 2013. Retrieved May 19, 2014.
- ↑ Keating, Joshua; Sutherlin, Gwyneth (April 11, 2013). "Letters: The problem with mapping hate speech". Foreign Policy. Retrieved May 19, 2014.