<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>VentureBeat &#187; hate speech</title>
	<atom:link href="http://venturebeat.com/tag/hate-speech/feed/" rel="self" type="application/rss+xml" />
	<link>http://venturebeat.com</link>
	<description>News About Tech, Money and Innovation</description>
	<lastBuildDate>Sun, 26 May 2013 05:17:35 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='venturebeat.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/c6d8c27ffa1c5a7f106f97e434437baf?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>VentureBeat &#187; hate speech</title>
		<link>http://venturebeat.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://venturebeat.com/osd.xml" title="VentureBeat" />
	<atom:link rel='hub' href='http://venturebeat.com/?pushpress=hub'/>
<copyright>Copyright 2013, VentureBeat</copyright>		<item>
		<title>Twitter &#8216;Hate Map&#8217; shows where racist, homophobic, and offensive tweets originate</title>
		<link>http://venturebeat.com/2013/05/14/twitter-hate-map-shows-where-racist-homophobic-and-offensive-tweets-originate/</link>
		<comments>http://venturebeat.com/2013/05/14/twitter-hate-map-shows-where-racist-homophobic-and-offensive-tweets-originate/#comments</comments>
		<pubDate>Tue, 14 May 2013 21:25:43 +0000</pubDate>
		<dc:creator>John Koetsier</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Health]]></category>
		<category><![CDATA[Media]]></category>
		<category><![CDATA[Social]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[hate map]]></category>
		<category><![CDATA[hate speech]]></category>
		<category><![CDATA[Humboldt State University]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://venturebeat.com/?p=737541</guid>
		<description><![CDATA[<p>Students at Humboldt State University in California individually reviewed 150,000 geocoded tweets containing racist, homophobic, or otherwise offensive terms to build a "hate map" indicating where people in the U.S. are most&#160;bigoted.</p>
<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=venturebeat.com&#038;blog=342986&#038;post=737541&#038;subd=venturebeat&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://venturebeat.files.wordpress.com/2013/05/screen-shot-2013-05-14-at-1-41-43-pm.png" target="_blank"><img class="aligncenter size-full wp-image-737550" alt="hate speech maps" src="http://venturebeat.files.wordpress.com/2013/05/screen-shot-2013-05-14-at-1-41-43-pm.png?w=1024&#038;h=483" width="1024" height="483" /></a>Students at Humboldt State University in California individually reviewed 150,000 geocoded tweets containing racist, homophobic, or otherwise offensive terms to <a href="http://users.humboldt.edu/mstephens/hate/hate_map.html#" target="_blank">build a &#8220;hate map&#8221;</a> indicating where people in the U.S. are most bigoted.</p>
<p>Or, at least, where they&#8217;re the most open about displaying their antisocial views.</p>
<p>The picture doesn&#8217;t look good for the Eastern states, although admittedly the bulk of the population is there as well. Areas in Virginia, North Carolina, Texas, and Alabama show up bright red on the map, as do areas in more central states Indiana, Iowa, and Minnesota.</p>
<p>The map is part of a larger project, called the Geography of Hate, by Humboldt State professor Dr. Monica Stephens. The data that forms the map comes from an analysis of every tweet posted between June 2012 and April 2013 that contained at least one of 10 designated &#8220;hate words,&#8221; including dyke, fag, chink, gook, wetback, and cripple.</p>
<p><div id="attachment_737570" class="wp-caption alignright" style="width: 287px"><a href="http://venturebeat.files.wordpress.com/2013/05/screen-shot-2013-05-14-at-2-13-28-pm.png" target="_blank"><img class="size-medium wp-image-737570" alt="California seems relatively hate-free" src="http://venturebeat.files.wordpress.com/2013/05/screen-shot-2013-05-14-at-2-13-28-pm.png?w=277&#038;h=400" width="277" height="400" /></a><div class="vb_image_source"><span>Source:</span> Hate Map</div><p class="wp-caption-text">California seems relatively hate-free</p></div>
<p>But while the original list of tweets was generated by a machine, every single one of the 150,000 tweets containing one of the target words was individually examined by undergraduate students. As the project description states:</p>
<blockquote><p>Because algorithmic sentiment analysis would automatically classify any tweet containing &#8220;hate words&#8221; as &#8220;negative,&#8221; this project relied upon the HSU students to read the entirety of tweet and classify it as positive, neutral or negative based on a predefined rubric. Only those tweets that were identified by human readers as negative were used in this analysis.</p>
</blockquote>
<p>To protect the identity of potentially racist, homophobic, or otherwise bigoted Twitter users, the tweets were aggregated up to the county level, and counties with high levels of hate speech were colored red on the map. Areas with moderate levels &#8212; though still higher than the national average &#8212; are varying shades of blue, and unshaded areas were below the national average.</p>
<p>Smaller towns seem to have a higher incidence of hate speech &#8212; in Virginia, for example, Palmyra is more hateful on Twitter than Richmond. And in Louisiana, New Orleans and Baton Rouge are less hateful than smaller towns nearby.</p>
<p><em>Image credits: Geography of Hate</em></p>
<br />Filed under: <a href='http://venturebeat.com/category/big-data/'>Big Data</a>, <a href='http://venturebeat.com/category/health/'>Health</a>, <a href='http://venturebeat.com/category/media/'>Media</a>, <a href='http://venturebeat.com/category/social/'>Social</a>  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=venturebeat.com&#038;blog=342986&#038;post=737541&#038;subd=venturebeat&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://venturebeat.com/2013/05/14/twitter-hate-map-shows-where-racist-homophobic-and-offensive-tweets-originate/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	<enclosure url="http://venturebeat.files.wordpress.com/2013/05/screen-shot-2013-05-14-at-1-41-43-pm.png?w=160" /><source url="http://venturebeat.com/2013/05/14/twitter-hate-map-shows-where-racist-homophobic-and-offensive-tweets-originate/">Twitter &#8216;Hate Map&#8217; shows where racist, homophobic, and offensive tweets originate</source>
		<media:thumbnail url="http://venturebeat.files.wordpress.com/2013/05/screen-shot-2013-05-14-at-1-41-43-pm.png?w=160" />
		<media:content url="http://venturebeat.files.wordpress.com/2013/05/screen-shot-2013-05-14-at-1-41-43-pm.png?w=160" medium="image">
			<media:title type="html">hate speech maps</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/6d4d24b12c84be6eecddf121bc3fee48?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">johnkoetsier</media:title>
		</media:content>

		<media:content url="http://venturebeat.files.wordpress.com/2013/05/screen-shot-2013-05-14-at-1-41-43-pm.png" medium="image">
			<media:title type="html">hate speech maps</media:title>
		</media:content>

		<media:content url="http://venturebeat.files.wordpress.com/2013/05/screen-shot-2013-05-14-at-2-13-28-pm.png?w=277" medium="image">
			<media:title type="html">California seems relatively hate-free</media:title>
		</media:content>
	</item>
		<item>
		<title>Twitter is under pressure to identify the racist users behind #unbonjuif</title>
		<link>http://venturebeat.com/2013/01/24/twitter-is-under-pressure-to-identify-the-racist-users-behind-unbonjuif/</link>
		<comments>http://venturebeat.com/2013/01/24/twitter-is-under-pressure-to-identify-the-racist-users-behind-unbonjuif/#comments</comments>
		<pubDate>Thu, 24 Jan 2013 20:09:07 +0000</pubDate>
		<dc:creator>Christina Farr</dc:creator>
				<category><![CDATA[Social]]></category>
		<category><![CDATA[court ruling]]></category>
		<category><![CDATA[discrimination]]></category>
		<category><![CDATA[French Twitter]]></category>
		<category><![CDATA[hate speech]]></category>
		<category><![CDATA[Internet policy]]></category>
		<category><![CDATA[racist tweets]]></category>

		<guid isPermaLink="false">http://venturebeat.com/?p=609972</guid>
		<description><![CDATA[<p>A French court ruled today that Twitter must identify its racist tweeters so they can be prosecuted, a decision that the social networking company is still&#160;evaluating.</p>
<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=venturebeat.com&#038;blog=342986&#038;post=609972&#038;subd=venturebeat&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://venturebeat.com/2012/10/09/sproutsocial-grows-revenue-750-as-social-media-bursts-out-of-the-marketing-department/twitter-explosion/" rel="attachment wp-att-547739"><img class="aligncenter size-full wp-image-547739" alt="twitter-explosion" src="http://venturebeat.files.wordpress.com/2012/10/twitter-explosion.jpg?w=665&#038;h=414" width="665" height="414" /></a></p>
<p>A French court ruled today that Twitter must identify its racist tweeters in France so they can be prosecuted, a decision that the social networking company is still evaluating.</p>
<p>This decision only applies to Twitter users in France, and it follows a slew of anti-Semitic tweets that violate that country&#8217;s laws on hate speech. In France, a nation with strict laws against anti-Semitism, <a href="http://www.slate.fr/story/58669/twitter-racisme-proces" target="_blank">no one has ever faced legal action for racist tweets</a>.</p>
<p>French ministers are also facing pressure from anti-discrimination groups in France. In October, the French Union of Jewish Students (UEJF) launched a petition alleging that it is too difficult to report and quickly remove offensive content.</p>
<p>The offending tweets used the hashtag #unbonjuif, which means &#8220;a good Jew&#8221; and was created to ridicule the Jewish community. Two others &#8212; #SiMonFilsEstGay (#Ifmysonwasgay) and #unbonmusulman (#agoodMuslim) &#8212; also surfaced in October. According to the French newspaper Le Monde, #unbonjuif was the third most tweeted subject in France on Oct. 10, and continued to be used for several days.</p>
<p>Twitter said today in a statement that &#8220;we are currently reviewing the court&#8217;s decision.&#8221; The company&#8217;s lawyer, Alexandra Neri argued, in October that Twitter&#8217;s data on users was collected and stocked in California and the French justice system would need to appeal to American judges to hand over this data. But the company agreed to delete the offensive tweets.</p>
<p>During the case, Twitter&#8217;s lawyers also made the case that there are numerous different methods to report or flag abusive posts.</p>
<p>Twitter is weighing its commitment to user expression with the potential damage that such objectionable tweets may inflict. It is already making steps by proactively removing terms like &#8220;swastika&#8221; from its trending topics list, <a href="http://www.google.com/hostednews/afp/article/ALeqM5gMdGDU3HD4fGRjd9YCBZ-amQVB-w" target="_blank">and as AFP reports</a>, it has deleted some of the anti-Semitic tweets from October. Among them:</p>
<div id="attachment_610007" class="wp-caption aligncenter" style="width: 310px"><a href="http://venturebeat.com/2013/01/24/twitter-is-under-pressure-to-identify-the-racist-users-behind-unbonjuif/screen-shot-2013-01-24-at-11-32-33-am/" rel="attachment wp-att-610007"><img class="size-medium wp-image-610007" alt="One user tweeted, &quot;A Good Jew can inflate his tire with his nose.&quot;" src="http://venturebeat.files.wordpress.com/2013/01/screen-shot-2013-01-24-at-11-32-33-am.png?w=300&#038;h=83" width="300" height="83" /></a><p class="wp-caption-text">&#8220;A Good Jew can inflate his tire with his nose.&#8221;</p></div>
<p>Twitter&#8217;s official position is that it does not moderate content. But it can instantly remove potential child abuse and suspend accounts. Last year, in an unprecedented move, Twitter complied with a request by German authorities to block the account of a neo-Nazi group.</p>
<p>This case may test Twitter and other social networking site&#8217;s refusal to mediate content.</p>
<br />Filed under: <a href='http://venturebeat.com/category/social/'>Social</a>  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=venturebeat.com&#038;blog=342986&#038;post=609972&#038;subd=venturebeat&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://venturebeat.com/2013/01/24/twitter-is-under-pressure-to-identify-the-racist-users-behind-unbonjuif/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<enclosure url="http://venturebeat.files.wordpress.com/2013/01/screen-shot-2013-01-24-at-11-32-33-am.png?w=160" /><source url="http://venturebeat.com/2013/01/24/twitter-is-under-pressure-to-identify-the-racist-users-behind-unbonjuif/">Twitter is under pressure to identify the racist users behind #unbonjuif</source>
		<media:content url="http://2.gravatar.com/avatar/54db9fa0da02d1fe98a5197333d6d08f?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">christinafarr</media:title>
		</media:content>

		<media:content url="http://venturebeat.files.wordpress.com/2012/10/twitter-explosion.jpg" medium="image">
			<media:title type="html">twitter-explosion</media:title>
		</media:content>

		<media:content url="http://venturebeat.files.wordpress.com/2013/01/screen-shot-2013-01-24-at-11-32-33-am.png?w=300" medium="image">
			<media:title type="html">One user tweeted, &#34;A Good Jew can inflate his tire with his nose.&#34;</media:title>
		</media:content>
	</item>
		<item>
		<title>Data scientists develop tool to exterminate spammers and trolls</title>
		<link>http://venturebeat.com/2012/10/21/impermium/</link>
		<comments>http://venturebeat.com/2012/10/21/impermium/#comments</comments>
		<pubDate>Sun, 21 Oct 2012 17:16:16 +0000</pubDate>
		<dc:creator>Christina Farr</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Business]]></category>
		<category><![CDATA[Dev]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[data scientists]]></category>
		<category><![CDATA[hate speech]]></category>
		<category><![CDATA[spam]]></category>

		<guid isPermaLink="false">http://venturebeat.com/?p=560338</guid>
		<description><![CDATA[<p>According to a Silicon Valley-based startup, there is a "big data solution" for one of the Internet's most common&#160;afflictions.</p>
<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=venturebeat.com&#038;blog=342986&#038;post=560338&#038;subd=venturebeat&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://venturebeat.com/2012/10/21/impermium/impermium-kaggle/" rel="attachment wp-att-560876"><img class="alignnone size-full wp-image-560876" title="impermium-kaggle" alt="" src="http://venturebeat.files.wordpress.com/2012/10/impermium-kaggle.jpg?w=655&#038;h=435" height="435" width="655" /></a></p>
<p>According to a Silicon Valley-based startup, there is a &#8220;big data solution&#8221; for one of the Internet&#8217;s most common afflictions.</p>
<p><a href="http://impermium.com" target="_blank">Impermium</a>, a Silicon Valley based venture-backed company that specializes in removing spam, has launched a new tool that that scans tens of millions of comments across the web. It spots highly offensive dirty words, and automatically removes them.</p>
<p>According to the company&#8217;s chief executive, Mark Risher, a tool like this is the first of its kind as algorithms have not traditionally been able to handle “internet speak” (coooooool, skilz, pr0n, and so on).</p>
<p>It took several months and the brainpower of leading data scientists to develop the technology. Spammers have evolved to become more sophisticated. &#8220;These are people that are actively trying to avoid getting caught,&#8221; said Risher in a phone interview with VentureBeat.</p>
<p>&#8220;Trolls and hackers and social spammers are changing their techniques to avoid detection,&#8221; he said.</p>
<p>As Facebook and Twitter reach their ascendancy, it has become more imperative that we find a way to remove spam. Recent data from security company Barracuda Labs sheds light on the extent of the problem. The study found that one in four Facebook users have received a virus or malware, often something posted to their public wall.</p>
<p>It&#8217;s a more profound problem for small businesses &#8212; the appearance of hate speech on their site is a reputation killer.</p>
<h3>The competition</h3>
<p>Rather than use an internal data services team, the company turned to <a href="http://kaggle.com" target="_blank">Kaggle</a>, a startup that hosts data-driven competitions.</p>
<p>From around the world, statisticians competed to build a tool that combines machine learning and natural language processing to root out malicious commentary. The winner received $7,000 &#8212; their solution was the most accurate with a false positive rate of less than one percent.</p>
<p>After posting the competition, the company received 154 submissions from contenders, and made some interesting discoveries about the nature of hate speech. In the words of Kaggle&#8217;s CEO, Anthony Goldbloom:</p>
<h3>The key takeaways</h3>
<ul>
<li>North Dakota is the most rambunctious state (6.4 percent of traffic is insulting) and Maine is the least (only 3.7 percent of traffic was considered spam or hate speech).</li>
<li>People on the internet are more malicious than you might expect (&#8220;there was no difficulty finding enough insults&#8221;).</li>
<li>Text with the word &#8220;mom&#8221; is more likely to constitute an insult.</li>
<li>The word f*#k is a surprisingly poor indicator that text contains and insult. It is as often “f*#k yeah” as it is “f*#k you”.</li>
</ul>
<p>According to Risher, it was more technically challenging than expected, namely because it takes a trained human eye to detect hate speech. For instance, the algorithm might flag a user for regurgitating lyrics to a popular rap song. Alternatively, it may fail to detect a malicious comment that is veiled in sarcasm.</p>
<p>&#8220;People are continuing to find new ways to insult each other,&#8221; said Risher. The self-described &#8220;spam Czar&#8221; left his job at Yahoo to focus on social media sites. In his former career, he was responsible for mitigating email spam.</p>
<p>Launching this week, the new tool called &#8220;Intelligent Content Protection&#8221; is already used by content-heavy sites like WordPress, Disqus and Livefyre. Pricing is flexible, but it&#8217;s about $2-3,000 per month, significantly less than the cost of a human editor.</p>
<p>Impermium launched in 2011, and has received funding from Charles River Ventures, Accel Partners, and others.</p>
<p><a href="http://www.shutterstock.com/pic-44605993/stock-photo-computer-on-a-desk-with-a-crime-scene-label.html?src=csl_recent_image-1" target="_blank"><em>Image via Shutterstock</em></a></p>
<br />Filed under: <a href='http://venturebeat.com/category/big-data/'>Big Data</a>, <a href='http://venturebeat.com/category/business/'>Business</a>, <a href='http://venturebeat.com/category/dev/'>Dev</a>  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=venturebeat.com&#038;blog=342986&#038;post=560338&#038;subd=venturebeat&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://venturebeat.com/2012/10/21/impermium/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	<enclosure url="http://venturebeat.files.wordpress.com/2012/10/impermium-kaggle.jpg?w=160" /><source url="http://venturebeat.com/2012/10/21/impermium/">Data scientists develop tool to exterminate spammers and trolls</source>
		<media:content url="http://2.gravatar.com/avatar/54db9fa0da02d1fe98a5197333d6d08f?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">christinafarr</media:title>
		</media:content>

		<media:content url="http://venturebeat.files.wordpress.com/2012/10/impermium-kaggle.jpg" medium="image">
			<media:title type="html">impermium-kaggle</media:title>
		</media:content>
	</item>
	</channel>
</rss>
