<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>VentureBeat &#187; data scientist</title>
	<atom:link href="http://venturebeat.com/tag/data-scientist/feed/" rel="self" type="application/rss+xml" />
	<link>http://venturebeat.com</link>
	<description>News About Tech, Money and Innovation</description>
	<lastBuildDate>Sat, 18 May 2013 09:05:13 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='venturebeat.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/c6d8c27ffa1c5a7f106f97e434437baf?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>VentureBeat &#187; data scientist</title>
		<link>http://venturebeat.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://venturebeat.com/osd.xml" title="VentureBeat" />
	<atom:link rel='hub' href='http://venturebeat.com/?pushpress=hub'/>
<copyright>Copyright 2013, VentureBeat</copyright>		<item>
		<title>Want a career in &#8216;big data&#8217;? You&#8217;ll need more than a degree</title>
		<link>http://venturebeat.com/2013/04/29/want-a-career-in-big-data-youll-need-more-than-a-degree/</link>
		<comments>http://venturebeat.com/2013/04/29/want-a-career-in-big-data-youll-need-more-than-a-degree/#comments</comments>
		<pubDate>Mon, 29 Apr 2013 17:06:31 +0000</pubDate>
		<dc:creator>Rebecca Grant</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Business]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data scientist]]></category>
		<category><![CDATA[education]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[professor]]></category>
		<category><![CDATA[student]]></category>
		<category><![CDATA[study]]></category>
		<category><![CDATA[universitiy]]></category>

		<guid isPermaLink="false">http://venturebeat.com/?p=727227</guid>
		<description><![CDATA[<p>Teradata's third State of Business Intelligence survey finds a significant gap between students interested in data-savvy careers and the number of employable candidates in the&#160;field.</p>
<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=venturebeat.com&#038;blog=342986&#038;post=727227&#038;subd=venturebeat&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://venturebeat.com/2013/04/29/want-a-career-in-big-data-youll-need-more-than-a-degree/shutterstock_127082789/" rel="attachment wp-att-727239"><img class="alignnone size-full wp-image-727239" alt="shutterstock_127082789" src="http://venturebeat.files.wordpress.com/2013/04/shutterstock_127082789.jpg?w=1000&#038;h=664" width="1000" height="664" /></a>Data scientists are like the elite athletes of the enterprise world, but there are far too few of them to meet demand.</p>
<p><a href="http://www.teradata.com" target="_blank">Teradata</a> released the third <a href="http://www.teradata.com/News-Releases/2013/Solving-the-Jobs-Gap-for-Big-Data-Analytics-Careers-Requires-Access-to-Cutting-Edge-Technology-and-Big-Data-Survey/" target="_blank">State of Business Intelligence Survey</a> today, which found that interest in big data analytics careers is on the rise. However, there is a significant gap between the number of students interested in these opportunities and the amount of hirable talent in the pipeline.</p>
<p>Working as a data scientist (or related position) requires taking a large data set, modeling it, and gleaning insights. As the hype surrounding big data continues, the demand for people with these skill sets is increasing, along with salaries. But just like top tier athletics, not everyone has the requisite talent or ability to follow a career in this field and <a href="http://venturebeat.com/2012/08/07/study-data-scientists-top-dogs/">there is a major, global shortage of data scientists.</a> And despite the fact that two out of three students surveyed agree or strongly agree that there are job opportunities for them in data-savvy careers, only 16 percent are actually considering careers in this area.</p>
<p>Accordingly, employers still struggle to find potential hires. Analytics hiring managers reported that their largest needs for recent graduates are for IT or systems analytics (35%), program developers (32%), data managers (30%), and business analytics (22%). In the survey, one-third of employers identified a lack of experience as their greatest challenge, followed by insufficient business skills, insufficient technical skills, and a general lack of candidates. A lack of communication skills is also an issue.</p>
<p>Preparing students for real-world employment is partly the responsibility of the teachers. Forty-one percent of professors reported an increase in the number of BI or analytics courses at their university, but they also identified six areas where businesses can help them meet the challenges of creating qualified employees: providing large data sets, suitable cases, staying current with practices, technical support and training, realistic and meaningful experiences, and access to contemporary enterprise software.</p>
<p>The tech industry is a competitive job market and companies are always looking for qualified, experienced data scientists who combine technical expertise with creative thinking and communication skills. The responsibility does not lay only on the students, but on the universities, teachers, and companies that turn them from students into successful employees. A degree alone is not enough.</p>
<p>The survey was conducted by Barbara Wixom, an associate professor of commerce at University of Virginia&#8217;s McIntire School for Commerce and a research affiliate at MIT&#8217;s Sloan School of management. The study was sponsored by the Teradata University Network, which includes 3,400 faculty members, 1,600 universities, and thousands of students. TUN is a free web portal for students and teachers working in this field.</p>
<p><em>Photo Credit: Shutterstock</em></p>
<br />Filed under: <a href='http://venturebeat.com/category/big-data/'>Big Data</a>, <a href='http://venturebeat.com/category/business/'>Business</a>, <a href='http://venturebeat.com/category/enterprise/'>Enterprise</a>  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=venturebeat.com&#038;blog=342986&#038;post=727227&#038;subd=venturebeat&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://venturebeat.com/2013/04/29/want-a-career-in-big-data-youll-need-more-than-a-degree/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	<enclosure url="http://venturebeat.files.wordpress.com/2013/04/shutterstock_127082789.jpg?w=160" /><source url="http://venturebeat.com/2013/04/29/want-a-career-in-big-data-youll-need-more-than-a-degree/">Want a career in &#8216;big data&#8217;? You&#8217;ll need more than a degree</source>
		<media:content url="http://0.gravatar.com/avatar/fec4e66421afed673eb1ac50b8f839d8?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">rebeccaggrant</media:title>
		</media:content>

		<media:content url="http://venturebeat.files.wordpress.com/2013/04/shutterstock_127082789.jpg" medium="image">
			<media:title type="html">shutterstock_127082789</media:title>
		</media:content>
	</item>
		<item>
		<title>New York University wants to train the next generation of data scientists</title>
		<link>http://venturebeat.com/2013/02/19/new-york-university-launches-initiative-to-train-the-next-generation-of-data-scientists/</link>
		<comments>http://venturebeat.com/2013/02/19/new-york-university-launches-initiative-to-train-the-next-generation-of-data-scientists/#comments</comments>
		<pubDate>Tue, 19 Feb 2013 17:57:23 +0000</pubDate>
		<dc:creator>Christina Farr</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Entrepreneur]]></category>
		<category><![CDATA[New York]]></category>
		<category><![CDATA[data research]]></category>
		<category><![CDATA[data science shortage]]></category>
		<category><![CDATA[data scientist]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[machine learning experts]]></category>
		<category><![CDATA[new data science program]]></category>
		<category><![CDATA[New york startups]]></category>
		<category><![CDATA[New York University data science]]></category>
		<category><![CDATA[NYU]]></category>
		<category><![CDATA[training in data science]]></category>

		<guid isPermaLink="false">http://venturebeat.com/?p=624474</guid>
		<description><![CDATA[<p>NYU will create a Center for Data Science composed of faculty from across the university and will offer a variety of graduate-degree programs to its&#160;students.</p>
<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=venturebeat.com&#038;blog=342986&#038;post=624474&#038;subd=venturebeat&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://venturebeat.com/2013/02/19/new-york-university-launches-initiative-to-train-the-next-generation-of-data-scientists/nyu/" rel="attachment wp-att-624531"><img class="alignleft size-full wp-image-624531" alt="nyu" src="http://venturebeat.files.wordpress.com/2013/02/nyu.jpg?w=655&#038;h=437" width="655" height="437" /></a></p>
<p>Everybody and their mother is on the hunt for a data scientist, dubbed the &#8220;sexiest job of the 21st century,&#8221; by the Harvard Business Review.</p>
<p>To train the next generation, New York University (NYU) has launched a data science and statistics initiative, which is intended to drive breakthrough research in other programs, including medicine and technology. <a href="http://datascience.nyu.edu" target="_blank">The new website</a> for the program <a href="http://www.mckinsey.com/insights/mgi/research/technology_and_innovation/big_data_the_next_frontier_for_innovation" target="_blank">cites research</a> from McKinsey that projects the U.S. alone will need 140,000 to 190,000 people with deep analytic skills by 2018 as well as 1.5 million managers and analysts for &#8220;big data&#8221; jobs.</p>
<p><span style="font-size:13px;line-height:19px;">NYU has created a Center for Data Science composed of faculty from across the university and will offer a variety of graduate-degree programs to its students. A</span><span style="font-size:13px;line-height:19px;"> two-year master’s program will begin accepting applications this month and will commence this fall. In addition, the university plans to establish a doctoral program in the field.</span></p>
<p>“Data Science is becoming a necessary tool to answer some of the big scientific questions and technological challenges of our times,” said Gerard Ben Arous, director of NYU’s Courant Institute of Mathematical Sciences, which will house the Center for Data Science.</p>
<p>Machine learning expert Yann LeCun will lead the initiative. He said in a statement that we can use a variety of data sources to answer questions like, &#8220;<em>How does the brain work</em>?&#8221; The new program will focus on managing a vast volume of data, but also on exploiting it in ways that can yield myriad &#8220;social and economic benefits,&#8221; according to the statement.</p>
<p><span style="font-size:13px;">As we reported when<a href="http://venturebeat.com/2012/05/03/ebays-new-east-coast-hq-boosts-new-york-tech-scene/"> eBay opened its New York office</a>, the city&#8217;s startups and tech companies have a keen interest in analytics and &#8220;big data.&#8221; </span></p>
<p><span style="font-size:13px;">Bloomberg Ventures&#8217; Matt Turck, who also runs the <a href="http://www.meetup.com/NYC-Data-Business-Meetup/" target="_blank">Data Business meetup</a> in New York, said with initiatives like these, the city is in a position to become a &#8220;big data mecca.&#8221; </span>&#8220;[But] let&#8217;s just hope that all the graduates don&#8217;t get scooped up by hedge funds or the Facebooks of the world,&#8221; he warned. <img style="font-size:13px;" alt="" src="https://mail.google.com/mail/ca/u/1/images/cleardot.gif" /></p>
<p>Jacob Brody, a New York based partner at seed fund MESA+, said this is a first step in addressing the talent shortage for data scientists in New York. &#8220;So many problems currently being solved by companies in New York require expertise in Big Data,&#8221;  he said.</p>
<p>&#8220;The program will help startups somewhat in the short term, but it will help longer term when alumni leave big jobs in a few years to join startups or start their own,&#8221; said Brody, who is also a former tech reporter for VentureBeat. &#8220;<span style="font-size:13px;line-height:19px;">It&#8217;s about creating an ecosystem, and this is a start,&#8221; he explained. </span></p>
<p>NYU isn&#8217;t the first university to offer training to budding data scientists: Syracuse University has a certificate in advanced study in data science, Columbia University <a href="http://www.bloomberg.com/news/2012-07-30/columbia-university-to-create-data-sciences-institute-in-nyc.html" target="_blank">recently launched </a>its own Data Science Institute, and Stanford University has classes in data mining. <a href="http://datascience101.wordpress.com/2012/04/09/colleges-with-data-science-degrees/" target="_blank">Check out this fairly comprehensive list.</a></p>
<p><em><a href="http://www.shutterstock.com/cat.mhtml?searchterm=university+grads&amp;search_group=&amp;lang=en&amp;search_source=search_form#id=95717242&amp;src=D37988F6-7ABB-11E2-9FD0-10D437D0D1A0-1-4" target="_blank">Graduation hats image </a>via Shutterstock </em></p>
<br />Filed under: <a href='http://venturebeat.com/category/big-data/'>Big Data</a>, <a href='http://venturebeat.com/category/entrepreneur/'>Entrepreneur</a>, <a href='http://venturebeat.com/category/new-york/'>New York</a>  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=venturebeat.com&#038;blog=342986&#038;post=624474&#038;subd=venturebeat&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://venturebeat.com/2013/02/19/new-york-university-launches-initiative-to-train-the-next-generation-of-data-scientists/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<enclosure url="http://venturebeat.files.wordpress.com/2013/02/nyu.jpg?w=160" /><source url="http://venturebeat.com/2013/02/19/new-york-university-launches-initiative-to-train-the-next-generation-of-data-scientists/">New York University wants to train the next generation of data scientists</source>
		<media:content url="http://2.gravatar.com/avatar/54db9fa0da02d1fe98a5197333d6d08f?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">christinafarr</media:title>
		</media:content>

		<media:content url="http://venturebeat.files.wordpress.com/2013/02/nyu.jpg" medium="image">
			<media:title type="html">nyu</media:title>
		</media:content>

		<media:content url="https://mail.google.com/mail/ca/u/1/images/cleardot.gif" medium="image" />
	</item>
		<item>
		<title>Data scientists: supermodels, football stars &#8230; or artists?</title>
		<link>http://venturebeat.com/2012/08/27/data-scientists-supermodels-football-stars-or-artists/</link>
		<comments>http://venturebeat.com/2012/08/27/data-scientists-supermodels-football-stars-or-artists/#comments</comments>
		<pubDate>Mon, 27 Aug 2012 17:57:36 +0000</pubDate>
		<dc:creator>John Koetsier</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Cloud]]></category>
		<category><![CDATA[Enterprise]]></category>
		<category><![CDATA[CloudBeat 2012]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[data artist]]></category>
		<category><![CDATA[data scientist]]></category>
		<category><![CDATA[editor's pick]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[teradata]]></category>

		<guid isPermaLink="false">http://venturebeat.com/?p=506924</guid>
		<description><![CDATA[<p><span class="post-label editors-pick">Editor's Pick</span> The appeal of "big data" has made data scientists a hot item. But perhaps a better term would be "data artists," befitting the artfulness that goes into interpreting and illustrating big&#160;datasets.</p>
<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=venturebeat.com&#038;blog=342986&#038;post=506924&#038;subd=venturebeat&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<div class="post-meta-blurb post-meta-before blurb-cat-cloud"><div class="event-boilerplate"><div class="logo-date-wrap"><a href="http://cloudbeat2013.com" data-vb-ga-outbound="CB2013boilerplateTOP" target="_blank"><img src="http://venturebeat.files.wordpress.com/2013/02/cloudbeat2013-boilerplate.png" alt="CloudBeat 2013" style="margin-top:5px;"></a><div class="date-location"><strong>Sept. 9 - 10, 2013</strong><br>San Francisco, CA</div></div><a href="http://cloudbeat2013-CB2013boilerplateTOP.eventbrite.com/" class="cta" data-vb-ga-outbound="CB2013boilerplateTOP" target="_blank">Early Bird Tickets on Sale</a></div></div><p><a href="http://venturebeat.com/2012/08/27/data-scientists-supermodels-football-stars-or-artists/data-art/" rel="attachment wp-att-518725"><img class="aligncenter size-full wp-image-518725" title="data-art" alt="" src="http://venturebeat.files.wordpress.com/2012/08/data-art.jpg?w=665&#038;h=407" height="407" width="665" /></a>The appeal of &#8220;big data&#8221; has made <a href="http://venturebeat.com/2012/07/17/data-scientist/">data scientists</a> a hot item, commanding respect and income worthy of <a href="http://venturebeat.com/2012/08/07/study-data-scientists-top-dogs/">supermodels or football stars</a>.</p>
<p>But perhaps a better term would be &#8220;data artists,&#8221; befitting the artfulness that goes into interpreting and illustrating big datasets. Perhaps these scientists are not the Einsteins and Edisons but the Van Goghs and Picassos of the big data revolution.</p>
<div style="float:right;width:245px;background-color:#ffffff;padding:10px;border:4px dotted #C2ECFC;">
<p><a href="http://venturebeat.com/events/cloudbeat2012/"><img class="alignleft  wp-image-510714" title="CloudBeat2012" alt="CloudBeat 2012" src="http://venturebeat.files.wordpress.com/2012/08/cloudbeat2012.jpg?w=241&#038;h=29" height="29" width="241" /></a><em><a href="http://venturebeat.com/events/cloudbeat2012/">CloudBeat 2012</a> is assembling the biggest names in the cloud’s evolving story to learn about real cases of revolutionary cloud adoption. <a href="http://cloudbeat2012.eventbrite.com/" target="_blank">Register now and save 25 percent!</a> The early-bird discount ends September 14.</em></p>
</div>
<p>The current revolution in big data is inherent in the name: datasets trending to petabyte size (that&#8217;s 1 million GBs) or more. And it&#8217;s all about complexity: unstructured data, multiple datasets, and structured data, all jumbled up. (Even <a href="http://venturebeat.com/2012/08/27/these-are-the-skills-you-need-to-be-a-data-scientist-at-facebook/">Facebook is looking to hire a data scientist</a>.) So big data requires big brains and massive computing brawn.</p>
<p>But it requires more, too, much more, according to author and data scientist Bill Franks.</p>
<p>He&#8217;s the author of <a href="http://www.amazon.com/Taming-Data-Tidal-Wave-Opportunities/dp/1118208781" target="_blank">Taming the Big Data Tidal Wave</a> and the chief analytics officer for <a href="http://www.teradata.com/" target="_blank">Teradata</a>, the data warehousing and analytics company.</p>
<p>According to Franks, even more than size and complexity, big data is increasingly about creativity: finding interesting patterns and following them down the rabbit hole.</p>
<p>Sometimes, nothing turns up. Other times, data artists discover wonderland. A very profitable wonderland.</p>
<h3>What is big data?</h3>
<p><a href="http://venturebeat.com/2012/08/27/data-scientists-supermodels-football-stars-or-artists/sea-of-data/" rel="attachment wp-att-506973"><img class="alignright  wp-image-506973" title="sea-of-data" alt="" src="http://venturebeat.files.wordpress.com/2012/08/sea-of-data.jpg?w=331&#038;h=334" height="334" width="331" /></a>&#8220;You will not find an agreed-upon definition for &#8216;big data,&#8217;&#8221; Franks says. &#8220;I&#8217;ve had vigorous debates about that.&#8221;</p>
<p>The general guidelines he offers are a combination of factors: volume, size, variety, and complexity. One of the most interesting and relevant is complexity. Big data doesn&#8217;t always fit into neat columns. Some data works in standard traditional SQL (structured query language) databases, and some does not.</p>
<p>For example, web analytics data from server logs is fairly structured, but tweets or Instagram photos are not. Sensor data reporting customer traffic in a retail store may be neat and clean, but data artists will probably want to match it up against an entirely different dataset: sales.</p>
<p>And then factor in weather data, and time of year, and the price of gasoline. Probably not the position of Pluto, but in big data you never really know what bit of information might prove to be critical.</p>
<h3>&#8216;Data artists&#8217; &#8230; really?</h3>
<p>&#8220;There have always been individuals who take a company&#8217;s data and find interesting patterns using data-mining and predictive analytics,&#8221; says Franks. &#8220;The technical abilities to do so are now table stakes to be successful in a business environment.&#8221;</p>
<p><a href="http://venturebeat.com/2012/08/27/data-scientists-supermodels-football-stars-or-artists/paint-brushes/" rel="attachment wp-att-506989"><img class="alignright  wp-image-506989" title="paint-brushes" alt="" src="http://venturebeat.files.wordpress.com/2012/08/paint-brushes.jpg?w=350&#038;h=234" height="234" width="350" /></a>But now in the era of big data, these technical abilities need to be married with softer skills: commitment, and especially creativity.</p>
<p>So the key to being a good data artist, Franks suggests, is creativity.</p>
<p>&#8220;There&#8217;s no need to be creative if you have the exact data for the exact problem … but the reality is that you have to make assumptions, deal with inconsistencies, and then choose a model that may not be perfect.&#8221;</p>
<p>That&#8217;s what takes creativity, and the best data scientists are, in fact, artists.</p>
<p>The tools of their trade, of course, are not palette, brush, and canvas. And increasingly they are not traditional databases and SQL, but tools like Hadoop and MapReduce, tools for processing huge amounts of data across large number of machines. Or hand-coding spur-of-the-moment solutions in Java or other languages.</p>
<h3>And the results?</h3>
<p>The business results are clear. Better products, better customer service, better asset utilization: All results of using big data.</p>
<p><a href="http://venturebeat.com/2012/08/27/data-scientists-supermodels-football-stars-or-artists/cash-9/" rel="attachment wp-att-506997"><img class="alignright  wp-image-506997" title="cash" alt="" src="http://venturebeat.files.wordpress.com/2012/08/cash2.jpg?w=350&#038;h=298" height="298" width="350" /></a>Walmart&#8217;s rise over the last few decades was largely a technology-powered revolution in retail. Google&#8217;s capability to be Google is entirely dependent on the company&#8217;s success to acquire and manage massive sets of data. The same is true of Facebook&#8217;s capability to keep almost a billion people connected.</p>
<p>But it&#8217;s not just about the big names in the technology world. Airlines scheduling flights and setting prices use big data to maximize profitability. Banks stocking ATMs with cash manage datasets on seasons and events in addition to their own customer data. And Teradata now has 36 members of its &#8220;Petabyte Club&#8221; &#8212; clients who have at least one but often dozens or scores of petabytes of data &#8212; up from just five in 2007.</p>
<p>&#8220;Big data is so much in the news lately that some might argue it&#8217;s overhyped,&#8221; Franks says. &#8220;But so many people are getting excited … they can&#8217;t all be wrong. And the new value that can be driven is just really cool.&#8221;</p>
<p>Franks sums it the job of a data artist this way:</p>
<p>&#8220;You need to use imperfect data and imperfect methods in an insufficient timeframe to get enough data to make a good business decision.&#8221;</p>
<p>The end result? Green on the balance sheet.</p>
<p><em>Image credit: <a href="http://www.shutterstock.com/pic-101276017/stock-vector-leaves-wave-effect-optical-art-green-and-blue-large.html?src=921ba8ff6b648a91cda91765526d3490-1-0" target="_blank">Atypeek Design/ShutterStock</a>, <a href="http://www.shutterstock.com/pic-104929805/stock-photo-man-adrift-in-tiny-baot-in-binary-ocean.html?src=630a3118bef9fa91c1931b297d2f4046-1-0" target="_blank">Bruce Rolf/ShutterStock</a>, <a href="http://www.shutterstock.com/pic-82683514/stock-photo-paint-brushes.html?src=c6054fe6ef8674a96bfeb21a503fab39-1-2" target="_blank">Val Lawless/ShutterStock</a>, <a href="http://www.shutterstock.com/pic-83630620/stock-photo-wads-of-dollars-d-image-isolated-white-background.html?src=e059204c8e7d96ee8cbe8595043e76b3-1-77" target="_blank">AnatolyM/ShutterStock</a>, <a href="http://www.flickr.com/photos/medul_la/5891727279/" target="_blank">medul.la</a> via <a href="http://photopin.com" target="_blank">photo pin</a> <a href="http://creativecommons.org/licenses/by/2.0/" target="_blank">cc</a></em></p>
<br />Filed under: <a href='http://venturebeat.com/category/big-data/'>Big Data</a>, <a href='http://venturebeat.com/category/cloud/'>Cloud</a>, <a href='http://venturebeat.com/category/enterprise/'>Enterprise</a>  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=venturebeat.com&#038;blog=342986&#038;post=506924&#038;subd=venturebeat&#038;ref=&#038;feed=1" width="1" height="1" /><style type="text/css">.blurb-cat-cloud .event-boilerplate {
width:278px;
margin:0px 0px 10px 20px;
padding:10px;
float:right;
border:1px solid #e4e4e4;
font-family: 'Open Sans', sans-serif;
color:#000;
}
.blurb-cat-cloud .event-boilerplate .logo-date-wrap {
width:100%;
display:block;
float:left;
margin-bottom:8px;
}
.blurb-cat-cloud .event-boilerplate img {
float:left;
}
.blurb-cat-cloud .event-boilerplate .date-location {
float:right;
font-size:12px;
line-height:14px;
text-align:center;
padding-left:7px;
padding-top:5px;
padding-bottom:3px;
border-left:1px solid #e6e6e6;
}
.blurb-cat-cloud .event-boilerplate .cta {
display:block;
clear:both;
width:100%;
border-radius:5px;
border:1px solid #1864b1;
color:#fff;
text-shadow: 0px -1px 0px rgba(0,0,0,0.3);
text-align:center;
text-decoration:none;
font-weight:600;
font-size:18px;
line-height:17px;
padding:4px 0px 6px 0px;
background: #1f80e4;
background: -moz-linear-gradient(top,  #1f80e4 0%, #1862ae 100%);
background: -webkit-gradient(linear, left top, left bottom, color-stop(0%,#1f80e4), color-stop(100%,#1862ae));
background: -webkit-linear-gradient(top,  #1f80e4 0%,#1862ae 100%);
background: -o-linear-gradient(top,  #1f80e4 0%,#1862ae 100%);
background: -ms-linear-gradient(top,  #1f80e4 0%,#1862ae 100%);
background: linear-gradient(to bottom,  #1f80e4 0%,#1862ae 100%);
filter: progid:DXImageTransform.Microsoft.gradient( startColorstr='#1f80e4', endColorstr='#1862ae',GradientType=0 );
}</style>]]></content:encoded>
			<wfw:commentRss>http://venturebeat.com/2012/08/27/data-scientists-supermodels-football-stars-or-artists/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	<enclosure url="http://venturebeat.files.wordpress.com/2012/08/data-art.jpg?w=160" /><source url="http://venturebeat.com/2012/08/27/data-scientists-supermodels-football-stars-or-artists/">Data scientists: supermodels, football stars &#8230; or artists?</source>
		<media:thumbnail url="http://venturebeat.files.wordpress.com/2012/08/data-art.jpg?w=160" />
		<media:content url="http://venturebeat.files.wordpress.com/2012/08/data-art.jpg?w=160" medium="image">
			<media:title type="html">data-art</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/6d4d24b12c84be6eecddf121bc3fee48?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">johnkoetsier</media:title>
		</media:content>

		<media:content url="http://venturebeat.files.wordpress.com/2012/08/data-art.jpg" medium="image">
			<media:title type="html">data-art</media:title>
		</media:content>

		<media:content url="http://venturebeat.files.wordpress.com/2012/08/cloudbeat2012.jpg" medium="image">
			<media:title type="html">CloudBeat2012</media:title>
		</media:content>

		<media:content url="http://venturebeat.files.wordpress.com/2012/08/sea-of-data.jpg" medium="image">
			<media:title type="html">sea-of-data</media:title>
		</media:content>

		<media:content url="http://venturebeat.files.wordpress.com/2012/08/paint-brushes.jpg" medium="image">
			<media:title type="html">paint-brushes</media:title>
		</media:content>

		<media:content url="http://venturebeat.files.wordpress.com/2012/08/cash2.jpg" medium="image">
			<media:title type="html">cash</media:title>
		</media:content>
	</item>
		<item>
		<title>These are the skills you need to be a data scientist at Facebook</title>
		<link>http://venturebeat.com/2012/08/27/these-are-the-skills-you-need-to-be-a-data-scientist-at-facebook/</link>
		<comments>http://venturebeat.com/2012/08/27/these-are-the-skills-you-need-to-be-a-data-scientist-at-facebook/#comments</comments>
		<pubDate>Mon, 27 Aug 2012 17:17:18 +0000</pubDate>
		<dc:creator>Christina Farr</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Social]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[data scientist]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://venturebeat.com/?p=519943</guid>
		<description><![CDATA[<p>Props to Facebook HR! Finally, a detailed description of what a "data scientist" actually is. Join the team that analyzes raw data to probe the motivations of human behavior (for&#160;fun).</p>
<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=venturebeat.com&#038;blog=342986&#038;post=519943&#038;subd=venturebeat&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://venturebeat.com/2012/08/27/these-are-the-skills-you-need-to-be-a-data-scientist-at-facebook/screen-shot-2012-08-27-at-10-03-39-am/" rel="attachment wp-att-519972"><img class="alignnone size-full wp-image-519972" title="Screen shot 2012-08-27 at 10.03.39 AM" src="http://venturebeat.files.wordpress.com/2012/08/screen-shot-2012-08-27-at-10-03-39-am.png?w=621&#038;h=497" alt="" width="621" height="497" /></a></p>
<p>Finally, a clear description of what it means to be a data scientist! Facebook is seeking a <a href="http://www.facebook.com/careers/department?dept=data&amp;req=a2KA0000000LjX4MAK" target="_blank">&#8220;data scientist&#8221;</a> to join the team of a dozen researchers in the Menlo Park HQ tasked with uncovering insights from the most extensive data set on human relationships ever assembled. The job is lumped into the general category of &#8220;software engineering careers,&#8221; but is a far more product-driven role than you might expect.</p>
<div style="float:right;width:245px;background-color:#ffffff;padding:10px;border:4px dotted #C2ECFC;">
<p><a href="http://venturebeat.com/events/cloudbeat2012/"><img class="alignleft  wp-image-510714" title="CloudBeat2012" src="http://venturebeat.files.wordpress.com/2012/08/cloudbeat2012.jpg?w=241&#038;h=29" alt="CloudBeat 2012" width="241" height="29" /></a><em><a href="http://venturebeat.com/events/cloudbeat2012/">CloudBeat 2012</a> is assembling the biggest names in the cloud’s evolving story to learn about real cases of revolutionary cloud adoption. Unlike other cloud events, customers &#8212; the users of cloud technologies &#8212; will be front and center. Their discussions with vendors and other experts will give you rare insights into what really works, who&#8217;s buying what, and where the industry is going. <a href="http://cloudbeat2012.eventbrite.com/" target="_blank">Register now and save 25 percent!</a> The early-bird discount ends September 14.</em></p>
</div>
<p><a href="http://www.facebook.com/careers/department?dept=data&amp;req=a2KA0000000LjX4MAK" target="_blank">According to the job description</a>, the ideal candidate is a strong communicator with an appreciation for products, someone with expertise in the full gamut of technical skills. To qualify, you&#8217;ll need:</p>
<ul>
<li>Fluency with at least one scripting language (Python or PHP)</li>
<li>Experience working with large data sets. Bonus points if you can work with distributed computing tools, such as Map/Reduce, Hadoop, Hive, etc.</li>
<li>Familiarity with relational databases and SQL</li>
<li>A strong passion for empirical research and for answering hard questions with data</li>
<li>Ability to communicate findings to product managers and engineers</li>
<li>Ability to answer and identify product questions using appropriate statistical techniques on available data</li>
</ul>
<p>Who is this wunderkind? This data hero &#8212; or <a href="http://venturebeat.com/2012/08/27/data-scientists-supermodels-football-stars-or-artists/">data artist</a>? It&#8217;s certainly not your typical low-level developer or analyst, many of whom have taken to passing themselves off as data scientists. You can&#8217;t blame them &#8212; it&#8217;s the tech industry&#8217;s sexy new profession. From LinkedIn to Netflix, every tech company worth its salt has developed a data-dedicated team. <a href="http://www.technologyreview.com/featured-story/428150/what-facebook-knows/" target="_blank">According to a rather brilliant feature in Technology Review, Facebook is set to double its data team in the next 12 months.</a></p>
<p>The right candidate for this job will be willing and able to find Facebook new ways to make money from personal information, whether it&#8217;s our family photos, conversations, life achievements, or other tidbits of data. Under the company&#8217;s top data scientist Cameron Marlow, the team&#8217;s goal is to find patterns among the data that the product team can make use of. For instance, data scientists helped Facebook identify users you may know but haven&#8217;t &#8220;friended.&#8221; The team is working with terabytes of potentially valuable data that is lodged in the company&#8217;s servers.</p>
<p>There still isn&#8217;t much consensus on what it actually means to be a data scientist, so I was relieved to stumble upon this opening, which sheds some light. Still, even Facebook&#8217;s HR department is going in a bit blind. Even Marlow prefers to go by the academic-sounding term of &#8220;chief sociologist.&#8221;</p>
<p>The job description says the ideal candidate will have a Ph.D or masters degree. Technology Review reported that several members of the team are trained in sociology or social psychology. In their free time, they can use Facebook&#8217;s raw data to uncover patterns in human behavior, and publish the results in academic journals.</p>
<p>The salary isn&#8217;t specified, but <a href="http://venturebeat.com/2012/08/07/study-data-scientists-top-dogs/#QLOh95j4xuFrqDh0.99">according to a recent study</a>, it averages at $55,000 for a data analyst to $132,000 for a vice president of analytics. This is Facebook, so expect to earn significantly more than that.</p>
<p><a href="http://venturebeat.com/2012/07/17/data-scientist/">Click here to read more about what it means to be a data scientist</a>.</p>
<br />Filed under: <a href='http://venturebeat.com/category/big-data/'>Big Data</a>, <a href='http://venturebeat.com/category/social/'>Social</a>  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=venturebeat.com&#038;blog=342986&#038;post=519943&#038;subd=venturebeat&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://venturebeat.com/2012/08/27/these-are-the-skills-you-need-to-be-a-data-scientist-at-facebook/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<enclosure url="http://venturebeat.files.wordpress.com/2012/08/screen-shot-2012-08-27-at-10-03-39-am.png?w=160" /><source url="http://venturebeat.com/2012/08/27/these-are-the-skills-you-need-to-be-a-data-scientist-at-facebook/">These are the skills you need to be a data scientist at Facebook</source>
		<media:content url="http://2.gravatar.com/avatar/54db9fa0da02d1fe98a5197333d6d08f?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">christinafarr</media:title>
		</media:content>

		<media:content url="http://venturebeat.files.wordpress.com/2012/08/screen-shot-2012-08-27-at-10-03-39-am.png" medium="image">
			<media:title type="html">Screen shot 2012-08-27 at 10.03.39 AM</media:title>
		</media:content>

		<media:content url="http://venturebeat.files.wordpress.com/2012/08/cloudbeat2012.jpg" medium="image">
			<media:title type="html">CloudBeat2012</media:title>
		</media:content>
	</item>
		<item>
		<title>The elephant in the room: Are you ready to face big data?</title>
		<link>http://venturebeat.com/2011/12/19/the-elephant-in-the-room-are-you-ready-to-face-big-data/</link>
		<comments>http://venturebeat.com/2011/12/19/the-elephant-in-the-room-are-you-ready-to-face-big-data/#comments</comments>
		<pubDate>Mon, 19 Dec 2011 19:52:42 +0000</pubDate>
		<dc:creator>Anjul Bhambhri</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Business]]></category>
		<category><![CDATA[data scientist]]></category>

		<guid isPermaLink="false">http://venturebeat.com/?p=367098</guid>
		<description><![CDATA[<p><span class="post-label guest-post">Guest Post</span>
<p>Our use of social media, smart devices and the Internet all create huge volumes of data. Behind the scenes, machine-to-machine interactions, sensors, recommendation engines and APIs drive a proliferation of information. Much of this data &#8212; in some studies as&#160;&#8230;</p>
<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=venturebeat.com&#038;blog=342986&#038;post=367098&#038;subd=venturebeat&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Our use of social media, smart devices and the Internet all create huge volumes of data. Behind the scenes, machine-to-machine interactions, sensors, recommendation engines and APIs drive a proliferation of information. Much of this data &#8212; in some studies as much as 80 percent &#8212; has little to no structure and much of it is generated at machine speed; at high velocity. This kind of &#8220;big data&#8221; &#8212; extremely large data sets that have little to no structure and are growing every minute &#8212; is hard for databases and analysis tools to handle just in time. But it also contains a wealth of valuable information waiting to be found.</p>
<p><a href="http://venturebeat.com/2011/12/19/the-elephant-in-the-room-are-you-ready-to-face-big-data/elephant/" rel="attachment wp-att-367136"><img class="alignleft  wp-image-367136" title="Elephant" src="http://venturebeat.files.wordpress.com/2011/12/elephant.jpg?w=338&#038;h=338" alt="" width="338" height="338" /></a>Often, the clues that lie in big data can be the key to an enterprise’s future success, offering insights to optimize supply chains, uncover consumer behavior patterns, identify traffic and energy patterns, and much more. Rather than be a burden, all of this newfound knowledge can allow businesses to make effective long-term and real-time business decisions.</p>
<p>How can businesses grapple with the task of not only managing this data explosion but more importantly, extracting value from it and turning it into an asset? How do you remove the noise from this high velocity, highly variable data and discover key insights while they are still relevant?</p>
<p>That’s a question I and and other panelists broached at VentureBeat’s recent <a href="http://venturebeat.com/2011/11/11/cloudbeat-breakout-session-the-new-social-layer/">CloudBeat</a> session on “The New Social Layer.” The session examined how companies are using social media to gain insight into what customers think.</p>
<p>Those of us on the panel reached a surprising agreement: Much of the big data tools and technology needed for businesses to begin exploring this data is already in place. So why are so few companies making use of this data? What they’re missing is the right culture and people.</p>
<p>What struck me about this agreement was that we had executives from a startup, a major media company, an SMB and a Fortune 500 firm all agreeing on this catalyst to succeed with big data.</p>
<p>Organizations will always need individuals with strong traditional data management skills, but today business and IT staff need to collaborate in order for enterprises and SMBs to evolve. The individuals who will be highly sought by organizations will be people who can serve as change agents, pushing organizational collaboration and information innovation. I have been speaking extensively this year about the need for data scientists and big data skills and plan on continuing to do so.</p>
<p>At IBM, part of my job is to help develop new big data products and partner with customers to figure out the cultural shift needed. One company I’ve worked with, Vestas Wind Systems, a Danish energy company, is using our big data analytics technology to determine the best locations for wind turbine placement. Vestas is able to analyze everything from weather reports, tidal phases and satellite images to geospatial and sensor data, and to use that analysis to select locations that will generate optimal return on investment. This type of analysis used to take weeks but can now be done in less than an hour. Once the turbine has been sited and assembled, engineers can use these big data solutions to predict turbine performance and determine the best times to schedule maintenance.</p>
<p>That’s just one example of what is possible today. In the future, advanced and predictive analytics will help business users gain powerful insights into data so they can forecast future scenarios. One form of predictive analytics currently underway is analyzing consumer sentiment, a new phenomenon where companies can mine the countless insights gained from streaming human interactions across social networks and the Internet to establish consumer behavior or corporate best practices.</p>
<p>Dutch entertainment company RTL Nederland, for example, is using predictive analytics to mine viewer sentiment and feedback from blogs and social media sites, allowing the company to make real-time adjustments to improve viewer satisfaction for shows like “The X Factor” and “So You Think You Can Dance.” For “The X Factor” alone, RTL analyzed over 70,000 online conversations, enabling it to address viewer likes and dislikes so it could effectively tweak its programming.</p>
<p>Lastly, whether we were aware of it at the time or not, many of us witnessed big data analytics in action on the TV show Jeopardy! this past February, when IBM’s <a href="http://www-03.ibm.com/innovation/us/watson/index.html" target="_blank">Watson</a>, faced off against Jeopardy champions Brad Rutter and Ken Jennings. Zipping through category after category (with a few stumbles along the way), Watson demonstrated its ability to search roughly 200 million pages of text and buzz in once it reached a “confidence threshold.” Being able to understand Natural Language – the way humans speak to each other – including puns, double entendres and pop culture references, was a groundbreaking achievement for technology. Being able to do it all in about three seconds blew the human competition away. The reach of this exciting new big data analytics technology will be broad and will certainly play a role in many areas of our future.</p>
<p>Yes, the amount of big data that exists can be intimidating, but it’s exciting to envision the countless possibilities it can achieve for businesses. In order to capitalize on big data, businesses need to recast their views of their data problem. Instead of looking for quick fixes to handle the vast amounts of incoming data, they should look at it as an opportunity to make a significant impact on their business. If they have people on staff dedicated to determining the types of insights they’d like to uncover from their data, the technology exists to help them find it.</p>
<p><em>Anjul Bhambhri is vice president of Big Data Products at IBM.</em></p>
<p>[Image via <a href="http://www.shutterstock.com/gallery-85920p1.html" target="_blank">Eric Isselée</a>/<a href="http://www.shutterstock.com" target="_blank">Shutterstock</a>]</p>
<br />Filed under: <a href='http://venturebeat.com/category/big-data/'>Big Data</a>, <a href='http://venturebeat.com/category/business/'>Business</a>  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=venturebeat.com&#038;blog=342986&#038;post=367098&#038;subd=venturebeat&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://venturebeat.com/2011/12/19/the-elephant-in-the-room-are-you-ready-to-face-big-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<enclosure url="http://venturebeat.files.wordpress.com/2011/12/elephant.jpg?w=140" /><source url="http://venturebeat.com/2011/12/19/the-elephant-in-the-room-are-you-ready-to-face-big-data/">The elephant in the room: Are you ready to face big data?</source>
		<media:content url="http://0.gravatar.com/avatar/f59aef76cbc94fe88b2255b07bd333df?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">venturebeat1</media:title>
		</media:content>

		<media:content url="http://venturebeat.files.wordpress.com/2011/12/elephant.jpg" medium="image">
			<media:title type="html">Elephant</media:title>
		</media:content>
	</item>
		<item>
		<title>Do you need a data scientist?</title>
		<link>http://venturebeat.com/2011/11/28/do-you-need-a-data-scientist/</link>
		<comments>http://venturebeat.com/2011/11/28/do-you-need-a-data-scientist/#comments</comments>
		<pubDate>Mon, 28 Nov 2011 17:54:09 +0000</pubDate>
		<dc:creator>Ciara Byrne</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Entrepreneur]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[data scientist]]></category>
		<category><![CDATA[decision science]]></category>
		<category><![CDATA[marketing analytics]]></category>
		<category><![CDATA[product analytics]]></category>
		<category><![CDATA[startup]]></category>

		<guid isPermaLink="false">http://venturebeat.com/?p=358053</guid>
		<description><![CDATA[<p>Some of the world&#8217;s biggest tech companies from Google to Facebook are data-driven, but few startup founders have any idea what a data scientist does, never mind whether they should hire one. Here is VentureBeat&#8217;s guide to data science for&#160;&#8230;</p>
<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=venturebeat.com&#038;blog=342986&#038;post=358053&#038;subd=venturebeat&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.shutterstock.com" target="_blank"><img class="alignleft  wp-image-358082" title="Data analyst image from Shutterstock" src="http://venturebeat.files.wordpress.com/2011/11/shutterstock_21637444.jpg?w=400&#038;h=267" alt="" width="400" height="267" /></a>Some of the world&#8217;s biggest tech companies from Google to Facebook are data-driven, but few startup founders have any idea what a data scientist does, never mind whether they should hire one. Here is VentureBeat&#8217;s guide to data science for startups.</p>
<p><strong>What does a data scientist do?</strong></p>
<p>DJ Patil led LinkedIn&#8217;s data science team and is now the <a href="http://greylockvc.com/2011/08/09/greylock-partners-welcomes-dj-patil/" target="_blank">Data Scientist in residence at Greylock Partners</a>. His free ebook “<a href="http://www.amazon.com/Building-Data-Science-Teams-ebook/dp/B005O4U3ZE" target="_blank">Building Data Science Teams</a>” provides an excellent introduction to the basic areas of data science and how to build a team.</p>
<p>For startups, the most relevant applications of data science are probably decision science and product and marketing analytics. Decision science, as the name implies, allows you to identify and monitor key metrics for your business and answer strategic questions like “Which country should we expand into next?” or “What is the impact on the business if we lose this client?”. Google&#8217;s data science team even <a href="http://venturebeat.com/2011/09/20/people-analytics-google-hr/">drives its HR policies</a>.</p>
<p>Product analytics covers anything from how users are reacting to new features to developing standalone data products. LinkedIn&#8217;s “<a href="http://webupon.com/social-networks/how-does-the-people-you-may-know-feature-work-on-linkedin/" target="_blank">People you may know</a>” feature and <a href="http://www.amazon.com/gp/help/customer/display.html?ie=UTF8&amp;nodeId=13316081" target="_blank">Amazon&#8217;s recommendation system</a> are data-driven features that attempt to keep users on the site longer or drive more sales.</p>
<p>Using data to showcase or market a product is the domain of marketing analytics. One of the best known examples is <a href="http://radar.oreilly.com/2011/06/dating-data-okcupid-oktrends.html" target="_blank">okCupid&#8217;s okTrends blog</a>, which features posts like “<a href="http://blog.okcupid.com/index.php/the-case-for-an-older-woman/" target="_blank">The case for an older woman</a>” or “<a href="http://blog.okcupid.com/index.php/the-4-big-myths-of-profile-pictures/" target="_blank">The 4 big myths of profile photos</a>”. The blog drives massive traffic to the site and is regularly covered in the media.</p>
<p><strong>Who are the data scientists?</strong></p>
<p>Since data science is a new area, practitioners often migrate from other fields. You may see maths, statistics, machine learning or computer science on their resumes or a data-intensive field like meteorology. Data scientists want to be of central importance to a business, especially when it&#8217;s a startup. The best data scientists are both intensely curious and great communicators. They answer important questions and tell good stories using data.</p>
<p><strong>What is data infrastructure? </strong></p>
<p>Data scientists need specialized tools to manage and process large amounts of data. The minimum you need to get started is simple data access, usually via a database. Larger-scale or less uniform data may require a tool like <a href="http://hadoop.apache.org/" target="_blank">Hadoop</a>, an open source platform for distributed processing of large data sets across clusters of computers, as well as someone with the technical expertise to use it. Data stores like <a href="http://cassandra.apache.org/" target="_blank">Cassandra</a> are designed to perform well on very large datasets. These are some of the most commonly used tools, but there are many others for tasks such as streaming data collection, querying non-relational databases and job scheduling.</p>
<p><strong>When do you need to hire a data scientist?</strong></p>
<p>VentureBeat talked to data scientist <a href="http://www.youtube.com/watch?v=c2z_A0fDeqw" target="_blank">Cathy O&#8217;Neil</a>, who herself works for a startup (<a href="http://www.intentmedia.com/" target="_blank">Intent Media</a>), about when you need to hire a data scientist. If your data volume is growing, you don&#8217;t know if you are seeing noise or information in your data, or in general, if you are not running your business sufficiently quantitatively, then you may need to consider hiring. The following is our brief Q&amp;A with O&#8217;Neil:</p>
<p><strong>VentureBeat</strong><em>:</em> How much data is enough for a startup to justify hiring a data scientist?</p>
<p><strong>Cathy O&#8217;Neil</strong>: Too much to fit on an Excel spreadsheet. And it&#8217;s not just how much, it&#8217;s really about how high quality the data is; the best is for it to be clean and for it to not be public, or at least not generally used for the purpose that your business uses it for.</p>
<p><strong>VB</strong>: Can a data scientist help you when your startup is still trying to find a product-market fit?</p>
<p><strong>CO</strong>: Yes, in various ways. If your business model is itself quantitative, say you are trying to make money on some inefficiency of some market, a data scientist can help quantify the potential for the business. This is traditionally the job of a business person but can get pretty quantitative so is best done in cooperation with an analyst.</p>
<p>More critically, a data scientist can estimate how much and what kind of data will be needed for a given business model. Depending on the business, getting access to high quality data can be a big expense so this is important.</p>
<p><strong>VB</strong><em>: </em>When you won&#8217;t have &#8220;expected numbers&#8221; based on past performance and maybe your business model isn&#8217;t fixed yet, how can you make forecasts?</p>
<p><strong>CO</strong>: This is tricky and a data scientist can do things like using proxy data, combined with domain expertise, to try to estimate things. In other words, if you have information about how other companies in related fields are doing, you can adjust their numbers to fit to yours, with quantitative estimates of how correlated the fields are.</p>
<p><strong>VB</strong>: Data scientists sometimes use expensive proprietary software but startups don&#8217;t tend to have a lot of cash. Can they also use open source or low-cost tools?</p>
<p><strong>CO</strong>: Yes, but you either have to hire a software engineer who has some data science skills, or you may need to have engineers spend quite a bit of time setting things up for the data scientists. In general data scientists don&#8217;t have the kind of technical skills that engineers do, and the engineers may need to develop some tools in-house to make things easy for the data scientist to do their thing.</p>
<p><strong>VB</strong><em>: </em>Is it ever useful to hire a data scientist part-time or for a particular project?</p>
<p><strong>CO</strong>: Sure, you can advertise on LinkedIn for a data science consultant. This may be a good idea at the beginning of your business to be sure you&#8217;re collecting the right kind and enough good data to follow your business plan.</p>
<p><strong>VB</strong>: Startup founders are often software developers, but few have expertise in data science. Any advice for hiring?</p>
<p><strong>CO</strong>: My advice is, don&#8217;t be dismissive when data scientists are bad at programming, because that&#8217;s not what they&#8217;ve been focusing on. Make sure they know what they need to know about analytics, and figure out how to give them enough support to do those things.</p>
<p><strong>VB</strong>: Any other advice on handling data scientists?</p>
<p><strong>CO</strong>: Make sure you know what&#8217;s sexy about your data.</p>
<br />Filed under: <a href='http://venturebeat.com/category/business/'>Business</a>, <a href='http://venturebeat.com/category/entrepreneur/'>Entrepreneur</a>  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=venturebeat.com&#038;blog=342986&#038;post=358053&#038;subd=venturebeat&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://venturebeat.com/2011/11/28/do-you-need-a-data-scientist/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<enclosure url="http://venturebeat.files.wordpress.com/2011/11/shutterstock_21637444.jpg?w=160" /><source url="http://venturebeat.com/2011/11/28/do-you-need-a-data-scientist/">Do you need a data scientist?</source>
		<media:content url="http://2.gravatar.com/avatar/221fcc5849a699e28bc5a72b2f9bc4a4?s=96&#38;d=http%3A%2F%2F2.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">deciarab</media:title>
		</media:content>

		<media:content url="http://venturebeat.files.wordpress.com/2011/11/shutterstock_21637444.jpg" medium="image">
			<media:title type="html">Data analyst image from Shutterstock</media:title>
		</media:content>
	</item>
	</channel>
</rss>
