Wolfram Alpha — it’s like plugging into an electronic brain

Scientist Stephen Wolfram has built a new engine, called Wolfram Alpha, that apparently can compute answers to factual questions more powerfully than Google.

Wolfram has just posted about the effort, which has taken years of working in stealth and involves more than a hundred workers. He explains the basics of how his “computational knowledge engine” works: You ask it factual questions (such as “How many protons are in a hydrogen atom?”), and it computes answers for you.

Many details about the engine, scheduled to launch in May, have yet to be released. However, Wolfram has shown it to search engine expert Nova Spivack. In a long post, Spivack calls the effort “almost absurdly ambitious” but concludes that it works, and claims that the engine has the potential to touch our lives as deeply as Google.

The engine doesn’t return documents that might contain the answer, like Google does, and it isn’t a giant database, like Wikipedia. Nor does it resort to natural language to return documents, like Powerset does. Rather, Wolfram (pictured left) has created a proprietary system based on fields of knowledge, containing terabytes of curated data and millions of lines of algorithms to represent real-world knowledge as we know it.

You ask it questions in a bar that looks very much like Google’s search bar, but it uses natural language to understand your question or even abbreviated notation. It then provides detailed answers.

As Spivack summarizes, the vision seems to be to create a system that can do for formal knowledge (all the formally definable systems, heuristics, algorithms, rules, methods, theorems, and facts in the world) what search engines have done for informal knowledge (all the text and documents in various forms of media).

Wolfram explains that the engine is built on two fundamental stepping stones: his works Mathematica and A New Kind of Science:

With Mathematica, I had a symbolic language to represent anything—as well as the algorithmic power to do any kind of computation. And with NKS, I had a paradigm for understanding how all sorts of complexity could arise from simple rules. But what about all the actual knowledge that we as humans have accumulated?

But if one’s already made knowledge computable, one doesn’t need to do that kind of natural language understanding. All one needs to be able to do is to take questions people ask in natural language, and represent them in a precise form that fits into the computations one can do…I wasn’t at all sure it was going to work. But I’m happy to say that with a mixture of many clever algorithms and heuristics, lots of linguistic discovery and linguistic curation, and what probably amount to some serious theoretical breakthroughs, we’re actually managing to make it work.

…It’s certainly the most complex project I’ve ever undertaken. Involving far more kinds of expertise—and more moving parts—than I’ve ever had to assemble before. And—like Mathematica, or NKS— the project will never be finished.

Note that last part — “the project will never be finished.” It’s clear that this is a massive undertaking and that it faces serious challenges. Spivack, who has toiled away at his own knowledge engine based on semantics (a company called Twine) clearly has a lot of admiration for Wolfram and is diplomatic about his criticism, couching it lower in his piece. There is a host of “hairy questions,” such as the fact that many facts in life are “fuzzy,” such as the scientific evidence of global warming. Even here, though, Wolfram has taken pains in his model to provide multiple answers, and Spivack appears to conclude that this is not a big problem. And while Spivack calls Wolfram’s system and the engine’s user interface “beautiful,” he cautions that it was also designed by and for people “with IQ’s somewhere in the altitude of Wolfram’s — some work will need to be done dumbing it down a few hundred IQ points so as to not overwhelm the average consumer with answers that are so comprehensive that they require a graduate degree to fully understand.”

Notably, the engine is not built using standard semantic web languages such as RDF, OWL and Sparql, in part because these ontologies are too difficult to build and curate for such a wide field of knowledge.

According to Spivack:

This is not to say that Wolfram Alpha IS a cellular automata itself — but rather that it is similarly based on fundamental rules and data that are recombined to form highly sophisticated structures. The knowledge and intelligence it contains are extremely modularized and can be used to synthesize answers to factual questions nobody has asked yet. The questions are broken down to their basic parts and then simple reasoning takes places, and answers are computed on the vast knowledge base in the system. It appears the system can make inferences and do some basic reasoning across what it knows — it is not purely reductionist in that respect; it is generative, it can synthesize new knowledge, if asked to.

Wolfram Alpha perhaps represents what may be a new approach to creating an “intelligent machine” that does away with much of the manual labor of explicitly building top-down expert systems about fields of knowledge (the traditional AI approach, such as that taken by the Cyc project), while simultaneously avoiding the complexities of trying to do anything reasonable with the messy distributed knowledge on the Web (the open-standards Semantic Web approach). It’s simpler than top down AI and easier than the original vision of Semantic Web.

Where Google is a system for FINDING things that we as a civilization collectively publish, Wolfram Alpha is for ANSWERING questions about what we as a civilization collectively know. It’s the next step in the distribution of knowledge and intelligence around the world — a new leap in the intelligence of our collective “Global Brain.” And like any big next-step, Wolfram Alpha works in a new way — it computes answers instead of just looking them up.

Apparently, the service will offer an API, so other developers can build on it.

I can’t wait to use this new engine. I remember when Powerset first emerged, making claims that it could use natural language to understand your questions, and generated a lot of hype. The company didn’t live up to the hype but at least offered a valuable contribution to the search engine field. Wolfram Alpha has the feel of something somewhat more realistic, because the magnitude of its task is so clearly obvious from the beginning, and because the founder concedes from the beginning this is a work in progress.

[Image credit: NNDB]

Next Story:
Previous Story:

Tags:

People:




Photo of Matt Marshall

About the Author, Matt Marshall

Matt Marshall is editor and CEO of VentureBeat. Follow him on Twitter at @mmarshall, and follow VentureBeat on Twitter at @venturebeat.

  • I really appreciate your help, it is very useful for me,you will get good grades!
    You will be successful.
  • This is really cool , thank you for sharing ,

    Funky Monkey ,

    http://telcomworld.com
  • Vincent X
    Its nice that someone has put everything into this.
    I'm sure it will be better or a vast improvement that in the past search engines.
    Its good for our future.
    Keep on getting better.
    I hope you get very close to what you/we are trying to achieve.
    Good luck.
    I cant wait till it's Launch in May 2009.
  • Readers interested in some thoughts on Wolfram Alpha will might find my blog on the subject interesting. It's at

    http://www.alpha-tips.com/alphatips/blog

    Thanks!

    --David (the blog's author...)
  • Hey, also be sure to check out our new search engine, Duck Duck Go: http://www.duckduckgo.com/. More info about us (and why we're better/different) can be found at: http://www.duckduckgo.com/about.html.

    We also have some semantic properties, e.g. ambigious keyword detection: http://www.duckduckgo.com/?q=apple, as well as have zero-click info, e.g. http://duckduckgo.com/?q=Futurama.

    Take care,

    Gabriel Weinberg
    Founder & CEO, Duck Duck Go

  • Twine_watcher
    Maybe Nova Spivack should focus more on getting Twine's new nonsense ranking algorithm right before he hypes any Wolfram Alpha offering.

    Twine's algorithm has people listed at #13 who've only posted 1 item and people who haven't actively used Twine in over six months in the Top 100 users.

    Meanwhile, not a single user who HAS been posting thousands of items and is active has been included in the Top 100.

    He's familiar with the dangers of hype. He claimed Twine has "explosive growth" and that proved to be mostly due to spambots whilst actual user engagement has dropped from a high of 16 minutes to 3 minutes.

    Yup, he should work on fixing and delivering Twine's own smarts first before he goes and worries about how smart anyone else's solutions are.
  • I too am skeptical. Obviously, I'll have to wait until it's released to see what it can do, but what I've read about Wolfram Alpha so far seems neither feasible nor useful. And there's another issue -- transparency. This appears to have been a "secret project", not something developed as an open-source project by a world-wide community. If such a thing is feasible and useful, why has it not been built by the smartest people in the world, open source hackers?
  • erdevs
    WolframAlpha is cataloging your criticisms and plotting its revenge...
  • Bob Agnes
    I hope it's as good or better than advertised. Why not wait until we've had a chance to play with it, before spouting all of the negative comment. Wolfram is a briliiant hard working thinker. We should encourage people like him to keep reaching for the next breakthrough for everyone's benefit.
  • Miramon
    Going to have to wait till May, apparently, to see if it's any good. Wolfram has done good things, of course, but he also seems to have a very big head. IMO A New Kind of Science was about ten pounds too heavy for its actual content, and its pervasive lack of citation of living automata researchers -- who sometimes had already investigated claims said to be new and original in the book -- was extremely arrogant and annoying.

    All prior "semantic search" that has supposedly been better than Google has not been, from PowerSet on down. I wouldn't get hopes too high for the Turing Test, cyberiban -- this thing will be lucky if it can even distinguish subject and object through NLP parsing (a principal PowerSet claim), much less figure out what the meaning of a complex query might be.
  • Type your comment here.
  • Needs a better name.
  • This is getting hyped wayyyy too much and much too quickly.

    I can't help but be a tad bit skeptical.
  • Mark
    What do You think about www.coount.com - step by step online calculator?
  • "Nor does it resort to natural language to return documents, like Powerset does. "

    "You ask it questions in a bar that looks very much like Google’s search bar, but it uses natural language to understand your question or even abbreviated notation. "

    Don't they contradict ?

    Any system whether Google, Powerset, Cuil or Wolfram Alpha cannot afford to not use NLP. In fact all of them use it to a certain extent.
  • The API opens up some really interesting platform plays. Plug twitter into that and you have framing for all sorts of knowledge networking domains/spaces. I'm from Missouri on all of this stuff (game changers), but conceptually, a really fresh idea. Mechanical Turk, for real.
  • cyberiban
    This is the worlds first real contender for the Turing Test, if it delivers.

    In some ways, it would be better than passing the turing test. If it can answer previously unanswered questions. Which appears to be a potential, it could actually be the first step toward furthering human understanding. A superior source of reference to other humans.

    Maybe the singlarity gets just a little closer this May!
  • Mrs. Smith
    typo - will over s/b will offer
  • swag
    And anyone else think that Wolfram's photo here is an homage to George Costanza's bio on the Vandelay Industries corporate Web site?
  • Ha.
  • Stu
    How about an example showing that it's actually better than Google? I put the exact same query into Google -- "How many protons are in a hydrogen atom?" -- and it didn't merely "return documents that might contain the answer," it gave the answer to the question and provided its source.
  • s
    How many protons are in a Na22?
  • swag
    The problem with this talk is that only a small fraction of my Google use is to answer specific questions. Most of the time it's for finding other sites, references, open source software packages, procedural info such as a cooking recipe for preparing cornish hens or how to rip audio from a DVD, etc., what have you.

    Meaning: the market for this is much, much smaller than Google.
  • galev
    Right. Google is a search engine, WolframAlpha is not.
  • Chris
    Cute - now try searching for any other atom on the periodic table.
  • Bob Agnes
    I did. It gave the answer for iron (FE) 56.

    Bob A.
  • k
    Cuil never really got a fair chance. Maybe the black UI had something to do with it.
    Anyway, I would like to give this one a try.
    I would like to know if it's better than Yahoo Answers or Mahalo Answers, both are powered by humans. Maybe Wolfram needs to look at those two companies first when giving an answer.
blog comments powered by Disqus