All the sessions from Transform 2021 are available on-demand now. Watch now.

Facebook today showed off the latest progress in its artificial intelligence research. The most impressive achievement is a new system that lets people ask questions about photos on Facebook using voice input and then receive audible answers about the photos in response.

The mobile-based system, called Visual Q&A, shows how Facebook can use multiple approaches to a type of artificial intelligence called “deep learning”—specifically, convolutional neural networks and more modern end-to-end memory networks—to turn its existing data into something that can be consumed by an audience that normally wouldn’t be able to access it.

“Think of what this might mean to the 285 million people globally who have low vision capabilities or the 40 million who are blind. Instead of being left out of the experience when friends share photo content, they’ll be able to participate,” Facebook chief technology officer Mike Schroepfer wrote in a blog post on the news.

Facebook isn’t releasing this app to the public, but the company has allowed some people to try it out, and a video documenting the research is pretty moving.

This work builds on Facebook’s ability to answer questions about textual information using AI, which Schroepfer demonstrated on stage at the company’s F8 developer conference in March.

Google and Baidu, among other companies, are also moving quickly to smarten up their products with deep learning.

In addition to unveiling the visual Q&A system, today Facebook is also announcing improvements in computer vision.

“Our team has created not only a system that has taught machines this skill, but also a state-of-the-art research system that can segment images 30 percent faster than most other systems, using 10x less training data across industry benchmarks,” Schroepfer wrote.

Facebook’s computers can now also make inferences about whether virtual blocks stacked unevenly will topple over. The system is 90 percent accurate, according to Schroepfer’s blog post.

And Facebook researchers have also been trying to teach a computer how to play the Chinese board game Go very intelligently.

“The Go player we’ve built is getting close to being able to compete with the best humans. We’ve only been working on it for a few months, but it’s already on par with the other AI-powered systems that have been published and is as good as a very strong amateur human player.”

This might just seem like a fun application, just like IBM’s Deep Blue supercomputer taking on chess grandmaster Garry Kasparov in 1997. But Facebook has nearly a billion and a half users now. In the future, a whole lot of people may be able to take advantage of this technology whenever they want.


VentureBeat's mission is to be a digital town square for technical decision-makers to gain knowledge about transformative technology and transact. Our site delivers essential information on data technologies and strategies to guide you as you lead your organizations. We invite you to become a member of our community, to access:
  • up-to-date information on the subjects of interest to you
  • our newsletters
  • gated thought-leader content and discounted access to our prized events, such as Transform 2021: Learn More
  • networking features, and more
Become a member