New York Times lets anyone search its 2.8 million articles

Updated


[Disclosure: The New York Times syndicates VentureBeat content]

The New York Times has been exposing more data to external websites and applications recently, and now it’s providing the most promising access of all: the ability to search the Times’ entire online article archive going back to 1981.

The Times is doing so through application programming interfaces (APIs), which let third-party company developers access the data easily.

I was already impressed by attempts to share campaign finance data, other relevant information about Congress, and fun stuff like movie reviews, but the new API unlocks the most valuable thing The Times has to offer — 2.8 million articles. You can search 35 separate fields, including title, publication data, byline, and organizations mentioned.

Of course, APIs are only valuable if developers are using them in interesting ways. I haven’t seen too many sites or applications using the Times APIs yet, though I haven’t looked too hard. I’m betting the Article Search API will see more use than any of the others — adding access to this content would be a great addition to news or reference sites. And compared to, say, campaign finance data, article searches seem like a much more compelling way to draw readers back to The Times’ site.

What’s less clear is how The Times plans to make money from these APIs. After all, there has been plenty of speculation about the paper’s finances, and editor Bill Keller has been openly discussing the possibility of charging for certain kinds of content. For now, there’s no charge to access the data, although the terms of service do leave the possibility open. I certainly think the Article Search API is something the Times could charge for — to my mind, the archive (not the columns) was what I was really paying for when I signed up for the abortive Times Select online subscription service.

By the way, The Times’ APIs were built by a San Francisco startup called Mashery. [Update: I was wrong. Mashery handles some of the infrastructure behind the APIs, namely the traffic throttling and key generation, but the APIs themselves were built by Times developers.] I had a chance to talk to Mashery chief executive Oren Michels last month, and he said the standard method for making money from APIs is to draw users back to the site. However, from what I hear, The Times’ problems (and the problems at most newspapers) have less to do with traffic and more to do with making money from existing visitors and ads. Michels argued that APIs are a good funnel for potential paying customers, too — among Mashery customers, Thomson Reuters’ Open Calais initiative is an example of this. Even with in the downturn, more and more companies are interested in opening themselves up through APIs, Michels said.

“In a recession or a depression, you can’t ignore things that create new distribution channels,” he said.

Anyway, it’s still early to judge The Times’ initiative. Let’s see what the paper does with its coolest API yet — for now, I’m just glad this is available. The Times’ post announcing the API begins, “Finally!” which captures my feelings exactly.

Next Story:
Previous Story:

Tags:

Photo of Anthony Ha

About the Author, Anthony Ha

Anthony is VentureBeat's assistant editor, as well as its reporter on enterprise technology, cloud computing, and tech policy. Before joining VentureBeat in 2008, Anthony worked at the Hollister Free Lance, where he won awards from the California Newspaper Publishers Association for breaking news coverage and writing. He attended Stanford University and now lives in San Francisco. Reach him at anthony@venturebeat.com. You can also follow Anthony on Twitter.

  • API seems to be one another marketing tool. When you get lots of blog post about it and attention from techies you are supposed to be up with the changing time. Soon enough i will come up with Kitchen API to access the contents of my fridge.

    Cheers !!
  • Laura Merling
    Hi Anil ...... Tesco (grocer in UK) announced an API a month or so ago. They are hoping to be in your fridge soon.....
  • harrisj
    "By the way, The Times’ APIs were built by a San Francisco startup called Mashery." is not correct. Mashery is a startup that handles key generation and traffic throttling and that is the service they provide for the NY Times among other clients. But the APIs were built and maintained by the developers at the NY Times.
  • Wow, that's a pretty bad misunderstanding or miscommunication, sorry about that. I'm trying to figure out how we crossed wires like that. Once I have my facts straight I will correct ASAP.
  • Laura Merling
    Hi Anthony, great article! Yes, the NYTimes folks built their APIs themselves and we (Mashery) provides the infrastructure for key issuing, rate limiting, throttling, etc.

    A big congrats to the NYTimes dev team! They have done an amazing job with their API and they are breaking ground for the industry. It is fun to watch and to be a part of it!
  • The New York Times has done an excelent job with the their new API search site. What a great resource for anyone who is trying to research Times articles.
    Mike K from: http://www.homeloanmortgagemodification.com
blog comments powered by Disqus