
Just add humans: Oxford medical study underscores the missing link in chatbot testing
Headlines have been blaring it for years: Large language models (LLMs) can not only pass medical licensing exams but also outperform humans. GPT-4 could correctly answer U.S. medical exam licensing questions 90% of the time, even in the prehistoric AI days of 2023. Since then, LLMs have gone on to best the residents taking those exams and licensed physicians.

The 3 biggest bombshells from this week’s AI extravaganza
Basketball has March Madness. Tech has the Consumer Electronics Show. AI has been waiting for its big moment—and this week may finally be it.