Element AI today released a search tool that combs through the COVID-19 Open Research Dataset, a repository of over 44,000 scholarly articles about COVID-19 and related coronaviruses, for papers that researchers might find useful. Users can search or query natural language terms, phrases, and keywords to surface articles that contain semantically similar content, or copy paragraphs of text or questions into the search bar to return articles with only the most important sentences highlighted.
A deluge of studies on the novel coronavirus, which is projected to sicken millions of people, has hit the web in the months since the outbreak began. (According to Reuters, at least 153 preprint studies about COVID-19 have been made publicly available as of March 24.) They promise insights into the virus’ spread, but many haven’t been peer-reviewed, making it difficult for stakeholders to sort the wheat from the chaff.
To this end, Element AI’s tool leverages tech from the company’s Knowledge Scout product, which uses AI to capture the relationships between different pieces of information to help it learn and improve over time while building a repository of tacit knowledge. Element AI says that the platform will be progressively updated in the coming weeks with additional COVID-19 data sets, alongside features that include open-domain question-answering capabilities, query-driven summarization, and topic discovery.
The launch of Element AI’s platform follows that of Vespa’s CORD-19 Search, which similarly trawls the COVID-19 Open Research Dataset for vetted research papers. For its part, Korea University’s DMIS Lab this month released Covidsearch, which provides real-time question-answering on 31,000 COVID-19-related articles with results that highlight relevant biomedical entities. And the Allen Institute for AI offers a no-frills platform that searches the full text of the COVID-19 Open Research Dataset.
The AI underpinning these and other COVID-19 search tools learns from signals (i.e., data derived from various inputs). Each signal informs the system’s predictions such that it learns how various resources are relevant to a search query (or not). Natural language processing enables the system to understand a piece of research in the context of a data set, while natural language search — a specialized application of AI that creates a “word mesh” from free-flowing text, akin to a knowledge graph — connects similar concepts that are related to larger ideas to return the same answer regardless of how a query is phrased.
It’s too soon to say how big an impact semantic search tools might have on continuing COVID-19 research, but they could help weed out the more questionable research that has come to light. One such paper suggests a link between the new coronavirus and HIV, while another claims it’s from outer space.