Skip to main content

COVID-19 Open Research Dataset (CORD-19)

Dataset combining all scholarly articles about or related to COVID-19.
United States


In response to the COVID-19 pandemic, the Allen Institute for AI has partnered with leading research groups to prepare and distribute the COVID-19 Open Research Dataset (CORD-19), a free resource of over 47,000 scholarly articles, including over 36,000 with full text, about COVID-19 and the coronavirus family of viruses for use by the global research community.

This dataset is intended to mobilize researchers to apply recent advances in natural language processing to generate new insights in support of the fight against this infectious disease. The corpus will be updated weekly.

Language: English
Contact details: e-mail:


Data Provider
Allen Institute for AI