EpidemicsDisease can be monitored, predicted by analyzing views of Wikipedia articles

Published 29 December 2014

Scientists can now monitor and forecast diseases around the globe more effectively by analyzing views of Wikipedia articles. Researchers were able successfully to monitor influenza in the United States, Poland, Japan and Thailand, dengue fever in Brazil and Thailand, and tuberculosis in China and Thailand. They were also able to forecast all but one of these, tuberculosis in China, at least twenty-eight days in advance.

Scientists can now monitor and forecast diseases around the globe more effectively by analyzing views of Wikipedia articles, according to a team from Los Alamos National Laboratory.

“A global disease-forecasting system will improve the way we respond to epidemics,” scientist Sara Del Valle said. “In the same way we check the weather each morning, individuals and public health officials can monitor disease incidence and plan for the future based on today’s forecast.”

An LANL release reports that Del Valle and her team were able successfully to monitor influenza in the United States, Poland, Japan and Thailand, dengue fever in Brazil and Thailand, and tuberculosis in China and Thailand. The team was also able to forecast all but one of these, tuberculosis in China, at least twenty-eight days in advance. This means, Del Valle said, that people start searching for disease-related information on Wikipedia before they seek medical attention.

The paper also shows the potential to transfer models across different regions; that is, one can “train” a computer model using public health data in one location and use it in another. For example, researchers could create models using data from Japan to track and forecast disease in Thailand. This is particularly important for countries that do not offer reliable disease data.

“The goal of this research is to build an operational disease monitoring and forecasting system with open data and open source code,” Del Valle said. “This paper shows we can achieve that goal.”

— Read more in Nicholas Generous et al., “Global Disease Monitoring and Forecasting with Wikipedia,” PLOS Computational Biology (13 November 2014) (DOI: 10.1371/journal.pcbi.1003892)