Internet crawling helps in tracking infectious disease outbreaks

Published 9 July 2008

New Web crawling tool helps identify and locate outbreaks of disease around the world

Could Internet discussion forums, listservs, and online news outlets be an informative source of information on disease outbreaks? A team of researchers from Children’s Hospital Boston and Harvard Medical School thinks so, and it has launched a real-time, automated data-gathering system called HealthMap to gather, organize and disseminate this online intelligence. “Web-based electronic information sources,” say John Brownstein and colleagues from the HealthMap project, “can play an important role in early event detection and support situational awareness by providing current, highly local information about outbreaks, even from areas relatively invisible to traditional global public health efforts.” Information overload, however, and difficulties in distinguishing “signal from noise” pose substantial barriers to fully using this information. To overcome these problems, the authors created the freely accessible HealthMap Project, which they describe as a “multistream real-time surveillance platform that continually aggregates reports on new and ongoing infectious disease outbreaks.” These reports are organized and disseminated in a variety of ways, including creating disease maps and “situational awareness windows.”

Ultimately, say Brownstein and colleagues, the use of news media and other nontraditional sources of surveillance data can “facilitate early outbreak detection, increase public awareness of disease outbreaks prior to their formal recognition, and provide an integrated and contextualized view of global health information.”