C. M. Rivers, PhD MPH
Network Dynamics and Simulation Science Laboratory, Virginia Bioinformatics Institute, Virginia Tech
I present case tree plots and checkerboard plots for visualizing contagions. The visualizations are best suited for diseases like SARS, MERS-CoV and H7N9 for which there are a limited (less than 200) number of cases, with data available on human to human transmission. They a) allow for easy estimation of epidemiological parameters like basic reproduction number b) indicate the frequency of introductory events, e.g. spillovers in the case of zoonoses c) represent patterns of case attributes like patient sex both by generation and over time.
Zoonoses represent an estimated 58% of all human infectious disease pathogens, and 73% of emerging infectious pathogens (Woolhouse 2005). Careful tracking of zoonotic disease is a major focus of global public health protection strategy. Recent examples of zoonotic outbreaks include Severe Acute Respiratory Syndrome (SARS), H1N1, and Middle East Respiratory Syndrome (MERS-CoV), which have caused thousands of deaths combined (Christian 2004, Domínguez-Cherit 2009, World Health Organization 2014). Early identification of new outbreaks is critical to successful containment of these diseases.
The current toolkit for visualizing data from these emerging diseases is limited. One popular option is the the epidemic curve, which is a histogram of new cases over time. Epidemic curves are limited in that they do not indicate how cases are related to one another, nor can they represent the presence of an animal source. Network diagrams are a useful though less popular option. These diagrams can depict individual human clusters, but often do not have a time component, and cannot represent constellations of unconnected clusters. Furthermore, network diagrams typically require complete information about the structure of the transmission tree. Here we introduce case tree plots and checkerboard plots to address those weaknesses and more clearly represent zoonotic outbreaks.
I present two new visualizations, case tree plots and checkerboard plots, for visualizing emerging zoonoses. Code for the plots are available in the open source python package epipy, which is available on github at www.github.com/cmrivers/epipy. The documentation is avaliable at www.cmrivers.github.io/epipy. Epipy relies heavily on the networkx (Hagberg 2008) and pandas (McKinney 2010) packages. In addition to the visualizations introduced here, epipy includes a number of functions for common epidemiology calculations, like odds ratio and relative risk. A function that generates realistic example data is also provided. All plots, data and tables in this manuscript were generated using epipy.