Through University Faculty Disease Spread Through University Faculty Jocelyn Bell, Bethany Kubik and Brian Macdonald United States Military Academy Abstract Simulation We modeled the network that exists among the teaching staff at the United States Military Academy. We used the network to study a hypothetical disease outbreak among the staff using a computer simulation. We used this information to understand the nature of how the disease might spread. We also used classic network science methods together with the simulation results to determine which faculty members would be likely to infect the greatest number of people. The program that we ran was written in R and simulated an SIR (susceptible infected recovery) situation. We began by assuming that a single node was infected. In each step the program determined through probability if the connected nodes were then infected. The program ran until no nodes were infected. We ran the simulation one hundred times for each node and computed the average number of nodes infected by the original infected node. We determined that if the probability of infection was less that 37%, then the disease would likely die out within one or two steps. This percentage is known as the epidemic threshold and the average infected nodes were calculated using this probability. Background We created a network of the faculty in the mathematics department at the United States Military Academy. A node represents a single individual. Edges represent shared office space or shared classroom space. Our goal is to understand how a disease, such as a cold or flu virus, spreads among faculty that share offices or classrooms. We created the network below for analysis and simulation. Results The simulation ranked the faculty in order of the average number of people they infected. The following table lists the ten faculty with the highest rank and the numerical value of the average number of people each faculty member infected. Network The data for this was taken from the Fall 2012 classroom list and the Fall 2012 phone list (for office data). Some changes were made to the classroom schedule after this list was made. We decided to stick with the original list since there were additional changes made mid semester. There were a few mislabeled offices in the phone list which we rectified. We defined an office to mean the room where a faculty member’s working desk is located. There are 73 vertices in the graph. We compared the nodes of the above list with the nodes in the top ten in each centrality measure category. To find which centrality measure best predicted a high infection rate, we counted the number of nodes that appeared on both lists. This is displayed as a percentage in the table below. Next we determined the correlation coefficients between the network centrality measures and the average number of infected nodes. The results are tabulated in the second line of the table above. Eigenvector and Degree are the most highly correlated; visual representations of these correlations appear in the scatter plots below. Centrality We computed standard centrality measures from the network with the following results. It should be noted that for some of the centrality measures a tie was produced. In those cases we listed the individuals alphabetically. Future Work Although shared office and classroom space is a good indicator of the probability of a disease communication, we can do better. Our next step is to include edges between the faculty that teach the same course and meet at least once a week. From this increase in data, it would be wise to weight the edges since multiple connections may exist between two individuals. Ultimately, we plan to apply this method to the personnel at a hospital instead of a school. This would allow us to pinpoint nodes, that is, staff and patients, that have a high probability of infecting others. We could then discern what measures and precautions could be taken to avoid the spreading of a disease.