Repository logo
 

Determining disease outbreak influence from voluminous epidemiology data on enhanced distributed graph-parallel system

dc.contributor.authorShah, Naman, author
dc.contributor.authorPallickara, Sangmi Lee, advisor
dc.contributor.authorPallickara, Shrideep, committee member
dc.contributor.authorTurk, Daniel E., committee member
dc.date.accessioned2017-09-14T16:06:49Z
dc.date.available2017-09-14T16:06:49Z
dc.date.issued2017
dc.description.abstractHistorically, catastrophe has resulted from large-scale epidemiological outbreaks in livestock populations. Efforts to prepare for these inevitable disasters are critical, and these efforts primarily involve the efficient use of limited available resources. Therefore, determining the relative influence of the entities involved in large-scale outbreaks is mandatory. Planning for outbreaks often involves executing compute-intensive disease spread simulations. To capture the probabilities of various outcomes, these simulations are executed several times over a collection of representative input scenarios, producing voluminous data. The resulting datasets contain valuable insights, including sequences of events that lead to extreme outbreaks. However, discovering and leveraging such information is also computationally expensive. This thesis proposes a distributed approach for aggregating and analyzing voluminous epidemiology data to determine the influential measure of the entities in a disease outbreak using the PageRank algorithm. Using the Disease Transmission Network (DTN) established in this research, planners or analysts can accomplish effective allocation of limited resources, such as vaccinations and field personnel, by observing the relative influential measure of the entities. To improve the performance of the analysis execution pipeline, an extension to the Apache Spark GraphX distributed graph-parallel system has been proposed.
dc.format.mediumborn digital
dc.format.mediummasters theses
dc.identifierShah_colostate_0053N_14415.pdf
dc.identifier.urihttps://hdl.handle.net/10217/184038
dc.languageEnglish
dc.language.isoeng
dc.publisherColorado State University. Libraries
dc.relation.ispartof2000-2019
dc.rightsCopyright and other restrictions may apply. User is responsible for compliance with all applicable laws. For information about copyright law, please see https://libguides.colostate.edu/copyright.
dc.subjectdistributed analytics
dc.subjectepidemiological PageRank
dc.subjectNAADSM influential analysis
dc.subjectenhanced distributed graph-parallel system
dc.subjectdisease propagation network
dc.subjectextended Apache Spark Graphx
dc.titleDetermining disease outbreak influence from voluminous epidemiology data on enhanced distributed graph-parallel system
dc.typeText
dcterms.rights.dplaThis Item is protected by copyright and/or related rights (https://rightsstatements.org/vocab/InC/1.0/). You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).
thesis.degree.disciplineComputer Science
thesis.degree.grantorColorado State University
thesis.degree.levelMasters
thesis.degree.nameMaster of Science (M.S.)

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Shah_colostate_0053N_14415.pdf
Size:
4.26 MB
Format:
Adobe Portable Document Format