Repository logo
 

Accurate prediction of protein function using GOstruct

dc.contributor.authorSokolov, Artem, author
dc.contributor.authorBen-Hur, Asa, advisor
dc.contributor.authorAnderson, Chuck, committee member
dc.contributor.authorMcConnell, Ross M., committee member
dc.contributor.authorWang, Haonan, committee member
dc.date.accessioned2007-01-03T05:49:30Z
dc.date.available2007-01-03T05:49:30Z
dc.date.issued2011
dc.description.abstractWith the growing number of sequenced genomes, automatic prediction of protein function is one of the central problems in computational biology. Traditional methods employ transfer of functional annotation on the basis of sequence or structural similarity and are unable to effectively deal with today's noisy high-throughput biological data. Most of the approaches based on machine learning, on the other hand, break the problem up into a collection of binary classification problems, effectively asking the question ''does this protein perform this particular function?''; such methods often produce a set of predictions that are inconsistent with each other. In this work, we present GOstruct, a structured-output framework that answers the question ''what function does this protein perform?'' in the context of hierarchical multilabel classification. We show that GOstruct is able to effectively deal with a large number of disparate data sources from multiple species. Our empirical results demonstrate that the framework achieves state-of-the-art accuracy in two of the recent challenges in automatic function prediction: Mousefunc and CAFA.
dc.format.mediumborn digital
dc.format.mediumdoctoral dissertations
dc.identifierSokolov_colostate_0053A_10688.pdf
dc.identifier.urihttp://hdl.handle.net/10217/52071
dc.languageEnglish
dc.language.isoeng
dc.publisherColorado State University. Libraries
dc.relation.ispartof2000-2019
dc.rightsCopyright and other restrictions may apply. User is responsible for compliance with all applicable laws. For information about copyright law, please see https://libguides.colostate.edu/copyright.
dc.subjectmachine learning
dc.subjectprotein function prediction
dc.titleAccurate prediction of protein function using GOstruct
dc.typeText
dcterms.rights.dplaThis Item is protected by copyright and/or related rights (https://rightsstatements.org/vocab/InC/1.0/). You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).
thesis.degree.disciplineComputer Science
thesis.degree.grantorColorado State University
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy (Ph.D.)

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Sokolov_colostate_0053A_10688.pdf
Size:
1.39 MB
Format:
Adobe Portable Document Format
Description: