Abstract
We address the important bioinformatics problem of predicting protein function from a protein's primary sequence. We consider the functional classification of G-Protein-Coupled Receptors (GPCRs), whose functions are specified in a class hierarchy. We tackle this task using a novel top-down hierarchical classification system where, for each node in the class hierarchy, the predictor attributes to be used in that node and the classifier to be applied to the selected attributes are chosen in a data-driven manner. Compared with a previous hierarchical classification system selecting classifiers only, our new system significantly reduced processing time without significantly sacrificing predictive accuracy.
Original language | English |
---|---|
Pages (from-to) | 191-210 |
Number of pages | 20 |
Journal | International Journal of Data Mining and Bioinformatics |
Volume | 4 |
Issue number | 2 |
DOIs | |
Publication status | Published - Mar 2010 |
Keywords
- hierarchical classification
- supervised learning
- attribute selection
- feature selection
- classifier selection
- protein function prediction
- GPCR
- G-protein coupled receptor
- bioinformatics