Ве молиме користете го овој идентификатор да го цитирате или поврзете овој запис:
http://hdl.handle.net/20.500.12188/22915
Наслов: | Hierarchical protein classification based on gene ontology and decision trees | Authors: | Ivanoska, Ilinka Trivodaliev, Kire Kalajdziski, Slobodan Mirceva, Georgina |
Keywords: | C4.5 Classification, Gene ntology, Protein function prediction | Issue Date: | 2010 | Conference: | ICT Innovations 2010 | Abstract: | Proteins are the most important cell parts, therefore, knowing their exact function is of a great significance. However, the function of large amount of proteins is still unknown. In addition, today, biologists persist on hierarchical organization the living world, and thus in protein databases also. There are many protein classification algorithms proposed determining the protein function, but, only a few of them take into consideration these hierarchical structures. The Gene Ontology (GO) is a protein and gene database structured as a controlled hierarchical vocabulary of terms to describe protein functions. This paper introduces a new hierarchical multi-label protein classifier that uses the relationships among the GO terms. First, protein descriptors are extracted from the structural coordinates stored in the Protein Data Bank (PDB) files. Then, a modified C4.5 algorithm is applied to select the most appropriate descriptor features for protein classification based on the GO hierarchy. An evaluation of this approach is presented, and the results show that the hierarchical structure of GO is important for improving the accuracy of the classification problem at higher levels. | URI: | http://hdl.handle.net/20.500.12188/22915 |
Appears in Collections: | Faculty of Computer Science and Engineering: Conference papers |
Files in This Item:
File | Опис | Size | Format | |
---|---|---|---|---|
DraskoNakikICTIn.pdf | 7.48 MB | Adobe PDF | View/Open |
Записите во DSpace се заштитени со авторски права, со сите права задржани, освен ако не е поинаку наведено.