Data mining framework for protein function prediction
Determining the functions of uncharacterized proteins from sequences remains a challenge despite the growth of the number of prediction methods. This is due to the nature of the inherent limitations of current tools and databases and the ambiguity of the function definition. Additionally, standard m...
Published in: | Proceedings - International Symposium on Information Technology 2008, ITSim |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
2008
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-57349104125&doi=10.1109%2fITSIM.2008.4631683&partnerID=40&md5=e85c28539d521ebfd983e31610318d89 |
id |
2-s2.0-57349104125 |
---|---|
spelling |
2-s2.0-57349104125 Rahman S.A.; Hussein Z.A.M.; Bakar A.A. Data mining framework for protein function prediction 2008 Proceedings - International Symposium on Information Technology 2008, ITSim 2 10.1109/ITSIM.2008.4631683 https://www.scopus.com/inward/record.uri?eid=2-s2.0-57349104125&doi=10.1109%2fITSIM.2008.4631683&partnerID=40&md5=e85c28539d521ebfd983e31610318d89 Determining the functions of uncharacterized proteins from sequences remains a challenge despite the growth of the number of prediction methods. This is due to the nature of the inherent limitations of current tools and databases and the ambiguity of the function definition. Additionally, standard methods of functional assignment involve sequence alignment to a gene function often fail to find the significant matches. This paper proposes a framework of machine learning method in predicting protein function irrespective of sequence similarity. The framework aims to provide a workflow on predicting protein function that combines both data mining and machine learning algorithms. Three main components are involved: pre-processing, model development and testing & evaluation. The study is expected to create a new method on feature selection processes towards predicting protein functional classes in addition to complementing the existing conventional method of functional assignment. © 2008 IEEE. English Conference paper |
author |
Rahman S.A.; Hussein Z.A.M.; Bakar A.A. |
spellingShingle |
Rahman S.A.; Hussein Z.A.M.; Bakar A.A. Data mining framework for protein function prediction |
author_facet |
Rahman S.A.; Hussein Z.A.M.; Bakar A.A. |
author_sort |
Rahman S.A.; Hussein Z.A.M.; Bakar A.A. |
title |
Data mining framework for protein function prediction |
title_short |
Data mining framework for protein function prediction |
title_full |
Data mining framework for protein function prediction |
title_fullStr |
Data mining framework for protein function prediction |
title_full_unstemmed |
Data mining framework for protein function prediction |
title_sort |
Data mining framework for protein function prediction |
publishDate |
2008 |
container_title |
Proceedings - International Symposium on Information Technology 2008, ITSim |
container_volume |
2 |
container_issue |
|
doi_str_mv |
10.1109/ITSIM.2008.4631683 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-57349104125&doi=10.1109%2fITSIM.2008.4631683&partnerID=40&md5=e85c28539d521ebfd983e31610318d89 |
description |
Determining the functions of uncharacterized proteins from sequences remains a challenge despite the growth of the number of prediction methods. This is due to the nature of the inherent limitations of current tools and databases and the ambiguity of the function definition. Additionally, standard methods of functional assignment involve sequence alignment to a gene function often fail to find the significant matches. This paper proposes a framework of machine learning method in predicting protein function irrespective of sequence similarity. The framework aims to provide a workflow on predicting protein function that combines both data mining and machine learning algorithms. Three main components are involved: pre-processing, model development and testing & evaluation. The study is expected to create a new method on feature selection processes towards predicting protein functional classes in addition to complementing the existing conventional method of functional assignment. © 2008 IEEE. |
publisher |
|
issn |
|
language |
English |
format |
Conference paper |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1809677613709393920 |