Data mining framework for protein function prediction

Determining the functions of uncharacterized proteins from sequences remains a challenge despite the growth of the number of prediction methods. This is due to the nature of the inherent limitations of current tools and databases and the ambiguity of the function definition. Additionally, standard m...

Full description

Bibliographic Details
Published in:Proceedings - International Symposium on Information Technology 2008, ITSim
Main Author: Rahman S.A.; Hussein Z.A.M.; Bakar A.A.
Format: Conference paper
Language:English
Published: 2008
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-57349104125&doi=10.1109%2fITSIM.2008.4631683&partnerID=40&md5=e85c28539d521ebfd983e31610318d89
id 2-s2.0-57349104125
spelling 2-s2.0-57349104125
Rahman S.A.; Hussein Z.A.M.; Bakar A.A.
Data mining framework for protein function prediction
2008
Proceedings - International Symposium on Information Technology 2008, ITSim
2

10.1109/ITSIM.2008.4631683
https://www.scopus.com/inward/record.uri?eid=2-s2.0-57349104125&doi=10.1109%2fITSIM.2008.4631683&partnerID=40&md5=e85c28539d521ebfd983e31610318d89
Determining the functions of uncharacterized proteins from sequences remains a challenge despite the growth of the number of prediction methods. This is due to the nature of the inherent limitations of current tools and databases and the ambiguity of the function definition. Additionally, standard methods of functional assignment involve sequence alignment to a gene function often fail to find the significant matches. This paper proposes a framework of machine learning method in predicting protein function irrespective of sequence similarity. The framework aims to provide a workflow on predicting protein function that combines both data mining and machine learning algorithms. Three main components are involved: pre-processing, model development and testing & evaluation. The study is expected to create a new method on feature selection processes towards predicting protein functional classes in addition to complementing the existing conventional method of functional assignment. © 2008 IEEE.


English
Conference paper

author Rahman S.A.; Hussein Z.A.M.; Bakar A.A.
spellingShingle Rahman S.A.; Hussein Z.A.M.; Bakar A.A.
Data mining framework for protein function prediction
author_facet Rahman S.A.; Hussein Z.A.M.; Bakar A.A.
author_sort Rahman S.A.; Hussein Z.A.M.; Bakar A.A.
title Data mining framework for protein function prediction
title_short Data mining framework for protein function prediction
title_full Data mining framework for protein function prediction
title_fullStr Data mining framework for protein function prediction
title_full_unstemmed Data mining framework for protein function prediction
title_sort Data mining framework for protein function prediction
publishDate 2008
container_title Proceedings - International Symposium on Information Technology 2008, ITSim
container_volume 2
container_issue
doi_str_mv 10.1109/ITSIM.2008.4631683
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-57349104125&doi=10.1109%2fITSIM.2008.4631683&partnerID=40&md5=e85c28539d521ebfd983e31610318d89
description Determining the functions of uncharacterized proteins from sequences remains a challenge despite the growth of the number of prediction methods. This is due to the nature of the inherent limitations of current tools and databases and the ambiguity of the function definition. Additionally, standard methods of functional assignment involve sequence alignment to a gene function often fail to find the significant matches. This paper proposes a framework of machine learning method in predicting protein function irrespective of sequence similarity. The framework aims to provide a workflow on predicting protein function that combines both data mining and machine learning algorithms. Three main components are involved: pre-processing, model development and testing & evaluation. The study is expected to create a new method on feature selection processes towards predicting protein functional classes in addition to complementing the existing conventional method of functional assignment. © 2008 IEEE.
publisher
issn
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1809677613709393920