Composition of Feature Relevancy Based Biomarker Gene Selection in Gene Expression Dataset
1M.Pyingkodi, Department of Computer Applications, Kongu Engineering College, Erode, Tamilnadu, India.
2Dr. S.Shanthi, Department of Computer Applications, Kongu Engineering College, Erode, Tamilnadu, India.
Manuscript received on November 20, 2019. | Revised Manuscript received on November 28, 2019. | Manuscript published on 30 November, 2019. | PP: 7480-7484 | Volume-8 Issue-4, November 2019. | Retrieval Number: D5324118419/2019©BEIESP | DOI: 10.35940/ijrte.D5324.118419
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Cancer gene selection plays a prominent work in the area of Bioinformatics. Gene selection methods aim to retain relevant genes and remove redundant genes. This proposed technique deals on gene selection techniques based on information theory. By investigating the information theory based on composition of feature relevancy, we consider that a excellent gene technique method could boost novel classification of the cancer gene data while reducing gene redundancy. Therefore, a modified gene selection technique called Composition of Feature Relevancy (CFR) is carried out. To assess CFR, the experiments are carrying out on five real-world cancer gene expression data sets and three best classifiers (KNN, Support Vector Machine and Random forest). The modified gene selection technique gives best outcome when competing to other recent technique in terms of accuracy and sensitivity in classification.
Keywords: Gene Selection, Composition of Feature Relevancy, Information Theory, Classification, Random forest, Cancer.
Scope of the Article: Classification.