Show simple item record

dc.contributor.authorRout, R.K.
dc.contributor.authorUmer, S.
dc.contributor.authorKhandelwal, M.
dc.contributor.authorPati, S.
dc.contributor.authorMallik, S.
dc.contributor.authorBalabantaray, B.K.
dc.contributor.authorQin, H.
dc.date.accessioned2024-08-05T18:23:28Z
dc.date.available2024-08-05T18:23:28Z
dc.date.issued2023-04-19
dc.identifier.citationRout RK, Umer S, Khandelwal M, Pati S, Mallik S, Balabantaray BK and Qin H (2023) Identification of discriminant features from stationary pattern of nucleotide bases and their application to essential gene classification. Front. Genet. 14:1154120. doi: 10.3389/fgene.2023.1154120
dc.identifier.issn1664-8021
dc.identifier.doi10.3389/fgene.2023.1154120
dc.identifier.urihttp://hdl.handle.net/10150/673653
dc.description.abstractIntroduction: Essential genes are essential for the survival of various species. These genes are a family linked to critical cellular activities for species survival. These genes are coded for proteins that regulate central metabolism, gene translation, deoxyribonucleic acid replication, and fundamental cellular structure and facilitate intracellular and extracellular transport. Essential genes preserve crucial genomics information that may hold the key to a detailed knowledge of life and evolution. Essential gene studies have long been regarded as a vital topic in computational biology due to their relevance. An essential gene is composed of adenine, guanine, cytosine, and thymine and its various combinations. Methods: This paper presents a novel method of extracting information on the stationary patterns of nucleotides such as adenine, guanine, cytosine, and thymine in each gene. For this purpose, some co-occurrence matrices are derived that provide the statistical distribution of stationary patterns of nucleotides in the genes, which is helpful in establishing the relationship between the nucleotides. For extracting discriminant features from each co-occurrence matrix, energy, entropy, homogeneity, contrast, and dissimilarity features are computed, which are extracted from all co-occurrence matrices and then concatenated to form a feature vector representing each essential gene. Finally, supervised machine learning algorithms are applied for essential gene classification based on the extracted fixed-dimensional feature vectors. Results: For comparison, some existing state-of-the-art feature representation techniques such as Shannon entropy (SE), Hurst exponent (HE), fractal dimension (FD), and their combinations have been utilized. Discussion: An extensive experiment has been performed for classifying the essential genes of five species that show the robustness and effectiveness of the proposed methodology. Copyright © 2023 Rout, Umer, Khandelwal, Pati, Mallik, Balabantaray and Qin.
dc.language.isoen
dc.publisherFrontiers Media S.A.
dc.rights© 2023 Rout, Umer, Khandelwal, Pati, Mallik, Balabantaray and Qin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY).
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectclassification
dc.subjectco-occurrence matrix
dc.subjectDNA
dc.subjectessential genes
dc.subjectfeature analysis
dc.titleIdentification of discriminant features from stationary pattern of nucleotide bases and their application to essential gene classification
dc.typeArticle
dc.typetext
dc.contributor.departmentDepartment of Pharmacology and Toxicology, University of Arizona
dc.identifier.journalFrontiers in Genetics
dc.description.noteOpen access journal
dc.description.collectioninformationThis item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.
dc.eprint.versionFinal Published Version
dc.source.journaltitleFrontiers in Genetics
refterms.dateFOA2024-08-05T18:23:28Z


Files in this item

Thumbnail
Name:
fgene-14-1154120.pdf
Size:
1.583Mb
Format:
PDF
Description:
Final Published Version

This item appears in the following Collection(s)

Show simple item record

© 2023 Rout, Umer, Khandelwal, Pati, Mallik, Balabantaray and Qin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY).
Except where otherwise noted, this item's license is described as © 2023 Rout, Umer, Khandelwal, Pati, Mallik, Balabantaray and Qin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY).