Document Type : Original Research Paper

Authors

1 Department of Computer and Information Technology Engineering, Payame Noor University (PNU), Tehran, Iran.

2 Department of Information Technology, Faculty of Industrial and Systems Engineering, Tarbiat Modares University (TMU), Tehran, Iran

3 Department of Data Science, Tarbiat Modares University (TMU), Tehran, Iran.

Abstract

Background and Objectives: One of the important topics in oncology treatment and prevention is the identification of genes that initiate cancer in cells. These genes are known as cancer driver genes (CDGs). Identification of the CDGs is important both for a basic understanding of cancer and to help find new therapeutic or biomarker goals. Several computational methods to find the genes responsible for cancer have been developed based on genome data. However, many of these methods find key mutations in genomic data to predict which genes are responsible for cancer. These methods depend on the mutation and genome data and often show a high rate of false positives in the results. In this study, we proposed an influence maximization-based approach, CinfuMax, which can detect the genes responsible for cancer without needing information on mutations.
Methods: In this method, the concept of influence maximization and the independent cascade model are employed. Firstly, the gene regulatory network for breast, lung and colon cancers was built using regulatory interactions and gene expression data. Next, we implemented an independent cascade diffusion algorithm on the networks to compute each gene's coverage. Finally, the genes with the highest coverage were classified as driver.
Results: The results of the proposed method were compared to 19 other computational and network-based methods based on the F-measure and the number of detected driver genes. The results demonstrated that the proposed method produces better results than other methods. Also, CinfuMax is able to detect 18, 19 and 22 individual driver genes in three breast, lung and colon cancers, respectively, which have not been identified in any of the previous methods.
Conclusion: The results show that independent cascading methods to identify driver genes perform better than linear threshold methods. Driver genes are also classified in terms of influence speed and have identified the genes with the highest diffusion rate in each type of cancer. Identification of these genes can be useful for molecular therapies and drug purposes.

Keywords

Main Subjects

Open Access

This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit: http://creativecommons.org/licenses/by/4.0/

 

Publisher’s Note

JECEI Publisher remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

 

Publisher

Shahid Rajaee Teacher Training University


LETTERS TO EDITOR

Journal of Electrical and Computer Engineering Innovations (JECEI) welcomes letters to the editor for the post-publication discussions and corrections which allows debate post publication on its site, through the Letters to Editor. Letters pertaining to manuscript published in JECEI should be sent to the editorial office of JECEI within three months of either online publication or before printed publication, except for critiques of original research. Following points are to be considering before sending the letters (comments) to the editor.


[1] Letters that include statements of statistics, facts, research, or theories should include appropriate references, although more than three are discouraged.

[2] Letters that are personal attacks on an author rather than thoughtful criticism of the author’s ideas will not be considered for publication.

[3] Letters can be no more than 300 words in length.

[4] Letter writers should include a statement at the beginning of the letter stating that it is being submitted either for publication or not.

[5] Anonymous letters will not be considered.

[6] Letter writers must include their city and state of residence or work.

[7] Letters will be edited for clarity and length.

CAPTCHA Image