Volume 21, Issue 10 (October 2023)                   IJRM 2023, 21(10): 809-818 | Back to browse issues page


XML Persian Abstract Print


Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

Asadzadeh A, Ghorbani N, Dastan K. Identification of druggable hub genes and key pathways associated with cervical cancer by protein-protein interaction analysis: An in silico study. IJRM 2023; 21 (10) :809-818
URL: http://ijrm.ir/article-1-3133-en.html
1- Department of Biology, Faculty of Science, Nour Danesh Institute of Higher Education, Meymeh, Isfahan, Iran. , az.asadzadeh@gmail.com
2- Department of Microbiology, Faculty of Basic Sciences, Lahijan Branch, Islamic Azad University, Lahijan, Iran.
Full-Text [PDF 1631 kb]   (494 Downloads)     |   Abstract (HTML)  (499 Views)
Full-Text:   (84 Views)

1. Introduction
Cervical cancer (CC) is the fourth most common cancer among women worldwide (1). According to the database of research on cancer, CC affects more than half a million women and causes 0.3 million deaths annually (1, 2).
CC affects the lower part of the uterus, which is connected to the vagina. The progression of CC occurs slowly, taking up to 10 yr to become a precancerous lesion. On the other hand, the asymptomatic nature of CC makes patients unaware of it. Therefore, patients should perform a cervical screening test to discover the presence of cancer cells (1-3).
Symptoms of CC in advanced stages include vaginal bleeding, pelvic pain, prolonged menstrual bleeding, and pain during intercourse (3, 4). The main methods of treating CC are surgery and radiotherapy, however it may lead to posttreatment problems such as recurrence, metastasis, and drug resistance (2, 3).
Emphasized factors in the development of CC include co-infection with the pathogen, reproductive factors, increasing gene expression, or in some cases, decreasing gene expression in the cervix cell line, sexual behavior, obesity, smoking, and long-term use of hormones, and prevention of pregnancy (5).
The mechanisms involved in developing CC are very complex, and hub genes, RNAs, and different signaling pathways are related to it (6-8). Krüppel-like factor 4 and estrogen receptor 1 are closely related to the poor prognosis of patients with CC. Endothelin-3 and endothelin B receptors may play an important role in CC development (8). In a protein network, some nodes are defined as genes with high correlation in candidate modules, which are called hub genes. CDC45, GINS2, MCM2, and PCNA, have been reported as hub genes that are associated with the prognosis of CC patients (7).
One of the best biological networks that recently received much attention is protein-protein interaction (PPI) network analysis. This is an in silico method to identify proteins associated with various diseases. This method presents the useful information about how proteins interact with each other and their roles (9, 10).
Detection of differentially expressed genes (DEGs) in the first step of PPI analysis allows the identification of key biomarkers, which helps in the early diagnosis and treatment of CC and can potentially increase the patient's life expectancy. In addition, gene expression data helps identify pathways and molecular information to select effective targets for drug design (9, 11).
Our present study uses gene expression patterns and PPI network analysis to identify druggable hub genes and molecular pathways involved in CC.

2. Materials and Methods
2.1. Selection of gene expression dataset
This study was conducted by in silico analysis. Gene expression omnibus (GEO) is a widely used database for gene expression and RNA methylation profiling (12). In this database, searches were limited by study keyword, type of organism, study type, and entry type. In this step, 2 gene expression datasets for CC were obtained for further analysis.

2.2. Data analysis and DEGs identification
DEGs between CC and normal samples in selected datasets were analyzed by GEO2R separately. Adjusted p-value < 0.05 as the cut-off criteria was determined. Up-regulated and down-regulated genes were selected based on log2 (FC) value > 1 and log2 (FC) value < -1, respectively. In the next step to exhibit the overlap of DEGs between the 2 datasets, a Venn diagram was drawn by Funrich software for up-regulated and down-regulated genes.

2.3. Gene ontology (GO) and Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis
GO assessment and KEGG pathway analysis were performed by EnrichR (13). For this purpose, all common up-regulated and down-regulated genes were used as input data in EnrichR.

2.4. PPI network analysis
Online software STRING 12.0 was used to generate a PPI (14). To construct the network by STRING, interaction scores of < 0.4 were selected, and disconnected nodes were deleted. Finally, the network was formed by using text mining, neighborhood, experiments, gene fusion, databases, co-occurrence, and co-expression as active interaction sources. In the next step, for the PPI network visualization, Cytoscape software version 3.9.1 was used, and 10 key genes were detected by the CytoHubba plugin.

2.5. Expression levels of hub genes in CC
To validate the differential expression levels of mRNA between CC and normal tissue, the online gene expression profiling interactive analyzer database was used.

2.6. Candidate drugs for druggable hub genes
Obtained hub genes were analyzed to search for their target drugs in the drug gene interaction database (https://www.dgidb.org/). Drug gene interaction databases from different sources, including ChEMBL, DrugBank, Ensembl, NCBI Entrez, and PharmGKB, were used to obtain candidate drugs (15, 16).

3. Results
3.1. Selection of gene expression dataset
Based on the following filtration: CC (study keyword), Homo sapiens (organism), expression profiling by an array (study type) and series (entry type), 2 datasets of GSE63514 and GSE9750 were obtained from NCBI-GEO. GSE63514 was based on GPL570 platforms (Affymetrix Human Genome U133 Plus 2.0 Array) and comprised 128 cervical specimens (104 cases and 24 controls), and GSE9750 was based on GPL96 platforms (Affymetrix Human Genome U133A Array) comprising 66 samples (42 cases and 24 controls).

3.2. Data analysis and DEG identification
2 datasets of GSE63514 and GSE9750 were analyzed by GEO2R, separately. After ensuring the normal distribution of the data in the box plots, up- and down-regulated genes were identified. 1374 and 1671 genes in GSE63514, and GSE9750 were up-regulated, respectively. Among the studied genes, 2486 genes in GSE63514 and 889 genes in the GSE9750 dataset were down-regulated. Funrich software showed that 475 over-expressed genes and 492 down-regulated genes were common among GSE63514, and GSE9750 (Figure 1).

3.3. GO and KEGG pathway analysis
The EnrichR database performed GO assessment and KEGG analysis. Table I shows the top 2 enriched GO biological processes and KEGG pathways. The common up-regulated DEGs were highly clustered in keratinocyte differentiation (GO: 0030216) and epidermal cell differentiation (GO: 0009913). The common down-regulated gene is involved in the DNA metabolic process (GO: 0006259) and DNA replication (GO: 0006260). KEGG 2021 human analyses revealed that common up-regulated DEGs were significantly enriched in the arachidonic acid metabolism and chemical carcinogenesis. KEGG pathways for down-regulated genes were cell cycle and DNA replication.

3.4. PPI network analysis
Results from the PPI network analysis of the STRING showed 924 nodes and 13,688 edges (Figure 2A). Output DSV file of STRING visualized by Cytoscape 3.9.1 and analyzed. In PPI network with 924 nodes and 13,688 edges, 10 hub genes were determined by the CytoHubba plugin, which includes NCAPG, KIF11, TTK, PBK, MELK, ASPM, TPX2, BUB1, TOP2A, and KIF2C respectively, based on the rank (Figure 2B).

3.5. Expression levels of hub genes in CC
To analyze the differential expression levels of the hub genes identified by the CytoHubba plugin, GEPIA was used. The mRNA expression levels of NCAPG, KIF11, TTK, PBK, MELK, ASPM, TPX2, BUB1, TOP2A, and KIF2C were increased in tumor tissue compared to those in normal tissues (num [T] = 306; num [N] = 13) (Figure 3).

3.6. Candidate drugs for druggable hub genes
The drug-gene interaction database is an online server that provides useful information about drug-gene interactions using publications, databases, and other sources (15, 16). The selected hub genes were subjected to the drug gene interaction database, and druggable hub genes were found. KIF11, TTK, PBK, MELK, and TOP2A were recognized as druggable genes among the studied genes. Candidate drugs are listed in table II.








4. Discussion
CC is a serious problem in women's health. The slow progress of this type of cancer and the absence of symptoms in the early stages can cause delayed onset in diagnosis. In diseases related to the uncontrolled proliferation of cells, the molecular study of carcinogenesis mechanisms is important to achieve early detection methods and prevention of metastasis.
In silico approaches help us to achieve important results with low cost and time. In previous studies, we have shown that bioinformatics methods are a suitable option in the design of enzyme inhibitors, drugs, and vaccines (17-21).
Microarray technology and PPI networks, as in silico methods, are used to investigate cancer biomarkers and cellular mechanisms. Using these methods makes it possible to analyze gene clusters whose expression decreases or increases simultaneously. In this study, gene expression patterns and PPI network were studied to obtain key pathways and druggable hub genes in CC.
Common DEGs were analyzed by comparing the 2 datasets, GSE63514, and GSE9750 of CC obtained from the GEO database. 475 over-expressed genes and 492 down-regulated genes were identified. According to GO and KEGG pathway analysis, most DEG genes participated in keratinocyte differentiation, epidermal cell differentiation, DNA metabolic process, arachidonic acid metabolism, chemical carcinogenesis, cell cycle, and DNA replication pathways. The most important cause of CC is related to human papillomavirus infection, which infects epithelial cells, so the virus replication cycle is closely related to the differentiation process of infected keratinocytes (22). Targeting the arachidonic acid metabolism was used as a therapeutic method against CC (23). Among all DEGs in the PPI network, 10 hub genes were determined, which includes NCAPG, KIF11, TTK, PBK, MELK, ASPM, TPX2, BUB1, TOP2A, and KIF2C. Their differential expression levels were validated by the CytoHubba plugin and GEPIA, respectively. The role of all these genes has been discussed as cancer biomarkers. NCAPG is a prominent molecular target in many types of cancers, and overexpression of this gene plays an important role in carcinogenesis and tumor progression (24).
KIF11 is a member of the kinesin family, and this gene has been over-expressed in tumor tissues. The functional study of KIF11 has shown that all stages of mitosis and cell division depend on it, and its main role is related to the formation and maintenance of bipolar spindle or cytokines (25). In a study by Zhou et al., on differentially expressed genes in adrenocortical carcinoma, they observed that compared to normal tissues, the expression of KIF11 is significantly increased in adrenocortical carcinoma samples (26). Another gene found as a key gene in this study is tyrosine/threonine kinase (TTK), which is also effective in breast cancer, according to previous reports. Mishra et al., reported that hub genes, including ASPM, BUB1, KIF2C, MELK, PBK, and TOP2A, are oncogenes, and their expression is increased in all hepatocellular carcinoma samples (27). Tpx2, in addition to regulating the mitotic spindle, plays an important role in cell-cycle kinase Aurora A activation, and its overexpression is associated with the development of various cancers (28). For this reason, in many studies, TPX2 has been proposed as a marker for the diagnosis and prognosis of malignancies (29). These research confirm the validity of the key genes obtained in this study.
Finally, among the studied genes, KIF11, TTK, PBK, MELK, and TOP2A were recognized as druggable genes. Most obtained drugs are inhibitors, and in many researches, their anti-cancer roles have been mentioned (30). Litronesib exerts its antitumor activity by selectively inhibiting the mitosis-specific kinesin Eg5 (31). Hesperadin is an aurora kinase inhibitor. Amrobicin and idronoxil are 2 potent inhibitors of topoisomerase II (32).
The most important drugs approved for the treatment of CC are bevacizumab, topotecan hydrochloride, pembrolizumab, and bleomycin sulfate. Bevacizumab is a class of monoclonal antibodies that work by binding to the VEGF protein, and it blocks the growth of blood vessels around the tumor. The antitumor mechanism of topotecan hydrochloride and bleomycin sulfate is related to binding to the DNA of cancer cells and other rapidly growing cells. Pembrolizumab binds to the PD-1 protein on the surface of T cells to attack and kill cancer cells (33-36).

5. Conclusion
This study identified 967 genes (475 over-expressed and 492 down-regulated genes) as DEGs in CC. Our results showed that key genes including NCAPG, KIF11, TTK, PBK, MELK, ASPM, TPX2, BUB1, TOP2A, and KIF2C might have effective roles in CC. In addition, drug compounds targeting the hub gene were identified. These drugs can potentially be used to treat patients with CC. However, future validation by both in vitro and in vivo studies is inevitable.

Acknowledgments
The authors are grateful for the useful guidance of Dr. Afshin Fasihi and for the support and encouragement of all members of the Isfahan Pharmaceutical Sciences Research Center, Isfahan, Iran.

Conflict of Interest
The authors declared that there is no conflict of interest.
Type of Study: Original Article | Subject: Reproductive Oncology

Send email to the article author


Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Designed & Developed by : Yektaweb