Insights of Rv2921c (Ftsy) Gene of Mycobacterium tuberculosis H 37 Rv To Prove Its Significance by Computational Approach

After seeing the epidemic of tuberculosis in our country and across world according to WHO report, we right now present with an emergence of treatment/research against this disease. In this study, we therefore exaggerate some important aspects of ftsY (Rv2921c) gene of Mycobacterium tuberculosis ( M tuberculosis ) which is a GTP binding and hydrolyzing protein. This gene is 1269bp long and contains four GTP binding motif. The above said protein is involved in Signal Recognition Particle (SRP) pathway. The M tuberculosis SRP pathway comprises of two proteins ffh (Rv2916c), ftsY and a RNA subunit 4.5s RNA. This protein interacts with ffh (Rv2916c) gene and another predicted GTP binding protein known as Rv3362c. The said protein is an important part of protein export system which is an essential process for importing and exporting protein that are synthesized in cytoplasm to plasma membrane and other organelles. Thus, this protein might also be important for pathogenesis. This study enlists the effect of disruption of ftsY gene on its interaction with ffh and secretion system. Hence this study might be an important step in the way of eradication of this disease.


Introduction
Tuberculosis is a fatal disease which is broadly conveyed and open. It is caused by Mycobacterium tuberculosis H 37 Rv (M. tuberculosis H 37 Rv) which is a gram-positive and aerobic bacterium [1]. Pathogenic strain H37Rv of this bacterium is differing from other non-virulent strains like Mycobacterium smegmatis (M. smegmatis) in various means [2]. After crossing the respiratory tract, this bacterium permanently resides in the alveolar macrophages where they reside for long time without any hindrance by host immune system [3]. It is very important to decrease the level of this disaster which appears to be at its peak and to comprehend this situation, struggle is going on. As it is already known that, BCG is the only candidate vaccine available till now for the curative therapy of tuberculosis (TB). There are many more drugs used in the treatment of this disease but not as effective as BCG vaccine, although in some of the cases BCG also fails to protect an individual from the drastic nature of this disease [4]. In the year 2016 only, there were 6 million novel cases had been reported which develop resistance to Rifampicin (RRTB)-the most effective and first line drug to battle TB [5]. Although there were some strategies made by WHO to end this disease and the management also succeed to a minor level, we right now deal with the situation that need a quick treatment with respect to the total population and the environment of our nation. M. tuberculosis H 37 Rv and HIV co infection are prone for diminishing the CD4+ T cell population which is known as cell mediated arm of immunity and this co-infection is now widespread in all over the world [6]. Persons with this co-infection are lessening host cell survival due to multifaceted nature of both these bacterium [7]. Thus, as to figure out problem of the pathogenesis of this bacterium, we have to figure out important aspects of this bacterium that may play significant role in its survival inside host cell and avoid many host immunological barrier [8]. To grasp trouble pathogenesis, it is imperative to portray significant highlights of M. tuberculosis H 37 Rv that sanction it to evade the host barrier framework and add to its destructiveness [9]. Many previous studies show the importance of GTP binding and hydrolyzing genes in the survival of the many prokaryotes as well as eukaryotes.
GTP related genes are most significant molecules in various signaling mechanism. As Guanosine Triphosphatases (GTPases) are known to accept a basic part in the survival and bind of various pathogens so accordingly the qualities which tie to GTP likewise have an imperative part in its survival inside the host macrophages [10,11]. GTPases are generally called sub-nuclear switch proteins [12][13][14]. The key piece of these proteins incorporates restriction in phagosomes improvement, enabling pathogens to get shielded from making tracks in an opposite direction from lysosomes and unsafe free radicals prompted as inborn invulnerable reactions of the host after disease by this bacterium. This discernment gives another path to the advance of threatening to TB drugs [15,16]. In a past couple of years, an expansive work has been done to understand the piece of GTPases in the improvement and advancement of organisms to make them not vulnerable to immune system of the host [17,18]. GTP-restricting proteins (G-proteins) are much monitored signaling substance that takes an interest in cell signaling and bacterial pathogenesis by controlling the movement of related GTPases [19,21]. These proteins especially attach and hydrolyze GTP, which in this way orders or inactivates the GTPases consistently GTPases are particularly checked and work through RNA or ribosome legitimate. G1, G2, G3 and G4 subjects are responsible for specific participation with the guanine nucleotide and effectors proteins.
The underlying two segments are related with interchanges with the phosphate part of the GTP iota and the last segment is locked in with nucleotide specificity [22][23][24]. The accord grouping contains three concord sequence, GXXXXGK, DXXG and NKXD. According to previously reported literatures this protein acts as Signal Recognition Particle (SRP) in combination with Rv2916c (ffh) [25]. The both genes act as a unit to introduce some cytoplasmic protein across plasma membrane or some other cell organelle. There is an assortment of major pathways have been involved in the protein exporting system of M. tuberculosis H 37 Rv like comprehensive Secretion (Sec) pathway, Twin-Arginine Translocation (Tat) pathway and some inconsequential pathways involved in ESAT-6 secretion system (Esx) and SRP pathway [26]. General secretion pathway system cooperates with post translational secretion of proteins whereas SRP pathways involved in co translational exporting of proteins [27]. SRP is a cytoplasmic ribonucleoprotein and is well conserved in eukaryotes and prokaryotes with somewhat varying composition. The all-around contemplated Escherichia coli (E. coli) SRP framework involves 4.5S RNA, ffh (SRP54) and ftsY (SRα), while the greater part of the bacterial SRP pathways comprise of just a single protein, SRP54 and a 4.5S RNA particle. The M. tuberculosis H 37 Rv SRP pathway comprises of two proteins ffh, ftsY and an RNA subunit 4.5S RNA. The said protein is an important part of protein export system which is an essential process for importing and exporting protein that is synthesized in the numerous organelles. Thus, this protein might also be important for pathogenesis.
Hence in this literature, author wants to describe some Insilico aspects about a GTP binding protein known as Rv2921c (ftsY) of M. tuberculosis H 37 Rv. The study comprises of comparison of sequences of ftsY gene of M. tuberculosis H 37 Rv with the other species of the bacterium, search of various interactive partners, phosphorylation capacity and mutation study [28]. As we noted above that ftsY contains GTP binding and hydrolyzing properties which is also important for its activity therefore by mutating specific amino acid of the motif should change the ability of the GTP binding and hydrolyzing properties which thus affects secretion. Also, mutation should also affect its interaction with ffh gene. Therefore, this study gives us the initial steps for proofing the actual mechanism of secretion system of SRP and thus might help in treatment procedure as shown in Table 1.

Retrieval of Protein Sequence Database
Mycobrowser database has been used for retrieving sequence (gene and protein) of ftsY (Rv2921c) gene [29]. This ftsY (Rv2921c) protein sequence has been disentangled in FASTA format ftsY is predicted to be involved in insertion and reception of various proteins in the outer side of cell membrane and contains ATP/GTP binding motif. MUSCLE online server for multiple sequence alignment approach for ftsY protein analysis. MUSCLE remains for Multiple Sequence Comparison by Log-Expectation it is professed to accomplish both better normal exactness and preferred speed over ClustalW2 or T-Coffee, contingent upon the picked choices [30,31].

Interaction Study by String
STRING database server is utilized for demonstrating the protein-protein association between two or more entities. In the cell cytoplasm, a protein may collaborate with different proteins and work in the web-like manner. The associations incorporate direct (physical) and aberrant (functional) interactions; They branch from computational expectation, from arrangement pass on among life forms and from connections collected from other primary databases [32]. The total number of interactions for each dataset had been kept in STRING and measure somewhere in the range of 0 and 1. The principle of its working is as the score is <0.4 means low interaction, score is between 0.4 to 0.7 means medium interaction and score is >0.7 is high interaction [33].

Prediction of Sub Cellular Localization
Protein Subcellular Localization and prediction confined protein destinations. TBpred is a subcellular localization prediction method for mycobacterial proteins depend on support vector machine learning (profile portion SVM) to foresee the local subcellular sites. Several parameters may be tuned for their appropriate values to get optimum results. The nth SVM model learns from nth class samples with positive labels and rest other samples with negative labels. Prediction of an unknown sample is based upon the maximum score out of four scores, generated by four models specific to four different subcellular compartments [34].

Prediction of Phosphorylation site
DEPP (Disorder Enhanced Phosphorylation Predictor) server evaluates the number of phosphorylated serine, threonine and tyrosine site. This server is depending on the Support Vector Machines prepared on succession profiles improved by data from the spatial setting of tentatively distinguished P-locales. DEPP server predicted the phosphorylated sites of serine, threonine and tyrosine with accurate score. DEPP server is able for predicting phosphorylation by the serine kinases PKA, PKC, MAPK, CKII and by the tyrosine kinases SRC. The nature of expectations is incredibly reliant on the nature of submitted protein structures. Incorrect or inadequate protein structures may prompt wrong forecasts [35,36].

Prediction of B-cell and T-cell epitopes
The prediction of (B & T-cell) epitopes found in ftsY (Rv2921c) protein was done by various online bioinformatics tools. B-cells epitopes prediction was done by using BCPREDS server, T-cell epitopes prediction by the ABCpred server and (MHC-Class II Binding Peptide Prediction) is done by ProPred tool [37][38][39][40].

Model Building
Structure modeling of ftsY protein was done by using Iterative Threading Assembly Refinement (I-TASSER). I-TASSER is utilized for the protein structure and function expectation [41][42][43][44]. I-TASSER needs the FASTA prearranged sequence of protein and assembles the 3D model of protein by Ab Initio display approach. I-TASSER server is an online stage for protein structure and functions forecasts [45]. I-TASSER pursues three phases to envision the 3D model of the protein. For advance illustration of the secondary structure of the protein, this tool secretly introduced Local Meta Threading Server (LOMETS) which uses H, E and C articulate for alpha-helix, beta-sheet and curl respectively. In I-TASSER server there are also predicted the desired dynamic binding sites of our selected protein were anticipated by COACH online server. Prior to dynamic site-specific docking, the affirmation of actual binding pocket is important. The binding pocket is the site of protein where the ligand interacts reversibly or irreversibly [46]. COACH server is a metaserver, it starts from given structure of target protein; At that point it will make correlative ligand restricting site forecast using the two relative systems, TM-site and S-site, which perceive ligand restricting plan from the database (BioLiP) protein work database by restricting specific substructure and collection profile examination. In the COACH server, yield has positioned top 10 displays by the bunch measure, given C-score, PDB hit, ligand name, complex structure download and agreement restricting buildup. Range estimations of C-score prediction lie somewhere in the range of 0 and 1, where the most noteworthy score demonstrate greater unwavering quality [47].

Model Validation
The evaluation of approval of the created protein structure has been completed by online server RAMPAGE (Ramachandran Plot Analysis). The RAMPAGE server endorses the protein structure on the hypothesis of φ, ψ purpose of individual stores [48][49][50]. The approval of protein was performed by the structure Analysis and

Mutational Analysis of The Protein
In the Mutational assessment of ftsY (Rv2921c) had been finished by via I-MUTANT 3.0 suite server. For studying of all the energy proteins, thinking about analysis of protein quality, free vitality change (ΔΔG) upon single point transformations may in engage the clearing up of process. The vast majority of the ΔΔG esteem is about zero (around 32% of the ΔΔG edifying record ranges -0.5 kcal/ mole) and both the consideration and suggestion of ΔΔG might be either positive or negative for a similar change blurring the relationship among imprecise and expected ΔΔG esteem. Remembering the last goal to vanquish this issue, we describe another pointer that confines among the three change classes: destabilizing changes (ΔΔG<−1.0 kcal/mol), counter balancing changes (ΔΔG>1.0 kcal/mole), and impartial changes (−1.0≤ΔΔG≤1.0 kcal/mole). For the I-MUTANT 3.0 suite score, DDG<-0.5 means (extensive decline of stability), DDG>0.5 infers (increment of stability) and -0.5<=DDG<=0.5 means (impartial stability) [53,54]. For the figure of protein, consistency change upon a singular point alteration was foreseen by I-MUTANT 3.0 Suite server.

Molecular Docking
Docking has been studied for confirming protein-protein and protein-DNA interaction analysis. Complexes formed after docking were visualized by the PyMOL. Molecular docking of ffh (Rv2916c) with wild type and mutant ftsY protein revealed the variation of binding energies, formation of hydrogen bonds and their distances. Docking of the two proteins we have been used HDOCK (http:// hdock.phys.hust.edu.cn/) docking server [55]. This server takes input in form of either FASTA sequence or PDB structure. This feature of taken up FASTA sequence makes this server easily available for new comers. The output of the server contains docking score and RMSD value. The lower the RMSD value stronger is the association. HDOCK predicted the protein-protein and protein-DNA/RNA docking. The interactions of the protein-protein and protein-DNA/RNA play an essential role in the assortment of biological process. For the docking, HDOCK is the novel tool for hybrid docking algorithm of template-based modeling and free docking, in which cases with the misleading templates it can be rescued by the free docking protocol. The docking process is fast and consumes about 10-20 min for a docking run [56,57]. HDOCK server performance will become enhanced when more predictions were considered.

Retrieval of Target Protein Sequence
The genomic and proteomic sequences for the gene ftsY (Rv2921c) have been retrieved in FASTA format from Mycobrowser database. The gene is 1269bp long and encodes protein of 43kDa. The predicted function of this protein is involving in reception and insertion of the protein at membrane site. Probably it functions as Signal Recognition Particle (SRP). The protein sequence contains four GTP binding motif (DXXG).

Protein-Protein Interaction
For the prediction of protein-protein interaction of ftsY protein is involved with the secretion system, involved the insertion of promising membrane proteins and it is in cytoplasmic membrane. The ftsY protein acts as a receptor for the complex formed by the signal recognition particle and the ribosome-nascent chain. The prediction of ftsY functional partner by STRING are the ffh, rplJ, rplQ, Rv3362c, rplK, rplR, rplS, rpmA, rplW, rpmG1 proteins due to the decreasing order of the String server version 10.5 score. The interactions score of ftsY and ffh interaction are very high 0.995 and both gene having same function and act as SRP protein Figure 1.

Multiple Sequence Alignment
Multiple sequence alignment of the protein ftsY (Rv2921c) of M. tuberculosis with other species of the bacterium like Mycocaterium bovis, Mycocaterium marinum, Mycocaterium smegmatis, Mycocaterium leprae has been done by using MUSCLE online server. After putting sequence in FASTA format for all five proteins the server aligns sequence and gives the perfect matches. The percent identity matrix proves its identity among all species. The result also shows the presence of consensus sequence DXXG in all sequences ( Figure 2). The presence of the motif in all species is might be an indicator for its importance of the process in the survival of these prokaryotes.
. Figure 2: Protein-protein interaction study by the String server version 10.5: Interaction study of ftsY gene has been check by STRING database server which found that this gene interacts with several other genes including ffh gene which is also involved with co translational secretion system and Rv3362c which is probable GTP binding proteins.

Prediction of Sub Cellular Localization
Prediction of the subcellular localization of a protein by using the TBpred server the length of this gene is 422 amino acid residues and the selected approach are Dipeptide composition based SVM. There are different class wise scores from SVM models like cytoplasmic protein, integral membrane protein, secretory protein and protein attached to membrane by lipid anchor. At last, this TBpred server finally predicts that ftsY protein is localized in membrane as an integral membrane protein. DEPP (Disorder Enhanced Phosphorylation Predictor) server evaluates the number of phosphorylated serine, threonine and tyrosine site. Notwithstanding serine, threonine and tyrosine result shows that in ftsY protein there were 8 phosphorylated serine residues out of 18 residues, 9 phosphorylated threonine residues out of 25 residue and 1 phosphorylated tyrosine residue out of 2 residues. In DEPP statistic score are 44.44% are phosphorylated serine, 36% are phosphorylated threonine and 50% tyrosine, phosphorylation prediction also shown in Figure 3. The nature of expectations is incredibly reliant on the nature of submitted protein structures. Incorrect or inadequate protein structures may prompt wrong predictions.

Prediction of B-Cell and T-Cell Epitopes
The prediction of B-cell epitopes by ABCpred server which has predicted the epitopes on this gene taking the overlapping window of 14 amino acids that consequences in the preeminent possibility of the score is 0.90 from the residues region "SVLLVVGVNGTGKT" that starts at 224 th position as shown in supplementary Figure 4a. The BCPREDS server are B-cell epitopes prediction which have shows two types prediction as Fixed length epitopes prediction by BCPred and flexible length epitopes prediction by FBCpred which have been shown in (Figure 4b) with the set specificity at 90% and epitopes length set on 14. B cell epitopes prediction fixed length method are predicted on residues 278 th -291 th , 165 th -178 th , and 114 th -127 th and for Flexible length epitopes prediction residues committed are as 282 th -295 th , 109 th -122 nd , 165 th -178 th and 330 th -343 th in the ftsY protein. For the T-Cell epitopes prediction there is multiple DR-β1 (DRB) alleles were used like HLA-DRB1*0101, HLA-DRB1*0102, HLA-DRB1*0301 which is the T-cell epitopes prediction for the prediction with the MHC Class-II binding region in the antigenic protein sequence of ftsY protein. The predicted binder was visualizing in graphical peak interface as well as in color residue in an HTML interface. Two consensus epitopes were LVVIAALTL in (DRB1_0101) at position 15 th -23 rd in (DRB1_0102) epitopes are present were -LWIATAVIA and LVVIAALTL, at the 5 th -13 th and 15 th -23 rd and in (DRB1_0301) VVIAALTLG residue are present 16 th -24 th position. In MHC Class-II binding peptide prediction result there are individual alleles at 1% threshold as shown in T-cell epitopes prediction. While at 3% threshold it gives another residue,(not mentioned in the article). For T-cell epitopes prediction result are shown in Figure 4c.

Model Building
The structural modeling of the wild type and mutant ftsY (Rv2921c) protein and its interactive partner ffh (Rv2916c) protein were prepared from I-TASSER. The quality of modelled protein depends upon the percentage of the favorable region lies above 90% of the value of C-score and RMSD value. The C-score is the confidence score for each model. It is computed by threading layouts arrangement. The C-score changes inside the range from -5 to 2 and higher certainty show is controlled by the higher estimation of C-score. At long last, I-TASSER creates top 5 models according to C score and positioned by group measure among which the figure with higher C score. For protein modeling we modeled the ffh (Rv2916c), ftsY (Rv2921c), mutant ftsY (D45) in this protein modeling we mutate 45 number residue aspartates into alanine, mutant ftsY (D71) residue, mutant ftsY (D312) residue and mutant ftsY (D367) residues as shown in Figure 5.

Model Validation
After modelling of structure, the protein structure was validated through and SAVES server (RAMPAGE, ERRAT and Verify3D). The demonstrated protein was validated by RAMPAGE (Ramachandran plot investigation) which is an online server. After examination of Ramachandran plot of our proteins, the structure demonstrated that have been present in a favored region. Although, other residues were laid in the allowed region and number of residues were laid in outlier region. These parameters of protein structure demonstrating that our displayed protein was of good quality stable and adequate. ERRAT is an online server which approves the protein structure on the premise of the nuclear connection between various sorts of atoms. The ERRAT analysis shows overall quality factor of our model protein is good and satisfactory. The Verify3D strategy evaluates protein structure by utilizing three-dimensional profiles. This program examines the similarity of a nuclear model (3D) with its own amino acid sequence which is 1 dimensional. Every deposit is doled out a basic class in radiance of its area and condition (alpha, beta, circle, polar, non-polar and so on). The score ranges from -1 (poor score) to +1 (great score). 82.29 -95.25% of the buildup had a found the middle value of 3D-1D score >=0.2 that is perceptive for our demonstrated protein result. is pass as shown in Table 2 Verify 3D result as shown in The protein model ftsY (Rv2921c) validation which is done by SAVES metaserver results convoluted that all of the protein model build are good and satisfactory which is shown in Table 2.

Mutational Analysis
In ftsY (Rv2921c) protein had been characterized as a GTP binding protein which has consensus binding sequence DXXG which promote the link sub sites for binding in Mg 2+ and -phosphate of GTP. In the consensus sequence DXXG, aspartate is crucial for GTP binding and hydrolyzing activity; Therefore we study the prediction of stability change upon the single point protein mutation by I-Mutant suite. Earlier studies have been shown that for GTP binding protein, when aspartate mutate with the alanine then the functional protein may loss the function. So, we mutate the DXXG aspartate into the Alanine for predicting the stability of the protein after change the single amino acid. In I-Mutant suite studies, we have seen that there are four consensus sequences in this protein on residue 45, 71, 312 and 367. After the analysis of predicted score by I-Mutant we have found that large stability decreased on the 45-position residue because the parameter of stability decreasing value is when -0.5 or below value towards negative and our outcome result is -1.39 which clearly shows at the position on 45 there will largely decreasing value are shown in Table 3.

Molecular Docking
In the docking content, we here doing protein-protein docking for knowing the interaction changes upon mutation of important residue aspartate involved in GTP binding and hydrolyzing motif in the protein. Aspartate would be mutating with alanine. The wild type ffh protein dock with wild type ftsY as a control value for our study for the experimental study like wild type ffh protein dock with mutant ftsY (D45A), ftsY (D71A) , ftsY (D312A) , ftsY (D367A) to identify change in binding patterns using HDOCK docking server [14,15] as shown in Figure 6. The binding energy and formation of hydrogen bonds to each molecule by the drug were calculated and the RMSD value was 0.38 the highest for D312 are seen in Table 4.

Discussion
In the current circumstance, we can see that there no defensive and healing treatment to destroy tuberculosis totally aside from BCG. Past many years of research as of now demonstrates that BCG gives constrained insurance against tuberculosis yet Fails in securing MDR, TDR and XDR instances of tuberculosis. There is a persistent exertion has been put by researchers with the end goal to build the adequacy of the antibody and in looking for new medication targets. GTP binding genes come out to be as novel targets for treatment of this disease [58][59]. GTP binding and hydrolyzing protein ftsY (Rv2921c) has been proved for possessing the same activity in many studies, this gene strongly interacts with ffh gene (Rv2916c) with score of 0.995 [32]. At the other hand, ftsY protein is also interacts with another predicted partner Rv3362 which also possess GTP binding activity [33]. Multiple sequence alignment result shows that this gene is universally present in all species of this bacterium with having some minute differences. All species contains same GTP binding motif DXXG at same location [30,31]. This protein is predicted to be present as an integral membrane protein [34]. Phosphorylation site prediction gives us result that this protein phosphorylated at serine, threonine and tyrosine [35,36]. The prediction of (B and T cell epitopes prediction) has been done by using ABCpred, BCpred, FBCPred and ProPred server [38][39][40]. 3D model of this protein has been made by I-TASSER server [43,44] and validated by SAVES server [51,52]. Mutational analysis has been done by I-Mutant 3.0 server and it shows that mutation on 45 residue aspartates with alanine at 25°C temperature and pH 7 noted the larger decrease in stability [53,54]. Molecular docking results of these proteins before and after mutation prove that interaction between these two proteins decrease maximal at position 312 residue with having value of 0.38 [56]. In summarizing our work we can narrate the essentiality of ftsY gene in secretion process and thus in pathogenesis of this bacterium. The two above mentioned proteins work in cooperative manner and produce a summative effect. The two proteins is key Regulator for the co-translational secretion system and thus might also play an important role in pathogenesis of the bacterium.

Conclusion
We need an emergent step to stop TB epidemic all around the world. This study concludes with knowledge of some important aspects of ftsY gene and its interaction with ffh gene. This interaction is very much important for one of the secretory pathways assembled in prokaryotes. Although the experiments enlisted in this manuscript are not enough, but these experiments provide us with better knowledge and initial steps in further in vitro and in vivo experimental works.