Genetic Annotation of BZIP Transcription Factor Family Genes in Soybean by Using Different Soft Wares

Transcription Factors play a vital role in almost all the biological processes. After the widespread sequencing...

or recruit the RNA polymerases access to DNA template [1].
Transcription factors are commonly defined as the sequencespecific DNA-binding proteins which can either repress or activate the transcription process. Now, it has been investigated that the genome of plants encodes more transcription factors than the genome of animals, which is a clear indicator of the fact that the complex nature of transcriptional regulation in plants is at par with that of the animals [2] as nearly every biological process is directly influenced or regulated by these transcription factors [3]. This fact is evident through example that there is a specific class of transcription factors called "general transcription factors" and without these factors, the process of transcription will not occur in eukaryotes [4].
Studies revealed that the transcription factors have been precisely involved in the process of cell development, division, differentiation and migration. The transcription factors of Arabidopsis thaliana have been deeply studied since its genome is being sequenced as a model specie [3,5,6]. After the widespread sequencing of genomic and cDNA, it was brought under notice that the legumes encode more than 2,000 transcription factors per genome. Despite of the fact that these transcription factors play pivotal roles in legume revolution, plant development and differentiation, still less than 1% of these factors have been completely characterized genetically.

Review of Literature
The use of Bioinformatics approaches has been contributory in identification of the putative Transcription factors in the plants.
Transcription factors are commonly defined by using the DNAbinding domain types which are held by the proteins in a family while the putative transcription factor genes are identified mainly due to DNA sequences present inside of the genes which encode the known DNA binding [2,13,14]. Wolfong et al. in 1997 were successful in isolating a cDNA which encodes a novel BZIP protein Liao et al. [15], in 2008 studied 131 BZIP type of transcription factor genes in soybean. They performed the expression analysis of these genes against different stresses, and then from all of the genes, three (03) genes were additionally investigated in relation to stress tolerance, transcriptional activation and DNA-binding specificity.
Their results indicated that those transgenic Arabidopsis plants which over express the three additionally investigated genes were less sensitive towards ABA but were more tolerant to the freezing and salt stresses. Soybean uni genes were analyzed by Tian et al. [17] in 2004through EST assembly and as result more than 1,000 transcription factor genes were identified. Arabidopsis possess at least four times as many BZIP genes as worm, yeast and humans [2]. Through molecular and genetic studies of the few of these Arabidopsis thaliana BZIP genes, it is evident that they regulate varied biological processes, for example, stress and light signaling, pathogen defense, flower development and seed maturation. data [28]. The genes of BZIP family downloaded from Soy KB, used for the analysis are shown in Table 1.  Figure 1 According to Rzhetsky and Nei [29] the history of evolution was derived by using the Minimum Evolution method.
Here the sum of branch length of an optimal tree is demonstrated as 175.29000652. Now, like those of the evolutionary distances which are used to infer the phylogenetic tree, similarly a tree is sketched to a scale having branches of the same length units.

Moreover, evolutionary distances are computed by using maximum
Composite Likelihood method [30] and positions were in total present in the final dataset while MEGA-X was used for computing the Evolutionary analyses [33]. Figure 2 Here, the Maximum Likelihood method and the Tamura-Nei model were used to derive the evolutionary history [34]. The tree carrying the highest log likelihood (-219250.10)

Use of Maximum Likelihood Method for Evolutionary Analysis
is depicted above. Neighbor-Join and Bio NJ algorithms were    the Evolutionary analyses [33]. Figure 5 Here, the UPGMA method was used to derive the evolutionary history [35]. positions were in total present in the final dataset while MEGA-X was used for computing the Evolutionary analyses [33].  positions were in total present in the final dataset while MEGA-X was used for computing the Evolutionary analyses [33].

Conclusion
Transcription factors along with the addition of RNA polymerase complex physically interact along with other proteins to cause variations in the gene transcription [36]. Now, the exact structures of these complexes are still unidentified for most of the plant genes, even though this knowledge is like a precondition to understand the combining control of transcription [37]. Lastly, the genes network being regulated by a "single" transcription factor and its allies are the small part of a greater genetic regulatory network which