Prediction of protein function is one of the key tasks of bioinformatics. The prediction of protein function is dependent on secondary structure and their amino acid sequence. Further, use of bioinformatics has revealed construction blocks of several proteins in recent years. The construction of 3D protein structure and prediction of biological function is increasing day by day. The region of protein actually interacted and involved in biological function is called domain/motifs. Therefore, prediction of protein structure is highly useful. However, it gives additional trouble to save protein structures and their features based on domain/motifs.
Usually most of the signaling transduction pathways use protein kinases to phosphorylate target protein by phosphorylation process. However, how protein kinases are able to achieve specificity needed to control several cellular functions remains a significant question . Nonetheless, many distinct stimuli can activate the protein kinase cascade which phosphorylates versatile molecule ERK and controls multiple cellular signals . Moreover, it is still uncertain how single ERK molecule is evolved to control comprehensive arrays of cellular and molecular mechanism. One of the possibilities is the recruitment of ERK interacting proteins. After interaction with these proteins, ERK controls array of cellular activity like cell proliferation, cell migration, survival, neuronal growth and neurotransmission. In addition, ERK dependent pathway is influenced by various stimulations such as growth, cytokines, viruses, second messengers, carcinogens and transforming agents  .
The protein kinases family is huge with lots of variation. However, a very few differ in size, sub cellular distribution and mode of action. The catalytic accuracy of protein kinases is very critical as they work as regulatory molecules for biological functions. Further, domain/motif of a protein serves as actual site of interaction for the manifestation of a specific biological function. Hence prediction of domain/motif or sites of proteins has been done using bioinformatics tools over the past decade. The prediction of domain/motif actually helps researchers to recognize proteins binding sites, secondary structures, 3D structure and building blocks . However, there are only a few techniques which are used for general investigation of significant motifs in protein structures .
A protein attains various conformations to perform diverse functions. Further, different conformations of proteins actually determine possibility of specific motifs leading to several functions. Further, in silico identification of ERK interacting proteins and physicochemical analysis was reported earlier  (Table 1). These were namely; ephrin type-B receptor 2 isoform 2 precursor (EPHB2), mitogen-activated protein kinase 1 (MAPK1), interleukin-17 receptor D precursor (IL17RD), WD repeat domain containing 83 (WDR83), tescalcin (Tesc), mitogen-activated protein kinase kinase kinase 4 (MAPP3K4), kinase suppressor of Ras2 (KSR2), mitogen-activated protein kinase kinase 6 (MAP3K6), UL16 binding protein 2 (ULBP2), UL16 binding protein 1 (ULBP1), dual specificity phosphatase 14 (DUSP14), dual specificity phosphatase 6 (DUSP6), hyaluronan-mediated motility receptor (RHAMM), kinase D interacting substrate of 220 kDa (KININS220), membrane-associated guanylate kinase (MAGI3), phosphoprotein enriched in astrocytes 15(PEA15), typtophenyl-tRNA synthetase,
Table 1. ERK interacting proteins.
cytoplasmic (WARS), dual specificity phosphatase 9 (DUSP9), mitogen-activated protein kinase kinase kinase 1 (MAP3K1), UL16 binding protein 3 (ULBP3), SLAM family member 7 isoform a precursor (SLAMMF7) and mitogen activated protein kinase kinase kinase 11 (MAP3K11). Majority of these are kinase and interact with ERK. Hence, they may contain similar secondary structures and domain/motifs. Moreover, no study of secondary structure and domain/motif is available about ERK interacting proteins. Hence, it promotes us to identify secondary structure and prediction of domain/motif of ERK interacting proteins using in silico. Such studies are useful to understand ERK dependent pathways and helpful to design engineered molecules for regulating them in disease situation.
2. Materials and Methods
For identification of secondary structure of ERK interacting proteins were retrieved in earlier study (Table 1) . Thereafter, secondary structure of these protein were done by Self-Optimized Prediction Method with Alignment (SOPMA) (https://npsa-prabi.ibcp.fr/NPSA/npsa_sopma.html) using aa sequence of protein in FASTA format (Figure 1).
For identification of domain/motif search among ERK interacting protein, motif search, (https://www.genome.jp/tools/motif) a computer-based database was used. There is a choice of computer-based application available for identification of motifs. These were most frequently characterized by a single function namely prediction of phosphorylation sites . In this paper, PROSITE was used for documentation of motif and domain of proteins. PROSITE is an annotated collection of protein families and domains classifying motif descriptors. PROSITE's motif descriptors are either prosite patterns (ProPat) or prosite profiles (ProPro) received from various alignments of homologous sequences. ProPat are descriptors of qualitative motifs. As the sequence
Figure 1. Schematic representation of procedure used for identification of secondary structure of ERK interacting proteins and prediction of their domain/motif.
of biologically significant motifs is evolutionarily preserved, a multiple alignment of them can be decreased to an expression of agreement called regular expression or pattern. Every location of such a pattern can be taken up by any residue from a given set of acceptable substances, and a variable number of times within a specified spectrum for variable number of times within a specified spectrum. In addition, ProPro is a quantitative protein domain demonstration. ProPat is more complicated. Usually the ProPro corresponds with the protein domain .
3. Results and Discussion
In this study, out of 22 ERK interacting proteins; 12 proteins namely; EPHB2, MAPK1, WDR83, TESC, MAP3K4, DUSP14, KSR2, MAP3K6, MAGI3, WARS, MAP3K1 and MAP3K11 showed both ProPat and ProPro. The highest number of ProPat was found in EPHB2. Further, 6 proteins namely; IL17RD, DUSP6, KIDINS220, PEA15, DUSP9 and SLAMF7 interacting proteins showed only ProPro. However, remaining 4 ERK interacting proteins namely; ULBP2, ULBP1, RHAMM and ULBP3 do not show both ProPat and ProPro. Among above mentioned 12 proteins, 6 interacting proteins viz, MAPK1, MAP3K4, KSR2, MAP3K6, MAP3K1 and MAP3K11 showed serine/threonine kinases active site signature and protein kinase ATP binding region signature domain commonly in their pattern whereas remaining namely EPHB2, WDR83, TESC, DUSP14, MAGI3 and WARS showed different protein kinase protein family domain pattern. In addition, in case of ProPro 7 proteins namely; EPHB2, MAPK1, MAP3K4, KSR2, MAP3K6, MAP3K1 and MAP3K11, indicated protein kinase domain profile commonly and other 3 proteins namely; DUSP9, DUSP14 and DUSP6 represented commonly dual specificity protein phosphatase family profile and tyrosine specific protein phosphatase family profile. The details of ProPat and Pro Pro are listed in Table 3 and Figures 1-10 (Supplementary materials).
Figure 2. Left panel-No of motifs of ephrin type B receptor 2 isoform 2 precursor; Right panel-No of motifs of mitogen-activated protein kinase 1.
Figure 3. Left upper panel-No of motifs of interleukin-17 receptor D precursor; Left lower panel-No of motifs of WD repeat domain-containing protein 83 Right panel-No of motifs of tescalcin.
Figure 4. Left panel-No of motifs of mitogen activated protein kinase kinase kinase 4; Right panel: no of motifs of kinase suppressor of ras 2.
Figure 5. Left panel-No of motifs of mitogen-activated protein kinase kinase 6; Right upper panel-No of motifs of UL16 binding protein 2; Right lower panel-No of motifs of UL16 binding protein 1.
Figure 6. Left panel-No of motifs of dual specificity phosphatase 14; Right panel-No of motifs of dual specificity phosphatase.
Figure 7. Left upper panel-No of motifs of hyaluronan-mediated motility receptor; Left lower panel-No of motifs of kinase D-interacting substrate of 220 kDa Right panel-No of motifs of membrane associated guanylate kinase.
Figure 8. Left upper panel-No of motifs of phosphoprotein enriched in astrocytes 15; Left lower panel -No of motifs of tryptophanyl-tRNA synthetase, cytoplasmic Right panel-No of motifs of dual specificity phosphatase 9.
Figure 9. Left panel: No of motifs of mitogen-activated protein kinase kinase kinase 1; Right upper panel-No of motifs of UL16 binding protein 3; Right lower panel-No of motifs of SLAM family 7 isoform a precursor.
Figure 10. No of motifs of mitogen activated protein kinase kinase 11.
The secondary structure of 22 ERK interacting are mentioned in Table 2 and Figures 11-32 (Supplementary materials). Out of 22 ERK interacting proteins 12 proteins namely; EPHB2, IL17RD, KSR2, ULBP1, DUSP6, KIDINS220, MAGI3, DUSP9, MAP3K1, SLAMF7, MAP3K11 showed higher % of random coil. Interestingly, when the dimensions of a number of proteins are compared, variations in the population of residual structure in the denatured state leads to deviations from ideal random coil behavior . Moreover, these types of proteins do not attain conformational changes like secondary structure, tertiary structure and any other conformations . Usually, in this situation, protein structure has been detectable in multidimensional magnetic resonance which depends on particular peptide-peptide communication. Likewise, parts of random coil appear merely
Table 2. Secondary structure data of ERK interacting proteins.
Figure 11. Secondary structure of EPHB2.
Figure 12. Secondary structure of MAPK1.
Figure 13. Secondary structure of IL17RD.
Figure 14. Secondary structure of WDR83.
Figure 15. Secondary structure of TESC.
Figure 16. Secondary structure of MAP3K4.
Figure 17. Secondary structure of KSR2.
Figure 18. Secondary structure of MAP3K6.
Figure 19. Secondary structure of ULBP2.
Figure 20. Secondary structure of ULBP1.
Figure 21. Secondary structure of DUSP14.
Figure 22. Secondary structure of DUSP6.
Figure 23. Secondary structure of RHAMM.
Figure 24. Secondary structure of KININS220.
Figure 25. Secondary structure of MAGI3.
Figure 26. Secondary structure of PEA15.
Figure 27. Secondary structure of WARS.
Figure 28. Secondary structure of DUSP9.
Figure 29. Secondary structure of MAP3K1.
Figure 30. Secondary structure of ULBP3.
Figure 31. Secondary structure of SLAMF7.
Figure 32. Secondary structure of MAP3K11.
as a lack of “electron density” or contrast in the pictures generated in crystallography experiments. Circular dichroism is used to distinguish random coils. Moreover, the complete denaturation of protein lowers the percentage of random coil. We found that ERK interacting proteins were rich in random coil and alpha helix. Hence, ERK interacting proteins are not denatured and may perform various functions . Moreover, 9 proteins namely MAPK1, TESC, MAP3K4, MAP3K6, ULBP2, RHAMM, WARS, ULBP3, PEA15, showed high percentage of alpha helix and WDR83 showed extended strand.
Long-term synaptic plasticity is the consequence of multiple gene expressions. ERK is one of those genes engaged in long-term memory and learning. A cascade of regulative genes has been recruited to particular genes including ERK in the process of long-term learning and memory formation  . Moreover, chromatin remodeling is decisive for expression of genes . ERK has been found to interact with a host of proteins to perform various cellular functions. Therefore, these proteins are important and involved in several diseases/disorders associated to the brain. Further, ERK interacting proteins have been reported to contain higher percentage of hydrophobic amino acid leucine .
Additionally, leucine works as an mTOR activator. mTOR also known as a mammalian target of rapamycin and FK 506-binding protein 12 rapamycin-associated protein 1 (FRAP1). FRAP1 is a kinase and encoded by the mTOR gene. mTOR links with other proteins and serves as a core component of two distinct protein complexes, mTOR complex 1 and mTOR complex 2. These both complexes work as a serine/threonine protein kinase. Being a core component of mTORC 2, mTOR works as a tyrosine protein kinase and promotes activation of insulin receptors as well as insulin like growth factor 1 receptors. In addition, mTOR modulates cell growth, proliferation and neuronal plasticity through protein synthesis regulation . Specifically, mTOR controls the translation of a small segment of mRNAs containing a large secondary structure at their 5' UTR or oligopyrimidine at their 5' ends (TOP mRNAs)  . This kinase can be triggered by distinct extracellular signals and controls protein synthesis at initiation stage primarily by phosphorylation of at least two downstream targets mainly p70S6 kinase (p70S6 K) and eukaryotic initiation factor 4E-binding proteins .
In neurons, mTOR has been reported in the synaptic region and mediates synthesis of locally-translated proteins. Also, mTOR is up-regulated in an activity dependent manner and critical for different forms of synaptic plasticity including long-term potentiation (LTP)  . Nonetheless, little is known about the extracellular signals activated by training and trigger mTOR for protein synthesis regulation during memory consolidation. However, mTOR is required for memory formation in many learning tasks  . BDNF induces rapamycin-sensitive synaptic potentiation   and regulates translation of 5' TOP mRNA encoded proteins at dendrites through an mTOR-dependent pathway    . Taken together, it can be concluded that leucine rich food supplement can be helpful for learning and memory.
In addition, ERK interacting proteins showed higher serine/threonine kinases active site signature and protein kinase ATP binding region signature domain commonly in ProPat. They showed protein kinase domain, dual specificity protein phosphatase and tyrosine specific protein phosphatase family domain in ProPro. Several studies have suggested that learning and memory are mediated by neuronal plasticity, strengthening of LTP , synaptogenesis , modulation of intrinsic excitability  and adult neurogenesis . All the above-mentioned processes may require protein kinases (enzymes that transfer phosphate groups to side chains of particular amino acids (serine/threonine or tyrosine) of target proteins). Moreover, phosphorylation at one or more sites of a protein has been involved to alter the conformation of target proteins affecting their functions or interaction with other proteins . Analysis of the Allen Mouse Brain Atlas suggests that at least 260 kinases are expressed in adult mouse brain. The involvement of these kinases has been implicated in learning and memory. These findings support our results as an ERK interacting proteins showed richness in protein kinase domain. Such studies are helpful to understand ERK mediated cell singling pathways including neurodegenerative diseases.
Table 3. ERK interacting proteins and their motifs.
Majority of the ERK interacting proteins are rich in random coil and alpha helix % (Table 2). Further, most of interacting proteins have serine/threonine kinases active site signature and protein kinase ATP binding region. In addition, few have commonly dual specificity protein phosphatase family and tyrosine specific protein phosphatase domains (Table 3). Such study may be helpful to design engineered molecules for regulating ERK dependent pathways in disease condition.
VP acknowledges Science and Engineering Research Board (SERB), Government of India for providing financial support (SB/YS/LS-200/2013). KK acknowledges SERB for Junior Research Fellowship and Indian Council of Medical Research (ICMR) for Senior Research Fellowship (Letter No. 45/72/l 8-PHA/BMS/OL).
 Biondi, M.R. and Nebreda, R.A. (2003) Signalling Specificity of Ser/Thr Protein Kinases through Docking-Site-Mediated Interactions. Biochemical Journal, 372, 1-13.
 Fantz, D.A., Jacobs, D., Glossip, D. and Kornfeld, K. (2001) Docking Sites on Substrate Proteins Direct Extracellular Signal Regulated Kinase to Phosphorylate Specific Residues. Journal of Biological Chemistry, 276, 27256-27265.
 Kurrey, K. and Paramanik, V. (2018) Identification and Physiochemical Analysis of ERK Interacting Proteins Using Bio-Computational Tools. World Journal of Neuroscience, 8, 303-313.
 Ebisuya, M., Kondoh, K. and Nishida, E. (2005) The Duration, Magnitude and Compartmentalization of ERK MAP Kinase Activity: Mechanisms for Providing Signaling Specificity. Journal of Cell Science, 118, 2997-3002.
 Rajasekaran, S.,Balla, S., Gradie, P., Gryk, M.R., Kadaveru, K., Kundeti, V., et al. (2009) Minimotif Miner 2nd Release: A Database and Web System for Motif Search. Nucleic Acid Research, 37, D185-D190.
 Sigrist, C.J., Cerutti, L., Hulo, N., Gattiker, A., Falquet, L., Pagni, M., et al. (2002) PROSITE: A Documented Database Using Patterns and Profiles as Motif Descriptors. Briefings in Bioinformatics, 3, 265-274.
 Kohn, J.E., Millett, I.S., Jacob, J., Zagrovic, B., Zagrovic, B., Dillon, T.M., Cingel, N., et al. (2004) Random-Coil Behavior and the Dimensions of Chemically Unfolded Proteins. Proceedings of the National Academy of Sciences of the United States of America, 101, 12491-112496.
 Taubenfeld, M.S., Milekic, H.M., Monti, B. and Alberini, M.C. (2001) The Consolidation of New but Not Reactivated Memory Requires Hippocampal C/EBPβ. Nature Neuroscience, 4, 813-818.
 Ruvinsky, I. and Meyuhas, O. (2006) Ribosomal Protein S6 Phosphorylation: From Protein Synthesis to Cell Size. Trends in Biochemical Sciences, 31, 342-348.
 Slipczuk, L., Bekinschtein, P., Katche, C., Cammarota, M.I., Izquierdo, I. and Medina, J.H. (2009). BDNF Activates mTOR to Regulate GluR1 Expression Required for Memory Formation. PLoS ONE, 4, e6007.
 Costa-Mattioli, M., Sossin, W.S., Klann, E. and Sonenberg, N. (2009) Translational Control of Long-Lasting Synaptic Plasticity and Memory. Neuron, 61, 10-26.
 Tang, S.J., Reis, G., Kang, H., Gingras, A., Sonenberg, N. and Schuman, E.M. (2002) A Rapamycin-Sensitive Signaling Pathway Contributes to Long-Term Synaptic Plasticity in the Hippocampus. Proceedings of the National Academy of Sciences of the United States of America, 99, 467-472.
 Cammalleri, M., Lutjens, R., Berton, F., King, R.A., Simpson, C., Francesconi, W., et al. (2003) Time-Restricted Role for Dendritic Activation of the mTOR-p70S6K Pathway in the Induction of Late-Phase Long-Term Potentiation in the CA1. Proceedings of the National Academy of Sciences of the United States of America, 100, 14368-14373.
 Bekinschtein, P., Katche, C., Slipczuk, L.N., Igaz, M.L., Cammarota, M., Izquierdo, I., et al. (2007) TOR Signaling in the Hippocampus Is Necessary for Memory Formation. Neurobiology of Learning and Memory, 87, 303-307.
 Lu, Y., Christian, K. and Lu, B. (2008) BDNF: A Key Regulator for Protein Synthesis-Dependent LTP and Long-Term Memory? Neurobiology of Learning and Memory, 89, 312-323.
 Takei, N., Inamura, N., Kawamura, M., Namba, H., Hara, K., Yonezawa, K., et al. (2004) Brain-Derived Neurotrophic Factor Induces Mammalian Target of Rapamycin-Dependent Local Activation of Translation Machinery and Protein Synthesis in Neuronal Dendrites. Journal of Neuroscience, 24, 9760-9769.
 Giese, K.P. (2012) Long-Term Potentiation and Memory. In: Giese, K.P., Ed., Memory Mechanism and Health Disease: Mechanistic Basis of Memory, World Scientific, Singapore, 1-17.
 Stewart, M.G. and Popov, V.I. (2012) Structural Synaptic and Dendritic Spine Plasticity in the Hippocampus. In: Giese, K.P., Ed., Memory Mechanisms in Health and Disease, World Scientific Publishing, Singapore, 19-52.
 Papoutsi, A., Sidiropoulou, K., Cutsuridis, V. and Poirazi, P. (2013) Induction and Modulation of Persistent Activity in a Layer V PFC Microcircuit Model. Frontiers of Neural Circuits, 7, 161.
 Stone, S.S.D., Frankland, P.W. (2012) Adult Hippocampal Neurogenesis and Memory. In: Giese, K.P., Ed., Memory Mechanisms in Health and Disease, World Scientific Publishing, Singapore, 81-146.
 Kim, S.Y., Hwang, G.S., Han, H. and Cho, M. (2011) Phosphorylation Alters Backbone Conformational Preferences of Serine and Threonine Peptides. Proteins: Structure, Function, and Bioinformatics, 79, 3155-3165.