Systematic identification and quantification of substrate specificity determinants in human protein kinases = Identificación y cuantificación sistemática de determinantes de la especificidad por sustrato en las proteínas quinasas de humano

Author

Alonso Tarajano, Manuel Alejandro

Director

Aloy, Patrick, 1972-

Mosca, Roberto

Date of defense

2013-10-24

Legal Deposit

B. 26354-2013

Pages

153 p.



Department/Institute

Universitat de Barcelona. Departament de Bioquímica i Biologia Molecular (Farmàcia)

Abstract

In human there are 518 protein kinases reported and many of them are involved in several cellular processes and also in important pathologies. Kinases have a diverse specificity for the sequences they phosphorylate, and it has been shown that their in vivo specificity is guided by several elements such as the sequence surrounding the phosphorylated amino acid in the substrate, the co-localization with the substrate, interactions mediated by docking sites and the association of kinases to adaptor or scaffold proteins (AS). The objective of the current thesis has been the identification and quantification of the contribution to the specificity of i) the phosphorylated site and its surrounding residues in sequence and ii) the association of kinases to AS proteins. By integrating data from public resources we compiled a set of kinase-phosphorylation sites in human corresponding to 325 (62.7%) kinases, 1856 substrates and 5946 phosphorylation sites. We have used sequence logos for representing the phosphorylation motifs recognized by the kinases in our set and we have used these logos to guide the classification of kinases attending to the residue composition of the sequences surrounding the phosphorylation site. We have used position-specific scoring matrices (PSSMs) as the probabilistic representation of the sequences phosphorylated by the kinases. Based on the score in the PSSMs relative to the phospho-acceptor amino acid, we classified several residues as specificity-determinant residues (SDR) for several kinase families. The identity, position in the sequence alignment and the frequency of the SDRs identified vary considerably among the kinase families analyzed. The statistical significance of the PSSMs was assessed taking into account their recall and their information content (IC). We found negative correlations between the the number of seed sequences and i) the recall of the PSSMs and ii) the IC of the PSSMs. Based on the IC value, and in the comparison to random distributions, we found that statistically significant PSSMs differ from the non-statistically significant ones regarding their IC, recall, number of seed phosphorylation sites and AUC-ROC. We have developed a computational strategy for the identification of proteins with known function as adaptors or scaffolds of human kinases (kAS). In total, we have identified a set of 191 kAS proteins which is enriched in domains and functional terms that support their role as AS. These 191 kAS associate to 55% of human kinases, which suggest that the association to this type of AS molecules is common among human kinases. Our results suggest that, when compared to random proteins, kAS proteins are five times more likely to interact with a significantly large fraction of the substrates of a the kinases to which the kAS are associated. Starting from a set of 156 human kinases for which we count with at least five substrates, we identified a set of 279 proteins with a potential function as adaptor or scaffold (pAS). This set is enriched in protein domains and functional terms related to the predicted function and that also suggest a relationship to signalling processes. Our analysis on cellular co-localization suggest that, for 74.6% of the kinase-pAS associations found, the pAS protein may play a role in the co-localization of the kinases and their corresponding sets of substrates. Finally, we have analyzed the relationship between the association of different kinases to common AS proteins and the number of in vivo substrates shared by the kinases. Our results suggest that kinases with AS proteins in common do not share more in vivo substrates than what would be expected only due to chance. To our opinion, this suggests that AS proteins may diminish the substrate crossed specificity of the kinases to which they associate.

Keywords

Proteïnes quinases; Proteínas quinasas; Protein kinases; Fosforilació; Fosforilación; Phosphorylation

Subjects

577 - Biochemistry. Molecular biology. Biophysics

Knowledge Area

Ciències de la Salut

Documents

MAAT_PhD_THESIS.pdf

4.636Mb

 

Rights

ADVERTIMENT. L'accés als continguts d'aquesta tesi doctoral i la seva utilització ha de respectar els drets de la persona autora. Pot ser utilitzada per a consulta o estudi personal, així com en activitats o materials d'investigació i docència en els termes establerts a l'art. 32 del Text Refós de la Llei de Propietat Intel·lectual (RDL 1/1996). Per altres utilitzacions es requereix l'autorització prèvia i expressa de la persona autora. En qualsevol cas, en la utilització dels seus continguts caldrà indicar de forma clara el nom i cognoms de la persona autora i el títol de la tesi doctoral. No s'autoritza la seva reproducció o altres formes d'explotació efectuades amb finalitats de lucre ni la seva comunicació pública des d'un lloc aliè al servei TDX. Tampoc s'autoritza la presentació del seu contingut en una finestra o marc aliè a TDX (framing). Aquesta reserva de drets afecta tant als continguts de la tesi com als seus resums i índexs.

This item appears in the following Collection(s)