Deep Proteome Profiling Enabled Functional Annotation and Data-Independent Quantification of Proline Hydroxylation Targets
Proline hydroxylation (Hyp) regulates protein structure, stability and protein-protein interaction and is widely involved in diverse metabolic and physiological pathways in cells and diseases. To reveal functional features of the proline hydroxylation proteome, we integrated various data sources for deep proteome profiling of proline hydroxylation proteome in human and developed HypDB (https://www.HypDB.site), an annotated database and web server for proline hydroxylation proteome. HypDB provides site-specific evidence of modification based on extensive LC-MS analysis and literature mining with 15319 non-redundant Hyp sites and 8226 sites with high confidence on human proteins. Annotation analysis revealed significant enrichment of proline hydroxylation on key functional domains and tissue-specific distribution of Hyp abundance across 26 types of human organs and fluids and 6 cell lines. The network connectivity analysis further revealed a critical role of proline hydroxylation in mediating protein-protein interactions. Moreover, the spectral library generated by HypDB enabled data-independent analysis (DIA) of clinical tissues and the identification of novel Hyp biomarkers in lung cancer and kidney cancer. Taken together, our integrated analysis of human proteome with publicly accessible HypDB revealed functional diversity of Hyp substrates and provides a quantitative data source to characterize proline hydroxylation in pathways and diseases.