The HD-GYP domain, named after two of its conserved sequence motifs, was first described in 1999 as a specialized version of the widespread HD phosphohydrolase domain that had additional highly conserved amino acid residues. Domain associations of HD-GYP indicated its involvement in bacterial signal transduction and distribution patterns of this domain suggested that it could serve as a hydrolase of the bacterial second messenger c-di-GMP, in addition to or instead of the EAL domain. Subsequent studies confirmed the ability of various HD-GYP domains to hydrolyze c-di-GMP to linear pGpG and/or GMP. Certain HD-GYP-containing proteins hydrolyze another second messenger, cGAMP, and some HD-GYP domains participate in regulatory protein-protein interactions. The recently solved structures of HD-GYP domains from four distinct organisms clarified the mechanisms of c-di-GMP binding and metal-assisted hydrolysis. However, the HD-GYP domain is poorly represented in public domain databases, which causes certain confusion about its phylogenetic distribution, functions, and domain architectures. Here, we present a refined sequence model for the HD-GYP domain and describe the roles of its most conserved residues in metal and/or substrate binding. We also calculate the numbers of HD-GYPs encoded in various genomes and list the most common domain combinations involving HD-GYP, such as the RpfG (REC–HD-GYP), Bd1817 (DUF3391– HD-GYP), and PmGH (GAF–HD-GYP) protein families. We also provide the descriptions of six HD-GYP–associated domains, including four novel integral membrane sensor domains. This work is expected to stimulate studies of diverse HD-GYP-containing proteins, their N-terminal sensor domains and the signals to which they respond.
IMPORTANCE
The HD-GYP domain forms class II of c-di-GMP phosphodiesterases that control the cellular levels of the universal bacterial second messenger c-di-GMP and therefore affect flagellar and/or twitching motility, cell development, biofilm formation, and, often, virulence. Despite more than 20 years of research, HD-GYP domains are insufficiently characterized; they are often confused with ‘classical’ HD domains that are involved in various housekeeping activities and may participate in signaling, hydrolyzing (p)ppGpp and c-di-AMP. This work provides an updated description of the HD-GYP domain, including its sequence conservation, phylogenetic distribution, domain architectures, and the most widespread HD-GYP-containing protein families. This work shows that HD-GYP domains are widespread in many environmental bacteria and are predominant c-di-GMP hydrolases in many lineages, including clostridia and
deltaproteobacteria
.