Combining automated peak tracking in SAR by NMR with structure-based backbone assignment from 15N-NOESY
MetadataShow full item record
AbstractBackground: Chemical shift mapping is an important technique in NMR-based drug screening for identifying the atoms of a target protein that potentially bind to a drug molecule upon the molecule's introduction in increasing concentrations. The goal is to obtain a mapping of peaks with known residue assignment from the reference spectrum of the unbound protein to peaks with unknown assignment in the target spectrum of the bound protein. Although a series of perturbed spectra help to trace a path from reference peaks to target peaks, a one-to-one mapping generally is not possible, especially for large proteins, due to errors, such as noise peaks, missing peaks, missing but then reappearing, overlapped, and new peaks not associated with any peaks in the reference. Due to these difficulties, the mapping is typically done manually or semi-automatically, which is not efficient for high-throughput drug screening.Results: We present PeakWalker, a novel peak walking algorithm for fast-exchange systems that models the errors explicitly and performs many-to-one mapping. On the proteins: hBclXL, UbcH5B, and histone H1, it achieves an average accuracy of over 95% with less than 1.5 residues predicted per target peak. Given these mappings as input, we present PeakAssigner, a novel combined structure-based backbone resonance and NOE assignment algorithm that uses just 15N-NOESY, while avoiding TOCSY experiments and 13C-labeling, to resolve the ambiguities for a one-to-one mapping. On the three proteins, it achieves an average accuracy of 94% or better.Conclusions: Our mathematical programming approach for modeling chemical shift mapping as a graph problem, while modeling the errors directly, is potentially a time- and cost-effective first step for high-throughput drug screening based on limited NMR data and homologous 3D structures. 2012 Jang et al.; licensee BioMed Central Ltd.
CitationJang R, Gao X, Li M (2012) Combining automated peak tracking in SAR by NMR with structure-based backbone assignment from 15N-NOESY. BMC Bioinformatics 13: S4. doi:10.1186/1471-2105-13-S3-S4.
PubMed Central IDPMC3402924
The following license files are associated with this item:
Except where otherwise noted, this item's license is described as This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
- Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA.
- Authors: Herrmann T, Güntert P, Wüthrich K
- Issue date: 2002 May 24
- Towards fully automated structure-based NMR resonance assignment of ¹⁵N-labeled proteins from automatically picked peaks.
- Authors: Jang R, Gao X, Li M
- Issue date: 2011 Mar
- Protein NMR structure determination with automated NOE-identification in the NOESY spectra using the new software ATNOS.
- Authors: Herrmann T, Güntert P, Wüthrich K
- Issue date: 2002 Nov
- Automated amino acid side-chain NMR assignment of proteins using (13)C- and (15)N-resolved 3D [ (1)H, (1)H]-NOESY.
- Authors: Fiorito F, Herrmann T, Damberger FF, Wüthrich K
- Issue date: 2008 Sep
- A new algorithm for reliable and general NMR resonance assignment.
- Authors: Schmidt E, Güntert P
- Issue date: 2012 Aug 1
Showing items related by title, author, creator and subject.
Solution Structure of the Tandem Acyl Carrier Protein Domains from a Polyunsaturated Fatty Acid Synthase Reveals Beads-on-a-String ConfigurationTrujillo, Uldaeliz; Vázquez-Rosa, Edwin; Oyola-Robles, Delise; Stagg, Loren J.; Vassallo, David A.; Vega, Irving E.; Arold, Stefan T.; Baerga-Ortiz, Abel (Public Library of Science (PLoS), 2013-02-28)The polyunsaturated fatty acid (PUFA) synthases from deep-sea bacteria invariably contain multiple acyl carrier protein (ACP) domains in tandem. This conserved tandem arrangement has been implicated in both amplification of fatty acid production (additive effect) and in structural stabilization of the multidomain protein (synergistic effect). While the more accepted model is one in which domains act independently, recent reports suggest that ACP domains may form higher oligomers. Elucidating the three-dimensional structure of tandem arrangements may therefore give important insights into the functional relevance of these structures, and hence guide bioengineering strategies. In an effort to elucidate the three-dimensional structure of tandem repeats from deep-sea anaerobic bacteria, we have expressed and purified a fragment consisting of five tandem ACP domains from the PUFA synthase from Photobacterium profundum. Analysis of the tandem ACP fragment by analytical gel filtration chromatography showed a retention time suggestive of a multimeric protein. However, small angle X-ray scattering (SAXS) revealed that the multi-ACP fragment is an elongated monomer which does not form a globular unit. Stokes radii calculated from atomic monomeric SAXS models were comparable to those measured by analytical gel filtration chromatography, showing that in the gel filtration experiment, the molecular weight was overestimated due to the elongated protein shape. Thermal denaturation monitored by circular dichroism showed that unfolding of the tandem construct was not cooperative, and that the tandem arrangement did not stabilize the protein. Taken together, these data are consistent with an elongated beads-on-a-string arrangement of the tandem ACP domains in PUFA synthases, and speak against synergistic biocatalytic effects promoted by quaternary structuring. Thus, it is possible to envision bioengineering strategies which simply involve the artificial linking of multiple ACP domains for increasing the yield of fatty acids in bacterial cultures. 2013 Trujillo et al.
Structural analysis and dimerization profile of the SCAN domain of the pluripotency factor Zfp206Liang, Yu; Huimei Hong, Felicia; Ganesan, Pugalenthi; Jiang, Sizun; Jauch, Ralf; Stanton, Lawrence W.; Kolatkar, Prasanna R. (Oxford University Press (OUP), 2012-06-26)Zfp206 (also named as Zscan10) belongs to the subfamily of C2H2 zinc finger transcription factors, which is characterized by the N-terminal SCAN domain. The SCAN domain mediates self-association and association between the members of SCAN family transcription factors, but the structural basis and selectivity determinants for complex formation is unknown. Zfp206 is important for maintaining the pluripotency of embryonic stem cells presumably by combinatorial assembly of itself or other SCAN family members on enhancer regions. To gain insights into the folding topology and selectivity determinants for SCAN dimerization, we solved the 1.85 crystal structure of the SCAN domain of Zfp206. In vitro binding studies using a panel of 20 SCAN proteins indicate that the SCAN domain Zfp206 can selectively associate with other members of SCAN family transcription factors. Deletion mutations showed that the N-terminal helix 1 is critical for heterodimerization. Double mutations and multiple mutations based on the Zfp206SCAN-Zfp110SCAN model suggested that domain swapped topology is a possible preference for Zfp206SCAN-Zfp110SCAN heterodimer. Together, we demonstrate that the Zfp206SCAN constitutes a protein module that enables C2H2 transcription factor dimerization in a highly selective manner using a domain-swapped interface architecture and identify novel partners for Zfp206 during embryonal development. 2012 The Author(s).
Simplified method to predict mutual interactions of human transcription factors based on their primary structureSchmeier, Sebastian; Jankovic, Boris R.; Bajic, Vladimir B. (Public Library of Science (PLoS), 2011-07-05)Background: Physical interactions between transcription factors (TFs) are necessary for forming regulatory protein complexes and thus play a crucial role in gene regulation. Currently, knowledge about the mechanisms of these TF interactions is incomplete and the number of known TF interactions is limited. Computational prediction of such interactions can help identify potential new TF interactions as well as contribute to better understanding the complex machinery involved in gene regulation. Methodology: We propose here such a method for the prediction of TF interactions. The method uses only the primary sequence information of the interacting TFs, resulting in a much greater simplicity of the prediction algorithm. Through an advanced feature selection process, we determined a subset of 97 model features that constitute the optimized model in the subset we considered. The model, based on quadratic discriminant analysis, achieves a prediction accuracy of 85.39% on a blind set of interactions. This result is achieved despite the selection for the negative data set of only those TF from the same type of proteins, i.e. TFs that function in the same cellular compartment (nucleus) and in the same type of molecular process (transcription initiation). Such selection poses significant challenges for developing models with high specificity, but at the same time better reflects real-world problems. Conclusions: The performance of our predictor compares well to those of much more complex approaches for predicting TF and general protein-protein interactions, particularly when taking the reduced complexity of model utilisation into account. © 2011 Schmeier et al.