Determination of binding affinity upon mutation for type I dockerin-cohesin complexes from Clostridium thermocellum and Clostridium cellulolyticum using deep sequencing.


Abstract

The comprehensive sequence determinants of binding affinity for type I cohesin toward dockerin from Clostridium thermocellum and Clostridium cellulolyticum was evaluated using deep mutational scanning coupled to yeast surface display. We measured the relative binding affinity to dockerin for 2970 and 2778 single point mutants of C. thermocellum and C. cellulolyticum, respectively, representing over 96% of all possible single point mutants. The interface ΔΔG for each variant was reconstructed from sequencing counts and compared with the three independent experimental methods. This reconstruction results in a narrow dynamic range of -0.8-0.5 kcal/mol. The computational software packages FoldX and Rosetta were used to predict mutations that disrupt binding by more than 0.4 kcal/mol. The area under the curve of receiver operator curves was 0.82 for FoldX and 0.77 for Rosetta, showing reasonable agreements between predictions and experimental results. Destabilizing mutations to core and rim positions were predicted with higher accuracy than support positions. This benchmark dataset may be useful for developing new computational prediction tools for the prediction of the mutational effect on binding affinities for protein-protein interactions. Experimental considerations to improve precision and range of the reconstruction method are discussed.

Submission Details

ID: WV8tmQRk

Submitter: Marie Ary

Submission Date: July 31, 2017, 11:46 a.m.

Version: 1

Publication Details
Kowalsky CA;Whitehead TA,Proteins (2016) Determination of binding affinity upon mutation for type I dockerin-cohesin complexes from Clostridium thermocellum and Clostridium cellulolyticum using deep sequencing. PMID:27699856
Additional Information

Structure view and single mutant data analysis

Study data

No weblogo for data of varying length.
Colors: D E R H K S T N Q A V I L M F Y W C G P
 

Data Distribution

Studies with similar sequences (approximate matches)

Correlation with other assays (exact sequence matches)


Relevant UniProtKB Entries

Percent Identity Matching Chains Protein Accession Entry Name
100.0 Type I cohesin domain from C. thermocellum Q06851 CIPA_CLOTH
90.5 Type I cohesin domain from C. thermocellum Q01866 CIPB_CLOTM