Protein secondary structure from circular dichroism spectroscopy. Combining variable selection principle and cluster analysis with neural network, ridge regression and self-consistent methods

J Mol Biol. 1994 Sep 30;242(4):497-507. doi: 10.1006/jmbi.1994.1597.

Abstract

Different approaches to improve the analysis of protein secondary structure from circular dichroism spectra are compared. Grouping proteins based on the similarity of their circular dichroism spectra, using cluster analysis methods, was utilized as a new way of implementing variable selection. The performance of three basic methods (neural networks, ridge regression and singular value decomposition) was evaluated in combination with three approaches to improve the predictions; namely, variable selection, cluster analysis and the self-consistent method. Cluster analysis performed on the basis set proteins resulted in three clusters, subanalyses of which provide a new way of performing variable selection. The neural network with two hidden layers performed better than that with one hidden layer and was combined with variable selection. Inclusion of the variable selection principle improved the performance of all three basic methods. While the neural network method performed slightly better than the other two methods at the basic level, the inclusion of variable selection led to similar performance indices for all three methods.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Circular Dichroism*
  • Cluster Analysis
  • Neural Networks, Computer
  • Protein Structure, Secondary*
  • Regression Analysis