Protein secondary structure from circular dichroism spectroscopy. Combining variable selection principle and cluster analysis with neural network, ridge regression and self-consistent methods

N Sreerama; R W Woody

doi:10.1006/jmbi.1994.1597

Protein secondary structure from circular dichroism spectroscopy. Combining variable selection principle and cluster analysis with neural network, ridge regression and self-consistent methods

J Mol Biol. 1994 Sep 30;242(4):497-507. doi: 10.1006/jmbi.1994.1597.

Authors

N Sreerama¹, R W Woody

Affiliation

¹ Department of Biochemistry and Molecular Biology, Colorado State University, Fort Collins 80523.

PMID: 7932706
DOI: 10.1006/jmbi.1994.1597

Abstract

Different approaches to improve the analysis of protein secondary structure from circular dichroism spectra are compared. Grouping proteins based on the similarity of their circular dichroism spectra, using cluster analysis methods, was utilized as a new way of implementing variable selection. The performance of three basic methods (neural networks, ridge regression and singular value decomposition) was evaluated in combination with three approaches to improve the predictions; namely, variable selection, cluster analysis and the self-consistent method. Cluster analysis performed on the basis set proteins resulted in three clusters, subanalyses of which provide a new way of performing variable selection. The neural network with two hidden layers performed better than that with one hidden layer and was combined with variable selection. Inclusion of the variable selection principle improved the performance of all three basic methods. While the neural network method performed slightly better than the other two methods at the basic level, the inclusion of variable selection led to similar performance indices for all three methods.

Publication types

Research Support, U.S. Gov't, P.H.S.

MeSH terms

Circular Dichroism*
Cluster Analysis
Neural Networks, Computer
Protein Structure, Secondary*
Regression Analysis

Grants and funding

GM22994/GM/NIGMS NIH HHS/United States