Advanced Search
Article Contents
Article Contents

# Learning theory applied to Sigmoid network classification of protein biological function using primary protein structure

• Recently, Valiant’s Probably Approximately Correct (PAC) learning theory has been extended to learning m-dependent data. With this extension, training data set size for sigmoid neural networks have been bounded without underlying assumptions for the distribution of the training data. These extensions allow learning theory to be applied to training sets which are definitely not independent samples of a complete input space. In our work, we are developing length independent measures as training data for protein classification. This paper applies these learning theory methods to the problem of training a sigmoid neural network to recognize protein biological activity classes as a function of protein primary structure. Specifically, we explore the theoretical training set sizes for classifiers using the full amino acid sequence of the protein as the training data and using length independent measures as the training data. Results show bounds for training set sizes given protein size limits for the full sequence input compared to bounds for input that is sequence length independent.
Mathematics Subject Classification: Primary: 58F15, 58F17, 58F11; Secondary: 53C35.

 Citation:

Open Access Under a Creative Commons license

## Article Metrics

HTML views() PDF downloads(36) Cited by(0)

## Other Articles By Authors

• on this site
• on Google Scholar

### Catalog

/

DownLoad:  Full-Size Img  PowerPoint