CD-NuSS: A Web Server for the Automated Secondary Structural Characterization of the Nucleic Acids from Circular Dichroism Spectra Using Extreme Gradient Boosting Decision-Tree, Neural Network and Kohonen Algorithms

Sathyaseelan, Chakkarai and Vijayakumar, Vinothini and Rathinavelan, Thenmalarchelvi (2021) CD-NuSS: A Web Server for the Automated Secondary Structural Characterization of the Nucleic Acids from Circular Dichroism Spectra Using Extreme Gradient Boosting Decision-Tree, Neural Network and Kohonen Algorithms. Journal of Molecular Biology, 433 (11). p. 166629. ISSN 00222836

Full text not available from this repository. (Request a copy)

Abstract

Nucleic acids exhibit a repertoire of conformational preference depending on the sequence and environment. Circular dichroism (CD) is an essential and valuable tool for monitoring such secondary structural conformations of nucleic acids. Nonetheless, the CD spectral diversity associated with these structures poses a challenge in obtaining the quantitative information about the secondary structural content of a given CD spectrum. To this end, the competence of the extreme gradient boosting decision-tree (XGBoost), Kohonen and neural network (nnet) algorithms have been exploited here to predict the diverse secondary structures of nucleic acids. A curated library of 450 CD spectra corresponding to 16 different secondary structures of nucleic acids has been created and used as a training dataset. The hyper-parameters corresponding to the aforementioned algorithms have been optimized using holdout and k-fold (here, k = 5) cross-validation methods. For a test dataset of 150 CD spectra, both the nnet and XGBoost algorithms have exhibited nearly similar prediction accuracy in the range of 85% and 87% (the latter exhibited a slightly higher prediction accuracy). Thus, the nnet and XGBoost algorithms tested here can be employed for predicting the hybrid nucleic acid topologies in future. For the sake of accessibility, the entire process has been automated and implemented as a webserver, called CD-NuSS (CD to nucleic acids secondary structure) and is freely accessible at https://project.iith.ac.in/cdnuss.

[error in script]
IITH Creators:
IITH CreatorsORCiD
Sathyaseelan, ChakkaraiUNSPECIFIED
Vijayakumar, VinothiniUNSPECIFIED
Rathinavelan, Thenmalarchelvihttp://orcid.org/0000-0002-1142-0583
Item Type: Article
Uncontrolled Keywords: circular dichroism; Kohonen algorithm; nnet algorithm; nucleic acids secondary structure prediction; XGBoost algorithm
Subjects: Others > Biotechnology
Divisions: Department of Biotechnology
Depositing User: . LibTrainee 2021
Date Deposited: 06 Jul 2021 07:13
Last Modified: 06 Jul 2021 07:13
URI: http://raiithold.iith.ac.in/id/eprint/8133
Publisher URL: http://doi.org/10.1016/j.jmb.2020.08.014
OA policy: https://v2.sherpa.ac.uk/id/publication/11379
Related URLs:

Actions (login required)

View Item View Item
Statistics for RAIITH ePrint 8133 Statistics for this ePrint Item