CORAZON: a web server for data normalization and unsupervised clustering based on expression profiles
Author
dc.contributor.author
Ramos, Thaís
Author
dc.contributor.author
Maracajá Coutinho, Vinicius
Author
dc.contributor.author
Ortega, J. Miguel
Author
dc.contributor.author
Do Rego, Thais G.
Admission date
dc.date.accessioned
2021-08-17T18:41:17Z
Available date
dc.date.available
2021-08-17T18:41:17Z
Publication date
dc.date.issued
2020
Cita de ítem
dc.identifier.citation
BMC Res Notes (2020) 13:338
es_ES
Identifier
dc.identifier.other
10.1186/s13104-020-05171-6
Identifier
dc.identifier.uri
https://repositorio.uchile.cl/handle/2250/181291
Abstract
dc.description.abstract
Objective Data normalization and clustering are mandatory steps in gene expression and downstream analyses, respectively. However, user-friendly implementations of these methodologies are available exclusively under expensive licensing agreements, or in stand-alone scripts developed, reflecting on a great obstacle for users with less computational skills. Results We developed an online tool called CORAZON (Correlations Analyses Zipper Online), which implements three unsupervised learning methods to cluster gene expression datasets in a friendly environment. It allows the usage of eight gene expression normalization/transformation methodologies and the attribute's influence. The normalizations requiring the gene length only could be performed to RNA-seq, meanwhile the others can be used with microarray and/or NanoString data. Clustering methodologies performances were evaluated through five models with accuracies between 92 and 100%. We applied our tool to obtain functional insights of non-coding RNAs (ncRNAs) based on Gene Ontology enrichment of clusters in a dataset generated by the ENCODE project. The clusters where the majority of transcripts are coding genes were enriched in Cellular, Metabolic, Transports, and Systems Development categories. Meanwhile, the ncRNAs were enriched in the Detection of Stimulus, Sensory Perception, Immunological System, and Digestion categories. CORAZON source-code is freely available atand the web-server can be accessed at http://corazon.integrativebioinformatics.me.
es_ES
Patrocinador
dc.description.sponsorship
Fondecyt Iniciacion, Comision Nacional de Investigacion Cientifica y Tecnologica (CONICYT), Chile
11161020
Programa Nacional de Insercion de Capital Humano Avanzado en la Academia, PAI-CONICYT, Chile
PAI79170021
Fondo de Financiamiento de Centro de Investigacion en Areas Prioritarias (FONDAP), CONICYT
15130011
Programa de Bienes Publicos Estrategicos para la Competitividad, Corporacion de Fomento de la Produccion (CORFO), Chile
16BPE-62321
Subsidio Semilla de Asignacion Flexible (SSAF), CORFO
14-SSAF-27061-9
Programa Start-Up Chile, CORFO
SUP12-13791
Coordenacao de Aperfeicoamento de Pessoal de Nivel Superior (CAPES)