Show simple item record

Authordc.contributor.authorBáez, Pablo
Authordc.contributor.authorVillena, Fabián
Authordc.contributor.authorZúñiga, Karen
Authordc.contributor.authorJones, Natalia
Authordc.contributor.authorFernández, Gustavo
Authordc.contributor.authorDurán, Manuel
Authordc.contributor.authorDunstan Escudero, Jocelyn Mariel
Admission datedc.date.accessioned2022-05-03T16:35:55Z
Available datedc.date.available2022-05-03T16:35:55Z
Publication datedc.date.issued2021
Cita de ítemdc.identifier.citationRev Med Chile 2021; 149: 1014-1022es_ES
Identifierdc.identifier.issn0034-9887
Identifierdc.identifier.urihttps://repositorio.uchile.cl/handle/2250/185227
Abstractdc.description.abstractA significant proportion of the clinical record is in free text format, making it difficult to extract key information and make secondary use of patient data. Automatic detection of information within narratives initially requires humans, following specific protocols and rules, to identify medical entities of interest. Aim: To build a linguistic resource of annotated medical entities on texts produced in Chilean hospitals. Material and Methods: A clinical corpus was constructed using 150 referrals in public hospitals. Three annotators identified six medical entities: clinical findings, diagnoses, body parts, medications, abbreviations, and family members. An annotation scheme was designed, and an iterative approach to train the annotators was applied. The F1-Score metric was used to assess the progress of the annotator’s agreement during their training. Results: An average F1-Score of 0.73 was observed at the beginning of the project. After the training period, it increased to 0.87. Annotation of clinical findings and body parts showed significant discrepancy, while abbreviations, medications, and family members showed high agreement. Conclusions: A linguistic resource with annotated medical entities on texts produced in Chilean hospitals was built and made available, working with annotators related to medicine. The iterative annotation approach allowed us to improve performance metrics. The corpus and annotation protocols will be released to the research community.es_ES
Lenguagedc.language.isoeses_ES
Publisherdc.publisherSoc Medica Santiagoes_ES
Type of licensedc.rightsAttribution-NonCommercial-NoDerivs 3.0 United States*
Link to Licensedc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/us/*
Sourcedc.sourceRevista Médica de Chilees_ES
Keywordsdc.subjectData curationes_ES
Keywordsdc.subjectData mininges_ES
Keywordsdc.subjectMedical informaticses_ES
Keywordsdc.subjectNatural language processinges_ES
Keywordsdc.subjectSupervised machine learninges_ES
Títulodc.titleConstrucción de recursos de texto para la identificación automática de información clínica en narrativas no estructuradases_ES
Title in another languagedc.title.alternativeConstruction of text resources for automatic identification of clinical information in unstructured narrativeses_ES
Document typedc.typeArtículo de revistaes_ES
dc.description.versiondc.description.versionVersión publicada - versión final del editores_ES
dcterms.accessRightsdcterms.accessRightsAcceso abiertoes_ES
Catalogueruchile.catalogadorcfres_ES
Indexationuchile.indexArtículo de publícación WoSes_ES
Indexationuchile.indexArtículo de publicación SCOPUSes_ES
Indexationuchile.indexArtículo de publicación SCIELOes_ES


Files in this item

Icon

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 United States
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 United States