Text Extraction and Standardization System Development for Pathological Records in the Korea Biobank Network

Soo Jeong Ko, Sunghyeon Park, Seol Whan Oh, Yun Seon Im, Surin Jung, Bo Yeon Choi, Jaeyoon Kim, Wona Choi, In Young Choi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In Korea, the Korea Centers for Disease Control and Prevention operates the Korea BioBank Network (KBN). KBN has pathological records that collected in Korea and it is useful dataset for research. In this study, we established system that time efficient and reduced error by step-by-step data extraction process from KBN pathological records. We tested the extraction process by 769 lung cancer cohorts and 1292 breast cancer cohorts and accuracy is 91%. We expect this system can be used to efficiently process data from multiple institutions, including Korea BioBank Network.

Original languageEnglish
Title of host publicationMEDINFO 2023 - The Future is Accessible
Subtitle of host publicationProceedings of the 19th World Congress on Medical and Health Informatics
EditorsJen Bichel-Findlay, Paula Otero, Philip Scott, Elaine Huesing
PublisherIOS Press BV
Pages1440-1441
Number of pages2
ISBN (Electronic)9781643684567
DOIs
StatePublished - 25 Jan 2024
Event19th World Congress on Medical and Health Informatics, MedInfo 2023 - Sydney, Australia
Duration: 8 Jul 202312 Jul 2023

Publication series

NameStudies in Health Technology and Informatics
Volume310
ISSN (Print)0926-9630
ISSN (Electronic)1879-8365

Conference

Conference19th World Congress on Medical and Health Informatics, MedInfo 2023
Country/TerritoryAustralia
CitySydney
Period8/07/2312/07/23

Bibliographical note

Publisher Copyright:
© 2024 International Medical Informatics Association (IMIA) and IOS Press.

Keywords

  • biobank system
  • NLP

Fingerprint

Dive into the research topics of 'Text Extraction and Standardization System Development for Pathological Records in the Korea Biobank Network'. Together they form a unique fingerprint.

Cite this