Multilingual RECIST classification of radiology reports using supervised learning

Mottin, Luc; Goldman, Jean-Philippe; Jäggli, Christoph; Achermann, Rita; Gobeill, Julien; Knafou, Julien; Ehrsam, Julien; Wicky, Alexandre; Gérard, Camille L.; Schwenk, Tanja; Charrier, Mélinda; Tsantoulis, Petros; Lovis, Christian; Leichtle, Alexander; Kiessling, Michael K.; Michielin, Olivier; Pradervand, Sylvain; Foufi, Vasiliki; Ruch, Patrick

doi:10.3389/fdgth.2023.1195017

Mottin, Luc; Goldman, Jean-Philippe; Jäggli, Christoph; Achermann, Rita; Gobeill, Julien; Knafou, Julien; Ehrsam, Julien; Wicky, Alexandre; Gérard, Camille L.; Schwenk, Tanja; Charrier, Mélinda; Tsantoulis, Petros; Lovis, Christian; Leichtle, Alexander; Kiessling, Michael K.; Michielin, Olivier; Pradervand, Sylvain; Foufi, Vasiliki; Ruch, Patrick

2023

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

Objectives: The objective of this study is the exploration of Artificial Intelligence and Natural Language Processing techniques to support the automatic assignment of the four Response Evaluation Criteria in Solid Tumors (RECIST) scales based on radiology reports. We also aim at evaluating how languages and institutional specificities of Swiss teaching hospitals are likely to affect the quality of the classification in French and German languages. Methods: In our approach, 7 machine learning methods were evaluated to establish a strong baseline. Then, robust models were built, fine-tuned according to the language (French and German), and compared with the expert annotation. Results: The best strategies yield average F1-scores of 90% and 86% respectively for the 2-classes (Progressive/Non-progressive) and the 4-classes (Progressive Disease, Stable Disease, Partial Response, Complete Response) RECIST classification tasks. Conclusions: These results are competitive with the manual labeling as measured by Matthew’s correlation coefficient and Cohen’s Kappa (79% and 76%). On this basis, we confirm the capacity of specific models to generalize on new unseen data and we assess the impact of using Pre-trained Language Models (PLMs) on the accuracy of the classifiers.

Détails

Titre Multilingual RECIST classification of radiology reports using supervised learning

Auteur(s)/ trice(s) Mottin, Luc (Haute école de gestion de Genève, HES-SO Haute Ecole Spécialisée de Suisse Occidentale ; SIB Text Mining Group, Swiss Institute of Bioinformatics, Geneva, Switzerland)
Goldman, Jean-Philippe (Division of Medical Information Sciences, University Hospitals of Geneva, Geneva, Switzerland)
Jäggli, Christoph (Inselspital – Bern University Hospital and University of Bern, Bern, Switzerland)
Achermann, Rita (Department of Radiology, Clinic of Radiology & Nuclear Medicine, University Hospital Basel, University of Basel, Basel, Switzerland)
Gobeill, Julien (SIB Text Mining Group, Swiss Institute of Bioinformatics, Geneva, Switzerland)
Knafou, Julien (SIB Text Mining Group, Swiss Institute of Bioinformatics, Geneva, Switzerland)
Ehrsam, Julien (Department of Radiology and Medical Informatics, University of Geneva, Geneva, Switzerland)
Wicky, Alexandre (Precision Oncology Center, Oncology Department, Centre Hospitalier Universitaire Vaudois – CHUV, Lausanne, Switzerland)
Gérard, Camille L. (Precision Oncology Center, Oncology Department, Centre Hospitalier Universitaire Vaudois – CHUV, Lausanne, Switzerland)
Schwenk, Tanja (Department of Oncology, Kantonsspital Aarau, Aarau, Switzerland)
Charrier, Mélinda (Division of Medical Information Sciences, University Hospitals of Geneva, Geneva, Switzerland)
Tsantoulis, Petros (Division of Medical Information Sciences, University Hospitals of Geneva, Geneva, Switzerland ; Department of Radiology and Medical Informatics, University of Geneva, Geneva, Switzerland)
Lovis, Christian (Division of Medical Information Sciences, University Hospitals of Geneva, Geneva, Switzerland ; Department of Radiology and Medical Informatics, University of Geneva, Geneva, Switzerland)
Leichtle, Alexander (Inselspital – Bern University Hospital and University of Bern, Bern, Switzerland)
Kiessling, Michael K. (Department of Medical Oncology and Hematology, University Hospital Zurich, Zurich, Switzerland)
Michielin, Olivier (Precision Oncology Center, Oncology Department, Centre Hospitalier Universitaire Vaudois – CHUV, Lausanne, Switzerland)
Pradervand, Sylvain (Precision Oncology Center, Oncology Department, Centre Hospitalier Universitaire Vaudois – CHUV, Lausanne, Switzerland)
Foufi, Vasiliki (Division of Medical Information Sciences, University Hospitals of Geneva, Geneva, Switzerland)
Ruch, Patrick (SIB Text Mining Group, Swiss Institute of Bioinformatics, Geneva, Switzerland)

Date 2023-06

Publié dans Frontiers in digital health

Volume 2023, vol. 5

Pagination 10 p.

DOI https://doi.org/10.3389/fdgth.2023.1195017

ISSN 2673-253X

Mots-clés (libres) supervised machine learning ; narrative text classification ; RECIST ; radiology reports ; language models

Type d'article scientifique

Domaine Economie et Services

Ecole HEG - Genève

Institut CRAG - Centre de Recherche Appliquée en Gestion

Le document apparaît dans Articles scientifiques
Global

Résumé

Détails

Actions

PDF