Not so weak PICO : leveraging weak supervision for participants, interventions, and outcomes recognition for systematic review automation

Dhrangadhariya, Anjani; Müller, Henning

doi:10.1093/jamiaopen/ooac107

Dhrangadhariya, Anjani; Müller, Henning

2023

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

Objective: The aim of this study was to test the feasibility of PICO (participants, interventions, comparators, outcomes) entity extraction using weak supervision and natural language processing. Methodology: We re-purpose more than 127 medical and nonmedical ontologies and expert-generated rules to obtain multiple noisy labels for PICO entities in the evidence-based medicine (EBM)-PICO corpus. These noisy labels are aggregated using simple majority voting and generative modeling to get consensus labels. The resulting probabilistic labels are used as weak signals to train a weakly supervised (WS) discriminative model and observe performance changes. We explore mistakes in the EBM-PICO that could have led to inaccurate evaluation of previous automation methods. Results: In total, 4081 randomized clinical trials were weakly labeled to train the WS models and compared against full supervision. The models were separately trained for PICO entities and evaluated on the EBM-PICO test set. A WS approach combining ontologies and expert-generated rules outperformed full supervision for the participant entity by 1.71% macro-F1. Error analysis on the EBM-PICO subset revealed 18–23% erroneous token classifications. Discussion: Automatic PICO entity extraction accelerates the writing of clinical systematic reviews that commonly use PICO information to filter health evidence. However, PICO extends to more entities—PICOS (S—study type and design), PICOC (C—context), and PICOT (T—timeframe) for which labelled datasets are unavailable. In such cases, the ability to use weak supervision overcomes the expensive annotation bottleneck. Conclusions: We show the feasibility of WS PICO entity extraction using freely available ontologies and heuristics without manually annotated data. Weak supervision has encouraging performance compared to full supervision but requires careful design to outperform it.

Détails

Titre Not so weak PICO : leveraging weak supervision for participants, interventions, and outcomes recognition for systematic review automation

Auteur(s)/ trice(s) Dhrangadhariya, Anjani (School of Management, University of Applied Sciences and Arts Western Switzerland Valais ; University of Geneva (UNIGE), Geneva, Switzerland)
Müller, Henning (School of Management, University of Applied Sciences and Arts Western Switzerland Valais ; University of Geneva (UNIGE), Geneva, Switzerland)

Date 2023-04

Publié dans JAMIA Open

Volume April 2023, vol. 6, no. 1, ooac107

Pagination 10 p.

DOI https://doi.org/10.1093/jamiaopen/ooac107

ISSN 2574-2531

Mots-clés (libres) weak supervision ; machine learning ; information extraction ; evidence-based medicine

Type d'article scientifique

Domaine Economie et Services

Ecole HEG-VS

Institut Institut Informatique de gestion

Le document apparaît dans Articles scientifiques
Global

Résumé

Détails

Actions

PDF