Ongoing programme

FMCEB-CXR

A structured chest X-ray benchmark from a Nigerian tertiary hospital, labeled with a hybrid natural language pipeline and validated by consultant radiologists.

The dataset

FMCEB-CXR is built from 5,517 anonymized radiology reports from Federal Medical Centre Ebute-Metta in Lagos, covering 14 chest pathology categories. It is a single site dataset today, designed to grow to further sites as the programme expands.

5,517 radiology reports 14 pathology categories FMC Ebute-Metta, single site, expanding

Pathology distribution

The 14 categories span common chest findings. The full validated distribution will be published with the dataset. Validation is in progress.

Publications and outputs

The programme is producing a series of papers. Status is shown honestly, and updated as the work progresses.

SubmittedMICCAI Open Data

Paper 1 — the dataset and its labeling pipeline

The dataset and its hybrid NLP labeling pipeline.

In preparationMIRASOL Workshop, MICCAI 2026

Paper 2 — NLP labeling methodology

A deeper treatment of the NLP labeling methodology. MIRASOL is the Medical Image Computing in Resource-Constrained Settings workshop.

PlannedEnd of phase

Paper 3 — end of phase findings

Findings from the completed annotation and validation phase. Three or more papers are expected from this phase.

Methodology

Labels are extracted with a hybrid natural language pipeline. A bilingual medical dictionary of 447 phrases is combined with a clinical language model, BioClinicalBERT, that we fine tuned for this task. Negation is detected explicitly, and radiologists validate every label before any model is trained.

Label quality comes first. Modeling follows validation.

Team and collaborators

Principal Investigator

MO

Muhammad Toha Oyelakin

Co-Investigators

Dr. Babatunde Badmus Oduola-Owoo

Dr. Oyindamola Albert

Supervisors

Dr. Ramon Wahab

Consultant Radiologist, External Supervisor

Dr. Latifat Oduola-Owoo

Consultant Radiologist, Internal Supervisor

Dr. Amina Omolola Bello

Consultant Family Medicine, Clinical Supervisor, Primary Care Perspective

Annotation and Validation

Dr. Adebayo Alaka

Dr. Gerald Ochibili

Dr. Damilola Oluboyede

Dr. Dominic Umeh

Dr. Tolulope Dada

Technical and Research Support

Dr. Chukwudi Eke

ML/AI Engineer

Oyindamola Fijabi

Data Analyst

Mercy Akiri

Research Assistant