Have your say in the design of BIP - The UK Biobank Brain Imaging Pipeline!

Presented During:

Tuesday, June 25, 2024: 12:00 PM - 1:15 PM
COEX  
Room: Grand Ballroom 103  

Poster No:

2260 

Submission Type:

Abstract Submission 

Authors:

Fidel Alfaro Almagro1, Stephen Smith1

Institutions:

1University of Oxford, Oxford, Oxfordshire

First Author:

Fidel Alfaro Almagro  
University of Oxford
Oxford, Oxfordshire

Co-Author:

Stephen Smith  
University of Oxford
Oxford, Oxfordshire

Introduction:

The image processing pipeline for UK Biobank brain imaging (1), has been used to process more than 72k UKB datasets, and >500 papers have used its outputs. The pipeline works with T1, T2 FLAIR, swMRI, dMRI, rfMRI, tfMRI (and ASL in recent subjects) and consists of a mix of scripts in bash, Python and Matlab, with the underlying tools coming primarily from FSL and FreeSurfer (Fig 1). It applies image processing and QC, and generates thousands of Imaging-Derived Phenotypes (IDPs). The pipeline was optimised for UKB data and has been adapted for other studies with some effort (2, 3, 4).

By 2023, 72k brain imaging datasets had been acquired and processed. 63k were usable data from the first imaging visit and 5k from the second. The goal is 100k first-visit and 60k repeat-visit scans. Fig 2 shows pairwise associations between 4k brain IDPs and 27k non-imaging variables.
Supporting Image: Pipeline_Abstract.png
   ·Figure 1: UKB pipeline flowcharts from (Alfaro-Almagro et al, 2018)
Supporting Image: UKB-PersonHatOn.jpg
   ·Figure 2: Results from 108 million association tests between brain IDPs and other UKB variables.
 

Methods:

The pipeline has maintained almost perfect backwards compatibility so that all outputs (for new scans) are compatible with existing outputs for previous subjects/scans. Therefore, changes have been limited to adding new processing (e.g., the recent addition of QSM processing). However, maintaining backwards compatibility limits the ability to incorporate improved algorithms and software, and we are now planning a complete rewrite of the pipeline, named BIP (Brain Imaging Pipeline).

Improvements being considered include:

Pipeline framework: fully Python-based and using three main components:
- Fslpy (5): Python library to perform FSL calls through wrappers
- File-Tree (6): Python library to abstract out filenames and folder structure
- FSL-pipe (7): Easy (to use and scale) declarative pipelining Python library

Usage
- The pipeline should be easy to configure (with as few configuration files as possible) for datasets with configurations different from UKB (but not too different)

Registration
- Multi-modal (T1, T2-FLAIR, dMRI) volumetric registration using MMORF and multi-modal UKB-derived OMM templates (8,9)
- Surface registration using MSMAll (10)

Surface ("CIFTI" HCP) processing (note that this has now been added to the pipeline)
- fMRI projected onto the surface

fMRI
- PROFUMO as a complement/replacement for group-ICA+dual-regression (11)

Diffusion
- Better cleaning (Gibbs ringing correction, modelling Rician noise)
- Improved Eddy-current correction (within-volume motion, long-time eddy currents)
- Replace autoPtx with XTRACT (reference)
- Improved NODDI model

QC
- Fully automated (if possible), improved QC

Results:

We want to ask the neuroimaging community for their opinion on our plans and what else we can do with UKB data in BIP, starting with a set of surveys. We encourage suggestions for solidly established complementary/improved methods. We will engage experts in each area, with expert working groups to evaluate suggestions, including considering these criteria:
- Feasible: Given the amount of data to process, the tools must be easy to implement (and free to use) and not too memory/disk/processing intensive
- Testable: The method should be easily testable against similar methods with clear and measurable criteria to establish which is best for UKB (and other) data.
- Fit for framework: Once the framework has been decided, adding pieces that do not fit well with the framework would make the pipeline too complex.
- Aligned with UKB data and goals: We will consider suggestions sufficiently close to the UKB brain imaging protocol.

Conclusions:

UKB is an epidemiological study that needs a robust and fully automated pipeline to process all its data. Therefore, once BIP is considered "finalised", backwards compatibility must be maintained. Thus, if you want to have your say on this processing pipeline, the moment to do that is now!

Modeling and Analysis Methods:

Methods Development

Neuroinformatics and Data Sharing:

Databasing and Data Sharing
Workflows 1
Informatics Other 2

Keywords:

Data analysis
Design and Analysis
Informatics
MRI
Open-Source Code
Open-Source Software
Workflows
Other - UK Biobank

1|2Indicates the priority used for review

Provide references using author date format

(1) Alfaro-Almagro, F., Jenkinson, et al. 2018. Image processing and Quality Control for the first 10,000 brain imaging datasets from UK Biobank. Neuroimage, 166, pp.400-424.
(2) Duff, Eugene, Fernando Zelaya, Fidel Alfaro Almagro, et al. "Reliability of multi-site UK Biobank MRI brain phenotypes for the assessment of neuropsychiatric complications of SARS-CoV-2 infection: The COVID-CNS travelling heads study." Plos one 17, no. 9 (2022): e0273704.
​​(3) Baillie, J.K., Lone, N.I., et al, 2023. Multiorgan MRI findings after hospitalisation with COVID-19 in the UK (C-MORE): a prospective, multicentre, observational cohort study. The Lancet Respiratory Medicine.
(4) Griffanti, L., Gillis, G., et al. 2022. Adapting UK Biobank imaging for use in a routine memory clinic setting: The Oxford Brain Health Clinic. NeuroImage: Clinical, 36, p.103273.
(5) McCarthy, P., Cottaar, M., et al. (2020). fslpy (3.2.0). Zenodo. https://doi.org/10.5281/zenodo.3890969
(6) Cottaar, M. (2022). File-tree: define the content of a structured directory for visualisation or pipelines (1.0). Zenodo. https://doi.org/10.5281/zenodo.6576809
(7) Cottaar, M. (2022). FSL-pipe: declarative pipelines based on FileTrees (0.6). Zenodo. https://doi.org/10.5281/zenodo.7821244
(8) Lange, F.J., Arthofer, C., et al. 2023. MMORF-FSL's MultiMOdal Registration Framework. bioRxiv, pp.2023-09.
(9) Arthofer, C., Smith, S.M., et al. 2022. Age-dependent multimodal MRI template construction from UK Biobank, in: 28th Annual Meeting of The Organization for Human Brain Mapping.
(10) Robinson, E.C., Jbabdi, S., et al. 2014. MSM: a new flexible framework for multimodal surface matching. Neuroimage, 100, pp.414-426.
(11) Harrison, S.J., Bijsterbosch, J.D., et al. 2020. Modelling subject variability in the spatial and temporal characteristics of functional modes. NeuroImage, 222, p.117226.