ECTRIMS eLearning

Building a manual reference dataset for deep grey matter segmentation: volumetric versus anatomical bias
Author(s): ,
J Burggraaff
Affiliations:
Neurology, VU University Medical Center, Amsterdam, The Netherlands
,
J.C Prieto
Affiliations:
Department of Radiology, Center for Neurological Imaging, Brigham and Women's Hospital, Harvard Medical School, Boston, MA;Department of Psychiatry, University of North Carolina, Chapel Hill, NC, United States
,
J.P Simoes
Affiliations:
Radiology and Nuclear Medicine, VU University Medical Center, Amsterdam, The Netherlands
,
Y Liu
Affiliations:
Radiology and Nuclear Medicine, VU University Medical Center, Amsterdam, The Netherlands
,
S Ruggieri
Affiliations:
Unit of Neurology, San Camillo-Forlanini Hospital Rome;Department of Neurology and Psychiatry, University of Rome 'Sapienza', Rome, Italy
,
M Palotai
Affiliations:
Department of Radiology, Center for Neurological Imaging, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
,
C Gasperini
Affiliations:
Unit of Neurology, San Camillo-Forlanini Hospital Rome
,
M.P Wattjes
Affiliations:
Radiology and Nuclear Medicine, VU University Medical Center, Amsterdam, The Netherlands
,
F Barkhof
Affiliations:
Radiology and Nuclear Medicine, VU University Medical Center, Amsterdam, The Netherlands
,
H Vrenken
Affiliations:
Radiology and Nuclear Medicine, VU University Medical Center, Amsterdam, The Netherlands;Physics and Medical Technology, VU University Medical Center, Amsterdam, The Netherlands
,
C.R.G Guttmann
Affiliations:
Department of Radiology, Center for Neurological Imaging, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
MAGNIMS Study Group
MAGNIMS Study Group
Affiliations:
ECTRIMS Learn. Palotai M. 09/14/16; 145540; EP1444
Miklos Palotai
Miklos Palotai
Contributions
Abstract

Abstract: EP1444

Type: ePoster

Abstract Category: Pathology and pathogenesis of MS - Imaging

Background: Volumetric assessment of deep grey matter structures is of increasing relevance for the assessment of degenerative changes linked to MS progression. Automated segmentation methods are typically validated on manual outlines, however, defining standard operating procedures for anatomically accurate and reproducible outlining remains challenging and cumbersome to evaluate.

Objectives: To assess anatomical consistency and volumetric reproducibility of raters guided by a detailed segmentation protocol for outlining the caudate (C), putamen (P), and thalamus (T) using a new, web-based virtual laboratory.

Methods: Three independent raters traced C, P, and T on 3T T1-weighted images (multi-site) from MS patients and healthy controls following a consensus protocol with detailed instructions and illustrations to guide in the definition of anatomical boundaries. SPINE, a web-based “virtual laboratory” that enables tracing in a web-browser, as well as interactive exploratory analysis (numerical and visual inspection) was used. Intra- and inter-rater reliability was assessed using intraclass correlation [consistency-of-agreement (CA-ICC) and absolute-agreement (AA-ICC)]; Bland-and Altman analysis, spatial overlap (Dice coefficients), as well as interactive visual inspection of anatomical accuracy and consistency.

Results: Inter-rater CA-ICC (mean (95%CI)) was 0.98 (0.49-1.00) (T), 0.99 (0.91-1.00) (C) and 0.98 (0.81-1.00) (P). Inter-rater AA-ICC was 0.99 (0.81-1.00) (T), 0.76 (0.07-0.99) (C) and 0. 78 (0.09-0.99) (P).

Interestingly, the student"s, medical doctor"s, and senior Neurologist"s intra-rater bias according to Bland-Altman was 30.7%, -14.2%, and 5.7% (T), respectively; 17.1%, 14.5%, and -2.6 (P); 22.6%, 9.5%, and -1.2% (C), suggesting that intra-rater reproducibility was impacted by the degree of radiological experience. Raters were consistently more reproducible on C, followed by P, with T exhibiting the worst reproducibility.

Overlap analysis signalled good inter-rater reproducibility (Mean Dice coefficients: 0.82 (T), 0.81 (C), 0.81 (P).

Nevertheless, interactive visual inspection of pair-wise comparisons between raters revealed that outlining errors were not equally distributed around the structures of interest, and demonstrated significant anatomical inconsistencies in all three structures.

Conclusion: Reliance on ICC and overlap analysis is insufficient and potentially misleading in assessing accuracy and reproducibility of anatomical parcellation.

Disclosure: J. Burggraaff received funding for research from the ´Nauta Fonds´

J.C. Prieto has nothing to disclose

J.P. Simoes has nothing to disclose

Y. Liu was a recipient of the ECTRIMS-MAGNIMS fellowship

S. Ruggieri has nothing to disclose

M. Palotai is currently a recipient of the McDonald Fellowship from the Multiple Sclerosis International Federation

C. Gasperini received fee as speaker for Bayer-Schering Pharma, Sanofi-Aventis, Genzyme, Biogen, Teva, Novartis, Merck Serono. Received a grant for research by Teva.

M.P. Wattjes serves on the editorial boards of Neuroradiology, Journal of Neuroimaging, European Radiology, Frontiers of Neurology, and serves as a consultant for Roche, Novartis and Biogen.

F. Barkhof serves on the editorial boards of Brain, Neurology, Neuroradiology, Multiple Sclerosis Journal and Radiology, and serves as a consultant for Bayer-Schering Pharma, Sanofi-Aventis, Genzyme, Biogen, Teva, Novartis, Roche, Synthon BV and Jansen Research.

H. Vrenken has received research grants from Pfizer, MerckSerono, Novartis and Teva, and a speaker honorarium from Novartis. All funds were paid directly to his institution.

C.R.G. Guttmann has nothing to disclose that could constitute a potential conflict of interest for this work. Received a grant for research from Sanofi and from the National Multiple Sclerosis Society.

Abstract: EP1444

Type: ePoster

Abstract Category: Pathology and pathogenesis of MS - Imaging

Background: Volumetric assessment of deep grey matter structures is of increasing relevance for the assessment of degenerative changes linked to MS progression. Automated segmentation methods are typically validated on manual outlines, however, defining standard operating procedures for anatomically accurate and reproducible outlining remains challenging and cumbersome to evaluate.

Objectives: To assess anatomical consistency and volumetric reproducibility of raters guided by a detailed segmentation protocol for outlining the caudate (C), putamen (P), and thalamus (T) using a new, web-based virtual laboratory.

Methods: Three independent raters traced C, P, and T on 3T T1-weighted images (multi-site) from MS patients and healthy controls following a consensus protocol with detailed instructions and illustrations to guide in the definition of anatomical boundaries. SPINE, a web-based “virtual laboratory” that enables tracing in a web-browser, as well as interactive exploratory analysis (numerical and visual inspection) was used. Intra- and inter-rater reliability was assessed using intraclass correlation [consistency-of-agreement (CA-ICC) and absolute-agreement (AA-ICC)]; Bland-and Altman analysis, spatial overlap (Dice coefficients), as well as interactive visual inspection of anatomical accuracy and consistency.

Results: Inter-rater CA-ICC (mean (95%CI)) was 0.98 (0.49-1.00) (T), 0.99 (0.91-1.00) (C) and 0.98 (0.81-1.00) (P). Inter-rater AA-ICC was 0.99 (0.81-1.00) (T), 0.76 (0.07-0.99) (C) and 0. 78 (0.09-0.99) (P).

Interestingly, the student"s, medical doctor"s, and senior Neurologist"s intra-rater bias according to Bland-Altman was 30.7%, -14.2%, and 5.7% (T), respectively; 17.1%, 14.5%, and -2.6 (P); 22.6%, 9.5%, and -1.2% (C), suggesting that intra-rater reproducibility was impacted by the degree of radiological experience. Raters were consistently more reproducible on C, followed by P, with T exhibiting the worst reproducibility.

Overlap analysis signalled good inter-rater reproducibility (Mean Dice coefficients: 0.82 (T), 0.81 (C), 0.81 (P).

Nevertheless, interactive visual inspection of pair-wise comparisons between raters revealed that outlining errors were not equally distributed around the structures of interest, and demonstrated significant anatomical inconsistencies in all three structures.

Conclusion: Reliance on ICC and overlap analysis is insufficient and potentially misleading in assessing accuracy and reproducibility of anatomical parcellation.

Disclosure: J. Burggraaff received funding for research from the ´Nauta Fonds´

J.C. Prieto has nothing to disclose

J.P. Simoes has nothing to disclose

Y. Liu was a recipient of the ECTRIMS-MAGNIMS fellowship

S. Ruggieri has nothing to disclose

M. Palotai is currently a recipient of the McDonald Fellowship from the Multiple Sclerosis International Federation

C. Gasperini received fee as speaker for Bayer-Schering Pharma, Sanofi-Aventis, Genzyme, Biogen, Teva, Novartis, Merck Serono. Received a grant for research by Teva.

M.P. Wattjes serves on the editorial boards of Neuroradiology, Journal of Neuroimaging, European Radiology, Frontiers of Neurology, and serves as a consultant for Roche, Novartis and Biogen.

F. Barkhof serves on the editorial boards of Brain, Neurology, Neuroradiology, Multiple Sclerosis Journal and Radiology, and serves as a consultant for Bayer-Schering Pharma, Sanofi-Aventis, Genzyme, Biogen, Teva, Novartis, Roche, Synthon BV and Jansen Research.

H. Vrenken has received research grants from Pfizer, MerckSerono, Novartis and Teva, and a speaker honorarium from Novartis. All funds were paid directly to his institution.

C.R.G. Guttmann has nothing to disclose that could constitute a potential conflict of interest for this work. Received a grant for research from Sanofi and from the National Multiple Sclerosis Society.

By clicking “Accept Terms & all Cookies” or by continuing to browse, you agree to the storing of third-party cookies on your device to enhance your user experience and agree to the user terms and conditions of this learning management system (LMS).

Cookie Settings
Accept Terms & all Cookies