Overview of retrospective data harmonisation in the MINDMAP project: Process and results

Tina W. Wey*, Dany Doiron, Rita Wissa, Guillaume Fabre, Irina Motoc, J. Mark Noordzij, Milagros Ruiz, Erik Timmermans, Frank J. Van Lenthe, Martin Bobak, Basile Chaix, Steinar Krokstad, Parminder Raina, Erik Reidar Sund, Marielle A. Beenackers, Isabel Fortier

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

5 Citations (Scopus)
4 Downloads (Pure)


Background The MINDMAP project implemented a multinational data infrastructure to investigate the direct and interactive effects of urban environments and individual determinants of mental well-being and cognitive function in ageing populations. Using a rigorous process involving multiple teams of experts, longitudinal data from six cohort studies were harmonised to serve MINDMAP objectives. This article documents the retrospective data harmonisation process achieved based on the Maelstrom Research approach and provides a descriptive analysis of the harmonised data generated. Methods A list of core variables (the DataSchema) to be generated across cohorts was first defined, and the potential for cohort-specific data sets to generate the DataSchema variables was assessed. Where relevant, algorithms were developed to process cohort-specific data into DataSchema format, and information to be provided to data users was documented. Procedures and harmonisation decisions were thoroughly documented. Results The MINDMAP DataSchema (v2.0, April 2020) comprised a total of 2841 variables (993 on individual determinants and outcomes, 1848 on environmental exposures) distributed across up to seven data collection events. The harmonised data set included 220 621 participants from six cohorts (10 subpopulations). Harmonisation potential, participant distributions and missing values varied across data sets and variable domains. Conclusion The MINDMAP project implemented a collaborative and transparent process to generate a rich integrated data set for research in ageing, mental well-being and the urban environment. The harmonised data set supports a range of research activities and will continue to be updated to serve ongoing and future MINDMAP research needs.

Original languageEnglish
Pages (from-to)433-441
Number of pages9
JournalJournal of Epidemiology and Community Health
Issue number5
Publication statusPublished - 1 May 2021

Bibliographical note

Funding Information:
Funding MINDMAP is supported by the European Commission HORIZON 2020 Programme under grant agreement #667661. The article does not reflect the Commission’s views and in no way anticipates the Commission’s future policy in this area. MAB was funded by a Netherlands Organization for Scientific Research (NWO) VENI grant on ‘DenCityHealth: How to keep growing urban populations healthy?’ (grant number 09150161810158). Funding for the Canadian Longitudinal Study on Aging (CLSA) is provided by the Government of Canada through the Canadian Institutes of Health Research (CIHR) under grant reference: LSA 94473 and the Canada Foundation for Innovation. The opinions expressed in this manuscript are the authors’ own and do not reflect the views of the Canadian Longitudinal Study on Aging. PR holds the Raymond and Margaret Labarge Chair in Optimal Aging and Knowledge Application for Optimal Aging, is the Director of the McMaster Institute for Research on Aging and the Labarge Centre for Mobility in Aging, and holds a Tier 1 Canada Research Chair in Geroscience.

Publisher Copyright:


Dive into the research topics of 'Overview of retrospective data harmonisation in the MINDMAP project: Process and results'. Together they form a unique fingerprint.

Cite this