Datasets for benchmarking antimicrobial resistance genes in bacterial metagenomic and whole genome sequencing

Amogelang R. Raphenya, James Robertson, Casper Jamin, Leonardo de Oliveira Martins, Finlay Maguire, Andrew G. McArthur, John P. Hays*

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

6 Citations (Scopus)
22 Downloads (Pure)


Whole genome sequencing (WGS) is a key tool in identifying and characterising disease-associated bacteria across clinical, agricultural, and environmental contexts. One increasingly common use of genomic and metagenomic sequencing is in identifying the type and range of antimicrobial resistance (AMR) genes present in bacterial isolates in order to make predictions regarding their AMR phenotype. However, there are a large number of alternative bioinformatics software and pipelines available, which can lead to dissimilar results. It is, therefore, vital that researchers carefully evaluate their genomic and metagenomic AMR analysis methods using a common dataset. To this end, as part of the Microbial Bioinformatics Hackathon and Workshop 2021, a ‘gold standard’ reference genomic and simulated metagenomic dataset was generated containing raw sequence reads mapped against their corresponding reference genome from a range of 174 potentially pathogenic bacteria. These datasets and their accompanying metadata are freely available for use in benchmarking studies of bacteria and their antimicrobial resistance genes and will help improve tool development for the identification of AMR genes in complex samples.

Original languageEnglish
Article number341
JournalScientific data
Issue number1
Publication statusPublished - 15 Jun 2022

Bibliographical note

This work was made possible and supported by a collaboration between the Public Health Alliance for Genomic Epidemiology (PHA4GE -, the Joint Programming Initiative on Antimicrobial Resistance (JPIAMR - and the MRC Cloud Infrastructure for Microbial Bioinformatics (MRC CLIMB-BD - We would also like to thank Boas van der Putten (University of Amsterdam) for initial contributions to the work performed in this publication.

Publisher Copyright: © 2022, The Author(s).


Dive into the research topics of 'Datasets for benchmarking antimicrobial resistance genes in bacterial metagenomic and whole genome sequencing'. Together they form a unique fingerprint.

Cite this