Repository containing lists of all published ancient metagenomic (and related) samples and libraries
AncientMetagenomeDir is a community curated resource of lists of all published shotgun-sequenced ancient metagenome or microbial genome-level enriched samples and libraries.
It is primarily meant to act as a reference guide to help point researchers toward any relevant public data for comparative analysis. It is hoped it will help researchers track growth and development of the field of ancient metagenomics over time.
These lists contain information of ancient metagenomic samples such as publication, sample name, sample type, geographic location, dates and DOIs and other accession numbers to raw data in establised sequencing data databases.
We define here 'metagenome' in a broad sense, primarily focusing any data where the whole DNA content is analysed and explored. Examples for this are (but not limited to) ancient microbiomes (host associated metagenome), ancient sedimentary DNA (environmental) and also samples used for ancient pathogen screening (single genomes).
The lists also contain metadata about all sequencing libraries generated from the corresponding samples.
See the README of each list for more detailed descriptions of the contents of each list. Further modification and expansion of these categories can be proposed to the SPAAM community for further consideration.
The purpose of this repository is not meant to be a comprehensive set of all metadata about every ancient metagenome, but the aim is to have a lightweight set of minimal information but with sufficient links to original sources of detailed information. Keeping the repository simple and lightweight will promote continual updating of the lists over a long period of time. That said, we will design lists as far as possible to follow larger database metadata specifications e.g. MIxS for compatibility.
Finally, we aim to make releases of this at regular intervals, and assign each release a citeable DOI. Each release will also include a changelog to record all additions and any corrections or modifications. Therefore researchers are free to use the information in their own publications, as per the CC-BY 4.0 licence.
ℹ️ A table for ancient anthropogenic metagenomes was included up until release v21.09 (Taputapuātea). However this was dropped in release "Pyu Cities" due to lack of interest from the community. It may be reinstated in the future.
To download the latest stable release of an AncientMetagenomeDir sample list:
.zip
button.tsv
file in a spreadsheet program (e.g. LibreOffice Calc, Microsoft Excel) or your programming language of choice (e.g. R)To download the bleeding edge AncientMetagenomeDir sample list:
.tsv
..tsv
file in a spreadsheet program (e.g. LibreOffice Calc, Microsoft Excel) or your programming language of choice (e.g. R).⚠️ these TSV files in the repository represent the cutting-edge version, and are subject to change, please use the files in the 'releases' section or Zenodo archive for when using for reproducible science!
✅ For some spreadsheet software copy>paste of the raw format directly into a spreadsheet of the _contents* of the links above can work, if not, follow instructions above.
📖 For general use of the project, please use the publication citation below.
For a specific release citation (e.g. the version used for analysis), go to Releases and use the DOI in the badge of the given release's description, or check the corresponding Zenodo entry
Alongside the Zenodo ID of the release version, please also cite:
Fellows Yates, J. A., Andrades Valtueña, A., Vågene, Å. J., Cribdon, B., Velsko, I. M., Borry, M., Bravo-Lopez, M. J., Fernandez-Guerra, A., Green, E. J., Ramachandran, S. L., Heintzman, P. D., Spyrou, M. A., Hübner, A., Gancz, A. S., Hider, J., Allshouse, A. F., Zaro, V., & Warinner, C. (2021). Community-curated and standardised metadata of published ancient metagenomic samples with AncientMetagenomeDir. Scientific Data, 8(1), 31. DOI: 10.1038/s41597-021-00816-y
⚠️ You must also cite each paper you use data from, citing AncientMetagenomeDir is not sufficient on it's own!
ℹ️ The AncientMetagenomeDir community curates this data on a voluntary basis, and therefore provides no warranty or completeness or accuracy of the data. If in doubt, refer to the original publication of a given sample. Corrections are accepted, and thus later versions may have different values.
Vector (.svg
) versions of all images and interactive map (.geojson
) are available under assets/images/figures
. R code for all plots are freely available for modification in this repository under assets/analysis/
(Viewable on the AncientMetagenomeDir Website or here)
Legend: Green: Environmental Metagenome, Light Blue: Host Associated Metagenome, Pink: Host Associated Single Genome
If you are not familiar with Git(Hub), quick start step-by-step guides can be found in the following (note these may be out of date):
Samples must be from published studies, and should only include sample level information. Preprints often have not made data avaliable at this point, and we make the assumption that peer-review has correctly evaluated any submitted metagenome is valid. Included papers should generally have uploaded their own data, however in certain cases may see exceptions (such as retrieval of bacterial genomes from off-target reads from human population studies). If unsure, please consult the AncientMetagenomeDir community on slack.
Samples must also have been accessible on public databases (e.g. EBI ENA or NCBI SRA).
Important: we do not currently record sequencing data information, due to different uploading strategies by different groups.
Contributing a new publication's sample information consists of the following:
assets/enums
yourselfAll contributions are attributed through Git's in-built recording of the author of each commit and pull request. These will be displayed publically in the future.
In addition to researchers adding their own publications, we will also maintain lists of published but not-yet-added studies using github issues.
Therefore researchers who wish to contribute who do not yet have their own publications, can 'assign' themselves to specific 'issues' (papers), and reduce the risk of duplicates.
Contribution of these publications follow as above.
To ensure efficient and consistent searching, we include guidelines on how to record and format information. Adherence to these guidelines is enforced by 'pull request' review by members of the SPAAM community and other contributors. Furthermore, each category of metagenome will have their own 'special' fields, which will be defined in each specific context.