AIRR Data Commons#
The use of high-throughput sequencing for profiling B-cell and T-cell receptors has resulted in a rapid increase in data generation. It is timely, therefore, for the Adaptive Immune Receptor Repertoire (AIRR) community to establish a clear set of community-accepted data and metadata standards; analytical tools; and policies and practices for infrastructure to support data deposit, curation, storage, and use. Such actions are in accordance with international funder and journal policies that promote data deposition and data sharing – at a minimum, data on which scientific publications are based should be made available immediately on publication. Data deposit in publicly accessible databases ensures that published results may be validated. Such deposition also facilitates reuse of data for the generation of new hypotheses and new knowledge.
The AIRR Common Repository Working Group (CRWG) has developed a set of recommendations that promote the deposit, sharing, and use of AIRR sequence data. These recommendations were refined following community discussions at the AIRR 2016 and 2017 Community Meetings and were approved through a vote by the AIRR Community at the AIRR Community Meeting in December 2017. Updates to these recommendations have continued, with the latest set of Recommendations ratified at the AIRR Community meeting in May 2019.
In May 2020, the AIRR Community released the first verion of the AIRR Data Commons Application Programming Interface (ADC API), a specification for programmatic access to query and download AIRR-seq data from repositories that adhere to the AIRR Standards. We define the AIRR Data Commons as consisting of the set of repositories that:
adhere to the CRWG recommendations for promoting, sharing, and use of AIRR-seq data, and
that implement the ADC API as a programmatic mechanism to access that data.
This page provides a central location for the community to discover resources that belong to the AIRR Data Commons.
AIRR Data Commons Repositories#
The repositories that are part of the ADC are listed on the AIRR Community ADC Registry github site. In order to find data across all of the repositories in the ADC, it is necessary to query all of these repositories.
Querying the AIRR Data Commons#
Each of the repositories above can be queried directly using the ADC API. In addition, the following tools and platforms implement web based user interfaces that use the ADC API to query repositories in the AIRR Data Commons:
There are query and analysis use cases and a set of example queries available for the AIRR Data Commons and the ADC API.