Unique identifiers and lookup service for sequence collections.
Seqcol, or Sequence Collections, is a GA4GH-sponsored community effort to standardize unique identifiers for collections of sequences. Seqcol identifiers can be used to identify genomes, transcriptomes, or proteomes -- anything that can be represented as a collection of sequences. The seqcol protocol provides:
Read the complete specification
- implementations of an algorithm for computing sequence identifiers;
- a lookup service to retrieve sequences given a seqcol identifier
- programmatic approach to assessing compatibility among sequence collections.
Uniquely identify the sequences you use with persisent identifiers
Use Seqcol identifiers to embed persistent information in your tools about what genome was used in an analysis.