Labels for the pretraining dataset

In Section B (Table 11) of the paper, the pretraining dataset is described as multi-species. However, the version available for download appears to contain only the raw DNA sequences, without any labels indicating the species of origin for each sequence.
Is there a way to obtain the species labels for the pretraining sequences, or could a mapping between sequences and their source species be released? 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Labels for the pretraining dataset #153

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Labels for the pretraining dataset #153

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions