General Guide on ENA Data Retrieval

Welcome to the general guide for the European Nucleotide Archive data discovery and retrieval. Please take some time to view this introduction and explore all the options available for our data retrieval services.

Viewing and Exploring ENA Records

The table below summarises the domains of data held within ENA and example records that are archived within each domain and displayed within the ENA Browser. Please see our How To guide on exploring an ENA project for an example of how to navigate through an ENA Project in the browser:

How to Explore an ENA Project

Data Domain	Description	Example Records
Projects/Studies	Contains information on a biological research project. This holds all the data generated as part of this research	PRJEB1787 (ERP001736)
Samples	Represents biological samples collected and sequenced in real life	SAMEA2620084 (ERS488919)
Reads (Runs/Experiments)	Hold raw read files and sequencing methods	ERR1701760 ERX1772048
Analyses	Hold results files of analyses performed on sequencing data and analysis methods	ERZ1195979
Contig set	Hold contig sets generated as part of a genome or transcriptome assembly.	CABHOY010000000.1
Assemblies	Represents an entire genome assembly and holds any contig sets or sequence records generated as part of the assembly	GCA_000001405.28
Assembled/Annotated Sequences (*)	Any sequence records from coding or non-coding regions to full assembled chromosomes	CM000667.2
Taxon	The sequenced organism or metagenome of a sample	Taxon:9606
Sample Checklist	The checklist of metadata that the sample was registered with	ERC000013

* Assembled and annotated sequence records fall into different data classes. Read more about the different classes of sequences here.

Search and Retrieval

You can search across the ENA browser in a number of ways:

The advanced search in the browser provides a simple interface for building more complex search queries that can be saved and run again with Rulespace. See our step by step guide on how to use the advanced search for examples on how to build queries and how to use Rulespace:

How To Perform An Advanced Search

The ENA Browser also provides different means to download data from the archive whether its XML ENA records, a tabulated summary of metadata resulting from a search or sequencing data files submitted as part of a research project. See our guide on file download for details on how to use our data retrieval services to download data from the archive:

How to Download Data Files

Programmatic Access

When working with a large number of records or when developing an automated pipeline, it can be preferable to explore and interact with the programmatic services that ENA has to offer.

Once you are familiar with how ENA records are linked and what data are available associated with each record, please explore our more advanced guides foraccessing data from the archive programmatically:

How to Access ENA Programmatically