The Biological Magnetic Resonance Bank is developing a REST API to use for querying against the substantial BMRB databases. Most data in the macromolecule and metabolomics databases is accessible through the API.
While the API is free to use and doesn't require an API key, if you submit a large number of queries you may be rate limited.
The URL of the API is api.bmrb.io. If you navigate there you will see links to all active versions of the API.
To see an interactive Jupyter notebook that demonstrates using the BMRB and PDB APIs, please click .
New API releases will be available at a unique URL so that you can continue to rely on a given API version as we continue to improve and develop the API. We intend to keep each released version of the API live as long as is feasible.
Releases will be named with a major and minor version and optionally a revision number. For example, the current release version is v2.0. This means that it is the 2nd API version and there have been zero minor releases.
Major release version incrementing means that the API may have changed in a way that breaks existing queries. Therefore you should write your applications to use a specific major version and only change the URL you query once you are sure your application is up to date with any changes.
Minor release version incrementing means that new features may have been released but the API should still be perfectly backwards compatible. As a result our release URLs will not include the minor release version.
Revision numbers incrementing means that bugs have been patched but no other substantial changes to the API have been made.
From the root of the API server, first add the version you want to query. For example, api.bmrb.io/v2/ for the release version v2, or api.bmrb.io/current/ to ensure your query goes to the current API version, whatever that is at the time. It is suggested that you use the /current/ version for development and a fixed version for software releases. This will ensure that your deployed applications do not break if a new API version is released.
HTTPS is available, though due to the overhead in establishing a TLS session, slightly slower. It is only recommended if you are uploading private data to the server or calling methods on previously uploaded data.
Certain queries return results on an entry, saveframe, or loop in JSON format. To see how we convert our NMR-STAR entries, saveframes, and loops into JSON format please see the reference here.
We have a rate limit enforced in order to guarantee a responsive API server. It is unlikely that you will encounter the limit, but if you do you will receive a HTTP 403 error as a response to all requests. Please ensure to check for this error in your applications and wait before sending further queries.
If you are blacklisted simply wait at least 10 seconds before sending ANY queries and you will be removed from the blacklist.
Limits:
- Up to 50 queries per second per IP.
We reserve the right to increase or decrease these limits in the future without warning.
If you need to perform a lot of queries and the rate limit is a problem for you please contact us at [email protected] to get an exception.
If using the API in an application you distribute to others, please include the HTTP header 'Application' whose value is the name of your application, a space, and then the version number of your application. This allows us to track API usage more accurately and determine when we can end-of-life old API versions. Some examples:
Python:
import requests
requests.get("http://api.bmrb.io/v2/status", headers={"Application":"My Application"})
Curl:
curl "http://api.bmrb.io/v2/status" -H 'Application: Curl Script'
All queries return results in JSON format by default, and optionally also in text format, or other formats pertinent to the query. See the documentation for each method to determine which other formats are available.
The BMRB API has 3 databases. They are:
macromolecules
- The standard BMRB database. Contains macromolecules.metabolomics
- The metabolomics database.chemcomps
- The chemical compounds used by the PDB in NMR-STAR format. Note that these entries only have chemical component and entity sections.
Note that not all databases contain the same tables. In general, a search in a table that a given database doesn't contain will not produce an error; instead it will return that no results were found.
Queries use the macromolecules
database by default.
Query types that work on an entry-basis do not need to specify a database as all databases are searched for those query types.
/status
Returns the current status of the databases and server. This includes the number of entries in each database, the number of chemical shifts in each database, and the last time each database was updated. The available methods are also returned, as well as the version number of the API.
/list_entries[?database=$database]
Returns a list of all entries.
Example: List all entries
Example: List macromolecule entries
Example: List metabolomics entries
Example: List chemcomp entries
/entry/
When you access this URI you must also provide a NMR-STAR entry in text or JSON format as the body of the request. The entry will be parsed and stored in the database. You can then use all of the entry-based queries below on your saved entry. The response to this request will include two keys:
entry_id
: The unique key assigned to your submission. You can then use this key as theentry_id
for the queries below.expiration
: The unix time that the entry will be removed from the database. This is set as a week after upload. Uploading the same exact file will reset the expiration to a week from the present time. The expiration date will also be reset to a week from the present time each you make a request to the API that uses the providedentry_id
. (For example, fetching the entry with "/entry/entry_id
" will reset the expiration.)
Caution - Data you upload to the server is publicly accessible to anyone with
access to the assigned entry_id
you are provided. Therefore you should not
share this key with anyone who you do not intend to share the data with.
/entry/$entry_id[?format=$entry_format]
By default returns the given BMRB entry in JSON format. If
entry_format
is specified then return in that format instead.
The formats available are:
json
- The default format. Returns the entry in JSON format. Examplerawnmrstar
- The entry is returned in pure NMR-STAR format. There is no wrapping JSON. If you need to fetch a large number of NMR-STAR entries in text form you may be better served getting them from the FTP site. Examplezlib
- A zlib-compressed JSON representation of the entry is returned. This is how the entries are stored in the API database and is therefore the absolute fastest way to retrieve an entry. This format mainly exists to enable the PyNMR-STAR python library to fetch entries as quickly as possible and it is not expected you will benefit from the moderately faster entry access considering the additional code complexity required. Example
/entry/$entry_id?saveframe_category=$saveframe_category[&format=$entry_format]
Returns all saveframes of the given category for an entry in
JSON format by default. If entry_format
is specified then
return in that format instead.
Only json
and nmrstar
are currently allowed for entry_format
. These
formats are the same as described in the entry method.
You may provide the URL parameter saveframe_category=saveframe_category
multiple times
to retrieve multiple saveframes.
Example: Querying for the entry information saveframe
Example: Querying for the entry information and citation saveframes
/entry/$entry_id?saveframe_name=$saveframe_name[&format=$entry_format]
Returns the saveframe with the given name from an entry in
JSON format by default. If entry_format
is specified then
return in that format instead.
Only json
and nmrstar
are currently allowed for entry_format
. These
formats are the same as described in the entry method.
You may provide the URL parameter saveframe_name=saveframe_name
multiple times
to retrieve multiple saveframes.
Example: Querying for the entry information saveframe by name
Example: Querying for the saveframes with the name entry_information and citation_1
/entry/$entry_id?loop=$loop_category[&format=$entry_format]
Returns all loops of a given category for a given entry in
JSON format by default. If entry_format
is specified then
return in that format instead.
Only json
and nmrstar
are currently allowed for entry_format
. These
formats are the same as described in the entry method.
You may provide the URL parameter loop=loop_category
multiple times
to retrieve multiple saveframes.
Example: Query of the entry author loop
Example: Query of the entry author loop and the sample component loop
/entry/$entry_id?tag=$tag
Returns tags of the specified type(s) for a given entry.
Example: Fetching the entry title
Example: Fetching the entry title and citation title
/entry/$entry_id/citation
Returns the citation information for the entry. Citation information
is available in three formats. The default format is bibtex. To use one
of the other formats, specify one of the following values for the format
tag:
json-ld
- Schema.org JSON-LD format.bibtex
- BibTeX format.text
- Text citation format.
Examples:
/entry/$entry_id/experiments
Returns information about the NMR experiments for an entry. The information returned:
- Data files available and their location
- Sample component information
- Sample experimental condition information
- NMR spectrometer information
Example: Experiments for entry bmse000001
/enumerations/$tag_name[?term=$search_term]
Returns a list of values suggested for the tag in the values
key if there are
saved enumerations for the tag. In the type
key one of the following values will
appear:
common
- The tag values returned are common values but are not the only legal values for the tag.enumerations
- The tag values returned are the only legal values for the tag.null
- There are no saved enumerations for the tag.
Example: List of common NMR-STAR versions
You can narrow the results to those starting with the value you provide in the
term
parameter in the query string. This will return the results in a form
that can be used by JQuery's auto-complete.
Example: List of common NMR-STAR versions starting with 2
/instant?term=$search_term[&database=$database]
This URL powers the BMRB instant search tool. It queries all macromolecule and metabolomics entries based on a variety of commonly searched fields. It does exact searches on certain fields and fuzzy-matches on others depending on what is most appropriate for the field (for example, database matches must be exact but InChI matches may be similar). It returns matches sorted by what results it thinks are the most relevant. It should always begin sending results within 1 second to allow you to use it in interactive applications. A non-exhaustive list of the search fields:
- Title
- Citation Title
- Authors
- Entry ID
- Related Database Codes
- Organism name (common and scientific)
- InChI string and SMILES string (for metabolomics entries)
- Formula (for metabolomics entries)
- Sequence (for macromolecule entries)
- Citation DOI
- Additional data available (e.g. residual dipolar couplings)
You can use this endpoint to do a "general search" against the entire BMBR archive. Example link for "john markley mouse". It will return results that can be used by JQuery auto-complete with some additional fields provided. This means it returns a list of dictionaries, each of which corresponds to one matching entry. Entries are only listed once even if multiple fields matches. The entry dictionaries will always contain the following keys:
sub_date
- The date of submission of the entry. YYY-MM-DD format.value
- The unique BMRB entry ID. Could be a macromolecule or metabolomics ID. (e.g. 15000 or bmse000001)label
- The title of the entry.citation
- A list of citation titles for the entry.link
- A relative link (relative to the BMRB home page) pointing to the URL this entry can be accessed at. Always use this value (appended to https://bmrb.io) rather than constructing the URL yourself since some results returned link to non-entry summary pages. (For example, on-hold entries appear in the results if searched by ID. Their link points to the "on hold entries" page at BMRB.)authors
- The list of authors for this entry.
If the search matched one of the "additional" search fields (any field other than
ID, Title, Author and Citation) the key extra
will also exist and point to another
dictionary. That dictionary contains the following two keys:
termname
- The name of the field that the search matched. (e.g. organism scientific name)term
- The value of the matching field value from the BMRB entry.
Furthermore, if you perform a query against only the metabolomics database, the following values will also be returned:
average_mass
: The average mass of the compound in daltons. (Natural isotopic composition.)formula
: The chemical formula.inchi
: The InChI string for the compound.monoisotopic_mass
: Mono-isotopic formula mass calculated using the most abundant naturally occurring isotope for each atom.smiles
: The canonical SMILES string for the compound.
/search/chemical_shifts[?database=$database][...]
Returns all of the chemical shifts in the BMRB for the specified atom type. You
can omit the atom type to fetch all chemical shifts and you can use *
as a
wild card character. Optionally specify macromolecule
or metabolomics
for the
database argument to search a specific database. macromolecule
is the default.
In addition, the following parameters can be provided using the standard notation to limit the set of results to those that match the search parameters. All provided search parameters are combined with a logical AND.
atom_type
The value for this tag is a standard IUPAC abbreviation for an element (e.g. H,C,P).atom_id
The atom name (e.g. HB2, CB). You may specify this parameter multiple times and results that match any of the specified atom ids will be returned. You may use*
as a wildcard to match 0 or more characters. (e.g. HB* to get all HBx hydrogens. Don't use H* or C* - instead use atom_type=H or atom_type=C as they will be much faster.)comp_id
The residue as a 3-letter code. You may specify this parameter multiple times and results that match any of the specified residues will be returned.shift
A specific chemical shift to search for. Uses a default threshold of .03. You may specify this parameter multiple times and results that match any of the specified chemical shifts will be returned.threshold
Only has meaning in conjunction withshift
. Specifies the search threshold for a shift. Inclusive.ph
Specify one or more times to limit your search to chemical shifts measured at the specific ph(s).ph_threshold
Specify the threshold for the pH search. Default is 0 - an exact match.temperature
Specify one or more times to limit your search to chemical shifts measured at the specific temperature(s).temperature_threshold
Specify the threshold for the temperature search. Default is 0 - an exact match.database
The database to search. Macromolecules or metabolomics.dictionary_result
Set totrue
to get the results as a list of key->value mappings (a list of dictionaries) rather than as a list of keys and an array of values. This representation uses significantly more data, and will as a result be slower to fetch, but may be slightly easier to work with.conditions
Removed. Both the pH and the temperature are now included with the results and this parameter is no longer needed.
Examples:
- All CA chemical shifts
- All HB* chemical shifts
- All C1 chemical shifts from metabolomics database
- All asparagine C chemical shifts within .01 of 175.1 ppm
- All shifts within .03 of 103 or 130 in residue PHE or TRP
/serach/fasta/$sequence[?type=rna|dna|polymer][&e_val=$expectation_val]
Returns a list of FASTA matches from the BMRB database for the given query string.
Parameters:
type
- Leave blank for polymer. Optionally specify asdna
orrna
for DNA or RNA searches.e_val
- Expectation value. Optionally specify to set the FASTA expectation value.
/search/multiple_shift_search?shift=x.x[&shift=x.x][...][&database=$database]
Returns all entries that contain at least one of the queried shifts, as well as the list of shifts that matched. Results returned as a list of matching entries along with the matching shifts, solvent(s) in which the shifts were observed, number of shifts matched, and total offset of shifts, sorted by number of shifts matched and total offset.
The titles and links to the matched entries are also returned. Note that for a large number of peaks, you should use 's' rather than 'shift' in order to save free up extra characters in the URL.
Parameters:
shift
ors
Specify once for each shift you intend to query against.database
Which database to query. Metabolomics by default.cthresh
The threshold to use when matching carbon atoms. Default: .2 (ppm)nthresh
The threshold to use when matching nitrogen atoms. Default: .2 (ppm)hthresh
The threshold to use when matching protons. Default: .01 (ppm)solvent
Filter results to only include ones observed in the specified solvent.
Example: Search for peaks 2.075, 3.11, and 39.31
/search/get_id_by_tag_value/$tag_name/$tag_value[?database=$database]
Returns a list of BMRB entry IDs which contain the specified tag_value
for the
value of at least one instance of tag tag_name
. The search is done
case-insensitively. You may optionally specify a database if you want
to query the metabolomics or chemcomp database rather than the macromolecule one.
Example: All entries which used solid-state NMR
Note that you need the proper tag capitalization for this method. Use the dictionary for reference.
/search/get_all_values_for_tag/$tag_name[?database=$database]
Returns a dictionary for the specified dictionary where the keys are entry IDs and the values are lists of all of the values of the given tag in each entry. This allows you to get all of the values of a given tag in the BMRB archive for a given database.
Example: The citation titles for all entries in the macromolecule database
Example: The compound names for all compounds in the metabolomics database
Note that you need the proper tag capitalization for this method. Use the dictionary for reference.
/search/get_pdb_ids_from_bmrb_id/$bmrb_id
Returns a list of dictionaries, each containing three keys corresponding to PDB IDs associated with the specified BMRB ID, the association between the two, and any notes on the relationship if present.
The keys are pdb_id
, match_type
, and comment
.
The pdb_id
field will contain the PDB ID of the match.
The following match types are possible for match_type
:
Exact
- The entry is an exact match as tracked by the BMRB entry tracking system. There is a one-to-one correspondence between this queried entry and the provided PDB ID.BLAST Match
- The entry was found during a routine BLAST search. It is similar to the queried entry in sequence but no other correlation is implied.Author Provided
- If an author provided a "related entry" or "related assembly" during deposition it will appear here.
The comment
field will only be present if it has a non-null value. It will contain any recorded
notes on how the specific PDB ID is related to the queried BMRB ID if present.
Example: PDB IDs associated with BMRB ID 15000
/search/get_bmrb_ids_from_pdb_id/$pdb_id
Returns a list of dictionaries, each containing three keys corresponding to BMRB IDs associated with the specified PDB ID, the association between the two, and any notes on the relationship if present.
The keys are bmrb_id
, match_type
, and comment
.
The bmrb_id
field will contain the BMRB ID of the match.
The following match types are possible for match_type
:
Exact
- The entry is an exact match as tracked by the BMRB entry tracking system. There is a one-to-one correspondence between this queried entry and the provided BMRB ID.BLAST Match
- The entry was found during a routine BLAST search. It is similar to the queried entry in sequence but no other correlation is implied.Author Provided
- If an author provided a "related entry" or "related assembly" during deposition it will appear here.
The comment
field will usually be null
, but if not, it will contain any recorded
notes on how the specific BMRB ID is related to the queried PDB ID.
Example: BMRB IDs associated with PDB ID 2JM0
/mappings/bmrb/pdb[?format=$format][&match_type=$match_type]
/mappings/pdb/bmrb[?format=$format][&match_type=$match_type]
Returns a mapping of BMRB ID
<->PDB ID
.
Parameters:
format
- The format to return results in. Default isjson
buttext
is also supported.match_type
The type of match to use when generating the list. Allowed values:all
- PDB links from any of the other sources.exact
* - The entry is an exact match as tracked by the BMRB entry tracking system. There is a one-to-one correspondence between the entry and the provided BMRB ID. This is the default.author
- Entries supplied by the author as "related entries" during deposition.blast
- The entry was found during a routine BLAST search. It is similar to the queried entry in sequence but no other correlation is implied.
Examples:
Returns a mapping of BMRB ID
<->UniProt ID
.
/mappings/bmrb/uniprot[?format=$format][&match_type=$match_type]
/mappings/uniprot/bmrb[?format=$format][&match_type=$match_type]
Returns a mapping of BMRB ID
<->UniProt ID
or PDB ID
.
Parameters:
format
- The format to return results in. Default isjson
buttext
is also supported.match_type
The type of match to use when generating the list. Allowed values:all
* - UniProt links from any of the other sources. This is the default.author
- Entries supplied by the author as "related entries" during deposition. Returns matches for both entities and assembles with a database link.blast
- The entry was found linked with a routine BLAST search. It is similar to the queried entry in sequence but no other correlation is implied.pdb
- UniProt IDs derived via the matching PDB record for an entry. This is the most reliable match type.
Examples:
/software/
Returns a summary of all software packages used in BMRB entries.
Example: All software packages used
/entry/$entry_id/software
Returns a list of all software packages used by a given entry. Each item in the list of software will be a list with the following four values in order:
software_name
The software package name.software_version
The version of the software package used.software_task
The purpose of the software in this investigation.softare_vender
The software vendor.
/software/package/$software_package/[?database=$database]
Returns a list of all entries used by the specified software package. The search
is done case-insensitive and does not require perfect matches. For example,
SPARK
would match SPARKY
and NMRFAM_SPARY
.
You may optionally specify which database to use.
Example: Entries using SPARKY
/molprobity/$pdb_id/oneline
Returns the full one-line MolProbity results for the given PDB ID.
Example: PDB 2DOG
/molprobity/$pdb_id/residue[?r=$residue][&r=$residue][...]
Returns the full MolProbity residue results for the given PDB ID. You may optionally specify a list of residues to only get results for those residues.
Parameters:
r
Specify the residue to query. May be specified multiple times to get the results for multiple residues.
Example: PDB 2DOG residues 10-13
/protein/uniprot[/$uniprot_id]
Provides detailed information on linked BMRB IDs for the provided UniProt ID, or for all UniProt IDs. Information available in either json or hupo-psi-json format.
Parameters:
format
Specify the format for the returned results. Allowed options:json
orhupo-psi-id
.json
is the default.
Examples: