Eckher
EckherInsightsContact
Eckher
Your guide to what's next.
Let's explore the Nobel Prize dataset
Aug 15, 2020

Let's explore the Nobel Prize dataset

An overview of the official Nobel Prize Linked Data dataset with some example SPARQL queries.

Since 1901, the Nobel Prizes and the Prizes in Economic Sciences have been awarded 597 times. 950 people and organisations have received the award in the following categories: Physics, Chemistry, Physiology or Medicine, Literature, Peace, and Economic Sciences.

The official Nobel Prize Linked Data dataset is an authoritative source of information about Nobel Prizes and laureates. Importantly, the Nobel Prizes are often shared between multiple people, and the same person or organisation can receive multiple Nobel Prizes. RDF works really well for representing such relationships.

The RDF vocabulary for expressing Nobel Prizes as Linked Data

The Nobel Prize dataset both reuses classes and properties from existing vocabularies and utilises some custom classes and properties that are defined in the () namespace. For example, the class represents a person or organization that receives a Nobel Prize and is a subclass of :

The same data in JSON-LD:

The full ontology can be downloaded here.

The Nobel Prize and laureate data

In the Nobel Prize Linked Data dataset, the URIs of all instances begin with . For example, the fact that Wilhelm Röntgen was awarded the 1901 Nobel Prize in Physics is represented as follows:

The same data in JSON-LD:

The Nobel Prize data is available via the D2R Server running at http://data.nobelprize.org/. A SNORQL server is provided at http://data.nobelprize.org/snorql/ so that we can issue SPARQL queries directly in the browser.

Querying the data

You can run some interesting queries directly in NobelPrize.org's SNORQL browser. Some example queries are described below.

All Nobel Laureates who were born in New Zealand

This query will list all Nobel Laureates who were born in New Zealand:

Try this query in NobelPrize.org's SNORQL

Query result:

All Nobel Laureates who were born in New Zealand in Chinese

The Nobel Prize dataset not only uses existing established vocabularies but is also linked to other RDF datasets through . For example, Nobel Prize Linked Data contains the following statement about Wilhelm Röntgen:

Using SPARQL federation we can simultaneously query all other linked data sources. This makes it possible to issue a wide range of queries that go beyond the Nobel Prize dataset.

The Nobel Prize dataset only specifies the names of the laureates in English, but since it has links to DBPedia for each laureate, we can get get the names of all Nobel Laureates who were born in New Zealand in Chinese using a single query:

Try this query in NobelPrize.org's SNORQL

Query result:

People and organisations that received more than one Nobel Prize

To find out which Nobel laureates were honoured with the award multiple times, this query can be used:

Try this query in NobelPrize.org's SNORQL

Query result:

Youngest Nobel laureates

The youngest recipients of the award in all categories can be retrieved using the following query:

Try this query in NobelPrize.org's SNORQL

Query result:

Resources

See also
RDF* and the onset of Linked Data* and the Semantic Web*
The evolution of RDF and the related technologies fuelled by the need to make statements about statements.
A network of drugs: The New Zealand Medicines Terminology
An overview of New Zealand's drug vocabulary.
Linked data for the enterprise: Focus on Bayer's corporate asset register
An overview of COLID, the data asset management platform built using semantic technologies.
A beginner's guide to graph embeddings
Understanding what graph embeddings are and why they are important for graph analytics.
Document understanding: Modern techniques and real-world applications
How document understanding helps bring order to unstructured data.
Navigating unstructured data: The rise of question answering
Question answering technologies are key to efficiently dealing with overwhelming amounts of unstructured data.
Why federation is a game-changing feature of SPARQL
SPARQL federation is an incredibly useful feature for querying distributed RDF graphs.