Statistics

This page gives an overview of the information about works, people and organizations made available via DataCite Commons.
The is not a live dashboard, but updated on a regular basis. Please reach out to DataCite Support for questions or comments.

Data Sources

Last updated as of September 20, 2023

The following main data sources are used in DataCite Commons for a total of currently 1,106,541 records:

DataCite

316,630 Works
100% of identifiers and metadata.

Crossref

593,190 Works
0.40% of identifiers and metadata. Import is ongoing.

ORCID

91,500 People
100% of identifiers. Personal and employment metadata.

ROR

105,221 Organizations
100% of identifiers and metadata.
Additional information comes from these data sources:
  • Wikidata: inception year, geolocation and Twitter account for organizations
  • Unpaywall: download link for Open Access content via Crossref

Works

DataCite Commons currently includes 909,820 works, with identifiers and metadata provided by DataCite and Crossref. For the three major work types publication, dataset and software, the respective numbers by publication year are shown below.

644,852 Publications

137,649 Datasets

3,817 Software

302,412 out of all 910,543 (33.21%) works have been cited at least once, including 1.45% of works registered with DataCite, and 50.13% of works registered with Crossref.

300,703 (46.63%) Cited Publications

914 (0.66%) Cited Datasets

1 (0.03%) Cited Software

People

DataCite Commons includes all 91,500 ORCID identifiers, and personal and employment metadata. This information is retrieved live from the ORCID REST API, the respective numbers by registration year are shown below.

91,500 People

164,038 out of all 910,543 (18.02%) works have been claimed (connected) to at least one ORCID record, including 26.14% of works registered with DataCite, and 13.70% of works registered with Crossref.

87,140 (13.51%) Claimed Publications

6,987 (5.08%) Claimed Datasets

2,529 (66.26%) Claimed Software

Organizations

DataCite Commons includes all 105,221 Research Organization Registry (ROR) identifiers and metadata. This information is retrieved live from the ROR REST API, the respective numbers by registration year are shown below.

105,221 Organizations

209,763 out of all 910,543 (23.04%) works are connected with at least one organization via ROR ID or Crossref Funder ID, including 51.93% of works registered with DataCite, and 7.64% of works registered with Crossref.

68,691 (10.65%) Connected Publications

41,941 (30.47%) Connected Datasets

2,616 (68.54%) Connected Software