SemOpenAlex

GraphsCC0Introduced 2023-08-07

SemOpenAlex is an extensive RDF knowledge graph that contains over 26 billion triples about scientific publications and their associated entities, such as authors, institutions, journals, and concepts.

  • SemOpenAlex is licensed under CC0, providing free and open access to the data.
  • We offer the data through multiple channels, including RDF dump files, a SPARQL endpoint, and as a data source in the Linked Open Data cloud, complete with resolvable URIs and links to other data sources (ISNI, DOI, ORCID, ROR, Scopus, DOAJ, Wikidata,
  • Moreover, we provide embeddings for knowledge graph entities using high-performance computing.

SemOpenAlex enables a broad range of use-case scenarios, such as

  • exploratory semantic search via our website,
  • large-scale scientific impact quantification,
  • other forms of scholarly big data analytics within and across scientific disciplines.
  • enables academic recommender systems, such as recommending collaborators, publications, and venues, including explainability capabilities.
  • can serve for RDF query optimization benchmarks,
  • creating scholarly knowledge-guided language models,
  • as a hub for semantic scientific publishing.