Skip to content

Entities and Fields Reference by Source

This document details the retrievable objects and their attributes for each bibliographic source.

Table of Contents


Semantic Scholar

Base URL: https://api.semanticscholar.org/graph/v1

Available Entities

EntityEndpointsDescription
Paper/paper/{id}, /paper/search, /paper/batchScientific publications
Author/author/{id}, /author/search, /author/batchAuthor profiles
Citation/paper/{id}/citationsIncoming citations
Reference/paper/{id}/referencesOutgoing references
Snippet/snippet/searchText excerpts

Paper - Available Fields

FieldTypeDescriptionAlways Returned
paperIdstringUnique SHA identifier
titlestringArticle title
corpusIdintegerSecondary numeric identifier
externalIdsobjectDOI, ArXiv, MAG, ACL, PMID, PMCID, DBLP
urlstringSemantic Scholar page URL
abstractstringAbstract
venuestringPublication venue name
publicationVenueobjectVenue details (id, name, type, URLs)
yearintegerPublication year
publicationDatestringFull date (YYYY-MM-DD)
publicationTypesarrayClassification (Review, JournalArticle, Conference)
referenceCountintegerNumber of references
citationCountintegerNumber of citations
influentialCitationCountintegerHigh-impact citations
isOpenAccessbooleanOpen Access status
openAccessPdfobjectPDF URL, status, license
fieldsOfStudyarrayAcademic categories
s2FieldsOfStudyarrayDetailed classifications with sources
journalobjectName, volume, pages
citationStylesobjectBibTeX format
authorsarrayAuthor list
citationsarrayCiting articles
referencesarrayCited articles
embeddingobjectSPECTER vector (v1 or v2)
tldrobjectAI-generated summary

Author - Available Fields

FieldTypeDescriptionAlways Returned
authorIdstringUnique identifier
namestringFull name
externalIdsobjectORCID, DBLP
urlstringS2 profile URL
affiliationsarrayOrganizations
homepagestringPersonal website
paperCountintegerNumber of publications
citationCountintegerTotal citations
hIndexintegerh-index
papersarrayPublications

Supported Identifiers

paperId (SHA), CorpusId:<id>, DOI:<doi>, ARXIV:<id>, MAG:<id>,
ACL:<id>, PMID:<id>, PMCID:<id>, URL:<url>

PubMed / NCBI

Base URL: https://eutils.ncbi.nlm.nih.gov/entrez/eutils/

E-utilities Endpoints

EndpointFunctionOutput
einfo.fcgiDatabase metadataDatabase list, indexed fields
esearch.fcgiSearchUIDs, count, query_key
efetch.fcgiRetrievalComplete records
esummary.fcgiSummariesDocSums (lightweight metadata)
elink.fcgiLinksLinked UIDs, external links
epost.fcgiUpload UIDsSession History
espell.fcgiSpell checkSuggestions
ecitmatch.cgiCitation matchPMIDs

Main Databases

DatabaseContentRecords
pubmedBiomedical literature35M+
pmcOpen Access full text8M+
geneGenes-
proteinProteins-
nucleotideDNA/RNA sequences-
taxonomyTaxonomy-
clinvarClinical variants-

PubMed Article - Available Fields

FieldDescriptionVia
PMIDPubMed identifierefetch
TitleTitleefetch, esummary
AbstractAbstractefetch
AuthorListAuthors (name, affiliation, ORCID)efetch
JournalJournal title, ISSN, volume, issueefetch
PubDatePublication dateefetch
ArticleTypeType (Review, Research, etc.)efetch
MeshHeadingListMeSH termsefetch
KeywordListAuthor keywordsefetch
GrantListFundingefetch
ReferenceListCited referencesefetch
DOIDigital Object Identifierefetch
PMCPubMed Central IDelink

Output Formats

FormatParameterDescription
XML PubMedrettype=xmlComplete native format
MEDLINErettype=medlineBibliographic format
Abstractrettype=abstractPlain text
JSONretmode=jsonFor esearch, esummary

Europe PMC

Base URL: https://www.ebi.ac.uk/europepmc/webservices/rest/

Endpoints

EndpointDescription
/searchPublication search
/fieldsAvailable search fields
/{source}/{id}/citationsIncoming citations
/{source}/{id}/referencesReferences
/{source}/{id}/databaseLinksDatabase links
/{source}/{id}/textMinedTermsText-mining annotations
/{id}/fullTextXMLFull text XML
/{id}/supplementaryFilesSupplementary files

Article - Available Fields

FieldTypeDescription
idstringIdentifier (PMID or PMC)
sourcestringMED, PMC, PAT, etc.
titlestringTitle
authorStringstringFormatted authors
authorListarrayDetailed authors
journalTitlestringJournal title
pubYearintegerYear
abstractTextstringAbstract
doistringDOI
isOpenAccessbooleanOA status
inEPMCbooleanFull text in EPMC
citedByCountintegerCitation count
hasReferencesbooleanReferences available
grantsListarrayFunding
meshHeadingListarrayMeSH terms
chemicalListarrayChemical substances

Text-mining Annotations

TypeDescription
DISEASEDiseases
GENE_PROTEINGenes/Proteins
ORGANISMOrganisms
CHEMICALChemical compounds
GO_TERMGene Ontology

Unpaywall

Base URL: https://api.unpaywall.org/v2/

Single Endpoint

GET /{doi}?email=your@email.com

Work - Available Fields

FieldTypeDescription
doistringDOI
doi_urlstringResolved DOI URL
titlestringTitle
genrestringType (journal-article, book-chapter, etc.)
is_paratextbooleanEditorial content
published_datestringPublication date
yearintegerYear
journal_namestringJournal name
journal_issnsstringISSNs
journal_issn_lstringISSN-L
publisherstringPublisher
is_oabooleanOpen Access?
oa_statusstringgold, green, hybrid, bronze, closed
has_repository_copybooleanRepository copy
best_oa_locationobjectBest OA location
first_oa_locationobjectFirst OA location
oa_locationsarrayAll OA locations
oa_locations_embargoedarrayEmbargoed locations
updatedstringLast update
data_standardintegerData version
z_authorsarrayAuthors (via Crossref)

OA Location - Sub-object

FieldTypeDescription
urlstringPage URL
url_for_pdfstringDirect PDF URL
url_for_landing_pagestringLanding page
host_typestringpublisher, repository
licensestringCC-BY, CC-BY-NC, etc.
versionstringpublishedVersion, acceptedVersion, submittedVersion
evidencestringDetection source
pmh_idstringOAI-PMH ID
endpoint_idstringEndpoint ID
repository_institutionstringRepository institution

OpenCitations

Base URL: https://api.opencitations.net/index/v2

Endpoints

EndpointDescription
/citation/{oci}Citation by OCI
/citations/{id}Incoming citations
/references/{id}Outgoing references
/citation-count/{id}Citation count
/reference-count/{id}Reference count

Citation - Available Fields

FieldTypeDescription
ocistringOpen Citation Identifier
citingstringCiting article IDs
citedstringCited article IDs
creationstringCreation date (ISO 8601)
timespanstringTime between publication (PnYnMnD)
journal_scstringJournal self-citation (yes/no)
author_scstringAuthor self-citation (yes/no)

Supported Identifiers

DOI, PMID, PMCID, OMID (OpenCitations ID), ISSN (for venues)

DataCite

Base URL: https://api.datacite.org/

Main Endpoints

EndpointDescription
/doisDOI search
/dois/{doi}Specific DOI
/clientsMember organizations
/providersProviders

DOI Record - Available Fields

FieldTypeDescription
doistringDOI
prefixstringDOI prefix
suffixstringSuffix
identifiersarrayAlternative identifiers
creatorsarrayCreators (name, affiliation, ORCID, ROR)
titlesarrayTitles (main, alternative)
publisherstringPublisher
publicationYearintegerYear
resourceTypeobjectType (Dataset, Software, etc.)
subjectsarraySubjects
contributorsarrayContributors
datesarrayDates (created, issued, updated)
languagestringLanguage (ISO 639)
typesobjectResource types
relatedIdentifiersarrayLinks to other resources
sizesarraySizes
formatsarrayFile formats
versionstringVersion
rightsarrayLicenses
descriptionsarrayDescriptions
geoLocationsarrayGeographic locations
fundingReferencesarrayFunding
urlstringLanding page
contentUrlstringContent URL
xmlstringXML metadata (Base64)
viewCountintegerView count
downloadCountintegerDownloads
citationCountintegerCitations
statestringfindable, registered, draft
createdstringCreation date
registeredstringRegistration date
updatedstringLast update

DOAJ

Base URL: https://doaj.org/api/

Endpoints

EndpointDescription
/search/articles/{query}Article search
/search/journals/{query}Journal search
/articles/{id}Article by ID
/journals/{issn}Journal by ISSN
/bulk/articlesBatch article upload

Article (bibjson) - Available Fields

FieldTypeDescription
idstringDOAJ ID
bibjson.titlestringTitle
bibjson.identifierarrayDOI, other IDs
bibjson.journal.titlestringJournal title
bibjson.journal.issnsarrayISSNs
bibjson.journal.publisherstringPublisher
bibjson.journal.countrystringCountry
bibjson.authorarrayAuthors
bibjson.abstractstringAbstract
bibjson.keywordsarrayKeywords
bibjson.yearstringYear
bibjson.monthstringMonth
bibjson.start_pagestringStart page
bibjson.end_pagestringEnd page
bibjson.linkarrayLinks (PDF, HTML, ePUB, XML)
bibjson.subjectarraySubjects
created_datestringCreation date
last_updatedstringLast update

Journal (bibjson) - Available Fields

FieldTypeDescription
idstringDOAJ ID
bibjson.titlestringTitle
bibjson.alternative_titlestringAlternative title
bibjson.identifierarraypISSN, eISSN
bibjson.publisherobjectPublisher, country
bibjson.institutionobjectInstitution
bibjson.oa_startintegerOA start year
bibjson.apcobjectPublication fees
bibjson.licensearrayLicenses
bibjson.subjectarraySubjects
bibjson.languagearrayLanguages
bibjson.ref.aims_scopestringAims and scope URL
bibjson.ref.author_instructionsstringAuthor instructions URL

Zenodo

Base URL: https://zenodo.org/api/

Main Endpoints

EndpointDescription
/recordsPublished records search
/records/{id}Specific record
/deposit/depositionsDeposit management
/licensesAvailable licenses
/communitiesCommunities

Record - Available Fields

FieldTypeDescription
idintegerZenodo ID
doistringDOI
doi_urlstringDOI URL
conceptdoistringConcept DOI (all versions)
conceptrecidintegerConcept ID
createdstringCreation date
modifiedstringLast modification
metadata.titlestringTitle
metadata.descriptionstringDescription (HTML)
metadata.upload_typestringpublication, dataset, software, etc.
metadata.publication_typestringarticle, preprint, thesis, etc.
metadata.publication_datestringPublication date
metadata.creatorsarrayCreators (name, affiliation, orcid, gnd)
metadata.contributorsarrayContributors
metadata.keywordsarrayKeywords
metadata.subjectsarrayControlled subjects
metadata.related_identifiersarrayRelated identifiers
metadata.grantsarrayFunding
metadata.communitiesarrayCommunities
metadata.licenseobjectLicense
metadata.access_rightstringopen, embargoed, restricted, closed
metadata.embargo_datestringEmbargo end date
metadata.journal.titlestringJournal (if article)
metadata.journal.volumestringVolume
metadata.journal.issuestringIssue
metadata.journal.pagesstringPages
metadata.conference.titlestringConference
metadata.conference.datesstringConference dates
metadata.conference.placestringLocation
metadata.conference.urlstringConference URL
metadata.imprint.publisherstringPublisher
metadata.imprint.isbnstringISBN
metadata.thesis.universitystringUniversity
metadata.thesis.supervisorsarraySupervisors
metadata.versionstringVersion
metadata.languagestringLanguage (ISO 639-3)
metadata.locationsarrayGeo locations
metadata.datesarrayAdditional dates
metadata.methodstringMethodology
filesarrayFiles (id, filename, size, checksum)
ownersarrayOwners
statsobjectStatistics (views, downloads)

DBLP

Base URL: https://dblp.org/

Endpoints

EndpointDescription
/search/publ/apiPublication search
/search/author/apiAuthor search
/search/venue/apiVenue search
/pid/{pid}.xmlPublication by ID
/rec/{key}.xmlRecord by key

Publication - Available Fields

FieldTypeDescription
keystringUnique DBLP key
titlestringTitle
authorsarrayAuthors
venuestringConference/Journal
yearintegerYear
typestringarticle, inproceedings, book, etc.
doistringDOI
eestringElectronic URL
urlstringDBLP URL
pagesstringPages
volumestringVolume
numberstringIssue

Author - Available Fields

FieldTypeDescription
pidstringPerson ID
namestringName
aliasesarrayAlternative names
urlstringProfile URL
affiliationsarrayAffiliations
notesarrayNotes

Venue - Available Fields

FieldTypeDescription
venuestringFull name
acronymstringAcronym
typestringjournal, conference, workshop
urlstringDBLP URL

bioRxiv / medRxiv

Base URL: https://api.biorxiv.org/

Endpoints

EndpointDescription
/details/{server}/{interval}Metadata by period
/pubs/{server}/{interval}Published preprints
/pub/{interval}Publications (bioRxiv)
/publisher/{prefix}/{interval}By publisher
/funder/{server}/{interval}/{ror}By funder
/sum/{interval}Statistics
/usage/{interval}/{server}Usage metrics

Preprint - Available Fields

FieldTypeDescription
doistringPreprint DOI
titlestringTitle
authorsstringAuthors (text format)
author_correspondingstringCorresponding author
author_corresponding_institutionstringCorresponding institution
datestringPublication date
versionstringVersion (1, 2, etc.)
typestringPreprint type
licensestringLicense
categorystringScientific category
jatsxmlstringJATS XML URL
abstractstringAbstract
publishedstringPublished version DOI
serverstringbiorxiv or medrxiv

Published Preprint - Additional Fields

FieldTypeDescription
biorxiv_doistringPreprint DOI
published_doistringPublished DOI
published_journalstringPublication journal
published_datestringPublication date
preprint_platformstringOrigin platform

Funder Data - Additional Fields

FieldTypeDescription
funding.namestringFunder name
funding.idstringFunder ID
funding.id-typestringID type (ROR, Crossref)
funding.awardstringGrant number

Usage Statistics

FieldTypeDescription
monthstringMonth
abstract_viewsintegerAbstract views
full_text_viewsintegerFull text views
pdf_downloadsintegerPDF downloads
*_cumulativeintegerCumulative totals

CORE

Base URL: https://api.core.ac.uk/v3/

Endpoints

EndpointDescription
/search/worksWorks search (deduplicated)
/search/outputsOutputs search (raw)
/search/data-providersProvider search
/search/journalsJournal search
/works/{id}Specific work
/outputs/{id}Specific output
/outputs/{id}/downloadPDF download

Work - Available Fields

FieldTypeDescription
idintegerCORE ID
doistringDOI
titlestringTitle
abstractstringAbstract
authorsarrayAuthors
contributorsarrayContributors
publisherstringPublisher
journalsarrayJournals
yearPublishedintegerYear
publishedDatestringPublication date
acceptedDatestringAcceptance date
depositedDatestringDeposit date
documentTypestringDocument type
fullTextstringFull text
downloadUrlstringDownload URL
sourceFulltextUrlsarraySource URLs
citationCountintegerCitations
referencesarrayReferences
fieldOfStudyarrayFields
identifiersarrayAll identifiers
arxivIdstringArXiv ID
magIdstringMicrosoft Academic ID
pubmedIdstringPubMed ID
oaiIdsarrayOAI-PMH IDs
dataProvidersarrayProviders
outputsarrayRelated outputs
linksarrayLinks
createdDatestringCreation date
updatedDatestringLast update

Output - Additional Fields

FieldTypeDescription
repositoriesarraySource repositories
repositoryDocumentobjectRepository document
fulltextStatusstringFull text status
languagestringLanguage
licensestringLicense
subjectsarraySubjects
tagsarrayTags
sdgarraySustainable Development Goals
oaistringOAI identifier
setSpecsarrayOAI-PMH sets

Data Provider - Available Fields

FieldTypeDescription
idintegerCORE ID
namestringName
institutionNamestringInstitution
typestringType (repository, journal)
homepageUrlstringWebsite URL
oaiPmhUrlstringOAI-PMH URL
emailstringContact
locationobjectLocation
logostringLogo URL
rorIdstringROR ID
openDoarIdstringOpenDOAR ID
softwarestringSoftware (DSpace, EPrints)
metadataFormatstringMetadata format

Scopus

EntityKey Fields
DocumentEID, DOI, title, authors, abstract, affiliation, citedby_count, keywords, subject_areas
AuthorAuthor ID, name, affiliation, h-index, document_count, cited_by_count, orcid
AffiliationAffiliation ID, name, city, country, document_count

Web of Science

EntityKey Fields
DocumentUID, DOI, title, authors, source, year, keywords, times_cited
JournalISSN, title, impact_factor, category, publisher

IEEE Xplore

EntityKey Fields
DocumentArticle number, DOI, title, authors, abstract, publication_title, conference_dates, content_type
StandardStandard number, title, status, committee

Dimensions

EntityKey Fields
Publicationsid, doi, title, authors, abstract, journal, year, citations_count, altmetrics
Grantsid, title, funder, amount, start_year, investigators
Patentsid, title, inventors, assignees, filing_date, jurisdiction
Clinical Trialsid, title, phase, conditions, interventions, registry
Datasetsid, doi, title, repository, year
Policy Documentsid, title, publisher, year

Coverage Comparison

SourcePublicationsAuthorsCitationsFull TextFunding
Semantic Scholar✅ 200M+✅ Rich
PubMed✅ 35M+Via PMC
Europe PMC✅ 40M+✅ 10M+
Unpaywall✅ URLs
OpenCitations✅ 1.4B+
DataCite✅ 50M+
DOAJ✅ 9M+
Zenodo✅ 3M+
DBLP✅ 6M+
bioRxiv✅ 250k+
CORE✅ 300M+