There are several sources of bibliography data that can be retrieved from online. Many of these have unique identifiers that can be used retrieve specific versions of a document. A single document may have multiple unique identifiers under different namespaces. Examples of these kinds of namespaces include:
- ArXiV ID
- DOI
- PubMed ID
- https://en.wikipedia.org/wiki/Bibcode
Bibliography data may be stored in different kinds of metadata formats including:
- RIS
- BibTeX
- MARC
URI normalisation
- remove proxies
s/.ezprox.lib.uh.edu//
- link DOIs to what they redirect to
Controlled vocabulary
- ORCID: Open Researcher and Contributor ID
- GRID: Global Research Identifier Database
- https://orcid.org/blog/2016/03/09/organisations-missing-link
Schema
- - Open standards and software for bibliographies and cataloging
- lists several software packages and formats for storing bibliographic data.
Standards
- Dublin Core
- FRBR
- SKOS
- CrossRef API
Sources for data
- CiteSeer
- DBLP
- http://www.visionbib.com/bibliography/contents.html
- http://pages.cs.wisc.edu/~anhai/projects/cimple/ - originally DBLife
- http://www.aaai.org/Library/library.php
- http://www.aclweb.org/anthology/
- HCI Bibliography : Human-Computer Interaction Resources
- OpenAlex
http://www.loc.gov/cds/products/marcDist.php
Retractions: - https://retractionwatch.com/retraction-watch-database-user-guide/ - https://www.zotero.org/blog/retracted-item-notifications/