There are several sources of bibliography data that can be retrieved from online. Many of these have unique identifiers that can be used retrieve specific versions of a document. A single document may have multiple unique identifiers under different namespaces. Examples of these kinds of namespaces include:

  • ArXiV ID
  • DOI
  • PubMed ID
  • https://en.wikipedia.org/wiki/Bibcode

Bibliography data may be stored in different kinds of metadata formats including:

  • RIS
  • BibTeX
  • MARC

URI normalisation

  • remove proxies s/.ezprox.lib.uh.edu//
  • link DOIs to what they redirect to

Controlled vocabulary

Schema

- Open standards and software for bibliographies and cataloging
lists several software packages and formats for storing bibliographic data.

Standards

Sources for data

http://www.loc.gov/cds/products/marcDist.php

Retractions: - https://retractionwatch.com/retraction-watch-database-user-guide/ - https://www.zotero.org/blog/retracted-item-notifications/