GithubHelp home page GithubHelp logo

panos-span / alexandria3k Goto Github PK

View Code? Open in Web Editor NEW

This project forked from dspinellis/alexandria3k

0.0 0.0 0.0 1.47 MB

Local relational access to openly-available publication data sets

License: GNU General Public License v3.0

Shell 1.41% Python 98.39% sed 0.21%

alexandria3k's Introduction

Alexandria3k CI

Alexandria3k

The alexandria3k package supplies a library and a command-line tool providing efficient relational query access to diverse publication open data sets. The largest one is the entire Crossref data set (157 GB compressed, 1 TB uncompressed). This contains publication metadata from about 134 million publications from all major international publishers with full citation data for 60 million of them. Alternatively, works can be selected from the PubMed data set (43 GB compressed, 327 GB uncompressed), which comprises more than 36 million citations for biomedical literature from MEDLINE, life science journals, and online books, with rich domain-specific metadata, such as MeSH indexing, funding, genetic, and chemical details. In addition, the Crossref and PubMed data sets can be linked with the ORCID summary data set (25 GB compressed, 435 GB uncompressed), containing about 78 million author records, the United States Patent Office issued patents (11 GB compressed, 115 GB uncompressed), containing about 5.4 million records, as well as data sets of funder bodies, journal names, open access journals, and research organizations.

The alexandria3k package installation contains all elements required to run it. It does not require the installation, configuration, and maintenance of a third party relational or graph database. It can therefore be used out-of-the-box for performing reproducible publication research on the desktop.

Documentation

The complete reference and use documentation for alexandria3k can be found here.

Major contributors

Publication

Details about the rationale, design, implementation, and use of this software can be found in the following paper.

Diomidis Spinellis. Open reproducible scientometric research with Alexandria3k. PLoS ONE 18(11): e0294946. November 2023. doi: 10.1371/journal.pone.0294946

alexandria3k's People

Contributors

dspinellis avatar aggelosmargkas avatar panos-span avatar basverlooy avatar dtgupta avatar austinjp avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.