GithubHelp home page GithubHelp logo

manuscripts-tracker's Introduction

manuscripts-tracker

manuscripts-tracker is a set of functions that collect submissions from multiple preprints platforms online. Those include: arXiv, bioRxiv, medRxiv, EarthArXiv, SocArXiv, PsyArXiv, NBER, Preprints.org and F1000Research. Those utilities will also infer the gender of the authors and identify those that are covid related.

Installation

The only step required to use the code in the repository is to install Firefox on your computer if you don't have it, and download the geckdriver (https://github.com/mozilla/geckodriver/releases) in the tools/ directory.

Usage

The first time you launch the code, you need to launch it with the init option, and you need to pass it a date in the format YYYY-mm-dd where it will start to collect the manuscripts metadata.

python main_gender.py init 2019-01-01

After that, you can either update the files by running the script with the all mode sporadically, or with the periodic mode to have the script update the data automatically every 24 hours. The script will only collect the data for the days since the last collected data for each repository.

python main_gender.py all

manuscripts-tracker's People

Contributors

lamvin avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.