GithubHelp home page GithubHelp logo

dhimmel / biorxiv-licenses Goto Github PK

View Code? Open in Web Editor NEW
8.0 4.0 0.0 5.53 MB

The licensing of bioRxiv preprints

Home Page: http://blog.dhimmel.com/biorxiv-licenses/

License: Other

Jupyter Notebook 99.80% Python 0.19% Shell 0.01%
biorxiv preprints licensing creative-commons publishing science satoshi-village

biorxiv-licenses's People

Contributors

dhimmel avatar omnesres avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

biorxiv-licenses's Issues

Commit work back to GitHub from mybinder.org

I used the following workflow (will only work if you're me):

if [ ! -d .git ] ; then
  git init
  git remote add origin https://github.com/dhimmel/biorxiv-licenses.git
  git config --global user.email "[email protected]"
  git config --global user.name "Daniel Himmelstein"
fi

# Add files you want and commit
git add
git commit --message "Your description"
git branch --set-upstream-to=origin/master master

git pull --rebase
# If there's a conflict
git checkout --ours Dockerfile
git checkout --theirs # Files modified by your commits here
# Add files with desired changes
git add 
git rebase --continue

# Push to github (enter username and password when prompted)
git push

Note I don't recommend this -- there are conflicts and it's hard to resolve. Also the binder can be out of date unless you recently rebuilt it.

Script biorxiv preprint dataset creation

@OmnesRes initially provided us with biorxiv_data.txt via Twitter (in 54db073). In a series of commits, I modified this file to create biorxiv-prepubmed.tsv which uses a simpler format. It would be great to script the creation of biorxiv-prepubmed.tsv so we could run in the future and make this study reproducible.

@OmnesRes is the generation of this dataset possible via public APIs? Do you have code that you used to create biorxiv_data.txt? I'm happy to help you create a pull request to add this functionality.

Analyzing article dates

I think it would be really interesting to look at license preferences over time, but dates with preprints are tricky.

The date in PrePubMed is not necessarily the date of the first version of the preprint. PrePubMed began indexing articles in April 2016. For any article added to bioRxiv after that date the date in PrePubMed should be the date of the first version of the preprint. For articles before this date PrePubMed indexed the version that was newest, not the earliest possible version. As a result, someone theoretically could have submitted a preprint in 2014 and chosen their license in 2014, but then updated their article in February 2016, and PrePubMed will have the date of February 2016. It is possible to reindex the dates to get the date of the first version, but this is assuming that users do not change the license when they upload a new version.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.