GithubHelp home page GithubHelp logo

teonghan / scopus-extract-h-index Goto Github PK

View Code? Open in Web Editor NEW
4.0 1.0 2.0 135 KB

A simple web scrapping script to extract h-index, citations & number of publications from Scopus Author Profile

License: GNU General Public License v3.0

Python 100.00%
python3 scopus

scopus-extract-h-index's Introduction

Scopus-Extract-h-Index

A simple web scrapping script to extract h-index, citations & number of publications from Scopus Author Profile

An input file (Excel xlsx) must be prepared before hand with at least the following headers

  1. Main reference ID: any reference ID; just to identify unique person in your record
  2. Scopus ID: Scopus ID corresponds to the individual; multiple Scopus ID of the same person can be separated by ;[space]

Requirements

Python 3, Pandas, Selenium

General Notes

  1. There is even more efficient way of doing this, by using Scopus API. For more detail, please visit https://dev.elsevier.com/
  2. The script is not perfect. Sometimes, perhaps due to the connection to Scopus.com, things like timeout, pending javascript rendering, etc. will resulted in certain indicators to be set to zero (I took the easy route to just catch all Exception and reset them indicators to zero). It is important to go through the output after finished mining and fix these errors manually.

scopus-extract-h-index's People

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.