This is only an overview. Check the manual for more details.
Platforms such as Google Scholar and Web of Science allow users to search for authors, search by paper/article, and see how many citations a given paper has. The goal of this project is to create a dashboard-style interface that displays activity and engagement surrounding scientific publications. Rather than focusing on numbers of citations, we will be displaying the number of interactions. The data describing these interactions is taken from the Crossref API.
Crossref is an official agency responsible for assigning DOIs (Document Object Identifier) to published research. They scrape the web across 13 different platforms (Twitter, Reddit, Wikipedia, and others) looking for interactions with scientific papers. These interactions can be in the form of tweets, reddit posts, or wikipedia references. See the Crossref documentation to learn about the platforms.
OpenCitations is an organization dedicated to publication of citation and bibliographic data. To learn more about OpenCitations data, click here.
We also have been given data from our friends at SciELO to add foreign publications to our records. To learn more about SciELO data, click here.
We are developing a website which will allow users to search our databases for DOI's, authors, paper titles, or journals. Users will be able to see how many Events, and of which type, a given paper or author has generated. The number of papers published per year for a given journal will also be shown.
Here are some screenshots taken with a small slice of the total data.
Software | License | Version |
---|---|---|
MySQL | GPL-2.0 License | 8.0.21 |
MySQL Python Connector | GPL-2.0 License | 8.0.21 |
Python | Python Software Foundation License | 3.8.0 |
Bootstrap | MIT License | 3.3.7 |
C3 | MIT License | 3.5.1 |
D3 | BSD-3-Clause License | 5.7.2 |
jQuery | MIT License | 0.7.20 |
MongoDB | SSPL v1.0 License | 4.4 |
Python Module | License | Version |
---|---|---|
schedule | MIT License | 0.6.0 |
crossrefapi | BSD-2-Clause License | 1.5.0 |
mysql-connector-python | GPL-2.0 License | 8.0.21 |
flask | BSD-3-Clause License | 1.1.2 |
flask-mysqldb | MIT License | 0.2.0 |
virtualenv | MIT License | 20.0.35 |
python-dateutil | Apache-2.0 License | 2.8.1 |
flask-paginate | BSD-3-Clause License | 0.7.1 |
pytz | MIT License | 2020.1 |
email-validator | CC0 1.0 Universal | 1.1.2 |
pandas | BSD-3-Clause License | 1.2.4 |
smtplib | PSF License Agreement | Python v3.9 |
ssl | PSF License Agreement | Python v3.9 |
EmailMessage | PSF License Agreement | Python v3.9 |
MIMEMultipart | PSF License Agreement | Python v3.9 |
MIMEApplication | PSF License Agreement | Python v3.9 |
MIMEText | PSF License Agreement | Python v3.9 |
OS: Windows 10
Browser: Google Chrome, Microsoft Edge
Time taken to ingest Metadata (for 1000 DOIs): 40 minutes
Time taken to ingest Events (for 1000 DOIs): 1 hour
Time taken to ingest Citations (for 1000 DOIs): 1 hour and 10 minutes
Name | ||
---|---|---|
Darpan Shah | https://www.linkedin.com/in/darpanshah-/ | [email protected] |
Rihat Rahman | https://www.linkedin.com/in/rihat-rahman/ | [email protected] |
Salsabil Bakth | https://www.linkedin.com/in/salsabil-bakth/ | [email protected] |
Mohammad Tahmid | https://www.linkedin.com/in/mtahmid/ | [email protected] |
Tabish Shaikh | https://www.linkedin.com/in/tabish-shaikh-0497011a4/ | [email protected] |
We would like to thank:
- Dr. Timothy Bowman for the project idea and guidance! ๐ก
- Saeid Mofrad and Minh Nguyen for assisting us with technical questions ๐ฌ
- Seyed Ziae Mousavi Mojab for teaching the class, of which this project was a part ๐