GithubHelp home page GithubHelp logo

medium-dataset's Introduction

An anonymized dataset of 1+ million users in the Medium social network

Publication:
F. Li, Y. Chen, R. Xie, F. Ben Abdesslem and A. Lindgren, "Understanding Service Integration of Online Social Networks: A Data-Driven Study," in Proceedings of IEEE International Conference on Pervasive Computing and Communications Workshops, 2018.
(BibTeX citation below)

Licensed under Creative Commons Attribution Share Alike 4.0.

Description

This dataset covers 1,075,983 users in Medium, including social graph and profile/activity data for each user. Usernames are anonymized with numerical IDs. This dataset was collected during August 2016.

Files

Download links:

  • nodes.txt: Node IDs (0 ~ 1,075,982) used to anonymize the dataset.
    Each line in nodes.txt is an ID for a user.

  • edges.txt: Edge list (30,026,896 edges) of the directed social graph.
    Each line in edges.txt is an directed edge. For example, "0 1" means user 0 is following user 1 in Medium.

  • user_data.csv: Profile/activity data, cross-site linking options and linked Twitter profile data for each user.
    Each row in user_data.csv is the data of a user. (Several users' data are missing due to errors in crawling process)
    Descriptions for each column in user_data.csv:

    • ID: User's anonymized ID.
    • createdAt: Unix timestamp of the user's registration time.
    • followers: Number of followers.
    • following: Number of followings.
    • bio: 1 if the user has a biography, 0 otherwise.
    • posts: Number of posts (articles).
    • responses: Number of responses.
    • recommends: Number of recommends (this function has been replaced by the "claps" function).
    • highlights: Number of highlights.
    • facebook: The cross-site linking option to the user's Facebook account. 1 if the user has enabled it, 0 otherwise.
    • twitter: The cross-site linking option to the user's Twitter account. 1 if the user has enabled it, 0 otherwise.
    • twitter_followers: Number of followers in the user's linked Twitter account.
    • twitter_followings: Number of followings in the user's linked Twitter account.
    • twitter_bio: 1 if the user has a biography in the linked Twitter account, 0 otherwise.
    • twitter_tweets: Number of tweets in the user's linked Twitter account.
    • twitter_likes: Number of likes in the user's linked Twitter account.
    • twitter_lists: Number of lists in the user's linked Twitter account.

BibTeX

@INPROCEEDINGS{Li_PerFoT18, 
author={Fei Li and Yang Chen and Rong Xie and Fehmi Ben Abdesslem and Anders Lindgren}, 
booktitle={2018 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops)}, 
title={Understanding Service Integration of Online Social Networks: A Data-Driven Study}, 
year={2018},
}

medium-dataset's People

Contributors

lifei96 avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.