GithubHelp home page GithubHelp logo

knowself's Introduction

Knowself

What can be learned about personality from writing samples.

As determined from the Twitter stream, Facebook posts and stream-of-consciousness essays.

The Datasets:

Three sets of data were found of writing with personality labels.

  1. Twitter posts from 152 unique people, 14166 in total.
  2. Facebook posts from 250 unique people, 9917 in total.
  3. Stream of consciousness essays from 2468 unique people.

All three of these are labeled with "Big 5" personality types. The Big 5 model for personality evolved from the lexical hypothesis that the range of human perspectives on the world must be encoded in the language that we use. The Big 5, or Five Factor model claims that the personality characteristics represented by the terms in our language cluster into five groups corresponding to scales on which personality can be measured.

The Application:

The plan is to use the model to predict personality given a person's twitter stream. Given this is the goal the model will probably best be developed using the first dataset, but there is some interest to know how well a model will do if it is trained on the other sets, then applied to tweets. Once the tags, user mentions and links are filtered out the remaining text from twitter may be similar to the stream of consciousness essays or similarly filtered Facebook posts.

The application will also include a short personality test to gather additional data.

knowself's People

Contributors

dbrehmer avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.