GithubHelp home page GithubHelp logo

fiverr's Introduction

Data Collection

All the below files crawl data from fiverr and store it in a json format. The below files also contained the code to push the json files into a mongo database from where it is easy to perform operations. There are different templates of web for different categories of fiverr and hence, different files.

  • Main2.py This crawler was used for Programming and Tech category and other similar.
  • Main_format2.py This crawler is used for extracting information out of Advertising, Business and other similar categories. You will be able to recognise the categories if you will have a look into Fiverr.
  • others.py This crawler was used for 'Others' category.
  • username.py

Processing

  • formatLIWC.py
    This code is used to convert the dataset into LIWC understandable format.

Case Specific Analysis

  • Multiple_review_check.py
    This script checks for existence of multiple reviews made by a single user on a particular gig. It runs on the existing collection of gigs and creates a new collection which gives the gig-wise data on the number of reviews made by each user that has purchased the gig. It also stores the actual message present in those reviews.
  • Multiple_data_analyse.py
    This script works on the collection made by the above file and finds out statistical data like the "Average Number of reviews given by a single user per gig" and "Average number of users making multiple reviews per gig".

Sentiment Analysis

  • senti_dict.py
    Performs sentiment analysis on the reviews of the gigs using Senti Word Net lexicon. Modifies the gig json to add the new positive, negative and objective scores.

Review Graph Implementation

Plotting

  • plot.py
    Plots of 'fraction of gigs vs reviews'(linear and log-log), 'fraction of gigs vs rating'(linear and log-log), 'category-wise representation of average reviews, ratings and favourite count'
  • pos_score_plot.py
    Plots of 'fraction of gigs vs positive score'.
  • plot_negative.py
    Plots of ''fraction of gigs vs negative score'.

LIWC plotting

  • After processing with formatLIWC.py get the LIWC scores and then running log_log.py will give all the value vs frequency in log-log scale in the same directory where it was run from.

Word-cloud plotting

  • fiverr_wordcloud.py
    Makes a word-cloud of all the words used in the product reviews given in the file specified.

fiverr's People

Contributors

a1k24 avatar abinashmeher999 avatar djr-jsr avatar krishna95 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.