GithubHelp home page GithubHelp logo

discovr's Introduction

Discovr

GSOC Page - This repository was created during GSOC 2016. As all commits were done for GSOC by the same user this page represents the best overview over the project. A summary of the GSOC project can be at the website below.

http://vaionex.com/discovrWB/

Graphical clusters of a variety of statistical methods in R.

The discovr package enables a fast and efficient way to test the most common statistical analysis methods. It therefore allows users to quickly check method specific indicators for the viability of a given method. Thereby, it reduces the trial and error approach R user yet approach most datasets.

During the development we saw the possibility of a rich d3 interactive interface to connect the different tests. Therefore we created a costumized D3 htmlwidget from scratch.

To have an instant access to all data, the future package is integrated to allow a parallel processing of all statistical tests.


####The htmlwidget d3 plot can be found here:

https://github.com/Kohze/Discovr_graphics

Overview

###The main module:

disc(x, method = "unPaired", preset = NULL)

with x as data.frame or data.table. Method is either "paired" or "unPaired".

The main module generates the interactive graphical cluster as plot. By clicking on each of the dots the bubble chart appears for the respective test. The colored lines indicate the output of the test, and the most preferable test to conclude. Green stands for preferable, grey for neutral, red for not applyable. By hoovering over the dots the name of each test related to the dot is displayed on the right area.

alt tag

Here are 3 examples of bubble plots. The bubble size is related to the test output of each test set. By hoovering over the bubbles the exact value is visible as tooltip. To generate those bubbles the D3.pack() routine is used.

alt tag

###The data module:

disc.data(x, method = "unPaired", preset = NULL)

The data module allows to extract all the data frames the main module graphic output relies on. It will output a list of data frames. (the f.test is here shown as example - all other tests are also available)

alt tag

###The shapiro module:

disc.normal(x)

The shapiro module quickly applies a normality test on all columns and adds a TRUE / FALSE as information next to it.

alt tag

Presets

Presets for the following methods are available

  • FRAP
  • FACS

Statistical Tests

  • Welchs T.Test
  • Student T.Test
  • Shapiro Wilks Test
  • Correlation Test
  • Anova Test
  • Wilcoxon Test
  • Principal Component Analysis
  • F-test
  • glm AIC summary

Installation

if (!require("devtools")) install.packages("devtools")

install.packages("jsonlite")
install.packages("future")
install.packages("Rcpp")
devtools::install_github('ramnathv/htmlwidgets')
devtools::install_github("Kohze/Discovr_graphics")
devtools::install_github("Kohze/Discovr")

Example

After installation and library() call of Discovr, type in:

disc(mtcars)

Comming Next

  • coloring for each bubble depending on significance level
  • adding other interactive plot ideas and features with d3

Known Issues

  • On the mac Rstudio version the htmlwidget is not displayed in correct dimensions yet.

discovr's People

Watchers

James Cloos avatar Amirul Momenin avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.