GithubHelp home page GithubHelp logo

bymochimomo / fdh Goto Github PK

View Code? Open in Web Editor NEW

This project forked from asemchishen/fdh

0.0 0.0 0.0 6.09 MB

Fake Data Hub is an open project for creating real looking data sets and schemas generated based on desired statistics

License: GNU General Public License v3.0

R 68.25% PLSQL 31.75%

fdh's Introduction

Fake Data Hub

Fake Data Hub is an open project for creating real looking data sets and schemas generated based on desired statistics. All code made in R!

Preface

In the modern world, we have various powerful tools to analyze data. Many of them promise a lot of advanced functionality for automatic insights, data mining, and ML. A problem comes with data: while tools do claim that they are capable to find insights in data, real-world data does not obligatory contain these insights. There are a lot of reasons for that: either we have not enough data or the data we have does not represent the population or phenomenon we are looking for, or life appear to work not the way we expected:)

To overcome this issue I decided to start a little project - Fake Data Hub. The idea is as simple as that: to get something out of the data you first need to put something in! The ultimate goal is to be able to generate any reasonable amount of fake data that looks almost like real and statistically shaped according to our needs.

Contents

  1. Customers folder contains support datasets and generator code to produce people demographic and personal data. Also readydata subfolder contains generated ready-to-use data in .csv format.
  2. Products folder contains support datasets and generator code to produce retail product data (partnumber, prod. categories, name, price etc). Also readydata subfolder contains generated ready-to-use product data in .csv format.
  3. classification_binary folder contains genetator of binary descision Y/N to buy something based on customers data. Readydata subfolder contains ins_sales(1.5k) for 1.5k random customers and their classification based on Ins_buyers_logic.R genrator.

fdh's People

Contributors

asemchishen avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.