GithubHelp home page GithubHelp logo

oferg's Projects

ace icon ace

Python package for performing the Alternating Conditional Expectation (ACE) regression

amldataprepdocs icon amldataprepdocs

Documentation for Microsoft Azure Machine Learning Data Preparation

arrow icon arrow

Better dates & times for Python

auto_ml icon auto_ml

Automated machine learning for analytics & production

automl-gs icon automl-gs

Provide an input CSV and a target field to predict, generate a model + code to run it.

azure-tdsp-utilities icon azure-tdsp-utilities

Utilities and scripts developed as part of Microsoft's Team Data Science Process for productive data science

bigmatch_utilities icon bigmatch_utilities

BigMatch is a record linkage software that was developed for the US Census Bureau. BigMatch is a matching engine without a graphical user interface (GUI). It executes based on two parameter files, conventionally named parmf.txt and parmn.txt. After execution, the user must manually open the results files (possible matches and weights) and decide which possible matches to accept. The purpose of the bigmatch_utilities code repository is to provide a code base for GUI and shell script tools to make BigMatch more user-friendly and less error prone.

bowtie icon bowtie

:bowtie: Create a dashboard with python!

brew icon brew

brew: Python Ensemble Learning API

caravel icon caravel

Caravel is a data exploration platform designed to be visual, intuitive, and interactive

cmicot icon cmicot

Efficient feature selection method based on Conditional Mutual Information.

company-standard icon company-standard

The standardization is a process to make data compatible. This code is addressed to company names standardization. This is normally the first step before a linkage or deduplication records process.

dask-ec2 icon dask-ec2

Start a cluster in EC2 for dask.distributed

data-science-utils icon data-science-utils

Some wrappers around python modules for simplifying the data exploration process.

datacleaner icon datacleaner

A Python tool that automatically cleans data sets and readies them for analysis.

dataprep icon dataprep

Sandpit for exploring Microsoft's data preparation SDK.

deep_architect icon deep_architect

DeepArchitect: Automatically Designing and Training Deep Architectures

deltapy icon deltapy

DeltaPy - Tabular Data Augmentation

django-csvimport icon django-csvimport

A generic CSV import tool for django models, imports run via admin upload logging model or custom command

duke icon duke

Duke is a fast and flexible deduplication engine written in Java

facets icon facets

Visualizations for machine learning datasets

falconn icon falconn

FAst Lookups of Cosine and Other Nearest Neighbors

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.