GithubHelp home page GithubHelp logo

george-jiexiong / multi-armed-bandit Goto Github PK

View Code? Open in Web Editor NEW

This project forked from abdullahkhan93/multi-armed-bandit

0.0 0.0 0.0 6.05 MB

A basic implementation of techniques to solve the Multi-Armed bandit (MAB) problem from the context of a marketing strategy. A couple of techniques namely the Epsilon-Greedy Approach, Upper Confidence Bound (UCB), Gradient Ascent and Thompson Sampling have been used to analyze choosing the best website in terms of receiving a click.

Jupyter Notebook 100.00%

multi-armed-bandit's Introduction

Multi-Armed Bandit Project for Research Project in Data Science course 2018 at Aalto Univeristy

Introduction

Multiarmed Bandit (MAB) problems can be categorized as sequential resource allocation tasks, where one or more resources must be chosen wisely and efficiently allocated among competing projects. This must be typically performed in such a way so as to maximize the overall expected gain. The main dilemmain these particular problems is to either naturally choose between possible paths that yield instantly the maximum gain currently (exploitation) or sacrifice current gain over better future gains (exploration). Since strategies for these problems adequately represent a subsection of reinforcement learning methods, the ultimate objective is to achieve the most appropriate balance between exploration and exploitation, consequently maximizing the overall rewards.

Objective

In this project, the goal is to implement two basic MAB techniques namely the Epsilon-Greedy approach and Thompson Sampling and do a comparative analysis of how these techniques perform across multiple experiments. The problem is to choose a website among many which guarantees the best overall reward. Here, the reward refers to the total number of clicks gained by the website across all trials.

Author(s)

Muhammad Abdullah Khan

multi-armed-bandit's People

Contributors

abdullahkhan93 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.