GithubHelp home page GithubHelp logo

stjordanis / ai-and-ml-made-functional Goto Github PK

View Code? Open in Web Editor NEW

This project forked from kavignon/ai-and-ml-made-functional

0.0 1.0 0.0 53 KB

Create AI/ML systems in the cloud through Azure Notebooks, F# & .NET Core with MLOps environments

Home Page: https://dev.azure.com/kevinoavignon/AI-and-ML-made-functional

Jupyter Notebook 100.00%

ai-and-ml-made-functional's Introduction

AI & ML made functional

Create AI/ML systems in the cloud through Azure Notebooks, F# & .NET Core with MLOps environments

FAQ

Why use F# for data science projects?

F# is an excellent programming language that is functional first. It's well-suited for data science projects due to the following:

  • Efficient execution
  • Concisness
  • Strict type system
  • Data access capabilities
  • Scalability

In time, F# has shown how powerful a tool it is and now Microsoft is investing more efforts and time to showcase that it's an excellent tool for machine learning.

"Finally, we're also devoting significant time in developing a compelling offering for using F# to do machine learning. In addition to being supported on ML.NET, we're working towards a world-class experience when using F# and TensorFlow," Microsoft said.

What is the scope of the repository?

The scope of the repository is quite big. It was created for both beginners and intermediate in the following categories:

  • Machine Learning
  • Functional Programming
  • DevOps

The purpose of the repository is to see machine learning from a different kind of lenses. Using skills that some may already have by using the .NET framework (or .NET Core), you will come to get to know enough of data science space such as:

  • Feature engineering
  • Data normalization
  • Setuping your environment
  • Using a Jupyter Notebook
  • Evaluating your model
  • Train your model

What kind of projects can be consulted in the repository?

The repository will use old and current machine learning competitions that are found on Kaggle. For those who don't know what Kaggle is, in a few words, it is the world's largest community of data scientists. It provides you courses on data science and the chance to compete on machine learning project either for the sake of putting your skills to the test or to get on the podium and win big cash prizes.

Also, you'll also be able to find machine learning notes on courses that I will take online with resources such as Coursera or Udacity. The vast majority of the courses should be with Python, but I'll make sure, to the best of my abilities, to refactor the code samples to showcase only F#.

Why should we go for automation in data science?

Like in any field of software engineering/development, we want to make sure we get the best value for our buck. In this case, you want to make sure you avoid repetitive tasks and you can spend your time on things that matter, not normalize a portion of your data for the tenth time in the same day manually.

How can you get started today for AI/ML projects with F# and Azure?

To get yourself started, you can already get a Jupyter Notebook environment on the cloud with Azure Notebook and use F# as your programming language. Once you've created your first notebook, you can get started with your first machine learning project.

To setup your automation and your pipeline(s), you can go to Azure DevOps to setup your DevOps which we'll use in this repository to set up your MLOps environment.

(As I get more and more familiar with MLOps setups, I will document this readme more and more to reflect what you need to do here for a complete MLOps environment).

What's MLOps and how its vision differs from day-to-day DevOps?

MLOps empowers data scientists and app developers to help bring ML models to production. MLOps enables you to track / version / audit / certify / re-use every asset in your ML lifecycle and provides orchestration services to streamline managing this lifecycle.

  • Data/model versioning != code versioning - how to version data sets as the schema and origin data change
  • Digital audit trail requirements change when dealing with code + (potentially customer) data
  • Model reuse is different than software reuse, as models must be tuned based on input data / scenario.
  • To reuse a model you may need to fine-tune / transfer learn on it (meaning you need the training pipeline)
  • Models tend to decay over time & you need the ability to retrain them on demand to ensure they remain useful in a production context.

(This comes from here)

Recommended resources

ai-and-ml-made-functional's People

Contributors

kavignon avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.