GithubHelp home page GithubHelp logo

getting-started-github-apps's Introduction

Nichita Diaconu - Machine Learning Engineer

GitHub | LinkedIn | Google Scholar

Profile

I am a senior engineer with a balanced experience in both applied and theoretical machine learning. My focus is on designing and integrating robust ML solutions, with particular expertise in natural language processing, cloud technologies, and software engineering practices. I am committed to continuously exploring new ways to enhance my technical understanding, with the goal of further specializing in ML technologies.

Areas of Expertise

  • Languages: Python, Ruby, CUDA C, Git, Linux (Bash).
  • SQL & NoSQL: PostgreSQL, ElasticSearch, BigQuery.
  • Machine Learning: SpaCy, PyTorch, scikit, PyTorch Lightning, MLflow, Optuna, HuggingFace, Ray, PySpark.
  • LLM: LangChain, LlamaIndex, OpenAI, llama, pgvector, LangFuse.
  • Model Management and Deployment: VertexAI, BentoML, TorchServe.
  • Web Development / CI/CD: FastAPI, SQLAlchemy, Ruby on Rails, GitHub Actions.
  • Cloud Technologies and Infrastructure: GCP, Docker, Kubernetes, Cloud Run, Pulumi, HTTP REST, Message Queues.

Professional Experience

Senior Machine Learning Engineer

Klue | Vancouver, Canada (01/2023–Present)

  • Lead projects involving multiple teams, ensuring project goals align with business needs and identifying potential blockers early.
  • Build an LLM application with prompt management and Retrieval-Augmented Generation (RAG), improving user engagement by 25% and reducing the time to value by 49%.
  • Implement a solution for news grouping, reducing content overlap by 40%. Developed a custom online clustering algorithm and ensured data consistency across distributed systems.
  • Develop a subscription-based scraping service, reducing manual labor by 81%.
  • Create a pipeline for transcript annotation using WhisperX, cutting transcription costs by 46% and increasing accuracy by 5%.

Machine Learning Engineer

Klue | Amsterdam, Netherlands (01/2020–12/2022)

  • Designed and implemented the document pipeline, processing hundreds of thousands of news items daily for thousands of clients.
  • Developed a rule-based management & filtering system using boolean logic, providing configurable filtering rules.
  • Created a junk filtering solution based on a custom Named Entity Recognition model and a tree-based classification model. We compared labeling services, created custom datasets, and trained models, resulting in a 93% reduction in irrelevant content.
  • Built a topic tagging system using multi-class, multi-label deep learning models, increasing user retention by 21%.
  • Oversaw data handling, metrics definition, model tuning, and deployment, while monitoring performance across all projects.

Research Intern

Philips Lab | Amsterdam, Netherlands (02/2019–09/2019)

  • Improved parameter efficiency and robustness in vision models, culminating in the paper "Affine Self Convolution."

Computer Vision Intern

Bosch | Cluj-Napoca, Romania (02/2016–05/2016)

  • Improved the scale sensitivity of an existing object detection solution by implementing techniques inspired by the "Fastest Pedestrian Detector in the West" paper. Learned about scale invariance and object detection.

Publications

  • Learning to Convolve (Main Author): ICML 2019 - Amsterdam, Netherlands (02/2019)
    Developed a method to learn transformations in group convolutions, enabling filters to adapt to transformations and reducing sensitivity to input rotations.

  • Affine Self Convolution (Main Author): Philips Lab - Amsterdam, Netherlands (11/2019)
    Introduced Affine Self Convolution, which uses data-dependent filters for image processing, combining self-attention with convolution to improve accuracy and parameter efficiency.

Education

  • Master's Degree in Artificial Intelligence: University of Amsterdam - Amsterdam, Netherlands (2017–2019)
    Specialized in AI topics, including machine learning, deep learning, computer vision, and natural language processing.

  • Bachelor's Degree in Computer Science: Babes-Bolyai University - Cluj-Napoca, Romania (2014–2017)
    Acquired foundational knowledge in core computer science principles, algorithms, data structure, and data analysis.

Online Courses & Certifications

  • Machine Learning by Andrew Ng (2016) - Coursera
  • CNNs for Visual Recognition by Andrej Karpathy, CS231n (2016) - Stanford University
  • Heterogeneous Parallel Programming, CUDA Programming by Wen-mei Hwu (2016) - Coursera

Skills

  • General: data-oriented, problem-solving, research methodology, effective communication, agile.
  • Fundamental: data structures and algorithms, model development, model evaluation, feature engineering, computer vision, natural language processing, recommender systems, data preprocessing, online inference, batch processing, distributed computing, object-oriented programming, system design.
  • LLM Apps: prompt management, prompt engineering, retrieval-augmented generation (RAG), few-shot learning, zero-shot learning, semantic search, keyword search, transformers.

getting-started-github-apps's People

Contributors

akien-mga avatar amdmi3 avatar bpacreau avatar capnmidnight avatar casmo avatar checkcheckzz avatar cheshire137 avatar chrishunt avatar cout-hello-world avatar cxong avatar filipkonieczny avatar games avatar github-learning-lab[bot] avatar gloomyson avatar haroenv avatar harry-wood avatar hollenberry avatar jhgrng avatar leereilly avatar maiki avatar mdtrooper avatar nuclearmonster avatar oskar-j avatar pravj avatar runelaenen avatar segrax avatar skittles1 avatar vpashkov avatar vuanhhaogk avatar yutaono avatar

Watchers

 avatar  avatar

getting-started-github-apps's Issues

Getting started with GitHub Apps

Welcome to Getting started with GitHub Apps

We're going to work together to learn about GitHub Apps! We'll do this in this repository, which is a collection of resources - games to be specific. Before we start using real GitHub applications, let's get some foundations out of the way.

New to GitHub?

For this course, you'll need to know how to create a branch on GitHub, commit changes using Git, and open a pull request on GitHub. If you need a refresher on the GitHub flow, check out the Introduction to GitHub course.

Application vs GitHub Application

There are many types of applications, like the apps you might have on your phone. In this course, we'll focus on GitHub Applications, apps designed to work on GitHub.

GitHub Apps allow you to change GitHub's surface-level functionality through webhooks and GitHub's APIs. GitHub Apps act as top-layer tools that you can use to customize and automate routine processes.

Please note that GitHub Apps are very different from OAuth Apps. These have a different installation flow and have different rules around how they handle their data and access. If you'd like to learn more, check out the documentation on the differences between GitHub and OAuth Apps.

Step 1: Get familiar with GitHub Apps

To learn more about GitHub Apps, you will install two on this repository. Make sure to install the apps on this repository only. Don't apply them to everything you have access to.

⌨️ Activity: Install your first apps 🎉

  1. Install the WIP app to this repository
  2. Install the Request Info app to this repository
  3. Close this issue

After installing, close this issue and look for my response below

Sometimes I respond too fast for the page to update! If you perform an expected action and don't see a response from me, wait a few seconds and refresh the page for your next steps.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.