GithubHelp home page GithubHelp logo

πŸ‘‹ Hi, I'm Annthomy GILLES

🌎 Location: Montréal, Québec, Canada
πŸ”— LinkedIn: https://www.linkedin.com/in/annthomygilles
πŸ“– Fundamentals of Data Engineering (∼50%)

πŸ“š About Me

Senior Data Consultant and Data Scientist with extensive experience in Information Management and Data Analytics. Adept at designing and implementing innovative solutions across various industries, including government, automotive, and IT consulting. Proficient in Python, R, Big Data technologies, and Machine Learning. Passionate about leveraging data to drive actionable insights, improve processes, and support decision-making.

My Articles

ChatGPT: Reflet du bullshit en entreprise
Comprendre les mΓ©tiers de la Data le temps d’une pause cafΓ©.
Is your organization TRULY data-driven? 12 questions to find out!
Le Temps GuΓ©rit Tout. ExceptΓ© Le Mauvais Code.

Current side projects

Dashboard Project - Private πŸ“ŠπŸ’»

  • Building a dashboard connected to a database using Flask, mySQL, and web scraping.
  • Implemented automatic notifications sent to Discord and Telegram.

WhatsApp Integration - PrivateπŸ“±πŸ’¬

  • Building a Python pipeline connected to a database using Flask,
  • MongoDB, and Docker. Implemented API integration with WhatsApp for automated messaging.

Weather Data Aggregation with Kafka - Public ☁️🌑️

  • Building a project to scrape weather data from different APIs.
  • Experimenting with Kafka to aggregate the data.
  • Integrating Spark for data analysis and processing.
  • Project is focused on learning Kafka and expanding knowledge of big data technologies.

πŸ’Ό Experience

πŸ‡¨πŸ‡¦ KPMG Canada (Oct 2022 - Present): Senior Consultant, Information Management & Data Analytics

πŸ‡§πŸ‡ͺ Belgian Government (Mar 2021 - Oct 2022): Data Scientist

  • Worked on a graph-based modelling project for COVID-19 infection spread and management.
  • Gained experience with Neo4j, ElasticSearch, PostgreSQL, MongoDB, Prefect, Dask, Python, Apache Airflow, Unit testing, CI/CD, JIRA, Agile, API, Pandas, and scikit-learn.

πŸ‡§πŸ‡ͺ Toyota Motor Corporation (Feb 2020 - Nov 2020): Data Scientist Consultant

  • Worked on a DataOps project to clean and prepare data from car sensors for R&D use cases.
  • Gained experience with AWS services, Dask, Python, multi Unit testing, CI/CD, JIRA, Agile, API, Pandas, and scikit-learn.

πŸ‡§πŸ‡ͺ Positive Thinking Company (Oct 2019 - Oct 2022): Data Scientist

  • Developed an automated tool for resume classification and summarization using NLP techniques.
  • Gained experience with Python, R, Shiny, MongoDB, TFIDF, word2vec, doc2vec, Random Forest, XGboost, and Docker.

πŸ‡«πŸ‡· Devoteam (Feb 2019 - May 2019): Data Consultant

  • Built a comprehensive web app dashboard for employee management and tracking.
  • Gained experience with Google Cloud Platform (GCP), Docker, web development, Firebase, R, JavaScript, MongoDB, Git, HTML, and CSS.

πŸ‡«πŸ‡· bioMerieux ( Sept. 2016 - Sept. 2018): Data Scientist

  • Worked on a decision support system for improving doctors prescribing behavior during infectious disease.
  • Gained experience with Python, R, inferential statistics, machine learning, dimensionality reduction, business intelligence, metagenomics, differential abundance analysis, nanopore technology, and SQL.

πŸ‡©πŸ‡ͺ Max Planck Institute (Mar. 2016 - Aug. 2016): Computational Biologist

  • Developed a differential gene expression analysis workflow using Python, shell, and R languages.
  • Gained experience with Tuxedo suite, DeSEQ2, MEME suite, GATK, Picards-tools, Stringtie, Go enrichment, variant calling, and differential expression.

πŸ‡«πŸ‡· Merial, a Sanofi Company (Mar. 2015 - Sept. 2015): Biological Engineer

  • Characterized virulence factors and vaccine targets of a bacterial canine pathogen.
  • Gained experience with cell culture techniques, flow cytometry, genetic engineering, northern and western blotting, fluorescent and confocal microscopy, and PCR.

Skills

Category Skills
Programming 🐍 Python, R, πŸ’» Shell/Bash/Command line
Databases πŸƒ MongoDB, πŸ—ƒοΈ SQL, πŸ”— Neo4J
Statistics & Machine Learning πŸ”¬ Inferential Statistics, πŸ“ˆ Hypothesis testing, πŸ“Š Regression methods, πŸ”„ Correlation, πŸ“‰ Descriptive Statistics, 🚦 Markov model, 🌐 Dimensionality reduction, 🧩 Clustering, 🌳 Decision tree, 🧠 KNN, πŸŽ„ SVM, 🌱 Random forest
Tools 🧰 Git, πŸ“Š Matplotlib, πŸ”’ Numpy, 🐼 Pandas, πŸƒ Pymongo, πŸ”¬ Scipy, πŸ€– Scikit-learn, 🌊 Seaborn, πŸ”— SQLalchemy
Web Development 🌐 HTML5/CSS3, πŸ’» Javascript, Typescript, NestJs, Prisma, 🌢️ Flask
Environment πŸ’» High Performance Computing, 🐧 Linux
Data Science πŸ› οΈ Data Engineering, πŸ§‘β€πŸ’Ό Data Governance, πŸ“ˆπŸ“‰πŸ“Š Big Data, πŸ€– Machine Learning, πŸ“Š Data Analytics, πŸƒMongoDB, 🐳 Docker, πŸ—ƒοΈ PostgreSQL, ☁️ Amazon Web Services (AWS), πŸ“ˆ JIRA, 🌐 Web Development, πŸ§‘β€πŸ”¬ NLP

🏫 Education

University of Rouen Normandie

Master in Bioinformatics and Statistics (2015 - 2018)

  • Three-year Research & Professional Master's Degree in Bioinformatics, Statistics and Mathematics.
  • Curriculum covers management, processing, and analysis of sequences and massive data.
  • Data science: supervised learning (Regression, Decision Tree, Random Forests, Markov Chains, SVM, KNN, Neural Network) and unsupervised learning (KNN, K-means, CAH)

University of Poitiers

Master's Degree in Bioengineering and Biomedical Engineering (2013 - 2015)

  • Interdisciplinary education in biomedical research and engineering program from various backgrounds including bioengineering, cell and molecular biology, oncology, pharmacology, genetics, and microbiology.

University of the French West Indies and Guiana

Bachelor's Degree (Licence) in Biochemistry and Biology (2010 - 2013)

  • Curriculum covers biochemistry, cellular & molecular biology, immunology, physiology, biological statistics, organic chemistry.

GILLES Annthomy's Projects

auto-gpt icon auto-gpt

An experimental open-source attempt to make GPT-4 fully autonomous.

doctorgpt icon doctorgpt

DoctorGPT is an LLM that can pass the US Medical Licensing Exam. It works offline, it's cross-platform, & your health data stays private.

handson-ml icon handson-ml

A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in python using Scikit-Learn and TensorFlow.

learning_nestjs icon learning_nestjs

I am leanring LearnJS, prisma and ReactJS (soon) by building an app

llms-in-production icon llms-in-production

Building an end-to-end production-ready LLM & RAG system using LLMOps best practices

privategpt icon privategpt

Interact privately with your documents using the power of GPT, 100% privately, no data leaks

rest_api_training icon rest_api_training

A Flask app in python having 2 endpoints: 1) GET method that returns the timestamp 2) POST method that performs calculation (sum, substract, divide, product) between 2 numbers.

rna-dna-seq icon rna-dna-seq

Repo hosts scripts for differential expression and variant calling analysis of high throughput sequencing data.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.