chrishayduk Goto Github PK

followers: 43.0 following: 11.0 repos: 46.0 gists: 2.0

Name: Chris Hayduk

Type: User

Location: NY

Blog: http://www.linkedin.com/in/chrishayduk

Hi there, I'm Chris 👋

I'm currently the Lead ML Engineer for Drug Discovery at Deloitte, where I work to develop, orchestrate, and deploy deep learning models to accelerate pharmaceutical research and development.

In my free time, I build large language model (LLM) powered tools, develop open source LLM models and datasets, and contribute to LLM research projects.

💻 Github Projects

I've worked on several LLM-focused projects featured on GitHub:

QLoRA for Masked Language Modeling - Updated QLoRA for use with the masked language modeling objective, enabling efficient finetuning of BERT-family models
Multi-GPU QLoRA - Updated QLoRA to allow for distributed data parallel finetuning, significantly accelerating finetuning workloads
Athena.ai (work in progress) - Leveraged GPT-4 and ChromaDB to create a personal knowledge management and chat tool
LLaMA Thought Cloning (work in progress) - A LLaMA-based repreoduction of "Thought Cloning: Learning to Think while Acting by Imitating Human Thinking", demonstrating that a single open source LLM can be used as a world model and reinforcement learning agent

🤗 HuggingFace Projects

I have also open sourced some of my LLM models and data on HuggingFace:

📈 Datasets

ChrisHayduk/Llama-2-SQL-and-Code-Dataset - Curated a SQL-focused code instruction set for LLaMA 2. The eval set includes dummy tables so that the trained model can be evaluated for SQL execution accuracy rather than token prediction accuracy. The dataset was processed in a number of ways, including introducing curriculum learning, fixing table inputs, and instruction filtering.

🚀 Models

ChrisHayduk/OpenGuanaco-13B - Created an open source recreation of Guanaco using OpenLLaMA.

📫 How to Reach Me

Twitter: https://twitter.com/chris_hayduk1

Chris Hayduk's Projects

advanced-calculus-ii

Homework assignments from my Advanced Calculus II course

advanced-mathematical-statistics

Completed assignments from my graduate Advanced Mathematical Statistics course

an-introduction-to-statistical-learning

an-introductory-course-in-computational-neuroscience

Completed tutorials and questions from the book "An Introductory Course in Computational Neuroscience" by Paul Miller

athena.ai

An AI-powered knowledge management and chat tool

bayesian-analysis

Assignments from my Bayesian Analysis independent study course

beacon-test

iPhone app to test the functionality and data analysis of an Estimote Beacon

bitcoin-true

Bitcoin fork with increased blocksize and ASIC resistance

:exclamation: This is a read-only mirror of the CRAN R package repository. ChannelAttribution — Markov Model for the Online Multi-Channel Attribution Problem. Homepage: http://www.slideshare.net/adavide1982/markov-model-for-the-multichannel-attribution-problem http://www.lunametrics.com/blog/2016/06/30/marketing-channel-attribution-markov-models-r/ http://analyzecore.com/2016/08/03/attribution-model-r-part-1/