GithubHelp home page GithubHelp logo

Hello, I'm MinWoo(Daniel) Park

Linkedin Badge Hits

  • I am a passionate developer adept at leveraging Machine Learning and Deep Learning technologies to address challenges across diverse domains.
  • My extensive experience and knowledge in fields such as LLM, Natural Language Processing, Computer Vision, and the medical and healthcare sectors have enabled me to connect the dots and craft sophisticated solutions for a broad range of industry problems.
  • Currently, I am focusing on advancing the field ofthe state-of-the-art LLM (Language Model) technology.
  • I believe that all the knowledge I have gained will converge at the end of my journey.


For more details

trophy

Contents

On 2023-11-07, I have decided to make the majority of repositories and Hugging Face models private.



Large Language Model

  • The code for LLM projects will remain private. Due to ethical issues, the model's performance will be disclosed once it is verified after development.
  • sLLM, Jindo: Jindo is a relatively small sLLM that includes various experiments. It aims to develop multi-modal and domain-specific highly personalized models, but it is not recommended for general use as it is primarily used for experiments.
  • GORANI: The project is actively underway. GORANI is being developed as an English language model for comparison with other LLMs and to assess its technical capabilities. It is planned to be distributed under a research-purpose license.
  • KORANI: KORANI is a Korean-specific LLM developed based on Jindo and GORANI's accumulated technology. It is based on the 13B Llama2 chat, transformed into an LLM, with the goal of making it available under a commercial license.

Huggingface

Link: https://huggingface.co/danielpark

Project Title Backbone Description
ko-llama-2-jindo-7b-instruct LLaMA2-7b Korean LLM model efficiently fine-tuned with QLoRA (Efficient Finetuning of Quantized LLMs)
ko-llama-2-jindo-13b-instruct LLaMA2-13b Korean LLM model efficiently fine-tuned with QLoRA
ko-llama-2-jindo-7b-instruct-ggml LLaMA2-7b Model weights transformed through GGML(Generic Graph Machine Learning) to efficiently perform inference using GPU and CPU.
ko-llama-2-jindo-7b-instruct-4bit-128g-gptq LLaMA2-7b Model weights using LLaMA2 as the backbone, one-shot weight quantized with GPTQ(Accurate Post-Training Quantization for Generative Pre-trained Transformers) to increase inference speed.

Projects

Project Description Repo
Bard API Interfaces with Google Bard API to retrieve responses. GitHub
Amazing Bard Prompts Includes curated Google Bard prompts for enhanced utilization. GitHub
ExceptNotifier Enriches try-except with comprehensive error messages. GitHub
Co Coder Python package that treamlines error debugging from Chat GPT and Google Bard. GitHub
GPT BERT Medical QA Chatbot Research repository focused on GPT 2 fine-tuning for medical domain. GitHub
Korean news topic classification using KO BERT Classifies Korean news articles into eight categories using fine-tuned Korean BERT. GitHub
Multi-objective recommender Recommendation system leveraging user behavior data for improved accuracy. GitHub

Packages

Pypi link: https://pypi.org/user/archi-park/

Package Description Repo
bardapi The python package that returns Response of Google Bard through API. GitHub
arxiv2text Converting PDF files to text, mainly with a focus on arXiv papers. GitHub
transllm LLMtranslator translates and generates text in multiple languages. GitHub
translang Translation Service API Module. GitHub
catchexception Nightly version of ExceptNotifier GitHub
googlebardapi The python package that returns Response of Google Bard through API. GitHub
cocoder Python package that treamlines error debugging from Chat GPT and Google Bard. GitHub
exceptnotifier With Python's try-except to receive notifications about Errors or Successes in your code through messenger app or email. GitHub
utilfunction The Python package utilfunction wraps and distributes useful functions in an easy-to-use way. GitHub
quickshow Quick-Show provides simply but powerful insight plots GitHub
googledriver The Python package google drive facilitates access to files uploaded to Google Drive. GitHub
youtuber Support tools including crawler, video editing, YouTube API, etc. GitHub
docfilter The Python package docfilter is used to detect and remove inappropriate information from text. GitHub
kmi2122 This dataset includes some macroeconomic indicators for South Korea in 2021-2022. GitHub
corpusshow Corpus-Show makes it easier and faster to visualize corpus through sentence embedding of corpus. GitHub
edanif EDA-NIf creates a dataframe containing meta information of NIfTi files and provides several useful features. GitHub

Dockerhub

Link: https://hub.docker.com/u/parkminwoo91


Work Experience

01 Internal Projects (2017 - 2022)

  • Inflow Analysis/Product Selection/Trend Analysis/Price Trend/Logistics Demand Prediction Model (2017-2018, Recommender System, Natural Language Processing)
  • Analysis of National Health Insurance Service (NHIS) Data and Development of Biological Age Calculation Algorithm, Disease Prevalence Prediction (2020, Machine Learning)
  • Detection of Overhead Wires using Big Data from Korea Electric Power Corporation (KEPCO) (2021, Computer Vision)
  • Development Planning of Food Ingredient Discovery and Characterization Platform (2021, Machine Learning, Natural Language Processing)
  • Software Development for Automating Protein Mechanisms, Interactions, and Molecular Structure Extraction and Analysis from Alzheimer's Disease Papers (2021, Natural Language Processing, Computer Vision)
  • Prediction of Diseases and Physical Vitality based on Animal Metabolite (Fur/Blood) Datasets (2021, Machine Learning, Natural Language Processing)
  • Anomaly Signs Prediction, Health Index Forecast, Gut Microbiome Data Analysis using National Health Insurance Data (2021, Machine Learning, Natural Language Processing)
  • Development Planning for Heat Efficiency and Energy Optimization Algorithms in Sihwabanwol Industrial Complex (2021, Optimization)
  • Development of Automatic Brain Structure Segmentation and Tumor Area Segmentation Model using MRI and CT Images and Skull Extraction Algorithms (2022, Computer Vision)
  • Development of Body Type Classification and 3D Body Shape Change Prediction Model based on Time-Series Korean Body Data Collection (2022, Computer Vision)
  • Algorithm and Deep Learning Model Development for Extracting Drawing Factors from Engineering Drawings (2022, Computer Vision)
  • Algorithm for Visualization and Analysis of Noise Sources, Automation Pipeline for Noise Source Localization and Clustering (2022, Computer Vision)

02 Personal Projects (2022 - 2023)

  • Bard-API: Unofficial Python Package for Fetching Responses from Google Bard (GitHub Star 5.4k, Downloads 379k, 2023, Python Development)
  • ExceptNotifier: Package for Sending Detailed Error Messages to Users via Messenger when Errors Occur in try-except Statements (Downloads 27k, 2023, Python Development)
  • All About LLM: Documentation of Papers and Projects on Large Language Models (2023, LLM)
  • Ko LLaMa2 Jindo: Project Focused on Creating a Korean Natural Language Model, Entire Pipeline Construction and Lightweighting (2023, LLM)
  • GORANI: Multipurpose Korean LLM Development Project based on LLaMA2 (2023, LLM)
  • HF Trans LLM: Translator Project for Multilingual Translation and Text Generation (2023, LLM, Python Development)
  • Korean Open LLM Datasets-chain: Project for Collecting/Processing Korean LLM Datasets (2023, Python Development, LLM)
  • Open LLM Datasets: Compilation of Datasets and Papers Used in Open LLM (2023, Python Development, LLM)
  • Open LLM Leaderboard-report: Visualization of Performance of Open Source LLMs based on Four Metrics for Performance Comparison (2023, LLM)
  • Medical QA Bert Chat GPT: Fine-tuning GPT-2 for Question-Answering in the Medical Domain (2023, LLM)
  • Translang: Translation Service Module Providing API for Language Translation (2023, Python Development)
  • Fine-tuned-korean-bert-news-article-classifier: Model Development for News Article Topic Classification, Comparing BERT Implementations in Various Frameworks (2023, LLM)
  • Multi Objective Recommender: Project to Build a Multi-Objective Recommendation System based on Real E-commerce Sessions (2023, Recommender System)
  • Co Coder: Python Package to Simplify Debugging using OpenAI Chat GPT and Google Bard (2023, Python Development)
  • EDA-Nif: Organizing Metadata of Medical AI Nifti Files and Providing Some Functions such as Image Registration and Arbitrary Slicing (2022, AI for Life)

MinWoo(Daniel) Park's Projects

all-about-llm icon all-about-llm

dsdanielpark's curation and categorization of resources on large language models, along with documentation.

alpaca-cot icon alpaca-cot

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!

arpl icon arpl

Automated Redpill Loader

arxiv2text icon arxiv2text

Converting PDF files to text, mainly with a focus on arXiv papers.

bard-api icon bard-api

The unofficial python package that returns response of Google Bard through cookie value.

blog icon blog

A static blog using notion database

co-coder icon co-coder

Co-Coder is a Python package that streamlines error debugging from Open AI chat GPT and Google Bard by providing hints, example code, and relevant Stack Overflow links.

datasets icon datasets

The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

dbrx icon dbrx

Code examples and resources for DBRX, a large language model developed by Databricks

dstack icon dstack

An open-source container orchestration engine for running AI workloads in any cloud or data center. https://discord.gg/u8SmfwPpMd

easylm icon easylm

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

evals icon evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

exceptnotifier icon exceptnotifier

The Python package ExceptNotifier enhances the try-except statement, allowing you to receive detailed error messages via email or messenger apps.

fastapi icon fastapi

FastAPI framework, high performance, easy to learn, fast to code, ready for production

full-stack-fastapi-template icon full-stack-fastapi-template

Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.

gemini-api icon gemini-api

The unofficial python package that returns response of Google Gemini through cookie values.

gpt2-bert-medical-qa-chat icon gpt2-bert-medical-qa-chat

Medical domain-focused GPT-2 fine-tuning, optimization, and lightweighting research repository (compared to GPT-4).

gymnasium icon gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

hf-chat-ui icon hf-chat-ui

Open source codebase powering the HuggingChat app

hf-transllm icon hf-transllm

LLMtranslator translates and generates text in multiple languages.

jax icon jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

llama-recipes icon llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

llama3 icon llama3

The official Meta Llama 3 GitHub site

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.