dsdanielpark,MinWoo(Daniel) Park,github

Hello, I'm MinWoo(Daniel) Park

I am a passionate developer adept at leveraging Machine Learning and Deep Learning technologies to address challenges across diverse domains.
My extensive experience and knowledge in fields such as LLM, Natural Language Processing, Computer Vision, and the medical and healthcare sectors have enabled me to connect the dots and craft sophisticated solutions for a broad range of industry problems.
Currently, I am focusing on advancing the field ofthe state-of-the-art LLM (Language Model) technology.
I believe that all the knowledge I have gained will converge at the end of my journey.

For more details

On 2023-11-07, I have decided to make the majority of repositories and Hugging Face models private.

Large Language Model
Huggingface
Projects
Packages
Dockerhub
Work Experience

Large Language Model

The code for LLM projects will remain private. Due to ethical issues, the model's performance will be disclosed once it is verified after development.
sLLM, Jindo: Jindo is a relatively small sLLM that includes various experiments. It aims to develop multi-modal and domain-specific highly personalized models, but it is not recommended for general use as it is primarily used for experiments.
GORANI: The project is actively underway. GORANI is being developed as an English language model for comparison with other LLMs and to assess its technical capabilities. It is planned to be distributed under a research-purpose license.
KORANI: KORANI is a Korean-specific LLM developed based on Jindo and GORANI's accumulated technology. It is based on the 13B Llama2 chat, transformed into an LLM, with the goal of making it available under a commercial license.

Huggingface

Link: https://huggingface.co/danielpark

Project Title	Backbone	Description
ko-llama-2-jindo-7b-instruct	LLaMA2-7b	Korean LLM model efficiently fine-tuned with QLoRA (Efficient Finetuning of Quantized LLMs)
ko-llama-2-jindo-13b-instruct	LLaMA2-13b	Korean LLM model efficiently fine-tuned with QLoRA
ko-llama-2-jindo-7b-instruct-ggml	LLaMA2-7b	Model weights transformed through GGML(Generic Graph Machine Learning) to efficiently perform inference using GPU and CPU.
ko-llama-2-jindo-7b-instruct-4bit-128g-gptq	LLaMA2-7b	Model weights using LLaMA2 as the backbone, one-shot weight quantized with GPTQ(Accurate Post-Training Quantization for Generative Pre-trained Transformers) to increase inference speed.

Projects

Project	Description	Repo
Bard API	Interfaces with Google Bard API to retrieve responses.	GitHub
Amazing Bard Prompts	Includes curated Google Bard prompts for enhanced utilization.	GitHub
ExceptNotifier	Enriches try-except with comprehensive error messages.	GitHub
Co Coder	Python package that treamlines error debugging from Chat GPT and Google Bard.	GitHub
GPT BERT Medical QA Chatbot	Research repository focused on GPT 2 fine-tuning for medical domain.	GitHub
Korean news topic classification using KO BERT	Classifies Korean news articles into eight categories using fine-tuned Korean BERT.	GitHub
Multi-objective recommender	Recommendation system leveraging user behavior data for improved accuracy.	GitHub

Packages

Pypi link: https://pypi.org/user/archi-park/

Package	Description	Repo
bardapi	The python package that returns Response of Google Bard through API.	GitHub
arxiv2text	Converting PDF files to text, mainly with a focus on arXiv papers.	GitHub
transllm	LLMtranslator translates and generates text in multiple languages.	GitHub
translang	Translation Service API Module.	GitHub
catchexception	Nightly version of ExceptNotifier	GitHub
googlebardapi	The python package that returns Response of Google Bard through API.	GitHub
cocoder	Python package that treamlines error debugging from Chat GPT and Google Bard.	GitHub
exceptnotifier	With Python's try-except to receive notifications about Errors or Successes in your code through messenger app or email.	GitHub
utilfunction	The Python package utilfunction wraps and distributes useful functions in an easy-to-use way.	GitHub
quickshow	Quick-Show provides simply but powerful insight plots	GitHub
googledriver	The Python package google drive facilitates access to files uploaded to Google Drive.	GitHub
youtuber	Support tools including crawler, video editing, YouTube API, etc.	GitHub
docfilter	The Python package docfilter is used to detect and remove inappropriate information from text.	GitHub
kmi2122	This dataset includes some macroeconomic indicators for South Korea in 2021-2022.	GitHub
corpusshow	Corpus-Show makes it easier and faster to visualize corpus through sentence embedding of corpus.	GitHub
edanif	EDA-NIf creates a dataframe containing meta information of NIfTi files and provides several useful features.	GitHub

Dockerhub

Link: https://hub.docker.com/u/parkminwoo91

Work Experience

01 Internal Projects (2017 - 2022)

Inflow Analysis/Product Selection/Trend Analysis/Price Trend/Logistics Demand Prediction Model (2017-2018, Recommender System, Natural Language Processing)
Analysis of National Health Insurance Service (NHIS) Data and Development of Biological Age Calculation Algorithm, Disease Prevalence Prediction (2020, Machine Learning)
Detection of Overhead Wires using Big Data from Korea Electric Power Corporation (KEPCO) (2021, Computer Vision)
Development Planning of Food Ingredient Discovery and Characterization Platform (2021, Machine Learning, Natural Language Processing)
Software Development for Automating Protein Mechanisms, Interactions, and Molecular Structure Extraction and Analysis from Alzheimer's Disease Papers (2021, Natural Language Processing, Computer Vision)
Prediction of Diseases and Physical Vitality based on Animal Metabolite (Fur/Blood) Datasets (2021, Machine Learning, Natural Language Processing)
Anomaly Signs Prediction, Health Index Forecast, Gut Microbiome Data Analysis using National Health Insurance Data (2021, Machine Learning, Natural Language Processing)
Development Planning for Heat Efficiency and Energy Optimization Algorithms in Sihwabanwol Industrial Complex (2021, Optimization)
Development of Automatic Brain Structure Segmentation and Tumor Area Segmentation Model using MRI and CT Images and Skull Extraction Algorithms (2022, Computer Vision)
Development of Body Type Classification and 3D Body Shape Change Prediction Model based on Time-Series Korean Body Data Collection (2022, Computer Vision)
Algorithm and Deep Learning Model Development for Extracting Drawing Factors from Engineering Drawings (2022, Computer Vision)
Algorithm for Visualization and Analysis of Noise Sources, Automation Pipeline for Noise Source Localization and Clustering (2022, Computer Vision)

02 Personal Projects (2022 - 2023)

Bard-API: Unofficial Python Package for Fetching Responses from Google Bard (GitHub Star 5.4k, Downloads 379k, 2023, Python Development)
ExceptNotifier: Package for Sending Detailed Error Messages to Users via Messenger when Errors Occur in try-except Statements (Downloads 27k, 2023, Python Development)
All About LLM: Documentation of Papers and Projects on Large Language Models (2023, LLM)
Ko LLaMa2 Jindo: Project Focused on Creating a Korean Natural Language Model, Entire Pipeline Construction and Lightweighting (2023, LLM)
GORANI: Multipurpose Korean LLM Development Project based on LLaMA2 (2023, LLM)
HF Trans LLM: Translator Project for Multilingual Translation and Text Generation (2023, LLM, Python Development)
Korean Open LLM Datasets-chain: Project for Collecting/Processing Korean LLM Datasets (2023, Python Development, LLM)
Open LLM Datasets: Compilation of Datasets and Papers Used in Open LLM (2023, Python Development, LLM)
Open LLM Leaderboard-report: Visualization of Performance of Open Source LLMs based on Four Metrics for Performance Comparison (2023, LLM)
Medical QA Bert Chat GPT: Fine-tuning GPT-2 for Question-Answering in the Medical Domain (2023, LLM)
Translang: Translation Service Module Providing API for Language Translation (2023, Python Development)
Fine-tuned-korean-bert-news-article-classifier: Model Development for News Article Topic Classification, Comparing BERT Implementations in Various Frameworks (2023, LLM)
Multi Objective Recommender: Project to Build a Multi-Objective Recommendation System based on Real E-commerce Sessions (2023, Recommender System)
Co Coder: Python Package to Simplify Debugging using OpenAI Chat GPT and Google Bard (2023, Python Development)
EDA-Nif: Organizing Metadata of Medical AI Nifti Files and Providing Some Functions such as Image Registration and Arbitrary Slicing (2022, AI for Life)

dsdanielpark Goto Github PK

Hello, I'm MinWoo(Daniel) Park

Contents

Large Language Model

Huggingface

Projects

Packages

Dockerhub

Work Experience

01 Internal Projects (2017 - 2022)

02 Personal Projects (2022 - 2023)

MinWoo(Daniel) Park's Projects

Recommend Projects

Recommend Topics

Recommend Org

Jobs