Light

Geonmo Gu photo

gmgu Goto Github PK

followers: 3.0 following: 0.0 repos: 57.0 gists: 0.0

Name: Geonmo Gu

Type: User

Company: AI Enginner at LG Electronics

Bio: Computer Scientist, Ph.D.

Research Interests

My primary research efforts have been devoted to developing fast algorithms. I have developed fast algorithms for graph isomorphim, graph isomorphism query processing, and multiple pattern Cartesian tree matching during my Ph.D. study. At LG Electronics, I have developed AI coding assistant using large language model (LLM).

Work Experience

LG Electronics - Artificial Intelligence Lab (Senior Researcher)

Aug. 2022 – Present: Fine-tuning large language models for AI coding assistance using PyTorch FSDP and DeepSpeed (for full parameter fine-tuning), and LoRA (for parameter-efficient fine-tuning) on AWS SageMaker. Development of data augmentation techniques for causal language models. Inference server development for large language model using NVIDIA Triton, FastTransformer, and FastAPI. Inference optimization for transformer models using flash attention and paged attention.
Apr. 2022 – Dec. 2022: Training a small language model (from scratch) for AI coding assistance using PyTorch. Web UI development using Python Streamlit. Inference server development for language models using Flask and ShannonAI/service-streamer.

Seoul National University – Institute of Computer Technology (Post-Doctoral Assistant)

Jan. 2022 – Mar. 2022: Algorithm development for graph isomorphism query processing (Efficient Graph Isomorphism Query Processing using Degree Sequences and Color-Label Distributions, IEEE ICDE 2022).

Tech/Skills

Competitive Programming

Programming Languages

C/C++, Python, CUDA C++, Rust, C#, Java, Shell Script, LaTeX

Libraries

PyTorch, TensorFlow, Triton (OpenAI), Seaborn, Pandas, PySpark, HuggingFace Transformers, DeepSpeed, NVIDIA Triton, NVIDIA Faster Transformer, FastAPI, gtest

Others

AWS (SageMaker, EC2, Lustre, S3)

CV

Geonmo Gu's Projects

alexnet-pytorch

Pytorch Implementation of AlexNet

apps

APPS: Automated Programming Progress Standard (NeurIPS 2021)

bss-ged

casrel

A Novel Cascade Binary Tagging Framework for Relational Triple Extraction. Accepted by ACL 2020.

ceci-release

Source Code for "CECI: Compact Embedding Cluster Index for Scalable Subgraph Matching"

cliques

Refined pivot selection for maximal clique enumeration in graphs, Theoretical Computer Science 2016

cmake_helloworld

codegen

CodeGen is an open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

d2codingfont

D2 Coding 글꼴

daf

Efficient Subgraph Matching: Harmonizing Dynamic Programming, Adaptive Matching Order, and Failing Set Together

dcq

deep-image-orientation-angle-detection

dfscode

To generate the minimum DFS code of a given graph

dmce

Distributed Maximal Clique Computation, IEEE BigData 2014

gboost

A fork of Sebastian Nowozin's and Koji Tsuda's gboost code

gi

gmgu

gmgu.github.io

:triangular_ruler: Jekyll theme for building a personal site, blog, project documentation, or portfolio.

graph-analysis-tool

graph-homomorphism-network

Graph Homomorphism Convolution (ICML'20)

graph_canon

graph_edit_distance

This project aims at exact graph edit distance (GED) computation and GED verification (verify whether the GED between two graphs is smaller than a given threshold), where all edit operators are assumed to have unit costs.

idar

Fast Supergraph Search Using DAG Integration

keras

lasagne

A fork of the LASAGNE project (http://amici.dsi.unifi.it/lasagne/) looking for improvement.

ldbc_snb_datagen

Synthetic data generator for the LDBC Social Network Benchmark

llmops

machine_learning

maximumclique

Finding Maximal Cliques in Massive Networks, SIGMOD 2010

metaseq

Repo for external large-scale work

1
2

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

Jobs