GithubHelp home page GithubHelp logo

Hello, I'm RESPWILL!

Welcome to my GitHub repository! Here, you will find a collection of projects that showcase my journey in data science and machine learning, demonstrating a strong foundation in data mining processes, model development, and deployment.

🌟 About Me

I am a passionate data scientist with hands-on experience in the end-to-end data mining process, including Exploratory Data Analysis (EDA), data preprocessing, model training, validation, and analysis. My expertise extends to deploying models into production using Azure Web Apps, FastAPI, and Docker, with an automated deployment pipeline via GitHub.

Despite facing challenges in aligning projects with direct business impacts, I have consistently sought to bridge the gap between technical AI model development and practical business applications. I am keen on projects that require deep understanding and alignment with production team needs, aiming for significant contributions beyond the realm of R&D.

🚀 Key Projects and Achievements

Weekly Shipping Volume Regression

Objective: Predict next week's shipping volume for the Busan-Qintao route.
Model & Tools: Linear Regression, feature selection, and engineering.
Insights: Overcame data scarcity with effective feature selection, achieving an RMSE of 56.9 and RMSPE of 10%.

Shipping Item Classifier

Objective: Classify shipping items into 96 categories using the HS system.
Model & Tools: Distil-BERT and T5 models, PyTorch, data preprocessing.
Insights: Achieved 94% accuracy despite challenges in data acquisition and label consistency.

Dangerous Item Classifier

Objective: Distinguish between dangerous and non-dangerous shipping cargo. Model & Tools: Distil-BERT, data sampling, text preprocessing. Insights: Attained an 85% recall rate and 93% precision, balancing dataset representation.

Shipping Route Finder

Objective: Optimize shipping volume with linear programming and reinforcement learning. Model & Tools: Reinforcement Learning with DDQN. Insights: Devised a complex reward system and environment for route optimization.

Recover Sparse Dataset

Objective: Recover missing data in sparse shipping volume arrays. Model & Tools: SVD method, creation of a rating matrix. Insights: Utilized item classification to generate a meaningful rating matrix for SVD application.

ChatGPT Prompt Engineering

Objective: Extract data from shipping invoices using prompt engineering. Model & Tools: Azure chatGPT, OCR, Python. Insights: Developed innovative prompt chaining techniques to process and interpret invoice data efficiently.

🔍 Current Focus

I am currently delving into ChatGPT Prompt Engineering, leveraging the power of AI to streamline data extraction from complex documents.
This work involves sophisticated prompt design, integration of OCR technologies, and strategic API usage to enhance data processing workflows.
Also I am very interested in RAG using vectorDB and function calling.

🌱 Looking Forward

I am eager to collaborate on projects that not only challenge my technical skills but also have a tangible impact on business processes and efficiency.
My goal is to develop solutions that are in perfect harmony with the actual needs of production teams, driving meaningful advancements in the field.

📫 How to Reach Me

Feel free to explore my projects and reach out for collaborations or discussions. You can contact me at [email protected]

Thank you for visiting my repository!

Feel free to customize the template with your personal information, contact details, and any additional projects or skills you wish to highlight. This introduction aims to present a comprehensive view of your capabilities and aspirations to potential collaborators or employers browsing your GitHub repository.

respwill's Projects

abalone_age_regression icon abalone_age_regression

Final project for DTSA 5509 Supervised Learning, Master of Data Science program at the University of Colorado Boulder.

ircot icon ircot

Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23

langchain icon langchain

⚡ Building applications with LLMs through composability ⚡

sinokor_alpaca icon sinokor_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.