GithubHelp home page GithubHelp logo

shikharkunal / finetuning_gpt2 Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 20 KB

Fine-tuned a pre-trained Large language model(GPT-2) using a publicly available dataset to make it a proficient Question Answering Bot

License: MIT License

Jupyter Notebook 100.00%

finetuning_gpt2's Introduction

Question Answering Model using GPT-2

Overview

This project is a Question Answering Model based on the GPT-2 language model by OpenAI. The model is implemented as a Jupyter Notebook, making it easy to explore, understand, and use for generating answers to user queries.

Features

  • GPT-2 Model: Utilizes the GPT-2 language model, known for its advanced natural language understanding and generation capabilities.

  • Question Answering: The notebook provides functionality for answering questions by interacting with the GPT-2 model.

  • Fine-tuned GPT-2: This model is fine-tuned to specifically be a question answering bot, I used publicly available dataset to finetune the model.

Getting Started

  1. Clone the Repository:

    git clone https://github.com/your-username/question-answering-gpt2.git
    cd Finetuning_GPT2
  2. Install Dependencies: Open the Jupyter Notebook and install any required dependencies specified in the notebook.

  3. Run the Notebook: Open the colab Notebook and follow the instructions provided. Execute the cells to load the model, and perform question answering.

Usage

  1. Open the Colab Notebook in your google colab.

  2. Follow the step-by-step instructions provided in the notebook cells.

  3. Execute the cells to load the GPT-2 model, and generate answers to questions.

Contributing

Contributions to the Notebook are welcome! If you have ideas for improvements or encounter issues, feel free to open an issue or submit a pull request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Happy questioning!

finetuning_gpt2's People

Contributors

shikharkunal avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.