GithubHelp home page GithubHelp logo

dannyl1u / doppelganger Goto Github PK

View Code? Open in Web Editor NEW
8.0 2.0 1.0 27 KB

flags duplicate issues & PRs using embeddings

Python 97.64% Dockerfile 2.36%
embedding-search open-source rag similarity-score vector-database sentence-transformers

doppelganger's Introduction

doppelgänger

Problem: open-source maintainers spend a lot of time managing duplicate/related (doppelgänger) issues & pull requests
Solution: doppelgänger compares newly submitted issues/PRs against existing ones to automatically flag duplicate/related (doppelgänger) issues/PRs

Topics: vector db, github, open-source, embedding search, rag, similarity scores

Screen.Recording.2024-04-27.at.4.57.11.PM.mov

Setup

  1. Clone this repository to your local machine:

    git clone https://github.com/dannyl1u/doppelganger.git
    cd doppelganger
    
  2. Build Docker image and run:

    docker build -t doppelganger . && docker run --name doppelganger doppelganger
    

or

  1. Create a virtual environment and install dependencies:

    • python -m venv venv
    • source venv/bin/activate # Use venv\Scripts\activate on Windows
    • pip install -r requirements.txt
  2. Run the Flask server:

    python app.py
    
  3. Configure a GitHub Webhook:

    • Go to your GitHub repository settings
    • Navigate to "Webhooks" and click "Add webhook"
    • Enter the following details:
      • Payload URL: https://your-public-url/webhook
      • Content type: application/json
      • Which events would you like to trigger this webhook?: Select "Let me select individual events" and check "Issues" and "Pull requests"
    • Click "Add webhook"

Notes

  • To make your Flask server publicly accessible, consider using a tool like ngrok to expose it to the internet during development.
  • Ensure proper security measures for the webhook endpoint to avoid unauthorized access or potential attacks.

Star History

Star History Chart

doppelganger's People

Contributors

dannyl1u avatar amiicao avatar

Stargazers

Justin Hayes avatar Parsa Rajabi avatar  avatar Patrick Zhao avatar Kiaan Castillo avatar  avatar Evert De Spiegeleer avatar Timothy Jaeryang Baek avatar

Watchers

 avatar Justin Hayes avatar

Forkers

amiicao

doppelganger's Issues

feat: openai api compatible model presets (profiles)

Is your feature request related to a problem? Please describe.
As the modelfiles from the hub are pretty good, it would be nice if we could use also them with external models from openAI api compatible services.

Describe the solution you'd like
Option A:
When adding a modelfile, we could check that it should use a model from the external provider and select it.
Option B:
When selecting a modelfile, we should be able to exchange then the actual model to one from the external service.

Describe alternatives you've considered
Setting the default prompt according to the modelfile works for now, just the profile pick and name is wrong and the extra settings.

Additional context
My ollama instance has limited vram available. External services can provide bigger models. I think it would be good if the same characters would be usable when using an external service.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.