GithubHelp home page GithubHelp logo

Vladislav Sorokin's Projects

verba icon verba

Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

verbum icon verbum

Verbum is a fully flexible text editor based on lexical framework.

versel-examples icon versel-examples

Enjoy our curated collection of examples and solutions. Use these patterns to build your own robust and scalable applications.

video-chatgpt icon video-chatgpt

Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation.

video-llama icon video-llama

Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

video-llava icon video-llava

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

video2music icon video2music

Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model

videocrafter icon videocrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

videomamba icon videomamba

VideoMamba: State Space Model for Efficient Video Understanding

vim-plug icon vim-plug

:hibiscus: Minimalist Vim Plugin Manager

vimacs icon vimacs

Neovim Configuration heavily inspired by Emacs & JetBrains. Based on NvChad

vimgpt icon vimgpt

Browse the web with GPT-4V and Vimium

vin-decoder icon vin-decoder

Universal vin decoder to retrieve vehicle informations

viquae icon viquae

Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22) and Multimodal ICT (Lerner et al., ECIR'23)

visinger icon visinger

Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.

visionomicon icon visionomicon

A utility that leverages GPT-4V to rename image files based on their content

visual_anagrams icon visual_anagrams

Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"

vit.cpp icon vit.cpp

Inference Vision Transformer (ViT) in plain C/C++ with ggml

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.