sorokinvld Goto Github PK

followers: 0.0 following: 0.0 repos: 3.7K gists: 9.0

Name: Vladislav Sorokin

Type: User

Location: Russia

Vladislav Sorokin's Projects

verba

Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

verbum

Verbum is a fully flexible text editor based on lexical framework.

versel-examples

Enjoy our curated collection of examples and solutions. Use these patterns to build your own robust and scalable applications.

video-based-infant-action-recognition

Challenges in Video-Based Infant Action Recognition: A Critical Examination of the State of the Art (WACVW'24)

Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation.

video-diffusion-webui

Video Diffusion WebUI: Text2Video + Image2Video + Video2Video WebUI

video-llama

Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

video-llava

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

video-mamba-suite

video-to-gif-telegram-bot

Want to create a GIF from one of your videos? Ask the bot to do so!

video2game

Code release of Video2Game

video2music

Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model

videocrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

videocrafter-colab

videomamba

VideoMamba: State Space Model for Efficient Video Understanding

vim-plug

:hibiscus: Minimalist Vim Plugin Manager

vimacs

Neovim Configuration heavily inspired by Emacs & JetBrains. Based on NvChad

vimgpt

Browse the web with GPT-4V and Vimium

vin-decoder

Universal vin decoder to retrieve vehicle informations

vin_decoder

viquae

Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22) and Multimodal ICT (Lerner et al., ECIR'23)

visinger

Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.

visionomicon

A utility that leverages GPT-4V to rename image files based on their content

vision_audio_and_multimodal_projects

This repository includes all computer vision, audio, document AI, and multimodal projects.

vision_transformer

vistripformer

visualizer-examples

Examples how to integrate visualizer on web applications

visual_anagrams

Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"

vit.cpp

Inference Vision Transformer (ViT) in plain C/C++ with ggml

sorokinvld Goto Github PK

Vladislav Sorokin's Projects

Recommend Projects

Recommend Topics

Recommend Org

Jobs