Ikko Eltociear Ashimine's Projects
Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)
A tool for secrets management, encryption as a service, and privileged access management
OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, etc) using a simple React frontend.
Unofficial Bitwarden compatible server written in Rust, formerly known as bitwarden_rs
This repo is no longer accepting PRs or new issues. Code questions? Try https://stackoverflow.com/questions/tagged/vba. Suggestions? Go to https://officespdev.uservoice.com. Need more help? Try https://docs.microsoft.com/office/vba/articles/feedback-support. Office VBA reference:
VirtualBox Git mirror
VCoder: Versatile Vision Encoders for Multimodal Large Language Models, arXiv 2023
NeurIPS 2023, Recaptured Raw Screen Image and Video Demoiréing via Channel and Spatial Modulations
💧 Instill VDP (Versatile Data Pipeline) is an open-source tool to seamlessly integrate AI to process unstructured data in the modern data stack
utilities for converting deep representations (like sentence embeddings) back to text
Framework for benchmarking vector search engines
VMAS is a vectorized framework designed for efficient Multi-Agent Reinforcement Learning benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set of challenging multi-robot scenarios. Additional scenarios can be implemented through a simple and modular interface.
Venom is the most complete javascript library for Whatsapp, 100% Open Source.
A new bootable USB solution.
Develop. Preview. Ship.
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, 2022
The open big data serving engine. https://vespa.ai
Official PyTorch codes for the paper: "ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation"
[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale
Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation.
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Video.js - open source HTML5 & Flash video player
Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model
A Toolkit for Text-to-Video Generation and Editing
The official Vim repository