Topic: flash-attention-2 Goto Github
Some thing interesting about flash-attention-2
Some thing interesting about flash-attention-2
flash-attention-2,Triton implementation of FlashAttention2 that adds Custom Masks.
User: alexzhang13
flash-attention-2,Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app
User: arihanv
Home Page: https://shush.arihanv.com
flash-attention-2,Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
User: bbc-esq
flash-attention-2,Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.
User: bruce-lee-ly
flash-attention-2,📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
User: deftruth
Home Page: https://github.com/DefTruth/Awesome-LLM-Inference
flash-attention-2,🎉CUDA/C++ 笔记 / 大模型手撕CUDA / 技术博客,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.
User: deftruth
Home Page: https://github.com/DefTruth/cuda-learn-notes
flash-attention-2,Poplar implementation of FlashAttention for IPU
Organization: graphcore-research
flash-attention-2,Transcribe audio in minutes with OpenAI's WhisperV3 and Flash Attention v2 + Transformers without relying on third-party providers and APIs. Host it yourself or try it out.
User: lalitdotdev
Home Page: https://transcribex.vercel.app
flash-attention-2,A fast, lightweight, parallel inference server for Llama LLMs.
User: nickpotafiy
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.