canbaba0517 Goto Github PK
Type: User
Type: User
Script for converting the pretrained VGGish model provided with AudioSet from TensorFlow to PyTorch, along with a basic smoke test.
[AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer
Code release for "BoxeR: Box-Attention for 2D and 3D Transformers"
Code for the ECCV22 paper "Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds"
[NeurIPS2022] This is the official code of "CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds".
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
The implementation of DeBERTa
The code for 'Intriguing Findings of Frequency Selection for Image Deblurring' and 'Deep Residual Fourier Transformation for Single Image Deblurring'
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
Boosting 3D Object Detection via Object-Focused Image Fusion
Pytorch implementation of DiffusionNet for fast and robust learning on 3D surfaces like meshes or point clouds.
[ECCV 2022] Official pytorch implementation of the paper, "PointMixer: MLP-Mixer for Point Cloud Understanding"
[CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding
FcaNet: Frequency Channel Attention Networks
Free-form Description-guided 3D Visual Graph Networks for Object Grounding in Point Cloud
Implementation of 1D, 2D, and 3D FFT convolutions in PyTorch. Much faster than direct convolutions for large kernel sizes.
Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms
Official implementation for "Frequency and Spatial Dual Guidance for Image Dehazing" [ECCV 2022]
[NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification
PC DFT
[ICCV 2021] InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring
Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation
An All-MLP solution for Vision, from Google AI
Multi-View Transformer for 3D Visual Grounding [CVPR 2022]
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.