Comments (12)
Hi, this seems very related to this issue: #169
Maybe also here, try to pip uninstall mamba-ssm and/or causal-conv1d and pip install them again making sure to prevent using the pip cache by adding the
--no-cache-dir
argument.
from mamba.
I have been successfully run. Environment follows:
cuda 11.8
python 3.10.13
pytorch 2.1.1
causal_conv1d 1.1.1
mamba-ssm 1.2.0.post1
pip install torch==2.1.1 torchvision==0.16.1 torchaudio==2.1.1 --index-url https://download.pytorch.org/whl/cu118
pip install causal_conv1d==1.1.1
pip install mamba-ssm==1.2.0.post1
from mamba.
Any solution on this now?
I find torch 2.2 will have this problem, after switch it to torch 2.1, I solve it
I was using the PyTorch 2.3 and it was not working. On the contrary, I downgrade to PyTorch 2.2. it is working currently.
and below are the configurations of 'causal-conv1d' and 'mamba-ssm '
causal-conv1d 1.1.3.post1 pypi_0 pypi
mamba-ssm 1.2.0.post1 pypi_0 pypi
from mamba.
Any suggestions on this till now?
from mamba.
Hi @FloMru,
Would this issue help you?
From what I understand, it's required that both torch and causal_conv1d use the same version of CUDA.
from mamba.
I had the same issue, recompiled both causal-conv1d and mamba from source, then cp -r build/lib.linux-x86_64-cpython-310/* ../venv/lib/python3.10/site-packages/ for both packages. Mind the ../venv - replace with the path to your venv.
from mamba.
Any solution on this now?
from mamba.
any update?
from mamba.
I think the issue has to do with cuda. Generally, Pytorch and causal_conv1d need to be compiled with the same version of cuda.
Something that worked for me on a Linux cluster is Pytorch 2.0.0 with cuda 11.8 (later versions should also work), causal conv1d 1.1.0 (not earlier versions) and mamba ssm 1.1.0.
If you check out the pip pages for the packages, make sure whatever verrsion you select for causalconv1d works for mamba-ssm
from mamba.
Hi, this seems very related to this issue: #169
Maybe also here, try to pip uninstall mamba-ssm and/or causal-conv1d and pip install them again making sure to prevent using the pip cache by adding the
--no-cache-dir
argument.
Hi, I was having same problem with another repo, and followed what you said. Now it is working, thanks :)
from mamba.
Any solution on this now?
I find torch 2.2 will have this problem, after switch it to torch 2.1, I solve it
from mamba.
Well, all you need to do is download the corresponding .whl from https://github.com/Dao-AILab/causal-conv1d/releases/tag/v1.0.0 and https://github.com/state-spaces/mamba/releases/tag/v1.0.1 based on your cuda, python as well as torch version and pip install the two .whl file.
from mamba.
Related Issues (20)
- > > 大家好!我专注于 mamba,也许你可以添加我的微信,我们可以一起学习实现我们的目标
- Mamba2 interpretability HOT 1
- CPU Memory Problem
- Flexible creating of parameters in Training? HOT 2
- mamba-2 error HOT 1
- mamba2 error HOT 1
- AMD GPU: AttributeError: 'HIPDriver' object has no attribute 'get_current_device' HOT 2
- Losses decline slowly
- mamba-ssm-2.2.0 has an error HOT 1
- How to estimate training time using the LLM model with the Mamba block? HOT 5
- The function of conv1d in Mamba block HOT 1
- Parameter number problem of Mamba2
- AssertionError. libcuda.so not found!
- When using "nn. Conv1d" to implement causal convolution in mamba2, the output is not truncated
- mamba2 triton.runtime.autotuner.outofresources: out of resource: shared memory, required: 294912, hardware limit: 49152. reducing block sizes or `num_stages` may help HOT 1
- On the small model, the actual GPU memory usage of Mamba2 is much higher than that of Mamba1. HOT 6
- ImportError: libtorch_cuda_cu.so: cannot open shared object file: No such file or directory
- support training on near-infinite sequence lengths?
- When to use ngroups in mamba-v2? HOT 6
- Understanding Pass State Associative Scan HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mamba.