Comments (3)
Hi,
I am not entirely sure why this happens, but let me take a stab. It is most likely related to the --ipaddr flag and the line 884 in pretrain_nmt.py which is "os.environ['MASTER_PORT'] = '26023'".
It is possible that the default argument of --ipaddr as localhost may be an issue with docker. Or it might be the case that 26023 is a bad port which is already in use. Basically, it seems like the process is waiting for something. So playing with this may help.
Other than that I can suggest that you try outside a docker environment.
Hope this helps.
from yanmtt.
This issue seemed to be related to some incompatibilities between my CUDA and the versions of Tensorflow and/or Pytorch in requirements.txt
. I have it working now using Python 3.6.8, Pytorch 1.10.1 and TensorFlow 2.4.3.
Just in case this is useful to someone else, this is the relevant part of my current Dockerfile
:
FROM nvcr.io/nvidia/pytorch:20.12-py3
RUN apt-get update
RUN apt-get install -y wget tmux && rm -rf /var/lib/apt/lists/*
WORKDIR /setup
WORKDIR /app
RUN conda update conda
RUN conda create -n yanmtt python=3.6.8
SHELL ["conda", "run", "-n", "yanmtt", "/bin/bash", "-c"]
RUN git clone https://github.com/prajdabre/yanmtt
WORKDIR yanmtt
RUN pip install -r requirements.txt
WORKDIR transformers
RUN python setup.py install
RUN pip install tensorflow==2.4.3
SHELL ["/bin/bash", "-c"]
ENV PYTHONPATH=$PYTHONPATH:/app/yanmtt/transformers
RUN conda install -n yanmtt pytorch==1.10.1 torchvision==0.11.2 torchaudio==0.10.1 cudatoolkit=11.3 -c pytorch -c conda-forge
WORKDIR /setup
RUN git clone --branch v0.1.95 https://github.com/google/sentencepiece.git
RUN mkdir sentencepiece/build
WORKDIR sentencepiece/build
RUN cmake .. && make -j 4
RUN make install && ldconfig -v
RUN echo 'eval "$(conda shell.bash hook)"' >>~/.bashrc && echo 'conda activate yanmtt' >>~/.bashrc
WORKDIR /app
from yanmtt.
Oh fantastic. Could you make a contrib folder in the examples folder and write down these points and then send a pull request? It would really help people.
from yanmtt.
Related Issues (20)
- Improve documentation
- Improve examples
- Binary executables for all python scripts
- CPU support
- Add support for latest version of transformers repo
- Display more information during training
- RuntimeError: The expanded size of the tensor (22) must match the existing size (21) at non-singleton dimension 1. Target sizes: [178, 22, 1] . Tensor sizes: [178, 21, 1]
- Add PEP8 style guide checker workflow
- Add post-norm to the model
- Mixtures of denoisers
- Support all optimizers and schedulers
- Error in BART Monolingual Pre-training. HOT 5
- Evaluation during training BARTforConditionalGeneration pre-training on English corpora HOT 1
- Alternative to installing sentencpiece HOT 11
- Extending IndicBART or IndicBERT HOT 7
- Pretrain Donut model HOT 1
- Problem with __future__ annotation HOT 1
- Could not find the version : tensorflow-gpu==2.3.0
- Disable shared sentencepiece libraries in installation instructions HOT 1
- Error: Invalid new-expression of abstract class type torchdistx::detail::{anonymous}::ProxyVariableHooks
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from yanmtt.