Comments (2)
If you navigate to the checkpoint of your choice
@ https://console.cloud.google.com/storage/browser/sf-ctrl/ ,
you can download the individual files in the checkpoint using their public links (The link can be found under the public access column).
For instance, for the seqlen256_v1.ckpt
, the links are:
https://storage.googleapis.com/sf-ctrl/seqlen256_v1.ckpt/checkpoint
https://storage.googleapis.com/sf-ctrl/seqlen256_v1.ckpt/model.ckpt-413000.data-00000-of-00001
https://storage.googleapis.com/sf-ctrl/seqlen256_v1.ckpt/model.ckpt-413000.index
https://storage.googleapis.com/sf-ctrl/seqlen256_v1.ckpt/model.ckpt-413000.meta
You can mkdir -p seqlen256_v1.ckpt
and then wget
the above files to the same effect as the gsutil
route.
Closing for now, reopen as necessary.
from ctrl.
Sure, I'll look into doing this.
In the meantime, does this help: https://twitter.com/gwern/status/1171869124613591040?s=20
from ctrl.
Related Issues (20)
- Using ctrl for summarization HOT 2
- TPU configuration - fine tuning
- Is that a way to do "general" generation? HOT 1
- Source attribution - Cannot replicate results
- why set "seq_length = min(args.generate_num, 256)" HOT 1
- training curriculum used
- repeats the last word on AWS HOT 1
- License for pre-trained model
- Sampling method used for translation
- tips and scripts related to data collection HOT 1
- Are control codes required for finetuning?
- Altering the tone of the output
- Will BERT+transformer-decoder better than tensor2tensor for text-generation?
- control code not recognised HOT 2
- Issues with pytorch_generation.py when running the Colab exercise
- CTRL model can not work in huggingface transformers HOT 2
- 12 layer (huggingface gpt-2 equivalent) ctrl model?
- Cuda out of memory issue.
- A transformer decoer-based model or seq2seq model?
- control code
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ctrl.