GithubHelp home page GithubHelp logo

gpt-2-colab's People

Contributors

ak9250 avatar milesoai avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

gpt-2-colab's Issues

Required memory?

Hello,
How much memory is required to run this model? I get out of memory errors with a graphic card having 6GB of memory.
Not enough memory or local problem?
Thanks,

Which is better: GPT or RelGAN for text generation?

Based on my understanding, gpt or gpt-2 are using language model loss to train and generate text, which do not contains GAN.

So which is better: GPT vs RelGAN/LeakGAN/SeqGAN/TextGAN

I am so confused about this question. Thank you very much.

Installing toposort is now redundant

Since toposort has been recently added to the requirements.txt of the nshepperd project, we can safely remove the line used to install it in the Colab notebook.

cd command

Works great, thanks.
I've just been confused by the missing

%

before the cd commands ...
(brand new to colab ...)

No module named 'tensorflow.contrib'

I am new to tensorflow but something seems changed. I get this when i try to train

Traceback (most recent call last):
  File "./train.py", line 14, in <module>
    import model, sample, encoder
  File "/content/gpt-2/src/model.py", line 3, in <module>
    from tensorflow.contrib.training import HParams
ModuleNotFoundError: No module named 'tensorflow.contrib'

I even tried %tensorflow_version 1.x

How to prepare the data for text generation task. Thank you very much.

First, I'm not sure whether the model contains the encoder during training.

EOS means end-of-sentence. Encoder and decoder are part of transformer network.

If without-encoder, training time:

target: [E, F, G, H, EOS]
decoder input: [0, E, F, G, H]

If without-encoder, testing time:

decoder input: [0]

If with encoder, training time:

encoder input: [A, B, C, D]
target: [E, F, G, H, EOS]
decoder input: [0, E, F, G, H]

If with-encoder, testing time:

encoder input: [A, B, C, D]
decoder input: [0]

Am I exact right?

I know it is beyond the topic of this project, but hope you could help.
Thank you and thank you.

Trained model spits out exact text from trained text

hi
im quite new to this but when i put this on my own server and let it train on my own text data

after getting close to avg loss 0.0 it started spitting out exact paragraphs from my trained data.

did i overtrain it or set some flags wrong?

Getting this error ZeroDivisionError: integer division or modulo by zero while training

Im getting error while training on this cell
!PYTHONPATH=src ./train.py --dataset /content/gpt-2/goblet_book.txt --model_name '345M'
Traceback (most recent call last): File "./train.py", line 266, in <module> main() File "./train.py", line 244, in main feed_dict={context: sample_batch()}) File "./train.py", line 220, in sample_batch return [data_sampler.sample(1024) for _ in range(args.batch_size)] File "./train.py", line 220, in <listcomp> return [data_sampler.sample(1024) for _ in range(args.batch_size)] File "/content/gpt-2/src/load_dataset.py", line 74, in sample self.chunks ZeroDivisionError: integer division or modulo by zero

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.