GithubHelp home page GithubHelp logo

Comments (2)

williamSYSU avatar williamSYSU commented on May 28, 2024 3

I have added the LeakGAN instructor leakgan_instructor.py in `instructor/real_data', please refer to it.

For a custom dataset, the steps should be as follows:

  1. Preprocess dataset files, one sentence per line, ending with '\n'. Put them into dataset folder, training data should be placed in the root of dataset and testing data should be placed in the testdata.

  2. Please refer to the function init_dict() in utils/text_process.py, similarly add the code to load your custom dataset files. (This function will automatically initialing dictionary based on dataset args in the future.)

  3. Now you can design your own Instructor. The main differences of instructors between oracle data and real data are:

    • self.oracle_data should load training data (config.train_data) and add a self.test_data loading testing data (config.test_data).

    • Add BLEU metrics.

    • The positive samples for discriminator should be `self.oracle_data.target' now.

    • Remove the evaluation process of discriminator since there is no validation data.

    • Change the way of saving generator's samples. Before changing write_tensor to write_tokens, use tensor_to_tokens() to transform Tensor to word tokens.

See 'instructor/real_data/leakgan_instructor.py' for more details.

Please don't worry that these questions will bother me. These also help me to improve my code.

from textgan-pytorch.

ishalyminov avatar ishalyminov commented on May 28, 2024

Perfect, thanks a lot! 🙏

from textgan-pytorch.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.