GithubHelp home page GithubHelp logo

Comments (5)

kevinykuo avatar kevinykuo commented on May 11, 2024

Sampling is cheap so you could just sample a bunch of records then filter for the ones you're interested in.

from ctgan.

csala avatar csala commented on May 11, 2024

Sampling is cheap so you could just sample a bunch of records then filter for the ones you're interested in.

Exactly what @kevinykuo says: right now there is no way to fix variables when sampling, so the only option is to discard the invalid rows and resample until you have as many as you need.

Another option is, is you know what you need in advance, to create a subset of the original dataset with only the rows that contain the value(s) that you need and then fitting a specific CTGAN instance with that subset of rows so all the newly generated ones have those values fixed.

from ctgan.

pawelkuk avatar pawelkuk commented on May 11, 2024

Sampling is cheap so you could just sample a bunch of records then filter for the ones you're interested in.

Exactly what @kevinykuo says: right now there is no way to fix variables when sampling, so the only option is to discard the invalid rows and resample until you have as many as you need.

Another option is, is you know what you need in advance, to create a subset of the original dataset with only the rows that contain the value(s) that you need and then fitting a specific CTGAN instance with that subset of rows so all the newly generated ones have those values fixed.

What about setting condvec by hand while sampling? Shouldn't we then get the desired class most of the time?

from ctgan.

onacrame avatar onacrame commented on May 11, 2024

This is exactly the feature I’m looking for. Would be a great addition. More specifically, would like the ability to fix any value, discrete or continuous. I’m in financial services where certain combination of high level parameters in a given month (eg interest rate) we would want to fix but have other variables generated such as credit profile of the customer. We may have a high level plan for lending but the loans that get written will vary significantly due to variation in customer profiles.

I can’t just generate synthetic credit profiles because only certain financial products would accept certain credit profiles and it would be inefficient to build a model for each combination of features I want to treat as fixed.

I would think this could be handled by the conditioning vector?

from ctgan.

csala avatar csala commented on May 11, 2024

This feature has been covered in the PR #68

from ctgan.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.