GithubHelp home page GithubHelp logo

Comments (7)

brucearctor avatar brucearctor commented on June 16, 2024 1

@shahar1 sounds like you've got a decent idea/design in mind, which could be supported.

Are you interested in contributing? Feel free to start and include me on PRs, if that's the case.

from beam.

shahar1 avatar shahar1 commented on June 16, 2024 1

.take-issue

from beam.

brucearctor avatar brucearctor commented on June 16, 2024

@shahar1 this level of customization might make sense.

Let's explore your specific concern for a moment, I might have others, but imagine worth understanding your needs/usecase:

What Quotas are getting hit that are problematic? Or, what are the specific billing charges you are looking to attribute elsewhere?
Your are running on Dataflow? Or other? [ not critical, but curious ]
What read method? [ BQ Storage Read API? ]
You want to run the compute in one GCP project, but use BQ from another? If this unloads, writes to GCS and then into Dataflow [ that is another way that can occur ], do you intend to specify which project [ bucket within ] that the data is written?

from beam.

brucearctor avatar brucearctor commented on June 16, 2024

Also, I wonder whether implimentation of this issue would help with #30747

from beam.

shahar1 avatar shahar1 commented on June 16, 2024

@shahar1 this level of customization might make sense.

Let's explore your specific concern for a moment, I might have others, but imagine worth understanding your needs/usecase:

What Quotas are getting hit that are problematic? Or, what are the specific billing charges you are looking to attribute elsewhere? Your are running on Dataflow? Or other? [ not critical, but curious ] What read method? [ BQ Storage Read API? ] You want to run the compute in one GCP project, but use BQ from another? If this unloads, writes to GCS and then into Dataflow [ that is another way that can occur ], do you intend to specify which project [ bucket within ] that the data is written?

Thank for your response! Here are the answers for your questions:

  1. I'd like to attribute the queries execution to another project. In my case, the BigQuery is on project A, and beam.io.ReadFromBigQuery runs on project B - I'd like to bill project B for the queries (for that matter it could also be project C).
  2. We use Dataflow and direct runner (when implementing, it should better a be a general solution and not Dataflow specific).
  3. I use both methods - if I'm not wrong, in both cases you could set the quota_project_id via ClientOptions.
  4. Yup, you got the idea correctly :)

As for #30747 - it is related, but there might be some changes in implementation as GCS is project's resource rather than a service.

from beam.

shahar1 avatar shahar1 commented on June 16, 2024

@shahar1 sounds like you've got a decent idea/design in mind, which could be supported.

Are you interested in contributing? Feel free to start and include me on PRs, if that's the case.

I'd be happy to try!
I need first to learn how development works here (I'm coming from the Airflow community)

from beam.

brucearctor avatar brucearctor commented on June 16, 2024

This should be pretty good --> https://github.com/apache/beam/blob/master/CONTRIBUTING.md

If you find a problem [ or that is outdated ], let's overcome and fix the docs along the way.

from beam.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.