GithubHelp home page GithubHelp logo

Comments (9)

rhshadrach avatar rhshadrach commented on July 30, 2024

I like family. I also like flavor, no more or less. Some more alternatives I don't like (but others might): system, set, base, strain.

from pandas.

lithomas1 avatar lithomas1 commented on July 30, 2024

I've seen some user confusion [citation needed] stemming from the term "backend" in the "dtype_backend" parameter. It gives the incorrect impression that behaviors are the same across backends, just with different implementations or performance characteristics.

Is there evidence that users would not be confused if it was called e.g. dtype_family?

I feel like this is something that would happen eventually as long as the numpy/arrow dtypes shared names (e.g. "int64" vs "int64[pyarrow]").

from pandas.

jbrockmendel avatar jbrockmendel commented on July 30, 2024

Is there evidence that users would not be confused if it was called e.g. dtype_family?

I don't understand the question. We haven't used any other terms... "backend" has connotations of swappability and an invariant frontend that wouldn't apply to other terms.

from pandas.

lithomas1 avatar lithomas1 commented on July 30, 2024

I'm asking since renaming a parameter causes a lot of code churn.

For me, personally, it is not clear what a dtype family or flavor is, while dtype backend gives me the understanding that the underlying arrays backing my Series/DataFrame is arrow/numpy/whatever. So, IMO, dtype_backend is more clear than the other terms.

I've seen some user confusion [citation needed] stemming from the term "backend" in the "dtype_backend" parameter. It gives the incorrect impression that behaviors are the same across backends, just with different implementations or performance characteristics.

I guess the [citation needed] part was what I was asking for in my previous question. If you could dig that up, that'd be really helpful.

from pandas.

jbrockmendel avatar jbrockmendel commented on July 30, 2024

I'm asking since renaming a parameter causes a lot of code churn.

Totally reasonable concern. My thought is that ATM this is used relatively little, so is easier to change than it would be after #58141 and related.

I guess the [citation needed] part was what I was asking for in my previous question. If you could dig that up, that'd be really helpful.

Also fair. I think there was a lot of confusion surfaced in https://www.reddit.com/r/Python/comments/11fio85/we_are_the_developers_behind_pandas_currently/ about what "backend" means. I remember other things on hackernews that I'm not inclined to dig up. Searching our issues for "backend" i see #53154 has a user expecting identical behavior. I'll update this as I find more of these, as I think "incorrectly expecting identical behavior" is a common complaint.

from pandas.

mroeschke avatar mroeschke commented on July 30, 2024

Is there evidence that users would not be confused if it was called e.g. dtype_family?

I also initially agree with @lithomas1's question here. I'm not fully convinced (yet) that renaming a keyword argument would be able to convey "pick a dtype implementation that is not fully equivalent to the other options". I am open to there being a better term though.

from pandas.

jbrockmendel avatar jbrockmendel commented on July 30, 2024

#58307 another case of incorrectly expecting identical behavior

from pandas.

jorisvandenbossche avatar jorisvandenbossche commented on July 30, 2024

It gives the incorrect impression that behaviors are the same across backends, just with different implementations or performance characteristics.

Personally, I think this is actually the correct impression. It's how I think most users should think about the backends (so in that sense I don't have a problem with the current naming).

I know that in practice this of course not correct in all cases right now, but it could be what we want it to be eventually. And so whenever we get a report about different behaviours, it might be something we should fix.

It's something that we should discuss and spell out, tough, what we generally think the expectations should be about those different backends (maybe as part of the PDEP discussion in #58455)

from pandas.

jbrockmendel avatar jbrockmendel commented on July 30, 2024

Reading the room, I'm going to learn to live with users continuing to be confused by this name. Closing.

from pandas.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.