Comments (7)
take
from pandas.
Seems to be a behavior change for the default c engine only.
python engine is also incorrect (returning int64) but was also doing that in v2.2.2
from pandas.
thanks for the report - we should probably first do a git bisect to see which commit introduced the bug
running it now: https://www.kaggle.com/code/marcogorelli/pandas-regression-example?scriptVersionId=187955574
from pandas.
@MarcoGorelli Hi Marco, I tried rerunning the regression example with just data = io.StringIO("345.5 519.5 0)
and it always lands at the latest commit, which is incorrect. Can you help take a look?
from pandas.
thanks @xouyang1 - I was missing an escape character in the script, and so it wasn't running properly
having tried again, I'm getting #57943 as the commit that introduced it, but I don't know if that seems reasonable, haven't looked closer yet
from pandas.
thanks @xouyang1 - I was missing an escape character in the script, and so it wasn't running properly
having tried again, I'm getting #57943 as the commit that introduced it, but I don't know if that seems reasonable, haven't looked closer yet
It looks like the right commit, specifically this change to pandas/core/indexes/base.py ded256d#diff-c34a28314fc8cb12f0d2aa710f1c15f06cdfe3e48f03e658f01f99a43d4f5d09
- Before: does not convert to range (
np_sequence.dtype.kind
is āuā for uint so returns the sequence as is)pandas/pandas/core/indexes/base.py
Line 7164 in cf40e56
- After: converts to range, which drops the dtype association
pandas/pandas/core/indexes/base.py
Line 7170 in ded256d
from pandas.
It looks like the right commit, specifically this change to pandas/core/indexes/base.py ded256d#diff-c34a28314fc8cb12f0d2aa710f1c15f06cdfe3e48f03e658f01f99a43d4f5d09
cool, thanks for checking (cc @mroeschke just fyi, no blame š¤ )
from pandas.
Related Issues (20)
- BUG: Couldn't run sql: 'Connection' object has no attribute 'cursor' HOT 3
- Surprising behavior: set_index cannot set a MultiIndex from a tuple, only a list HOT 4
- BUG: pandas.to_datetime reports incorrect index when failing. HOT 4
- BUG: Error message in read_csv misleading when using decimal="," HOT 1
- DOC: Add Bodo to out-of-core projects in ecosystem HOT 4
- QST: Is this expected behavior when pd.read_csv() with na_values arguments? HOT 3
- BUG: GroupBy.value_counts doesn't preserve original order for non-grouping rows HOT 2
- pandas.Series.groupby example is not relevant HOT 6
- PERF: Significant Performance Difference in DataFrame.to_csv() with and without Index Reset
- BUG: df.to_json causes low precision in floats
- DOC: Typo in docs for na_values parameter in pandas.read_csv function HOT 1
- BUILD: Pandas never succeeds, the most time consuming part of using pandas HOT 2
- ENH: extent Styler `to_latex` for index name styling
- String dtype: overview of breaking behaviour changes HOT 2
- BUG: pandas.read_parquet () dtype_backend argument does not get the default value as documented HOT 1
- BUG: pd.Series.duplicated(keep='first'|'last') returns multiple duplicates HOT 2
- DOC: Website opens search when I press Caps Lock HOT 2
- ENH: Need API support and __repr__ to discover the storage used for strings
- DOC: Make usage of `rtol` and `atol` arguments in `pd.testing.assert_frame_equal` clearer
- API: pd.StringDtype.value_counts should return pd.Int64Dtype
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
š Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ššš
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ā¤ļø Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pandas.