Comments (2)
Yes, if no processing is done on the strings then uppercase and lowercase characters are essentially treated as different characters and that is why you'd get different results. (Levenshtein distance does not really consider these as being the same character)
For this reason default pre-processing (which transforms them to lowercase) is performed on input strings on all the ratios except the partial/simple ratios (which are the ones you are using) in case someone desires this functionality. I will consider adding an option though for simple / partial ratios to have pre-processing as well in the next version just to be consistent.
Thanks for the idea!
from fuzzywuzzy.
Thanks for the explanation! I am not that familiar with Levenshtein distance (adding to my reading list ;) so I did not know that capitalization matters haha! I do not think any code changes are necessary and a "Note: capitalization matters!" would suffice. Thanks again for the great work!
from fuzzywuzzy.
Related Issues (20)
- Could not find library in gradle HOT 2
- Difference in PartialScore between Java and Python Implementations
- module me.xdrop.fuzzywuzzy cannot be resolved to a module
- Is there somewhere I can find out what the different methods do? HOT 2
- Wrong score in Partial Ratio
- v1.3.0 I can't find the .pom file HOT 10
- Difference in extractOne results compared to Python version HOT 2
- Inconsistent results from extractOne and extractTop
- FuzzyWuzzy MIT? HOT 11
- Is there a security scanning performed on this project?
- PartialRatio issue HOT 4
- Convert codes to apex class HOT 4
- Bug in search
- NoClassDefFoundError
- Using custom object Instead of String would lead to performance issue? HOT 2
- How to set the scorer like the python fuzzywuzzy? HOT 6
- Can we priortize results to push first appears over top HOT 1
- Incompatibility with the Python version in handling underscores HOT 3
- Still Incompatibility with the Python Version HOT 4
- Difference between java and python implementation: Spoiler, the problem is the round HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from fuzzywuzzy.