-
Notifications
You must be signed in to change notification settings - Fork 68
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TFIDF min_similarity not applied #49
Comments
You are using the library correctly but it seems that the
|
I will try that, thanks for the quick response! |
Hello Maarten, Thanks, |
@nitindabadghav Could you provide a bit more information? What version do you use? Can you share your code? Have you tried the answer I provided above? Etc. |
When using the
TFIDF
model themin_similiary
parameter seems not to be applied to the results.Minimal Example that reproduces the problem (polyfuzz 0.4.0):
When running the code the following output is generated, but the rows 4 and 7 should have a Similarity score of 0, if I understand the documentation correctly.
I would expect the rows with a Similarity of < 0.9 to have a Similarity of 0 and a
To
value of None.Output:
In case I'm using the library wrong, how would I be able to get only results with a similarity higher than
0.9
?The text was updated successfully, but these errors were encountered: