statisticianinstilettos / recmetrics Goto Github PK

View Code? Open in Web Editor NEW

561.0 561.0 100.0 5.85 MB

A library of metrics for evaluating recommender systems

License: MIT License

Jupyter Notebook 98.56% Python 1.39% Makefile 0.03% Dockerfile 0.02%

recmetrics's People

Contributors

Stargazers

Watchers

Forkers

sibyllalee1688 42eric huytu7 rizplate altaha sctjgz siripinyoch 406185 danielray54 abhishekshingadiya martinthoma itsoum n00b001 higgsboson1209 silvrduck jbdatascience diogoflorencio aqqin helma-t swaroop-p yufenlin i-hun hsanchez frandis ddalgrande yz3007 luna275 rohitroposo hongzhili ellcrane mpstewart1 ncarrasco88 gregwchase scitator t-kabaya rutujajadhav marcelomedel ll3351 pro-ady phoitack kiminh j-greer sandy4321 meikuam ml-d vvandriichuk a-k-coder peterisfar jehilm spencerjsmall mridul98 shainaraza qobuz hrymar smerkme nikhilsingh13 francisdinh tom-cat-god suhail-shaik-zz remanitinpo hegdemohan yejiahaoye ibuda vinayasathyanarayana margaretnm auberthh maggie860326 celina-07 nejood2019 arita37 sab-6 longshen931 izenish hadryan zhipeng-luo claudiawestby rafaelaraujo-hotmart domagic elizabethzq tselmeg-c mehra-rohit techsupportmx1 kineticcookie lijinfeng0713 mrkaye97 john-james-ai noopurdhawan aashokanuno kenho211 tvkpz shelly77 chrisjkuch alex-bujorianu ashishshrivastava14 nasibasevar laowangzi mosthumble sanders003

recmetrics's Issues

any method for AP@k or AP@K calculation?

Is there any method for AP@k or AP@K calculation?

Coverage over 100%

In the example bellow, the coverage measured exceeds 100%, which does not make sense.

This happens when items that are not listed on the catalog are recommended.

> from rcmetrics import prediction_coverage
> prediction_coverage([['x', 'y'], ['w', 'z']], catalog=['w', 'x', 'y'])
133.33

Integration with Deep Learning Based Frameworks

Is there any way to integrate this with recommender system frameworks that involve more deep learning-based algorithms such as PyTorch etc.? Sci-Kit Learn's with Surprise doesn't really support such algorithms

Is surprise really required?

First of all: this package looks great! It's exactly what I need for some small projects, so thanks for putting it out there!

I'm looking at the setup.py, and it lists surprise as a requirement. I don't see it imported anywhere in the package though, so I'm wondering if it can be removed? I get that it's useful for the example notebook, but that wouldn't be included in the pip install anyway. (I might suggest making surprise an extras_require if you want to keep it in there for demo purposes.)

If you're open to some packaging changes along these lines, I'd be happy to send a PR your way.

Installation issues

Hi! Have been trying to install recmetrics with "pip install recmetrcis", keep getting an error "ERROR: Could not build wheels for scikit-learn, which is required to install pyproject.toml-based projects". I'm using Windows, Python version 3.9.7, pip all upgraded. pip freeze shows that scikit-learn is actually already installed: "scikit-learn==0.24.2". I've also tried installing with pip from git, same result. Any ideas what I could still try?

mapk shouldn't require actual and predicted have the same length

This assertion check is incorrect. The actual parameter as used in _apk is expecting a list of true items and the predicted parameter is expecting a list of predicted items that can be true or false. See an example below where only A-C are true items and the prediction can be longer than the true list because it can contain false items.

recmetrics/recmetrics/metrics.py

Lines 236 to 237 in b21222d

 if len(actual) != len(predicted): 

 raise AssertionError("Length mismatched")

true_items = ["A","B","C"]
prediction = ["A","Z","B","X"]
metrics.mapk(actual=true_items, predicted=prediction, k = 3)

License

This is missing a license. You can use https://tldrlegal.com/ for an overview. The top-3 are MIT, BSD and GPL (see my analysis).

The simplest way to add it is in the setup.py as license='MIT' or similar.

Is this library actively maintained

Dear @statisticianinstilettos,

Is this library actively maintained?

Bests,

Benedek

Unused Requirement

Surprise is listed as a module dependency but is not used in metrics or plots. Might be worth removing the dependency - especially since it requires additional built tools (Visual C++) and thus may throw unnecessary errors.

TypeError on class_separation_plot of example notebook

I attached the error below

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-30-05160122655c> in <module>
----> 1 recmetrics.class_separation_plot(pred_df, n_bins=45, class0_label="True class 0", class1_label="True class 1")

TypeError: class_separation_plot() got an unexpected keyword argument 'class0_label'

ImportError: cannot import name 'signature'

Importing the repository does not work. I am getting the following error ImportError: cannot import name 'signature'

pip install recmetrics
import recmetrics as re

The problem is this import from sklearn.utils.fixes import signature.

Implement MAP@k

MAP@k implementation linked in the documentation (https://github.com/benhamner/Metrics) has not been updated for 7 years and has bugs in MAP@k implementation (e.g. benhamner/Metrics#51, benhamner/Metrics#57). It would be really useful to have MAP@k implementation in recmetrics. Would it be possible to implement it? It would be almost identical to the existing mark() function.

module 'recmetrics' has no attribute 'mapk'

As described in the title.

ImportError: cannot import name 'signature'

Getting following error when importing recmetrics.

Cross-referencing other libraries

There are dead libraries that might have useful features, it would be good to take a peak.

Slows down even if x_labels=False

First, thank you for providing this great libary! I faced an issue on rather large data sets, in particular when option x_labels is set to False. Suggests to insert an
if x_labels == True: before, similar as it is done on bottom of function. Because, whe I don't want to plot labels, why should plt.xticks(x) be executed?

recmetrics/recmetrics/plots.py

Line 44 in 082536a

plt.xticks(x)

module 'recmetrics' has no attribute 'prediction_coverage'

Hi there
I am trying to run example notebook. But I am getting 'module 'recmetrics' has no attribute 'prediction_coverage'' and "attribute error: module 'recmetrics' has no attribute 'catalog_coverage'"

any pointer or suggestion.

Thanks in advance

Catalog coverage is changed when using the exact same input arguments

personalization() has explosive memory requirements due to pairwise comparison

On my system (16gb ram), a list of 10k recommendations will run. A list of 50k will crash out. I'd like to try to understand the personalization score across my entire hypothetical customer base 250k+.

Is there a way to chunk the scipy.sparse.csr_matrix and iteratively calculate the cosine similarity to avoid holding the whole thing in memory?

Metric: familiarity

A new metric familiarity would be a nice addition.
Paper:https://link.springer.com/article/10.1007%2Fs11257-011-9115-7

I am working on my thesis and wouldn't mind adding the feature to this project.

Personalization metric calculation optimization

Hi @statisticianinstilettos,

kudos for a great tool!
I would like to propose an optimization for calculating Personalization Metric here:

#get indicies for upper right triangle w/o diagonal
upper_right = np.triu_indices(similarity.shape[0], k=1)

#calculate average similarity
personalization = np.mean(similarity[upper_right])
return 1-personalization

There is no need to get the upper triangle indices, as the cosine similarity is a symmetric distance.
I will follow up with a pull request for this.

dev dependencies breaking installation

» poetry add recmetrics   
Using version ^0.1.5 for recmetrics

Updating dependencies
Resolving dependencies... (0.2s)

Because no versions of recmetrics match >0.1.5,<0.2.0
 and recmetrics (0.1.5) depends on pytest-cov (>=2.10.1,<3.0.0), recmetrics (>=0.1.5,<0.2.0) requires pytest-cov (>=2.10.1,<3.0.0).
So, because jewel-ml-models depends on both recmetrics (^0.1.5) and pytest-cov (^4.0.0), version solving failed.

pytest-cov is a development dependency, it shouldn't break like this. You can easily solve this by installing pytest-cov as development dependency. There are others as well... ipython maybe? Jupyter and twine too

Unable to import recmetrics

I am working on a recommendation engine using collaborative filtering and wanted to try the metrics provided by recmetrics. Here, the error I get trying to import the package (version 0.0.12).

---------------------------------------------------------------------------
ImportError                               Traceback (most recent call last)
<ipython-input-309-301854677c00> in <module>
----> 1 import recmetrics
      2 
      3 recmetrics.long_tail_plot()

~/.virtualenvs/py3/lib/python3.6/site-packages/recmetrics/__init__.py in <module>
----> 1 from .plots import long_tail_plot, mark_plot, mapk_plot, coverage_plot, class_separation_plot, roc_plot, precision_recall_plot
      2 from .metrics import mark, coverage, personalization, intra_list_similarity, rmse, mse, make_confusion_matrix, recommender_precision, recommender_recall

~/.virtualenvs/py3/lib/python3.6/site-packages/recmetrics/plots.py in <module>
      5 from matplotlib.lines import Line2D
      6 from sklearn.metrics import roc_curve, auc, precision_recall_curve, average_precision_score
----> 7 from sklearn.utils.fixes import signature
      8 
      9 

ImportError: cannot import name 'signature'

remove from sklearn.utils.fixes import signature

deprecated

Update PyPI pacakge

The current version needs to be updated as the packages depends on deprecated/removed functionality from different dependencies.

	if len(actual) != len(predicted):
	raise AssertionError("Length mismatched")

statisticianinstilettos / recmetrics Goto Github PK

recmetrics's People

Contributors

Stargazers

Watchers

Forkers

recmetrics's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs