pola-rs / valves Goto Github PK

View Code? Open in Web Editor NEW

16.0 16.0 2.0 45 KB

general functions for your data .pipe()-lines.

Makefile 3.26% Python 96.74%

data-science data-structures etl-pipeline

valves's People

Contributors

Stargazers

Watchers

Forkers

pedronet4k

valves's Issues

Naive Bayes

Just for the heck of it. Can we build a text-based NaiveBayes classifier in dataframes? That means splitting on the whitespace and generating tokens.

user_item and item_item recommender tables

Given a log of weighted user-item interactions, can we generate a item-item recommendation table and a user-item recommendation table?

Kind of! We can calculate p(item_a | item_b) and p(item_a) which is can be reweighed into a table with recommendations. We can also do something similar for users. After all, a user that interactive with items a, b and c will have a score for item x defined via;

p(item_x | user) = p(item_x | item_a, item_b, item_c)
                 \propto p(item_x | item_a) p(item_x| item_b) p(item_x|item_c)

Benchmarks

As we compare different tools here. It would be cool to run benchmarks from this repo.

Maybe in CI, and later maybe even a dedicated runner.

These can could then be shown on the website. I am already assuming here that polars does great. 😄

Weighted aggregation.

Things like weighted mean/sum/std might be good to support.

Exponentaion weighed functions

Someone asked if we could support this: https://pandas.pydata.org/pandas-docs/stable/user_guide/window.html#window-exponentially-weighted

I haven't looked at it much, but it seems like this should be possible with some cumulative expression kung fu.

Archive this project?

@koaning this project has not been updated in a while.

There are now other solutions available, like Polars plugins and projects like polars-ds.

Do you have any plans with this project? If not, I propose we archive it.

pypi: valve is taken

Looking at pypi, it seems valve is taken.

@ritchie46 Maybe valves instead?

Also, would we want to host this package on pypi?

pola-rs / valves Goto Github PK

valves's People

Contributors

Stargazers

Watchers

Forkers

valves's Issues

Naive Bayes

user_item and item_item recommender tables

Benchmarks

Weighted aggregation.

Exponentaion weighed functions

Archive this project?

pypi: valve is taken

Switch subtitle of project

Docs, any preference?

Group-Based Sampling

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs