Comments (3)
对了,作者。这个是environment.py中load_data()的部分函数
from mesa.
你好,感谢你对我们工作的关注~
是的,默认是少数类为正类,imbalance ratio是期望多数类比少数类。
我有几点建议:
- imbalanced ensemble里面提供的集成学习方法默认使用决策树作为基分类器,正则化项也都是sklearn的默认设置(max_depth=None,boosting类方法max_depth=1),如果所有方法效果都不好的话,你可能需要先确定决策树适不适合你的分类任务。换句话说,要先找到一个合理的分类器,再去加对于类别不平衡的处理。可能的选项有:sklearn里的各种分类器,xgboost,lightgbm,catboost等。
- 除了集成方法,还可以尝试其他package里提供的预处理方法,比如imbalanced learn和smote-variants
- lightgbm处理离散特征很方便,并且有内部的优化,可以尝试一下
一切算法的基础都是数据,我还是建议先搞清楚自己的任务数据的特点,再确定重点需要解决的问题是什么:数据不平衡?噪声?类别重叠?特征稀疏?特征冗余?等等。问题常常会出现在没考虑到的地方。
链接:
imbalanced learn: https://imbalanced-learn.org/stable/
smote-variants: https://smote-variants.readthedocs.io/en/latest/getting_started.html
from mesa.
你好,感谢你对我们工作的关注~
是的,默认是少数类为正类,imbalance ratio是期望多数类比少数类。
我有几点建议:
- imbalanced ensemble里面提供的集成学习方法默认使用决策树作为基分类器,正则化项也都是sklearn的默认设置(max_depth=None,boosting类方法max_depth=1),如果所有方法效果都不好的话,你可能需要先确定决策树适不适合你的分类任务。换句话说,要先找到一个合理的分类器,再去加对于类别不平衡的处理。可能的选项有:sklearn里的各种分类器,xgboost,lightgbm,catboost等。
- 除了集成方法,还可以尝试其他package里提供的预处理方法,比如imbalanced learn和smote-variants
- lightgbm处理离散特征很方便,并且有内部的优化,可以尝试一下
一切算法的基础都是数据,我还是建议先搞清楚自己的任务数据的特点,再确定重点需要解决的问题是什么:数据不平衡?噪声?类别重叠?特征稀疏?特征冗余?等等。问题常常会出现在没考虑到的地方。
链接: imbalanced learn: https://imbalanced-learn.org/stable/ smote-variants: https://smote-variants.readthedocs.io/en/latest/getting_started.html
感谢回复!的确我有很多问题没有考虑到,对于您的建议我好好学习一下!
from mesa.
Related Issues (9)
- Could you please release the paper? HOT 1
- 请求修改代码以适配torch新的版本
- error : mesa.predict_proba
- Not able to run the model HOT 5
- 在多分类任务下报错 HOT 2
- Issue running model HOT 3
- Add Contributor HOT 4
- MESA训练集与其他模型的训练集不一样 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mesa.