GithubHelp home page GithubHelp logo

datawhalechina / team-learning-data-mining Goto Github PK

View Code? Open in Web Editor NEW
1.5K 28.0 806.0 100.05 MB

主要存储Datawhale组队学习中“数据挖掘/机器学习”方向的资料。

Jupyter Notebook 99.99% Python 0.01% Shell 0.01%

team-learning-data-mining's Introduction

简介

本项目主要存储Datawhale组队学习中“数据挖掘/机器学习”方向的资料。

主要包括:

备注

有关组队学习的开源内容

  • team-learning:主要展示Datawhale的组队学习计划。
  • team-learning-program:主要存储Datawhale组队学习中“编程、数据结构与算法”方向的资料。
  • team-learning-data-mining:主要存储Datawhale组队学习中“数据挖掘/机器学习”方向的资料。
  • team-learning-nlp:主要存储Datawhale组队学习中“自然语言处理”方向的资料。
  • team-learning-cv:主要存储Datawhale组队学习中“计算机视觉”方向的资料。
  • team-learning-rs:主要存储Datawhale组队学习中“推荐系统”方向的资料。
  • team-learning-rl:主要存储Datawhale组队学习中“强化学习”方向的资料。

team-learning-data-mining's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

team-learning-data-mining's Issues

集成学习案例一,一个代码错误

错误地址
#部分特征处理,取众数(首先去除缺失值的数据)
for i in range(1,9+1):
data.loc[data['public_service_'+str(i)]<0,'public_service_'+str(i)] = data['public_service'+str(i)].dropna().mode().values_
for i in range(1,13+1):
data.loc[data['trust_'+str(i)]<0,'trust_'+str(i)] = data['trust'+str(i)].dropna().mode().values_
解决方法

  1. 删除.values,如data['public_service_'+str(i)].dropna().mode()
  2. 末尾添加[0],如data['public_service_'+str(i)].dropna().mode().values[0]
  3. 转换为int,如int(data['public_service_'+str(i)].dropna().mode().values)

Markwown语法错误导致图片不能显示

问题地址
image
2.3之后的所有图片相关maekdown语法地址引用错误,导致图片不能显示。

如2.3的第一个图片,应该为
![dscreteoverall](tupian/discreteoverall.PNG)
而不是
![dscreteoverall](discreteoverall.PNG)

特征工程中,箱型图不一致

您好,我前面两个小节和您的图出来是一致的,但在特征工程中箱型图和您完全不一样。
即使是使用您的代码,配合这份数据 依然结果不一致。请问您在特征工程前对数据进行了什么处理吗?
image

PermissionError: [Errno 13] Permission denied: 'result.zip'

When I run the following method, it raised error as stated at title

def make_zip(source_dir='./result/', output_filename = 'result.zip'):
zipf = zipfile.ZipFile(output_filename, 'w')
pre_len = len(os.path.dirname(source_dir))
source_dirs = os.walk(source_dir)
print(source_dirs)
for parent, dirnames, filenames in source_dirs:
print(parent, dirnames)
for filename in filenames:
if '.npy' not in filename:
continue
pathfile = os.path.join(parent, filename)
arcname = pathfile[pre_len:].strip(os.path.sep) #相对路径
zipf.write(pathfile, arcname)
zipf.close()
make_zip()

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.