Comments (5)
kataku sih buang
from dmc-2016.
Aku mau propose untuk coba impute productGroup dengan semi-supervised. http://scikit-learn.org/stable/modules/label_propagation.html
Caranya: returnQuantity di-drop sementara. productGroup yang missing ditandain -1, terus productGroup dijadiin y/label
Kalau emang ga ngefek, yaudah sekalian drop kolom productGroup. Kalau bisa, nanti kita semua jangan drop rows ya (takut pengaruh ke kolom prior probability, ya nggak?), kecuali kalau yakin emang aman untuk drop rows.
Ada yang mau ambil ini?
from dmc-2016.
tapi dipikir-pikir, untuk menebak productGroup, parameternya apa ya?
- price?
- rrp, asal nggak missing aja
- colorCode, tapi menurutku colorCode sangat tak terstruktur, kaya cenderung beda sendiri per item
parameter yang lain seperti orderID, customerID dkk technically sebagai parameter untuk menebak productGroup ya bisa-bisa aja. tapi agak dipertanyakan sih, takut bikin kacau
tapi benda seperti kemeja dan sabuk harganya bisa bervariasi sih. mungkin aja ada sabuk seharga kemeja
yaudahlah drop aja kali ya? atau coba impute dulu deh. bebas deh
from dmc-2016.
lho bukan articleID ya malah? namanya juga product group.
Entah deng pendapatku doang ._.
tapi gimana sih caranya ngimpute-- misal parameternya buat nebak productGroup, trs gmn caranya dia tau articleID x productGroupnya apa?
from dmc-2016.
udah fixed buang aja
from dmc-2016.
Related Issues (20)
- Papan Mading HOT 5
- voucherID: Missing value HOT 1
- rrp: missing values HOT 4
- n_estimators HOT 2
- selidiki colorCode HOT 4
- selidiki PolynomialFeatures HOT 5
- Tambahan feature extraction? HOT 1
- mean, variance, skewness, kurtosis HOT 1
- TPOT & for-in model2
- Ekstraksi Fitur HOT 2
- Seleksi Fitur Polynomial
- Return Probabilities HOT 1
- Transformasi Fitur
- Make make_datasets.py work
- Payment Method: Binarize atau probability? HOT 2
- average_article_price: harga rerata item, dan apakah harga item lebih rendah/tinggi dari biasanya HOT 1
- Benerin probabilty biar jadi kumulatif? HOT 1
- Apakah perlu membuat fitur sebanyak-banyaknya? (Lalu di-reduce)
- Pembelian produk mahal pada selasa/rabu
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dmc-2016.