Comments (13)
hi, @GrahamMThomas and @Kamalesh3112 - I've been at a conference and wasn't able to reply earlier, but thank you, @GrahamMThomas for raising this issue. It's a very important fix to be made to the lesson! I also spoke to @shwars about it and we would indeed like to fix it as soon as possible. I'd recommend that we start by raising a PR to fix both the lesson text in English and the solution notebook. Then, I need to contact all translators to get the edits done across the board.
One thing to note is that it's ok to show the 'wrong way' of going about doing things as an example (in this case, converting to numeric values without using One Hot Encoding). But we should follow on immediately by showing the better way to solve the problem. So in this case we can add a lesson element about using this technique for much better results, so that the lesson can flow well.
We also can rethink the part at the bottom to give a better exercise.
thank you again!
from ml-for-beginners.
Hi ,
This is Kamalesh . I would like to work and solve this issue .Can you assign the issue to me and assist me in solving it ?
from ml-for-beginners.
I am unable to assign the issue to you, as I don't have the permissions. But I'm down to help as much as I can!
from ml-for-beginners.
Oh that's fine then . Could you be able to help me in this issue for further resolving it and can I start solving?
from ml-for-beginners.
I would like to work on this issue if it's not assigned, should I start working on this issue?
from ml-for-beginners.
@jlooper So shall I start solving this issue ?
from ml-for-beginners.
hi, I'm not assigning the issue to anyone, if someone would like to raise a PR I will review it
from ml-for-beginners.
@Kamalesh3112 @Stapan17 @GrahamMThomas
We had a discussion with @jlooper and agreed to change the lesson plan as follows:
- Building a linear regression based on date/month - this is likely to give low accuracy, because relationship is inherently non-linear. This will heavily build on code and graphs discussed in the previous lesson.
- Building a polynomial regression based on date/month - this should give better accuracy
- Showing how to add non-numeric features using one-hot encoding
I was planning to start working on this next week, but I guess if someone wants to start working right away and make a PR - you are more than welcome to!
from ml-for-beginners.
Maybe we can break up the work and start on the third bullet point Dmitry notes above, to make it easier on everyone (I'm concerned about propagating this to translations in particular so it will take some organization), then address the date/month topic
from ml-for-beginners.
Building a linear regression based on date/month - this is likely to give low accuracy, because relationship is inherently non-linear. This will heavily build on code and graphs discussed in the previous lesson.
- Building a polynomial regression based on date/month - this should give better accuracy
I implemented the two first bullet points on my side to see how much of an improvement that change can bring. I don't know if the accuracy of the model can be dependent of my computer or any other variable but here is what I got:
For the Linear Regression, I obtain a Model Accuracy of 0.019, which in my opinion makes sense because as you said, the relationship is non-linear. However, I expected a much larger accuracy with the Polynomial Regression. I got a Model Accuracy of 0.028 (an increase of 147%) but I think that this accuracy is a too low. As seen with @GrahamMThomas message, even the linear regression Package/Price gives a better model accuracy (0.35).
There is still a possibility that my code is incorrect so here is a Gist: https://gist.github.com/booleans-oss/ead3513b89c505132732c63975eea86c
After seeing the results, I tried to reproduce the lesson and do the regression for Package/Price. I got the following results:
This result reveals that there might be something different with either my code or my computer as we can see a bias since my results are ~50/60 smaller than the expected value. But this bias should not contradict the results of the regression Month/Price as it would not be a drastic change (due to the small nature of the values)
from ml-for-beginners.
@jlooper Can I work on this issue? It would be very helpful if you could point out which file need to be changed?
from ml-for-beginners.
@abetpal thanks for offering help! A fix for this issue has been proposed already, you can have a look once it's merged and see if you have any further suggestions.
from ml-for-beginners.
I'm going to close this issue as Dmitry has posted a fix which I will be merging shortly. thank you everyone!
from ml-for-beginners.
Related Issues (20)
- Lesson 1 HOT 1
- Graph ML HOT 6
- Beginner friendly ML from scratch HOT 2
- Machine l
- ML for beginners
- Build Web App with Gradio HOT 1
- R if in for loop error - how can I save selected model? HOT 5
- [TRANSLATIONS] Hindi HOT 21
- [TRANSLATIONS] Tamil HOT 1
- hi-telugu added
- Titles Guidelines HOT 2
- Pranjay OS
- outdated link in 1-Introduction/4-techniques-of-ML/README.md HOT 1
- Guide
- 1-Tools/notebook.ipynb is an empty file HOT 1
- Machine language for beginners
- Shall I add DBSCAN and Hierarchical Clustering Algo? HOT 3
- [TRANSLATIONS]
- Microsoft/ML-For-Beginners HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ml-for-beginners.