GithubHelp home page GithubHelp logo

jailukanna / 20-news-group-dataset Goto Github PK

View Code? Open in Web Editor NEW

This project forked from ajay6140/20-news-group-dataset

0.0 0.0 0.0 594 KB

For this project we will use the “The 20 Newsgroups” data set available at: http://qwone.com/~jason/20Newsgroups/ To simplify your project you are allowed to use the version of the “20 Newsgroups” data set that comes preloaded with scikit-learn library. To load is you can use the following code: You can learn more from the scikit-learn documentation at: https://scikit- learn.org/0.19/modules/generated/sklearn.datasets.fetch_20newsgroups.html#sklearn.datasets.fetch_ 20newsgroups Project Description The goal of the project is to train different classifier and test the outcomes trying to achieve as high accuracy as you can. Please make sure you are clear on which metrics you are using to measure how good is your classification. You can use as many techniques as you have learned in class. Especially may be helpful preparing and transforming the data. Selecting the appropriate classification algorithm with the optimal parameters will give you different results. Final deliverables: You can work as a group but must submit each individually the following project deliverable: 1. Project report (Word or PDF) that describes in detail your project activities, what specific steps you did, what challenges you encountered and how you resolved them. a. Describe the different models you ran and the different results that you achieved with each model b. Describe the different types of data preparation and transformation you performed and how the different data steps affected the outcomes #Loading the 20 Newsgroup data set. Example loading the training data. from sklearn.datasets import fetch_20newsgroups mydata_train = fetch_20newsgroups(subset='train', shuffle=True) c. Finally describe your best model and outcome, show the results you achieve and comment on your insights and learnings from the project 2. Jupyter Notebook in Python 3.6 or higher that can be executed without errors a. The code must be very well documented explaining and self-explanatory from the notebook b. The code must run without errors c. The code must contain at least three (3) different models – trained and tested – showing the results d. Any interesting data transformations must be highlighted and their impact on the outcome must be described and explained 3. Configuration and setup file with user instruction a. The file must briefly describe how I should set up and execute your notebook. The easier it is for me to run and evaluate your code, but better your project will be accepted

20-news-group-dataset's Introduction

BINDU G

Hi there 👋

I am BINDU G. My main interest lies in Machine Learning,NLP,Computer vision,Deep Learning,AI,Robotics. I am always curious to learn something new.

  • 🔭 I’m currently working on AI projects.

  • 🌱 I’m currently learning Deep Learning and Robotics.

  • 👯 I’m looking to collaborate on any Machine Learning,Deep Learning,NLP,Computer vision,RL,DRL,AI,Robotics projects.

  • 🤔 I’m looking for help with Research And Development(R&D).

  • 💬 Ask me about ML,NLP,OpenCV,chatbot,Tensorflow,Keras,Pytorch,Algo Trading,ROS.

  • 📫 How to reach me: Contact me directly on this mail [email protected]

  • 😄 Pronouns: You can give one.

  • ⚡ Fun fact: Playing Games.

20-news-group-dataset's People

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.