This project is written in Python to predict the Indonesian news based on the headline using supervised machine learning (SVM) with accuracy 88.23%. There are three .py files in this repository.
- Data scraping.py is used to fetch dataset from CNN Indonesia website and export the result (Headline news and the category) to CSV file
- Trainig.py is used to train the data and obtain the accuracy using SVM method. It will also export the model and tfdif data which will be used for testing process
- Testing.py is used to test input data and obtain the news category prediction using previous model