GithubHelp home page GithubHelp logo

ayush-raj13 / webscrapideo Goto Github PK

View Code? Open in Web Editor NEW
3.0 2.0 3.0 423 KB

Text Summarizer, Flipkart Web Scraper and Online Video Downloader

Home Page: https://web-scrapideo.vercel.app

License: MIT License

HTML 2.76% JavaScript 69.79% CSS 1.95% Python 10.92% Jupyter Notebook 14.58%
beautifulsoup flipkart-scraper flipkart-scraper-python online-video-downloader react sumy sumy-api text-summarization text-summarizer text-summary video-downloader web-scraper web-scraping web-scraping-python youtube-dl yt-dlp yt-dlp-gui yt-dlp-wrapper hacktoberfest hacktoberfest-accepted

webscrapideo's Introduction

WebScrapideo

WebScrapideo is a React-based web application that provides various web scraping and data extraction features. It simplifies the process of extracting information from HTML pages, summarizing text, scraping data from Flipkart, and downloading online videos. The application utilizes popular libraries such as Tailwind CSS for styling, Framer Motion for animations, Sumy for text summarization, BeautifulSoup for web scraping, and yt-dlp for video downloading.

Demo Link

Click here

Features

The WebScrapideo website offers the following key features:

  1. Text Summarizer: Summarizes text and HTML pages using various methods such as LSA, TextRank, LexRank, Edmundson, Luhn, KL-Sum, and Reduction. The feature condenses lengthy documents and articles into concise summaries.

  2. Flipkart Web Scraping: Scrapes data from Flipkart based on user-defined selections. Users can choose specific data to scrape, preview the result, and download it in CSV format. The feature supports competitive pricing analysis, product research, market trends analysis, and more.

  3. Online Video Downloader: Allows users to download online videos from supported websites. The feature utilizes the yt-dlp library to retrieve download links for videos from platforms like YouTube, Vimeo, Twitter, Twitch, Udemy, and more.

Installation

To run the client locally, follow these steps:

  1. Clone the repository: git clone https://github.com/ayush-raj13/WebScrapideo.git
  2. Navigate to the project directory: cd client
  3. Install dependencies: npm install
  4. Start the development server: npm start
  5. Open the website in your browser: http://localhost:3000

To run the flask server locally, follow these steps:

  1. Clone the repository: git clone https://github.com/ayush-raj13/WebScrapideo.git
  2. Navigate to the project directory: cd flipkart-scraper (Same for text-summarizer and online-video-downloader)
  3. Install dependencies:
    1. pip install virtualenv
    2. virtualenv env (If you get error running virtualenv env, then run Set-ExecutionPolicy unrestricted in cmd with admin privileges.)
    3. .\env\Scripts\activate.bat
    4. pip install -r requirements.txt
  4. Create .flaskenv with following contents.
    FLASK_APP=app
    FLASK_ENV=development
    FLASK_RUN_PORT=3001
    FLASK_DEBUG=on
  5. Start the development server: flask run
  6. Open the website in your browser: http://localhost:3001

Technologies Used

The WebScrapideo website is built using the following technologies and libraries:

  • React: A JavaScript library for building user interfaces.
  • Tailwind CSS: A utility-first CSS framework for rapid UI development.
  • Framer Motion: A library for creating smooth animations and transitions.
  • Sumy: A library for text summarization using various methods.
  • BeautifulSoup: A Python library for web scraping and extracting data from HTML pages.
  • yt-dlp: A command-line program and library for downloading videos from various websites.

Contributing

Contributions to the WebScrapideo project are welcome! If you have any suggestions, bug reports, or feature requests, please open an issue or submit a pull request.

License

The WebScrapideo project is licensed under the MIT License. You can find more details in the LICENSE file.

Contact

For any inquiries or questions, please contact the project maintainer at [email protected].

webscrapideo's People

Contributors

amrahs02 avatar anubhav1206 avatar asryan11 avatar ayush-raj13 avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar

webscrapideo's Issues

UI improvement

  • Currently the UI is very basic. Also, currently, it is not responsive. Before you start working on UI, make sure to remove any unnecessary line from index.html that are forcing user to view only in landscape mode in mobiles

Request for Local Storage-Persisted Dark Mode and System Default Integration

  1. Add local storage support for maintaining user-selected dark mode preferences across page refresh.
  2. Integrate an option to automatically follow the browser's system default theme for a seamless user experience.
  3. Enhance user interaction and satisfaction by implementing these features.

Can I work on this?
Please assign hacktoberfest-accepted ,.... labels

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.