GithubHelp home page GithubHelp logo

bbhunter / succinct Goto Github PK

View Code? Open in Web Editor NEW

This project forked from codingo/succinct

0.0 0.0 0.0 73 KB

A tool to create a text summary of a provided list of websites

License: GNU General Public License v3.0

Go 100.00%

succinct's Introduction

Succinct

This Go script takes a list of URLs and outputs the most common words for each URL along with a summary paragraph describing the website. The script uses the github.com/JesusIslam/tldr library for summarization.

Usage

  1. Install the required libraries:

    go get github.com/JesusIslam/tldr
    go get github.com/PuerkitoBio/goquery
    
  2. Build the script:

    go build main.go
    
  3. Run the script with the required flags:

    ./main -t <targets-file> [-e <exclude-file>] [-n <number-of-common-words>] [-threads <number-of-threads>] [-s <number-of-summary-sentences>]
    
    • -t or --targets: Targets file (newline per webpage to load)
    • -e or --exclude: Exclude file (newline per word to exclude) - optional
    • -n: The number of most common words to output - optional, default is 10
    • --threads: The number of threads to use - optional, default is 10
    • -s: The number of sentences in the summary - optional, default is 3

Example

Create a targets.txt file with a list of URLs to process:

https://example.com
https://example.org

Create an exclude.txt file with a list of words to exclude:

the
and

Run the script:

./main -t targets.txt -e exclude.txt -n 10 -threads 10 -s 3

succinct's People

Contributors

codingo avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.