GithubHelp home page GithubHelp logo

mendel5 / most-visited-websites Goto Github PK

View Code? Open in Web Editor NEW
1.0 2.0 0.0 41 KB

How to create a list of the most visited websites based on the browsing history of Mozilla Firefox

License: GNU Affero General Public License v3.0

browser extract firefox most-popular most-viewed most-visited sort websites

most-visited-websites's Introduction

most-visited-websites

How to get a list of the most visited websites when using Mozilla Firefox

Code

  • Open the browsing history in Mozilla Firefox by pressing Ctrl + Shift + H
  • Select the full history of Mozilla Firefox by pressing Ctrl + A, copy it with Ctrl + C and paste it to a .txt file, e.g. my_history.txt, with Ctrl + V
  • Make sure to get the full history, not just the past 7 days or so
  • Use one of the following commands on a GNU/Linux system:
cat my_history.txt | cut -d'/' -f3 | sort | uniq -cd | sort -nr > ./ranked-websites-no-protocol.txt
# Returns the domain names without their protocol
# Example:
# https://en.wikipedia.org/wiki/Linux
# is returned as
# en.wikipedia.org
cat my_history.txt | cut -d'/' -f1,2,3 | sort | uniq -cd | sort -nr > ./ranked-websites-with-protocol.txt
# Returns the domain names with their protocol
# Example:
# https://en.wikipedia.org/wiki/Linux
# is returned as
# https://en.wikipedia.org
cat my_history.txt | cut -d'/' -f3 | grep -ive "www\." | grep -e "\..*\." | sort | uniq -cd | sort -nr
# Returns the domain names (without their protocol) that have a subdomain and this subdomain is not "www."
# Example:
# https://www.wikipedia.org
# is not returned because its subdomain is "www."
# https://en.wikipedia.org/wiki/Linux
# is returned as "en.wikipedia.org" because its subdomain is "en."

Links

most-visited-websites's People

Contributors

mendel5 avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.