GithubHelp home page GithubHelp logo

jibe-b / webmemex Goto Github PK

View Code? Open in Web Editor NEW

This project forked from worldbrain/memex

0.0 2.0 0.0 2.25 MB

The refactored version of the (Re)search-Engine

Home Page: https://worldbrain.io

License: Other

Makefile 0.41% JavaScript 90.27% CSS 8.82% HTML 0.50%

webmemex's Introduction

WorldBrain's WebMemex

A browser extension that lets you create, search and share your personal web of knowledge.

  • Full-Text Search pages and you visited and bookmarked (and later) also all the apps you use to organise your knowledge.
  • Manually and automatically create links between your content to organise & search your web by your associations
  • Share content recommendations, associations and qualifying metadata with your network and followers

Project status

Sorry to disappoint, but we do not have the promised solution complete yet. 😒

In its current released version, the you can full-text search the browsing history & bookmarks. Here you can download it: worldbrain.io/download

At the moment we are in the process of refactoring/porting the features of the WorldBrain (Re)search-Engine into the WebMemex, a project started by @treora

This project is in full development. You are most welcome to contribute! See Hacking below about how to build and run it from source and teach it new tricks.

Approach

As it stands now, it is a WebExtension (thus should work on most modern browsers), bundled by browserify with some babel ES6–7→ES5 compilation, that logs and stores visited pages in PouchDB, and provides a viewer for this data based on React+Redux. See Code Anatomy below for a full tour du code.

The project strategy is to combine and integrate features from other projects, and to factor out developed functionality into separate modules wherever it seems sensible. This extension could then be regarded as a bunch of different (but related) features a browser ought to have, bundled together for quick installation.

See our initial blog post for the feature roadmap of the next few months.

Contribute

We are happy about any kind of feedback, bug fixes, new feature ideas and tips? Give a shout. 📢

Hacking

Like playing with ES6, WebExtension browser APIs, React, Redux, PouchDB? Come play along! 🎉

Build and run it

  1. Clone this repo.
  2. Get Node/NPM and yarn (npm install -g yarn).
  3. Run yarn to install dependencies.
  4. Compile the source files using yarn watch for incremental builds or yarn build for standard dev builds.
  5. Load it in Firefox or Chromium/Chrome:
    • In Firefox (≥49): run yarn run firefox (or run [web-ext] directly for more control). Alternatively, go to about:debugging, choose 'Load Temporary Add-on', and pick extension/manifest.json from this repo.
    • In Chromium/Chrome: go to Tools→Extensions (chrome://extensions), enable 'Developer mode', click 'Load unpacked extension...', and pick the extension/ folder from this repo.

Automatic recompilation

If the steps above worked, running yarn run watch will trigger a quick recompilation every time a source file has been modified.

If you are testing in Firefox through yarn run firefox, the extension should also reload automatically. Otherwise, manually press the reload button in the extension list.

  • If your edits affected only the overview interface, just refresh/reopen it.
  • However, if you changed the background script, you have will to reload the extension: find it back in the list of temporary add-ons/extensions and click Reload.
  • If you changed the 'content_script', it seems browser-dependent whether newly opened pages will get the new version. Better reload the extension to be sure.

Code Anatomy

WebExtension parts

To comply with the anatomy of a WebExtension, it consists of three main parts (found in extension/ after compilation):

  • background.js always runs, in an 'empty invisible tab', listening for messages and events.
  • content_script.js is loaded into every web page that is visited. It is invisible from that web page's own scripts, and can talk to the background script.
  • overview/overview.html, with the resources in that folder, provides the main user interface.

The parts communicate in two ways:

  • Messaging through browser.sendMessage, usually done implicitly by using a remote procedure call (util/webextensionRPC.js).
  • Through the in-browser PouchDB database, they get to see the same data, and can react to changes made by other parts.

Besides these parts, browser-polyfill.js provides the promise-based browser API, that simply wraps Chromium/Chrome's callback-based chrome API, in order to make the same code run in different browsers (and to structure the callback mess).

Source organisation

To keep things modular, the source code in src/ is not split in exactly those the three parts of the extension, but is rather grouped by functionality. Some folders may end up being factored out into separate repos later on, or at some point perhaps even into separate but interacting browser extensions.

src/activity-logger/: activity logger

This logs every page visit in PouchDB. Soon it should also watch for user interactions, for example to remember which parts of a page you have read.

Currently, for every visit, a new page object is created in the database, to represent the visited page itself. This object should soon be deduplicated when the same page is visited multiple times. After creating a new page object, the next module is triggered to start analysing the page.

src/page-analysis: page analysis

This extracts and stores information about the page in a given tab, such as:

  • The plain text of the page, mainly for the full-text search index.
  • Metadata, such as its author, publication date, etcetera.
  • A screenshot for visual recognition.

src/overview/: overview

The overview is the user interface that opens in a tab of its own. It is built with React and Redux, which create a somewhat complex but nicely organised application structure.

See src/overview/Readme.md for more details.

src/search: document search

Functions for finding relevant knowledge in the user's memory. Currently provides a simple full-text keyword search through visited pages using pouchdb-quick-search. This can be improved in many ways, because we are searching through a person's memory, not just some arbitrary document collection. For example, we can use created assocations and browsing paths to better understand what one is looking for.

src/dev/: development tools

Tools to help during development. They are not used in production builds.

src/util/: utilities

Contains small generic things, stuff that is not project-specific. Things that could perhaps be packaged and published as an NPM module some day.

...: other stuff

The build process is based on yarn, that runs some npm commands specified in package.json, which in turn start the corresponding tasks in gulpfile.babel.js (transpiled by settings in .babelrc).

So much for the code tour. 💤 Any questions? ☝️

webmemex's People

Contributors

treora avatar poltak avatar urbancvek avatar reficul31 avatar aquibm avatar bohrium272 avatar jessicaml avatar blackforestboi avatar the-fallen avatar rohanhacker avatar chaitya62 avatar mukeshkharita avatar rajpratim21 avatar swissums avatar alxmiron avatar gastonche avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.