GithubHelp home page GithubHelp logo

ubclaunchpad / sleuth Goto Github PK

View Code? Open in Web Editor NEW
4.0 4.0 4.0 213 KB

:shipit: Domain-specific search engine service backed by Django, Apache Solr, and Scrapy

License: MIT License

Python 97.77% Shell 2.23%
django scrapy search solr

sleuth's People

Contributors

bfbachmann avatar bobheadxi avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

sleuth's Issues

GenericPage parser

Keep improving our general page parser, particularly its versatility and improving the quality of the data we keep

Generate search suggestions for a given query

This ticket is just to implement something that uses Solr (or something better?) to return a list of suggestions given some query. In certain cases we should return suggestions as part of the results.

Display search results as graph view

Be able to extract a set of results from the search API response and display them on the front end

Ideally we should display the results as a tree, but if we don't have time to implement this by the end of this sprint, we can do a simple list view like Google.

Provide a given list of search suggestions to the user dynamically as they type

This should use the existing suggestion functionality to stream search suggestions to the client as they type their query into the search input field. It might be a good idea to do this over websockets, since the query will be changing quickly as the user types, but there's also a good chance it won't be necessary.

Front end design

  • Results show up as bubbles, connected to all linked children
  • Results can be deleted, and children disappear with it
  • Children can be "focused" to pull up all its children
  • Hovering over result shows blurb, other details etc
    IMG_1164.JPG

Backend design

  • Initialize Django project (this will probably be the first commit to our repo)
  • Come up with a few simple use cases for the backend
  • Create high level design for backend service

Front end setup

  • Come up with a few simple user stories for what the front end should do
  • Initialize front end framework (React)
  • Research React and create a high level design for how we'll modularize/develop the front end

Research Web Crawlers and come up with detailed Crawler Architecture

We want the crawler to be able to:

  • Connect to a generic DB (like Postgres) and insert web page info/content info the DB
  • Access web pages and download their content
  • Jump from one page to another
  • Connect to Solar and insert content
  • Maybe process data and connect directly to the DB, or just give raw data straight to the backend

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.