GithubHelp home page GithubHelp logo

im-bhatman / inverted-index-series Goto Github PK

View Code? Open in Web Editor NEW
1.0 0.0 0.0 22 KB

Code and references for the Inverted Index blog series on dev.to.

Home Page: https://dev.to/bhatman/inverted-index-the-story-begins-4l60

License: Apache License 2.0

Java 100.00%
search-engine invertedindex reverse-indexes

inverted-index-series's Introduction

inverted-index-series

Reverse/Inverted Index - The Story begins

To read the original post please visit this link on dev.to

Almost 2 years I came across Elasticsearch and was astonished after having a look at the complex functionalities that it provide just out of the box. And since then I have digging into book, article, company tech blogs to find out how such Search/IR systems works.

While going through many articles I came across "Reverse Indexes" and my love for it grew every day since then.

The amount of resources left over the internet to read about Inverted Index and Search System are very limited. So I thought to give my 5 cents back to our tech community via these articles explaining the core concepts of Reverse Indexes and its implementation.

When we understand the underlying technology to the core, then only we can really appreciate the "out of the box" functionalities that we get in our frameworks/DBs. โœŒ๏ธ

So we will be covering following topics in this series one in each post, with code examples whenever required:

Topics:

  • Introduction to Inverted Indexes
  • Inverted index Component 1 - Dictionaries
  • Inverted index component 2 - Posting Lists
  • Tokenization
  • Structure of Inverted index
  • Creation of the Reverse index
  • Storing index-Data over disk
  • Retrieval and Ranking - Vector Space Model
  • Inverted Index Compression
  • Distributed Index Creation
  • Dynamic Index Creation
  • And More...

I will be writing these articles one by one in the coming month. before 6th July 2020. And will be attaching links to the articles list mentioned above.

You reviews/feedbacks/important are most welcome.

TODO code improvs: Use Java Modules instead of a single src folder. I am new to java, will be having different iteration for code implementations to make it better.

Stay Tuned. Stay Safe.

inverted-index-series's People

Contributors

im-bhatman avatar

Stargazers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.