๐ Audiobook Search Engine
This project is about creating a neural search engine that allows to discover new audio-books based on title the user already likes.
๐ Search Engine
Neural search engine is created using the Jina framework.
๐๏ธ Data
The data used for this project are scrapped from Audible.
Data is scrapped using a Breadth First Search strategy: from a starting point (random audiobook), all recommended audibooks are enqueued and explored turn in turn.
๐ Embedding
Embeddings are created using a link prediction model based on already existing Audible recommendations.
A recommendation graph of books is extracted from scrapping. A model is then trained to predict whenever two nodes of that graph are linked. During the training, embeddings are tuned and can finally be used during neural search.
The link prediction model has been created using PyG.