archersama / inttower Goto Github PK

Source code of CIKM 2022 and DLP-KDD workshop 2022 Best Paper: IntTower-“ IntTower: the Next Generation of Two-Tower Model for Pre-ranking System”

License: Apache License 2.0

Python 100.00%

inttower's Introduction

👋 Hi, I’m @archersama , HuaWei Noah Ark Recommendation&Search Lab Researcher
✨ Welcome to join us！Now, we need school graduates and interns. Resume can be sent to me directly.

Requirements:1. Graduated from Top School OR 2. At least one computer top conference paper published
👀 I’m interested in information retrieval and nature language processing. Recently, I focus on LLM for recommendation and RAG.
📫 How to reach me [email protected]

inttower's People

Contributors

Stargazers

Watchers

Forkers

dibyendumandal xiaoqingwang kingleao cshaoping mertgurkan0 karndeepsingh weiucas vincentami seven-xu

inttower's Issues

How to deploy in real recommender systems

I have several questions:

As I known, faiss does not support 'max' operation.
Fot i-th layer user representaion, we will compute each head pairwise to get the similarity score, So we need to retrieve H^2 times？If there are L layers, eventually we need to retrieve L* H^2 times?

Thanks!

Question about serving the model

Hello!

First of all thanks a lot for your great article and for opening the code base.
I have a question regarding the model serving:
I understand that you create Faiss indices based on the multi-head latent representation of the items but how do you query them? Do you use the multi-head latent representation of the last layer of the user tower? And after retrieving the top K items, do you compute the Fe score to rerank the candidates?

CUDA out of memory

RuntimeError: CUDA out of memory. Tried to allocate 32.00 MiB (GPU 0; 23.99 GiB total capacity; 37.27 GiB already allocated; 0 bytes free; 37.33 GiB reserved in total by PyTorch)

I run this code on 24G GPU, this error always happened after epoch 2 whatever batch_size I set, is there anything wrong with my environment?

多目标serving时的融合

您好，请教一下，如果我的粗排有多个目标，比如，ctr,cvr, 想问一下在预测时如何进行融合，目前我想到的，
1、使用multi-head分别对ctr塔和cvr塔的顶部进行提取，将提取得到的ctr embedding以及cvr embedding分别和multi-head提取的user embedding过一遍fe_score函数，然后将 ctr的fe_score和cvr的fe_score 以一定权重进行融合，得到最终的score
2、使用multi-head分别对ctr塔和cvr塔的顶部进行提取，将提取得到的ctr embedding以及cvr embedding以一定权重进行融合，将融合后的embedding 和multi-head提取的user embedding过一遍fe_score函数，得到最终的score
谢谢~

is CIR contrastive loss removed?

有个小问题想请假下哈

这个地方最后是没有用CIR 的 contrastive loss 吗

IntTower/model/base_tower.py

Lines 175 to 178 in a234d89

# total_loss = loss + reg_loss + self.aux_loss + contras

total_loss = loss + reg_loss + self.aux_loss

# print(total_loss, contras, loss)

2.contrastive loss 这个地方为啥用y 去作为索引选择cos_sim score呢, 比如batch_size 256，那岂不是都选到前两个的score了后面254个的都选不到，另外一般这种不是只包含正例这里面应该是正负的label 都有？

IntTower/preprocessing/utils.py

Lines 74 to 76 in a234d89

 # Compute the loss 

 loss = torch.log(exp_scores.sum(dim=1)) - scores[range(scores.shape[0]), y] 

 loss = loss.mean()

archersama / inttower Goto Github PK

inttower's Introduction

inttower's People

Contributors

Stargazers

Watchers

Forkers

inttower's Issues

How to deploy in real recommender systems

Question about serving the model

CUDA out of memory

多目标serving时的融合

is CIR contrastive loss removed?

Could you share the serving code?

矩阵相乘求相似度

Could you provide the training script for the dataset of Amazon and Alibaba?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs


	# total_loss = loss + reg_loss + self.aux_loss + contras
	total_loss = loss + reg_loss + self.aux_loss
	# print(total_loss, contras, loss)

	# Compute the loss
	loss = torch.log(exp_scores.sum(dim=1)) - scores[range(scores.shape[0]), y]
	loss = loss.mean()