GithubHelp home page GithubHelp logo

Comments (6)

NEUYuYang avatar NEUYuYang commented on May 28, 2024

请你的lightgcn训练样本数和验证样本数正常吗,我跑lightgcn的时候,训练集的数量远小于验证机和测试集的数量

from recbole.

zzzZHANGYIXUAN avatar zzzZHANGYIXUAN commented on May 28, 2024

请你的lightgcn训练样本数和验证样本数正常吗,我跑lightgcn的时候,训练集的数量远小于验证机和测试集的数量

你是否设置了相关参数来划分训练集、验证集和测试集?

from recbole.

NEUYuYang avatar NEUYuYang commented on May 28, 2024

model config

embedding_size: 32

dataset config

field_separator: "\t" #指定数据集field的分隔符
seq_separator: " " #指定数据集中token_seq或者float_seq域里的分隔符
USER_ID_FIELD: user_id #指定用户id域
ITEM_ID_FIELD: item_id #指定物品id域
RATING_FIELD: rating #指定打分rating域
TIME_FIELD: timestamp #指定时间域
NEG_PREFIX: neg_ #指定负采样前缀
#指定从什么文件里读什么列,这里就是从ml-1m.inter里面读取user_id, item_id, rating, timestamp这四列
load_col:
inter: [user_id, item_id, rating, timestamp]

training settings

epochs: 500 #训练的最大轮数
train_batch_size: 4096 #训练的batch_size
learner: adam #使用的pytorch内置优化器
learning_rate: 0.001 #学习率
training_neg_sample_num: 1 #负采样数目
eval_step: 1 #每次训练后做evalaution的次数
stopping_step: 10 #控制训练收敛的步骤数,在该步骤数内若选取的评测标准没有什么变化,就可以提前停止了
eval_args:
split: {'RS':[0.8, 0.1, 0.1]} #对数据随机重排,设置按比例划分数据集
group_by: ~ #是否将一个user的记录划到一个组里
mode: full
order: RO
metrics: ["Recall","NDCG"] #评测标准
topk: [10] #评测标准使用topk,设置成10评测标准就是["Recall@10", "MRR@10", "NDCG@10", "Hit@10", "Precision@10"]
valid_metric: Recall@10 #选取哪个评测标准作为作为提前停止训练的标准
eval_batch_size: 4096 #评测的batch_size

val_interval:
rating: "[3,inf)"
unused_col:
inter: [rating]
user_inter_num_interval: "[10,inf)"
item_inter_num_interval: "[10,inf)"

这是我的参数,请大佬指教一下

from recbole.

NEUYuYang avatar NEUYuYang commented on May 28, 2024

1712577188634
这是我的图片,训练集只有201,验证集有6033

from recbole.

NEUYuYang avatar NEUYuYang commented on May 28, 2024

您是这样的吗

from recbole.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.