castellanzhang / alphafm Goto Github PK

View Code? Open in Web Editor NEW

876.0 32.0 274.0 32 KB

Multi-thread implementation of Factorization Machines with FTRL for binary-class classification problem.

License: MIT License

Makefile 0.78% C++ 99.22%

alphafm's Introduction

alphaFM

前言：

alphaFM是Factorization Machines的一个单机多线程版本实现，用于解决二分类问题，比如CTR预估，优化算法采用了FTRL。 FTRL是一种online learning算法，在Google于2013年给出的论文中用于解决LR的优化，但其实FTRL是一种通用的优化算法，同样可以用于FM。
算法原理见我的博客文章：http://castellanzhang.github.io/2016/10/16/fm_ftrl_softmax/
在最早写此代码时，正值alphaGo完虐人类，便随手给其取名曰alphaFM。
实现alphaFM的初衷是解决大规模数据的FM训练，在我们真实的业务数据中，训练样本数常常是千万到亿级别，特征维度是百万到千万级别甚至上亿，这样规模的数据完全加载到内存训练已经不太现实，甚至下载到本地硬盘都很困难，一般都是经过spark生成样本直接存储在hdfs上。
alphaFM用于解决这样的问题特别适合，一边从hdfs下载，一边计算，一个典型的使用方法是这样：
训练：10个线程计算，factorization的维度是8，最后得到模型文件fm_model.txt
hadoop fs -cat train_data_hdfs_path | ./fm_train -core 10 -dim 1,1,8 -m fm_model.txt
测试：10个线程计算，factorization的维度是8，加载模型文件fm_model.txt，最后输出预测结果文件fm_pre.txt
hadoop fs -cat test_data_hdfs_path | ./fm_predict -core 10 -dim 8 -m fm_model.txt -out fm_pre.txt
当然，如果样本文件不大，也可以先下载到本地，然后再运行alphaFM。
由于采用了FTRL，调好参数后，训练样本只需过一遍即可收敛，无需多次迭代，因此alphaFM读取训练样本采用了管道的方式，这样的好处除了节省内存，还可以通过管道对输入数据做各种中间过程的转换，比如采样、格式变换等，无需重新生成训练样本，方便灵活做实验。
alphaFM还支持加载上次的模型，继续在新数据上训练，理论上可以一直这样增量式进行下去。
FTRL的好处之一是可以得到稀疏解，在LR上非常有效，但对于FM，模型参数v是个向量，对于每一个特征，必须w为0且v的每一维都为0才算稀疏解，但这通常很难满足，所以加了一个force_v_sparse的参数，在训练过程中，每当w变成0时，就强制将对应的v变成0向量。这样就可以得到很好的稀疏效果，且在我的实验中，发现最终对test样本的logloss没有什么影响。
当将dim参数设置为1,1,0时，alphaFM就退化成标准的LR的FTRL训练工具。
当前版本在v1.0.0基础上做了如下优化，具体见：http://castellanzhang.github.io/2018/09/01/memory_optimization_for_alphafm/ ，注意：当前版本只在Linux x86_64系统上通过g++编译测试过，其他环境不保证能够成功编译以及成功执行：
- 内存优化，相比v1.0.0，fm_train在不改变任何运行参数的情况下内存占用能降到1/3左右，具体降幅取决于特征数据以及-dim参数；fm_predict内存占用降幅更为明显。内存优化带来的益处是显著的，比如对于我们一个典型的应用场景：单机128G内存训练LR模型，可以从原来支持3亿左右的特征维度提升到支持10亿左右。
- 增加-mnt参数，可以指定内存中模型参数的类型为double还是float，当指定为float时能够进一步降低内存占用，但可能对模型效果有一定影响，谨慎使用。
- 模型文件可以选择指定为二进制格式，模型加载和输出的速度可以带来10倍量级的提升。
- 增加模型格式转换工具model_bin_tool，可以输出二进制模型相关信息，可以相互转换二进制模型和文本模型，从二进制转为文本格式时可以选择只保留非0特征。

安装方法：

直接在根目录make即可，编译后会在bin目录下生成三个可执行文件。如果编译失败，请升级gcc版本。

输入文件格式：

类似于libsvm格式，但更加灵活：特征编号不局限于整数也可以是字符串；特征值可以是整数或浮点数（特征值最好做归一化处理，否则可能会导致结果为nan），特征值为0的项可以省略不写；正负label可以是1/0或者1/-1。举例如下：
1 sex:1 age:0.3 f1:1 f3:0.9
0 sex:0 age:0.7 f2:0.4 f5:0.8 f8:1
...

txt模型文件格式：

第一行是bias的参数：
bias w w_n w_z
其他行的格式为：
feature_name w v1 v2 ... vf w_n w_z v_n1 v_n2 ... v_nf v_z1 v_z2 ... v_zf

预测结果格式：

label score
其中label为1或-1，score等于预测为正样本的概率值。

参数说明：

fm_train的参数：

-m <model_path>: 设置模型文件的输出路径。
-mf <model_format>: 设置模型文件的输出格式，txt（文本）或bin（二进制）。 default:txt
-dim <k0,k1,k2>: k0为1表示使用偏置w0参数，0表示不使用；k1为1表示使用w参数，为0表示不使用；k2为v的维度，可以是0。 default:1,1,8
-init_stdev <stdev>: v的初始化使用均值为0的高斯分布，stdev为标准差。 default:0.1
-w_alpha <w_alpha>: w0和w的FTRL超参数alpha。 default:0.05
-w_beta <w_beta>: w0和w的FTRL超参数beta。 default:1.0
-w_l1 <w_L1_reg>: w0和w的L1正则。 default:0.1
-w_l2 <w_L2_reg>: w0和w的L2正则。 default:5.0
-v_alpha <v_alpha>: v的FTRL超参数alpha。 default:0.05
-v_beta <v_beta>: v的FTRL超参数beta。 default:1.0
-v_l1 <v_L1_reg>: v的L1正则。 default:0.1
-v_l2 <v_L2_reg>: v的L2正则。 default:5.0
-core <threads_num>: 计算线程数。 default:1
-im <initial_model_path>: 上次模型的路径，用于初始化模型参数。如果是第一次训练则不用设置此参数。
-imf <initial_model_format>: 初始化模型文件的格式，txt（文本）或bin（二进制）。 default:txt
-fvs <force_v_sparse>: 为了获得更好的稀疏解。当fvs值为1, 则训练中每当wi = 0，即令vi = 0；当fvs为0时关闭此功能。 default:0
-mnt <model_number_type>: 模型参数在内存中和二进制文件中的类型，double或float。 default:double

fm_predict的参数：

-m <model_path>: 模型文件路径。
-mf <model_format>: 模型文件格式，txt（文本）或bin（二进制）。 default:txt
-dim <factor_num>: v的维度。 default:8
-core <threads_num>: 计算线程数。 default:1
-out <predict_path>: 输出文件路径。
-mnt <model_number_type>: 模型参数在内存中和二进制文件中的类型，double或float。 default:double

model_bin_tool的参数：

-task <task_type>: 1-输出模型信息；2-格式转换，bin到txt；3-格式转换，bin到txt，只保留非零特征；4-格式转换，txt到bin。
-im <input_model_path>: 输入模型路径。
-om <output_model_path>: 输出模型路径，用于task 2、3和4。task 4必须指定，task 2和3若不指定则默认为标准输出。
-dim <factor_num>: v的维度，用于task 4。
-mnt <model_number_type>: 模型参数在二进制文件中的类型，用于task 4，double或float。 default:double

计算速度：

我的实验结果：

本地1000万的样本，200万的特征维度，2.10GHz的CPU，开10个线程，非缺省参数如下：
-dim 1,1,2 -w_l1 0.05 -v_l1 0.05 -init_stdev 0.001 -w_alpha 0.01 -v_alpha 0.01 -core 10
训练时间只需要10多分钟。若指定模型文件为二进制格式，速度会更快。

alphafm's People

Contributors

Stargazers

Watchers

Forkers

gucasbrg frankfqchen vincentami kelvict sunmingze bidai541 ck8275411 lukebelieves arthur503 mawbhkdg jimberxin gubobo irwenqiang alvis-huang frankiegu wanesta allensmile starsnet83 liu-jin sendlerlee xuanhan863 huangpingchun berli seven-xu yxzf sdd031215 poseidon1214 mllearn lilonghua1987 karcylee 466152112 10183308 casywang zhouyonglong chenmoshushi fulquan peizhe shenleiz btbujiangjun zgcgreat y-lan ytjia iamsecure maogeng chimingyu wangke19910912 jz3707 iwii0425 killallkill dukeyuan zzyangchn xiaokai-wang danxiangjie leavingseason sdzr wjth07 woisnow baokunguo david082 crazylook fengyuan777 curryyang andong0323 defaultrobot dmz0907 zorospace dotrado shenbai zhouliang1979 princeon redfriday enyun wonderlzy a907471325 zhanyueluoxuanwan mejihero zhu2856061 tandychao huanyingbazhe cugwhp abelard223 nkuhyx jerrylu5683 yu3401 skytodinfi ks838 xiaobocser gavinljj wuhh suanec lawlietzh yifengchen9 uranuszs zhangzee danglei912 slover2000 matricer liguoyu1 bjjacking shuoranly

alphafm's Issues

no member named '_Hash_node_base' in namespace 'std::1::detail'

MacOS

g++ -O3 fm_train.cpp src/Frame/pc_frame.cpp src/Utils/utils.cpp -I . -std=c++11 -o bin/fm_train -lpthread
In file included from fm_train.cpp:5:
In file included from ./src/FTRL/ftrl_trainer.h:5:
In file included from ./src/FTRL/ftrl_model.h:14:
./src/FTRL/../Mem/my_allocator.h:42:58: error: expected expression
if(typeid(T) == typeid(__detail::_Hash_node_base*))
^
./src/FTRL/../Mem/my_allocator.h:42:42: error: no member named '_Hash_node_base' in namespace 'std::__1::__detail'
if(typeid(T) == typeid(__detail::_Hash_node_base*))

./src/FTRL/../Mem/my_allocator.h:53:32: error: no template named '_Hash_node' in namespace 'std::__1::__detail'; did you mean '__hash_node'?
if(typeid(T) != typeid(std::__detail::_Hash_node<std::pair<const char* const, MODEL_UNIT >, false>))
^~~~~~~~~~~~~~~~~~~~~~~~~
__hash_node
/Library/Developer/CommandLineTools/usr/include/c++/v1/__hash_table:95:8: note: '__hash_node' declared here
struct __hash_node
^
In file included from fm_train.cpp:5:
In file included from ./src/FTRL/ftrl_trainer.h:5:
In file included from ./src/FTRL/ftrl_model.h:14:
./src/FTRL/../Mem/my_allocator.h:53:104: error: template argument for template type parameter must be a type
if(typeid(T) != typeid(std::__detail::_Hash_node<std::pair<const char* const, MODEL_UNIT >, false>))
^~~~~
/Library/Developer/CommandLineTools/usr/include/c++/v1/__hash_table:94:28: note: template parameter is declared here
template <class _Tp, class _VoidPtr>
^
In file included from fm_train.cpp:5:
In file included from ./src/FTRL/ftrl_trainer.h:5:
In file included from ./src/FTRL/ftrl_model.h:14:
./src/FTRL/../Mem/my_allocator.h:64:58: error: expected expression
if(typeid(T) == typeid(__detail::_Hash_node_base*))
^
./src/FTRL/../Mem/my_allocator.h:64:42: error: no member named '_Hash_node_base' in namespace 'std::__1::__detail'
if(typeid(T) == typeid(__detail::_Hash_node_base*))
~~~~~~~~~~^
./src/FTRL/../Mem/my_allocator.h:82:16: error: use of undeclared identifier '_Hash_impl'
return _Hash_impl::hash(key, strlen(key));
^
In file included from fm_train.cpp:5:
In file included from ./src/FTRL/ftrl_trainer.h:5:
./src/FTRL/ftrl_model.h:44:27: error: no template named '_Hash_node' in namespace 'std::__1::__detail'; did you mean '__hash_node'?
using node_type = std::__detail::_Hash_node<std::pair<const char* const, ftrl_model_unit >, false>;
^~~~~~~~~~~~~~~~~~~~~~~~~
__hash_node
/Library/Developer/CommandLineTools/usr/include/c++/v1/__hash_table:95:8: note: '__hash_node' declared here
struct __hash_node
^
In file included from fm_train.cpp:5:
In file included from ./src/FTRL/ftrl_trainer.h:5:
./src/FTRL/ftrl_model.h:44:104: error: template argument for template type parameter must be a type
using node_type = std::__detail::_Hash_node<std::pair<const char* const, ftrl_model_unit >, false>;
^~~~~
/Library/Developer/CommandLineTools/usr/include/c++/v1/__hash_table:94:28: note: template parameter is declared here
template <class _Tp, class _VoidPtr>
^
In file included from fm_train.cpp:5:
In file included from ./src/FTRL/ftrl_trainer.h:5:
./src/FTRL/ftrl_model.h:45:61: error: use of undeclared identifier 'node_type'
size_t offset_this = get_value_offset_in_Hash_node((node_type*)NULL);
^
./src/FTRL/ftrl_model.h:45:71: error: expected expression
size_t offset_this = get_value_offset_in_Hash_node((node_type*)NULL);
^
./src/FTRL/ftrl_model.h:46:33: error: unknown type name 'node_type'; did you mean 'true_type'?
size_t padding = sizeof(node_type) - offset_this - class_size;
^~~~~~~~~
true_type
/Library/Developer/CommandLineTools/usr/include/c++/v1/type_traits:540:38: note: 'true_type' declared here
typedef _LIBCPP_BOOL_CONSTANT(true) true_type;
^
In file included from fm_train.cpp:5:
In file included from ./src/FTRL/ftrl_trainer.h:5:
In file included from ./src/FTRL/ftrl_model.h:4:
/Library/Developer/CommandLineTools/usr/include/c++/v1/unordered_map:826:5: error: static_assert failed due to requirement 'is_same<value_type, typename
allocator_type::value_type>::value' "Invalid allocator::value_type"
static_assert((is_same<value_type, typename allocator_type::value_type>::value),
^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
./src/FTRL/ftrl_model.h:204:20: note: in instantiation of template class 'std::__1::unordered_map<const char *, ftrl_model_unit, my_hash, my_equal,
my_allocator<std::__1::pair<const char *, ftrl_model_unit >, float, ftrl_model_unit> >' requested here
my_hash_map muMap;
^
./src/FTRL/ftrl_trainer.h:191:18: note: in instantiation of template class 'ftrl_model' requested here
pModel = new ftrl_model(opt.factor_num, opt.init_mean, opt.init_stdev);
^
fm_train.cpp:40:21: note: in instantiation of member function 'ftrl_trainer::ftrl_trainer' requested here
ftrl_trainer trainer(opt);
^
fm_train.cpp:87:16: note: in instantiation of function template specialization 'train' requested here
return train(opt);
^
In file included from fm_train.cpp:5:
In file included from ./src/FTRL/ftrl_trainer.h:5:
In file included from ./src/FTRL/ftrl_model.h:4:
/Library/Developer/CommandLineTools/usr/include/c++/v1/unordered_map:826:5: error: static_assert failed due to requirement 'is_same<value_type, typename
allocator_type::value_type>::value' "Invalid allocator::value_type"
static_assert((is_same<value_type, typename allocator_type::value_type>::value),
^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
./src/FTRL/ftrl_model.h:204:20: note: in instantiation of template class 'std::__1::unordered_map<const char *, ftrl_model_unit, my_hash, my_equal,
my_allocator<std::__1::pair<const char *, ftrl_model_unit >, double, ftrl_model_unit> >' requested here
my_hash_map muMap;
^
./src/FTRL/ftrl_trainer.h:191:18: note: in instantiation of template class 'ftrl_model' requested here
pModel = new ftrl_model(opt.factor_num, opt.init_mean, opt.init_stdev);
^
fm_train.cpp:40:21: note: in instantiation of member function 'ftrl_trainer::ftrl_trainer' requested here
ftrl_trainer trainer(opt);
^
fm_train.cpp:89:12: note: in instantiation of function template specialization 'train' requested here
return train(opt);
^
14 errors generated.
src/Frame/pc_frame.cpp:9:5: warning: 'sem_init' is deprecated [-Wdeprecated-declarations]
sem_init(&semPro, 0, 1);
^
/Library/Developer/CommandLineTools/SDKs/MacOSX10.14.sdk/usr/include/sys/semaphore.h:55:42: note: 'sem_init' has been explicitly marked deprecated here
int sem_init(sem_t *, int, unsigned int) __deprecated;
^
/Library/Developer/CommandLineTools/SDKs/MacOSX10.14.sdk/usr/include/sys/cdefs.h:176:40: note: expanded from macro '__deprecated'
#define __deprecated attribute((deprecated))
^
src/Frame/pc_frame.cpp:10:5: warning: 'sem_init' is deprecated [-Wdeprecated-declarations]
sem_init(&semCon, 0, 0);
^
/Library/Developer/CommandLineTools/SDKs/MacOSX10.14.sdk/usr/include/sys/semaphore.h:55:42: note: 'sem_init' has been explicitly marked deprecated here
int sem_init(sem_t *, int, unsigned int) __deprecated;
^
/Library/Developer/CommandLineTools/SDKs/MacOSX10.14.sdk/usr/include/sys/cdefs.h:176:40: note: expanded from macro '__deprecated'
#define __deprecated attribute((deprecated))
^
2 warnings generated.
make: *** [all] Error 1

删除频次太少的特征

大佬，我想让模型训练的时候忽略出现次数太少的特征，直到它出现次数够多了以后再纳入模型。
是不是就在model_unit里面增加一个记录出现次数的变量cnt，然后把src/FTRL/ftrl_trainer.h里面的
if(fabs(mu.w_zi) <= w_l1)
改成
if(fabs(mu.w_zi) <= w_l1 || mu.cnt <= cntThre)
就好了

谢谢大佬

求联系，讨论FM

CastellanZhang，你好：
我目前也在做fm和与其有关的算法，对于你的项目很感兴趣，比如如何通过openMP进一步提高单机执行效率，使用FTRL的时候到底读取几次数据等问题希望向你多多请教，我的邮箱：[email protected] ，请问你的是？希望和你建立联系

load model中的bug

您好，我在使用过程中遇到一个bug，即：
我在训练时设置了不同的dim参数，如1,1,2，但是在predict时抛出了异常。
具体原因是output model时，没有写factor num这个参数
void ftrl_model::outputModel(ofstream& out) { out << "bias " << *muBias << endl; for(unordered_map<string, ftrl_model_unit*>::iterator iter = muMap.begin(); iter != muMap.end(); ++iter) { out << iter->first << " " << *(iter->second) << endl; } }

而在load model中，factor采用的是默认的factor num，所以抛出异常
while(getline(in, line)) { strVec.clear(); utils::splitString(line, ' ', &strVec); if(strVec.size() != 3 * factor_num + 4) { return false; } string& index = strVec[0]; ftrl_model_unit* pMU = new ftrl_model_unit(factor_num, strVec); muMap[index] = pMU; }

我想请教一下为什么我用alphaFM跑出来的结果auc表现远不如LR

在预测文件中，第一个是label1/-1，后面跟着的是概率，在计算auc的时候，我通过如下的方式来获取模型预测的概率

如果label是1（在fm_pre.txt文件中），那么模型预测为1的概率为prob
如果label是-1，那么模型预测为1 的概率为1-prob

如上准备的概率和标签一起进入BinaryClassificationMetrics，最后表现的auc不超过1%，而LR接近90%，请问可能在哪里出现了问题？

我想咨询一下您bert多GPU项目的一点小问题，您方便给个联系方式吗？

有类似问题的么？同样1亿语料，单线程训练比多线程，auc高0.02

同样的输入文件，一亿语料，用相同100w测试
当core = 1时，auc是0.68
当core = 10时，训练速度提升很大，但是auc是0.65

训练语料按时间顺序和用户，双key排序。

force_v_sparse

force_v_sparse是在w=0时，强制令v=0, 这个操作有什么理论依据吗？
举例来说，当年龄对预测结果没有影响，但是年龄性别组合到一起可能就有影响了。
如果年龄的w=0, 就令年龄的v=0, 那么会导致跟年龄组合的其它组合特征都为0了？

关于模型跨平台问题

你好，楼主，请问有对应java版本的库吗？如果用c++训练完的模型，保存后，用java来加载，有什么好的方式吗？

随着模型在线训练的进行，w_z 逐渐变大，导致稀疏效果不好，AUC降低

您好，我在使用 alphaFM 在线训练一个 LR 模型。在持续训练的过程中，随着在线训练的进行，每个特征出现的次数逐渐增多，w_z 的绝对值也逐渐变大。当 w_l1 > abs(w_z) 时，特征权重为0 ，由于 w_z 变大，导致权重不为 0 的特征暴增，AUC 也逐渐降低。看了 FTRL 的公式，感觉这个问题是避免不了的。请问怎么解决这个问题呢？随着训练的进行，逐渐增大 w_l1 吗？

考虑支持一下group lasso？

预估模型报错 load model error

cat /data/zhangbo_ret/data/test_data/ptr_train_libsvm_data_alpha_test_off.txt | /root/work/FM_Recall/alphaFM/bin/fm_predict -core 15 -dim 8 -m /data/zhangbo_ret/data/test_data/alpha_fm_ptr_param_train.txt -mf txt -out /data/zhangbo_ret/data/test_data/alpha_fm_ptr_predict_result.txt

我训练的模型参数文件是 alpha_fm_ptr_param_train.txt ，是8维的，但是按照官网介绍的这样取预测样本时，一直报错 load model... load model error! 。
请问这是怎么回事啊/？我加了 -mf 参数还是不行。你们有遇到过吗？

得到的模型文件如何部署到线上用有案例吗

在线训练时，载入模型后，卡在 start！ init end！上，并不开始模型训练

以下是我的训练输出，从 2:40：45 输出 init end! 之后，八个小时都未开始训练。机器有 40 个核，用 13 个线程肯定没问题用 top 命令看了内存使用情况，没有其他使用内存的大程序。这种情况会有哪些可能的原因呢？
the train command is : hadoop fs -cat hdfs://xxxxx/20190124/22/* | /home/stat/alphaFM/bin/fm_train -imf txt -im model_test.txt -init_stdev 0 -core 13 -w_l1 9.91 -w_alpha 0.01 -dim 1,1,0 -mf txt -m model_test.txt load model... model loading finished [2019-01-25 02:40:45] start! [2019-01-25 02:40:45] init end!

fm_predict, multi-core, random order

when using fm_predict with -core 30, the output order is not the same as input.
It is ok if you only need auc. But when you need gauc, you can't associate label/score with uid, because the fm_predict output is random order.
You can only use -core 1 to persist order, but it is very slow.

Any improvements?

模型txt转bin的时候提示：read file error

命令：
./model_bin_tool -task 4 -im ./models/click_model_20200510_filter -om ./models/click_model_20200510_filter_bin -dim 8 -mnt float
模型：
F0 -0.264242
weekday_6 0.0173578 -0.0102694 0 -0.022693 0.00254934 0.0396216 0 0 -0.00407052
dist_interval_4000_4500 -0.00691738 0 0.00269233 0.000255495 0 -0.00661624 0.00172624 -0.0268299 -0.00641555
rec 0.00342799 0 0 -0.0272095 0.0139266 -0.0251638 0 -0.00760475 0.000963799
dist_interval_3500_4000 0.0140954 0.00791469 -0.0205997 0.0250574 0 0.041754 -0.0493869 0.00497005 -0.0204439

请问多值特征如何配置

假设有一个特征是用户看过的视频列表，那么是否可以这样配置 watch_video_ids:101,102,103

about exp/log

I use FTRL to optimize logistic regression，and find it slow than newGLMNET；
now，i wander why，because compute loss need exp ？

请问代码中使用的loss function是什么？

我看您在计算gi时是这样算的：
double mult = y * (1 / (1 + exp(-p * y)) - 1);
double w_gi = mult * xi;

不应该是p - y吗，为什么是mult那样算的呢？
还请不吝赐教！

是否考虑加入CV的模式？

非常感谢作者的无私的奉献，比如线上的引擎是java的请问FM的predict有考虑提供不同语言的load接口？

如何看到训练过程中的损失函数？logloss

请教一下，多个consumer之间看起来是不会串行的，那么使用多个consumer的意义是什么？

尝试打印了一下consumer开始&结束消费的时间

训练完模型进行预测得时候，发现，同一个样本，每次预测出来得分，会不一样，这是为啥？

性能跑不上去

发现从 hdfs上用 –text 同步下来样本，用 pipe的方式，作为ftrl的输入，开始的时候，处理的很快，cpu能达到core数(–core 48，48个thread 开始能cpu跑满)。但是跑几分钟后，cpu就只能达到 (1400%左右)，接下来就一直上不去，大伙了解可能是啥原因吗

感觉FTRL里面train加锁有问题

多线程训练的情况下，有的线程更新参数g,有的线程读了参数s, 虽然有锁程序运行没问题，但感觉可能出现一条样本对参数的跟新不一致的问题
280 mu.mtx.lock();
个人感觉应该加在循环体外

尝试过adam优化器吗

hi，大神。没错，又是我。

（1）个人在做实验的时候，修改mult为标准的h(x)-y,并将sample的标签小于0的置为0，发现auc提升了一些（千分之二吧）；

（2）目前个人在尝试adam替换ftrl，不知道大神以前做过这个实验不？

祝好~

          total        used        free      shared  buff/cache   available

Mem: 125G 101G 6.2G 8.0M 18G 18G

PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
22797 algorit+ 20 0 0.098t 0.097t 1544 S 50.2 79.4 1280:04 fm_train

模型预估的分数都是0

请问下，模型预估的分数都是0，这个可能是什么原因呢