Comments (6)
ernie训练使用的是applications/neural_search/ranking/ernie_matching这个
from paddlenlp.
请问你之前用的是什么模型融合的方法.
Paddlenlp目前没有开源模型融合的技术,欢迎开发者贡献!
from paddlenlp.
请问你之前用的是什么模型融合的方法.
Paddlenlp目前没有开源模型融合的技术,欢迎开发者贡献!
使用的是BAAI的LM_Cocktail对bge进行合并,那请问您这边有遇到过我这种程序突然终止的情况吗?没有报错信息,本来是在正常训练的,然后突然进程就结束了,只能看到一个pod failed,然后exit code不是-9就是-15,这种情况一直没能解决就有一段时间没有训练ernie了,数据量小的话是没问题的,我用的微软的mMarco就不行
from paddlenlp.
目前模型融合没有相应的开发计划,可以使用python的pdb打断点进行调试,或者提供一下最小复现代码。
from paddlenlp.
目前模型融合没有相应的开发计划,可以使用python的pdb打断点进行调试,或者提供一下最小复现代码。
代码几乎没有什么改动,只是把学习率策略改成了cos这个
lr_scheduler = CosineDecayWithWarmup( learning_rate=args.learning_rate, warmup=warmup_step, total_steps=num_training_steps, with_hard_restarts=True, num_cycles=100.0, last_epoch=-1, verbose=False )
但是我试过不做任何修改训练数据量大的时候也会出现这个问题,镜像也是从docker hub拉取的paddlepaddle的镜像
from paddlenlp.
可以看一下您的显存或者内存是否足够,如果是这个原因,可以调小batch_size,使用轻量化的小模型来解决
如果还有问题,则需要提供最小复现的代码和数据,方便我们定位原因
from paddlenlp.
Related Issues (20)
- [Question]: 使用develop版本的paddlepaddle cpu 版本执行retrieval_based的export_to_serving报错 HOT 1
- [Question]: NPU下支持的功能有哪些 HOT 1
- [Question]: 进行实体抽取时,能否获得各个schema的bbox HOT 1
- [Question]: paddleNLP-uie 是否可以在移动端上使用 HOT 1
- [Question]: pipelines的faiss如何根据条件删除向量 HOT 1
- Taskflow默认的最大序列长度怎么看?FastDeploy UIE中最长序列长度怎么设置? HOT 12
- [Question]: 2.8版本使用LLM工作流报错缺少fused_ln HOT 2
- [Bug]: pipelines中语义检索系统,启动运行后,上传扫描式PDF文件 无法解析 HOT 1
- [Bug]: TaskFlow zero_shot_text_classification HOT 3
- [Bug]: get_rank_by_dim_and_process_id 函数未实现
- [Question]: paddle.distributed.launch 启动多进程训练结束后Loading best model from checkpoint 报错 HOT 7
- 如何对长文本进行抽取 HOT 3
- uie可以做嵌套抽取吗? HOT 3
- 文档公式有误 HOT 5
- [Question]: 请问文档智能任务有用自己数据集微调的教程吗? HOT 1
- [Bug]: ImportError: DLL load failed while importing libpaddle: 找不到指定的程序。
- [Question]: 分布式
- [Question]: Data annotation and pre processing for Relation Extraction
- [Bug]: paddle的nansum不支持empty的求和
- [Bug]: Taskflow("document_intelligence"): Illegal instruction (core dumped) HOT 7
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from paddlenlp.