Comments (9)
2.跟英文的差不多,效果缩水一点
from pretrained-language-model.
tinybert通用蒸馏模型可以发布出来吗?或者是共享一下?
from pretrained-language-model.
简单任务的话性能和论文里差不多,复杂任务有所缩水
from pretrained-language-model.
同问。
from pretrained-language-model.
- 官方最近有更新中文版模型计划吗?
- 有人自己训练蒸馏中文版模型吗?蒸馏模型做下游任务效果如何?
请问你搞来了中文蒸馏模型了吗?
from pretrained-language-model.
2.跟英文的差不多,效果缩水一点
请问如何自己训练中文版蒸馏模型
from pretrained-language-model.
直接用textbrewer吧
from pretrained-language-model.
@wykdg 老哥,你是怎么做的能给个教程不?
from pretrained-language-model.
您好,如果有相关开源计划我们会第一时间告知,多谢关注~
from pretrained-language-model.
Related Issues (20)
- For BinaryBERT, Is 2 epochs used to fine tune both the ternary BERT and binary BERT with data augmentation?
- For BinaryBERT, what is the value of config.dyna_hidden_size? HOT 2
- CeMAT预训练模型下载? HOT 3
- 想问下tinybert Task-specific Distillation第一步中间层蒸馏的评价指标 HOT 2
- TinyBERT's Google resource 404 HOT 3
- wmt数据下载 HOT 1
- 使用nezha_base_www模型,得到的嵌入向量为nan
- tinybert 在mnli任务不能复现 HOT 4
- TinyBERT实验到底用哪个enwiki-latest-pages-articles数据集?
- When to use DFL loss
- How does BinaryBERT store the 1 bit weight?
- 复现TinyBERT需要pre-train的wiki语料,另是否开源tinybert-cased模型
- TernaryBERT如何实现模型size降低的
- CeMAT 加载预训练 ckpt 报错 HOT 1
- 你好,请问能否分享一下pytorch版nezha的NEZHA-Base-WWM/model.ckpt文件,谢谢
- 请教一下大家,4层tinybert和6层tinybert加载到显存中,占用了多大的显存
- 运行WuKong的官方测试代码
- Where is the half-width fine-tuned full-precision DynaBERT? HOT 1
- Is the TinyBERT_General_4L_312D published on huggingface pure English LM or En-CN-Mixed LM?
- When will the source code of bert2BERT be released?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pretrained-language-model.