Light

yang-collect / bert-bigru-crf Goto Github PK

View Code? Open in Web Editor NEW

5.0 1.0 0.0 539 KB

Python 100.00%

bert-bigru-crf's Introduction

数据介绍

数据来源于百度飞桨的paddlenlp内置数据集，数据间的分隔符为\002，链接为：https://paddlenlp.bj.bcebos.com/paddlenlp/datasets/waybill.tar.gz 该任务属于ner任务，标签体系为BIO，具体如下表所示：

标签	定义
P-B	姓名起始位置
P-I	姓名中间位置或结束位置
T-B	电话起始位置
T-I	电话中间位置或结束位置
A1-B	省份起始位置
A1-I	省份中间位置或结束位置
A2-B	城市起始位置
A2-I	城市中间位置或结束位置
A3-B	县区起始位置
A3-I	县区中间位置或结束位置
A4-B	详细地址起始位置
A4-I	详细地址中间位置或结束位置
O	无关字符

模型

ner任务属于token classifer任务，目前比较好的解决方案是采取词向量+bilstm-crf ，预训练的词向量可以引入很多先验信息，也在一定程度上缓解oov词的问题，双向lstm层用于学习输入数据双向的编码表示，crf则用于解决：lstm的当前时刻输出没有考虑上一时刻的输出的问题。词向量可以选择word2vec这类的静态词向量，也可以选择基于bert进行微调的动态词向量。

本文是基于hugging face transformers实现的bert-bigru-crf ,参考 https://github.com/HandsomeCao/Bert-BiLSTM-CRF-pytorch 的实现，这里的bert采用的是bert-base-chinese是基于字的模型。

将其中的pytorch_pretrained_bert和crf部分替换为transfomers和pytorch-crf

模型在经过15个epoch之后在测试集上的loss为：7.310302734375

/src/eval.py 验证结果如下：

这里取出训练集的第一条进行验证，

/src/predict.py 输出如下：

/src/server.py 在postman上测试结果如下：

bert-bigru-crf's People

Contributors

Stargazers

Watchers

bert-bigru-crf's Issues

可以提供一下tag_map.json么

新手小白找不到tag_map.json文件

没有tag_map.json

请问tag_map.json里面的内容是什么？

跑通后loss为512

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

Jobs