GithubHelp home page GithubHelp logo

nlp_related_projects's Introduction

导航

So far:

GNN

|--RGCN (link prediction) (关系图卷积神经网络,可用于已知图谱的链路预测,完成图谱补全)

Bert

|--bert_sim (text similarity) (完成文本相似度任务,可用于基于知识库的问答模型)

Attention

|--Multi-Head Self-Attention (text classification demo)(完成文本分类demo,包含位置编码)

NER

|--BiLSTM + CRF (完成实体识别模型,主要实体包括食物,营养素,人群等)

RE

|--BIGRU + Attention (完成关系抽取模型)

ESIM

| -- BiGRU + local inference model(文本匹配模型)

nlp_related_projects's People

Contributors

lixuanhng avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

nlp_related_projects's Issues

dataset

请问RGCN链路预测这个数据集是什么呀?可以发出来嘛

相似性结果没有改变

为什么我用自己预训练得到的模型来做英文的文本相似性分析时,不管怎么改变两个用来预测的句子,最后的结果都很接近0.5。而且最后两个句子预测得到的label好像不是计算得到的,就是一开始设置的那个label啊?

GNN data

作者能分享下RGCN的输入数据吗,想用你的数据尝试跑下程序

problem

在load_vocab时出现编码错误

UnicodeDecodeError: 'utf-8' codec can't decode byte 0xd5 in position 144: invalid continuation byte

请问这个如何解决
Traceback (most recent call last):
File "D:/down/NLP_related_projects-master/BERT/Bert_sim/run_similarity.py", line 716, in
sim = BertSim()
File "D:/down/NLP_related_projects-master/BERT/Bert_sim/run_similarity.py", line 141, in init
self.tokenizer = tokenization.FullTokenizer(vocab_file=cf.vocab_file, do_lower_case=True)
File "D:\down\NLP_related_projects-master\BERT\Bert_sim\bert_model\tokenization.py", line 165, in init
self.vocab = load_vocab(vocab_file)
File "D:\down\NLP_related_projects-master\BERT\Bert_sim\bert_model\tokenization.py", line 127, in load_vocab
token = convert_to_unicode(reader.readline())
File "D:\Anaconda3\envs\tf2\lib\site-packages\tensorflow\python\lib\io\file_io.py", line 169, in readline
self._preread_check()
File "D:\Anaconda3\envs\tf2\lib\site-packages\tensorflow\python\lib\io\file_io.py", line 79, in _preread_check
self.__name, 1024 * 512)
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xd5 in position 144: invalid continuation byte

Process finished with exit code 1

bert模型文件好像没有

bert模型文件好像没有

from bert_dir.bert.bert import modeling
from bert_dir.bert.bert import tokenization
from bert_dir.bert.bert import optimization

bert_dir不存在

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.