Comments (1)
首先,我不是作者,不过想说一下自己的想法。
-
首先是因为在HMM计算初始状态概率矩阵的时候,需要考虑每个sentence起始位置的标签,如果全部读成一个list,就只有一个sentence,那么对于起始标签来说,只有一种可能。也就是说把整篇文章当做一句话,这样就人为减少了训练集的数量,得到的结果会非常不准确,至少对于初始状态概率矩阵来说。
-
LSTM的输入时间序列的格式要求就是(sentence, word, emb_size),这里的sentence也可以理解成batch_size,但是肯定得是多个sentence;如果只封装在一个list中,每个元素都是一个word,这样就只有一个sentence了。
以上是两点个人想法,如果错误还请指教!
from named_entity_recognition.
Related Issues (20)
- potential fix in build_corpus
- RuntimeError: Attempting to deserialize object on a CUDA device but torch.cuda.is_available() is False. If you are running on a CPU-only machine, please use torch.load with map_location=torch.device('cpu') to map your storages to the CPU.
- 使用BIO标注是否也能运用该代码 HOT 5
- 数据问题
- bilistm_crf模型中的为啥要sort_by_lengths(word_lists, tag_lists),作用是啥。 HOT 1
- recall
- 数据集问题
- 数据问题 HOT 2
- BiLstm-CRF
- 请问batch_size是在哪里设置的
- 想问一下bilstm+crf做推理的时候,为什么还要加入tag呢?
- 环境配置不成功 HOT 3
- 请问数据集有哪些需要注意的点吗
- HMM模型训练时状态转移矩阵,观测概率矩阵的问题
- 模型运行速度/调参
- 如何添加Bert模型
- emission.unsqueeze(2).expand(-1, -1, out_size, -1) + self.transition.unsqueeze(0)
- cuDNN error: CUDNN_STATUS_EXECUTION_FAILED
- UnicodeEncodeError: 'ascii' codec can't encode characters in position 9-10: ordinal not in range(128)为什么在训练CRF模型时会一直报错? HOT 1
- 训练数据特征过于明显
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from named_entity_recognition.