Light

关于list和lists的问题 about named_entity_recognition HOT 1 OPEN

luopeixiang commented on June 27, 2024

关于list和lists的问题

from named_entity_recognition.

Comments (1)

Zessay commented on June 27, 2024

首先，我不是作者，不过想说一下自己的想法。

首先是因为在HMM计算初始状态概率矩阵的时候，需要考虑每个sentence起始位置的标签，如果全部读成一个list，就只有一个sentence，那么对于起始标签来说，只有一种可能。也就是说把整篇文章当做一句话，这样就人为减少了训练集的数量，得到的结果会非常不准确，至少对于初始状态概率矩阵来说。
LSTM的输入时间序列的格式要求就是(sentence, word, emb_size)，这里的sentence也可以理解成batch_size，但是肯定得是多个sentence；如果只封装在一个list中，每个元素都是一个word，这样就只有一个sentence了。

以上是两点个人想法，如果错误还请指教！

from named_entity_recognition.

Related Issues (20)

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

Jobs