ray075hl / attention-ocr-toy-example Goto Github PK

View Code? Open in Web Editor NEW

31.0 31.0 9.0 6.4 MB

License: Apache License 2.0

Python 100.00%

attention-mechanism ocr tensorflow

attention-ocr-toy-example's People

Stargazers

Watchers

Forkers

fendaq peternara aileader xianfengju gds101054108 zhouleidcc comnamu18 whqchina dhlrd

attention-ocr-toy-example's Issues

deocde训练阶段的输入

代码里训练阶段使用TrainingHelper，这样decode的输入就是target，但带来一个问题，就是测试的时候是没有target的，这样训练与测试的环境差的很大。实际我发现，训练阶段准确率很高，但测试的时候准确率很低。所以能不能在训练的时候用decode的输出当做输入？而不是用target当做输入

数据生成的疑问

在生成attention数据时，train_output和target_output不太理解为什么要这么定义，还有就是为什么一开始定义Y,YY都是-2，然后最终又都要+3，图片的标签不应该时图片上是什么数字，标签就是什么数字吗，Y，YY用意何在？，刚开始接触attention，不是很清楚。

Have you compared CTC and attention results using ctc&attention model? In the related study,CTC was found to play an auxiliary role.
but...In actual results, CTC results were better than attention results. Do you have an opinion on this?

ps.
related paper : JOINT CTC-ATTENTION BASED END-TO-END SPEECH RECOGNITION USING MULTI-TASK LEARNING

请问joint ctc&attention训练是固定长度吗，效果怎样

你指的是输入图片的长度还是 label的长度？

attention代码的疑问？

attention 机制应该需要使用encoder的hide states，但是我看代码里enc_state没有返回，decoder也没有用enc_state初始化？这里是不是有问题？

CTC_model feature_length

您好，请问下CTC_model里面的feature_length为什么是固定的29呢，我替换为实际的label sequence的长度反而会报错。还有就是是否CTC_model的project_out用np.argmax(project_out, axis = 1)就应该是预测的结果？我在训练的时候输出的loss降低的很快，但是按照上述方式解码出来看上去只有少数两三个字符顺序是正确的。
期待您解惑，非常感谢！

关于代码的一些疑问

tf.contrib.seq2seq.LuongAttention(num_units=cfg.RNN_UNITS, memory=memory)这里的memory是什么意思？我看代码是把encode的输出当作memory的，另外我把tf.contrib.seq2seq.BasicDecoder(
cell=attn_cell, helper=helper,
initial_state=attn_cell.zero_state(dtype=tf.float32, batch_size=cfg.BATCH_SIZE).clone(cell_state=enc_state[0]),
output_layer=output_layer)
里面的state从0改为encode的状态，效果好了很多，这是为什么？

i have a problem

hi
..
File "../model.py", line 228, in _att_decode
att_outputs, _. _ = tf.contrib.seq2seq.dynamic_decode( decoder=decoder, output_time_major=False, impute_finished=True, maximum_iterations=self.params.attention_iteration)
..
ValueError: Shape must be rank 1 but is rank 2 for '..../while/BasicDecoderStep/decoder/attention_wrapper/concat' (op: 'ConcatV2') with input shapes: [64], [64,256], [].

do you kow this problem?
thanks.

attention是不是对sequence长度比较敏感？

楼主你好，我测试了一下您提供的代码，在3位数图像精度比较高，但是4位数和5位数就比较差了。
这个问题有什么好的解决办法吗？

ray075hl / attention-ocr-toy-example Goto Github PK

attention-ocr-toy-example's People

Stargazers

Watchers

Forkers

attention-ocr-toy-example's Issues

deocde训练阶段的输入

数据生成的疑问

about ctc & attention model

请问joint ctc&attention训练是固定长度吗，效果怎样

attention代码的疑问？

CTC_model feature_length

关于代码的一些疑问

i have a problem

attention是不是对sequence长度比较敏感？

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs