GithubHelp home page GithubHelp logo

Deep undertanding about tutorials HOT 5 CLOSED

morvanzhou avatar morvanzhou commented on May 19, 2024
Deep undertanding

from tutorials.

Comments (5)

MorvanZhou avatar MorvanZhou commented on May 19, 2024

Hi Gisselle,

Your understanding is right, there are three Weight matrixes all the time.
The weights['in'] you have seen in the example is W_hx, weights['out'] is
W_yh. The third one W_hh is in the cell, we just need to define the hidden
units size for the cell instead of defining W_hh.

Hope this would help you.

Regards,
Morvan

On 28 October 2016 at 11:45, Gissella Bejarano [email protected]
wrote:

Hi,
Can you recommend any source to try to understand the real architecture
done with tensorflow in the classification of RNN (MNIST). I thought that
the simplest RNN would work with 3 Weight matrixes. W_hx, W_hh and W_yh but
I just saw two in the tutorial and I don't understand very well why.

Regards,
Gissella B.


You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
#19, or mute the thread
https://github.com/notifications/unsubscribe-auth/ASgmNEi3BByT-mLPOOsCKV-mBGvCavzXks5q4UXBgaJpZM4Ki-Qd
.

from tutorials.

gissemari avatar gissemari commented on May 19, 2024

Oh, I see..thanks for the explanation!
So BasicLSTMCell() would create a hidden recurrent layer which would create the W_hh matrix? One more question. I have a testSet of different size of a batch size... is there a way to change the batchsize we used here: lstm_cell.zero_state(batch_size, dtype=tf.float32)?

Regards,
Gissella B.

from tutorials.

MorvanZhou avatar MorvanZhou commented on May 19, 2024

BasicLSTMCell() would create a hidden recurrent layer which would create
the W_hh matrix?

Answer: Yes, it will.

I have a testSet of different size of a batch size... is there a way to
change the batchsize we used here: lstm_cell.zero_state(batch_size,
dtype=tf.float32)?

Answer: I also faced this issue (my issue is the different time_steps in
training and testing), and my solution is to create a test-RNN with new
parameters and restore the Weights and biases from a saved Train-RNN. I
don't think this is a good solution but I don't see other better answers.

On 29 October 2016 at 01:07, Gissella Bejarano [email protected]
wrote:

Oh, I see..thanks for the explanation!
So BasicLSTMCell() would create a hidden recurrent layer which would
create the W_hh matrix? One more question. I have a testSet of different
size of a batch size... is there a way to change the batchsize we used
here: lstm_cell.zero_state(batch_size, dtype=tf.float32)?

Regards,
Gissella B.


You are receiving this because you commented.
Reply to this email directly, view it on GitHub
#19 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/ASgmNKc5kHMWrllKjhC3QqjPZBxnFi2hks5q4gG4gaJpZM4Ki-Qd
.

from tutorials.

gissemari avatar gissemari commented on May 19, 2024

Thank you. I managed to fixed the time_steps of all my instances and I am doing cross validation so I can have equal-size training batchs and test batch. I'm still looking for an example to manage different time_steps, there should be a more formal solution than the one you mention right? What if instances in the same training set has different time_steps?

from tutorials.

MorvanZhou avatar MorvanZhou commented on May 19, 2024

I figured out a new method to deal with the different time steps by using variable_scope: https://github.com/MorvanZhou/tutorials/blob/master/tensorflowTUT/tf22_scope/tf22_RNN_scope.py

from tutorials.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.