GithubHelp home page GithubHelp logo

greveley / keras_lstm_diagram Goto Github PK

View Code? Open in Web Editor NEW

This project forked from mohammadfneish7/keras_lstm_diagram

0.0 0.0 0.0 592 KB

Understanding Keras Recurrent Nets' structure and data flow (mainly LSTM) in a single diagram.

License: GNU General Public License v3.0

keras_lstm_diagram's Introduction

Keras_LSTM_Diagram

Understanding Keras Recurrent Nets' structure and data flow (mainly LSTM) in a single diagram.

Actually as I was working on understanding how Recurrent Neural Networks really work and what gives these special network architectures this high power and efficiency, especially when working with sequence datasets, I found many difficulties to get the whole concept clearly into my mind. I was very familiar with Convolutional Neural Networks and fully-connected Feed-Forward architectures, where I had been working with these networks for a long time ago, and I can simply describe it as an easy-to-cut cake. However, when talking about Recurrent Nets (e.g. LSTM's, GRU's โ€ฆ) things gets much more complex.

So, when it came into building up my own LSTM model for the first time, I decided to use Keras framework (because I was very familiar with), and I realized then that the worst and the hardest part was about understanding how to prepare and transform my input data to match the Keras model input expectations, and how to transform my training and validation labels to match the output of the network for validation and testing. Besides, when working with LSTM's you will find that there are a lot of network-special parameters that you must understand and take care of in order to start the game.

Finally after having a clear and sharp view of this problem, and because all of what I passed through, I decided to summarize the main flow of Keras LSTM networks in the this diagram for the public benefit.

Please note that the following diagram describes Keras LSTM layers, however the same diagram, as it is, is applicable for GRU's, with a minor difference between both structures in the inner shape of the processing units (nodes).

For Keras LSTM code example, I think that Mr.Jason Brownlee has a great blog here that worth to check.

Diagram

Please try to read the diagram from bottom to top for better flow coherence.

License

This project is licensed under GNU General Public License.

keras_lstm_diagram's People

Contributors

mohammadfneish7 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.