neuralegion / shainet Goto Github PK

View Code? Open in Web Editor NEW

178.0 31.0 19.0 26.52 MB

SHAInet - a pure Crystal machine learning library

License: MIT License

Crystal 100.00%

neural-network machine-learning deep-learning deep-neural-networks crystal convolutional-neural-networks

shainet's Introduction

shainet's People

Contributors

Stargazers

Watchers

Forkers

artlinkov gitter-badger msa7 drujensen hubbucket-team peterschols hugoabonizio reiisky psikoz nettishq chipper1 rmarronnier bararchy mbrukman theonlyartz shailiniobrien jekroll greithmaier

shainet's Issues

Ability to set activation function per layer

This request is to add the ability to set the activation function per layer.

Network class - Initialize with specified topology

Initialize with specified topology (including non-homogeneous layers)

Try and find a way to introduce multi-threading for training

Following @oprypin 's suggestion, I'm trying to devise a way to multi-thread non-allocating computations, If someone got any good suggestion I'll welcome it, for now I'm testing locally

save then load gives less accurate results

I have created a model for the Kaggle titanic competition. I ran into an issue related to saving and loading from files generates different results.

The first thing I wanted to try was to create an NN that will predict the ages of the passengers that they didn't know the age of. Before I saved the model, I was getting decent results. The ages were fairly close. This accuracy was based on exact age matches but I also manually verified that the ages were close.

Training size: 714
----------------------
T: 55 | F: 659
----------------------
Accuracy: 0.07703081232492998

After saving and loading the model and then running on the same test data, I am getting less accurate results and the ages are not close. They all come back as just a subset of possible ages.

Training size: 714
----------------------
T: 21 | F: 693
----------------------
Accuracy: 0.029411764705882353

Feedback on design

Hello team,

Like you I'm writing my own neural network library for a super niche language, Nim. I'm always interested into what other niche languages devs are doing in the NN domain (D, Elm, Elixir, Rust, Ocaml, Clojure ...) and I see that you are taking a completely different approach from most and from the state of the art (Tensorflow, PyTorch, Caffe, Mxnet) which I find interesting but also questionable.

Let's start with the interesting part.

I find your neurons/synapses approach interesting, is there any research or documentation that highlights the benefits of this modelization? I know that there was some research on modelling AI like a brain (visual cortex, sound, memory separated, a thinking part ...) that was completely put aside with the advance of gradient descent techniques, but I have a hard time getting my hands on that.

Especially I'm looking into the NEURON_TYPES = ["memory", "eraser", "amplifier", "fader", "sensor"] and the Synapse type and what would be future applications that would be eased by that approach?

Questionable design

While I find the neurons/synapses approach interesting, I have several reserves considering your current implementation, some are fixable but other might require a complete rewrite:

No matrix, ndarrays, tensor type. You probably want to define custom matrix types with common functions instead of having all layers define its own loops.
Storing by one neuron at a time is inefficient. I don't think the current architecture can scale to networks with millions of parameters. I don't know how Crystal classes work but if they allocate on the heap each access to a neuron will require pointer dereferencing. The main bottleneck in NN is memory access and that will make it much worse.
Furthermore, this is unmappable to BLAS, Cuda or OpenCL for efficient computations.
Synapse will be a performance bottleneck, the connection of neurons between 2 fully connected/dense/linear layers can just be represented as a matrix multiplication both for the forward and the gradient.

In the roadmap ?

Depending on your use case (research vs production), you might want to add a way to slice the data.

Network class - add batch and SGD learning

Using Crystal Workbook fails to create SHAINet model

I was trying to play with SHAINet using crystal play and creating a workbook similar to Jupyter:

require "shainet"

diabetes = SHAInet::Network.new

I'm getting a JSON parsing error and wanted to know if this is something you have encountered this before or if I'm doing something wrong.

Exception

Error in line 2: instantiating 'Crystal::Playground::Agent#i(Int32)'

in /usr/local/Cellar/crystal-lang/0.23.1_3/src/compiler/crystal/tools/playground/agent.cr:25: instantiating 'send(String)'

    send "value" do |json|
    ^~~~

in /usr/local/Cellar/crystal-lang/0.23.1_3/src/compiler/crystal/tools/playground/agent.cr:53: instantiating 'JSON:Module#build()'

    message = JSON.build do |json|
                   ^~~~~

in /usr/local/Cellar/crystal-lang/0.23.1_3/src/json/builder.cr:356: instantiating 'String:Class#build()'

    String.build do |str|
           ^~~~~

in /usr/local/Cellar/crystal-lang/0.23.1_3/src/string.cr:269: instantiating 'String::Builder:Class#build(Int32)'

    String::Builder.build(capacity) do |builder|
                    ^~~~~

in /usr/local/Cellar/crystal-lang/0.23.1_3/src/string.cr:269: instantiating 'String::Builder:Class#build(Int32)'

    String::Builder.build(capacity) do |builder|
                    ^~~~~

in /usr/local/Cellar/crystal-lang/0.23.1_3/src/json/builder.cr:356: instantiating 'String:Class#build()'

    String.build do |str|
           ^~~~~

in /usr/local/Cellar/crystal-lang/0.23.1_3/src/json/builder.cr:357: instantiating 'build(String::Builder, Nil)'

      build(str, indent) do |json|
      ^~~~~

in /usr/local/Cellar/crystal-lang/0.23.1_3/src/json/builder.cr:367: instantiating 'JSON::Builder#document()'

    builder.document do
            ^~~~~~~~

in /usr/local/Cellar/crystal-lang/0.23.1_3/src/json/builder.cr:367: instantiating 'JSON::Builder#document()'

    builder.document do
            ^~~~~~~~

in /usr/local/Cellar/crystal-lang/0.23.1_3/src/json/builder.cr:357: instantiating 'build(String::Builder, Nil)'

      build(str, indent) do |json|
      ^~~~~

in /usr/local/Cellar/crystal-lang/0.23.1_3/src/compiler/crystal/tools/playground/agent.cr:53: instantiating 'JSON:Module#build()'

    message = JSON.build do |json|
                   ^~~~~

in /usr/local/Cellar/crystal-lang/0.23.1_3/src/compiler/crystal/tools/playground/agent.cr:54: instantiating 'JSON::Builder#object()'

      json.object do
           ^~~~~~

in /usr/local/Cellar/crystal-lang/0.23.1_3/src/compiler/crystal/tools/playground/agent.cr:54: instantiating 'JSON::Builder#object()'

      json.object do
           ^~~~~~

in /usr/local/Cellar/crystal-lang/0.23.1_3/src/compiler/crystal/tools/playground/agent.cr:25: instantiating 'send(String)'

    send "value" do |json|
    ^~~~

in /usr/local/Cellar/crystal-lang/0.23.1_3/src/compiler/crystal/tools/playground/agent.cr:27: instantiating 'JSON::Builder#field(String, (File::PReader | HTTP::Server::Response::Output | IO::FileDescriptor | Int32 | OpenSSL::SSL::Socket | String | Nil))'

      json.field "value", safe_to_value(value)
           ^~~~~

in /usr/local/Cellar/crystal-lang/0.23.1_3/src/json/builder.cr:226: no overload matches 'JSON::Builder#scalar' with type (File::PReader | HTTP::Server::Response::Output | IO::FileDescriptor | Int32 | OpenSSL::SSL::Socket | String | Nil)
Overloads are:
 - JSON::Builder#scalar(value : Nil)
 - JSON::Builder#scalar(value : Bool)
 - JSON::Builder#scalar(value : Int | Float)
 - JSON::Builder#scalar(value : String)
 - JSON::Builder#scalar(string = false, &block)
Couldn't find overloads for these types:
 - JSON::Builder#scalar(File::PReader)
 - JSON::Builder#scalar(HTTP::Server::Response::Output)
 - JSON::Builder#scalar(IO::FileDescriptor)
 - JSON::Builder#scalar(OpenSSL::SSL::Socket::Client)
 - JSON::Builder#scalar(OpenSSL::SSL::Socket::Server)

    scalar(value)
    ^~~~~~

If I run the same code using crystal run, I do not encounter the error.

Add a way to convert the neurons representation to matrix

So @ArtLinkov had a great idea about a way to combine the uniqueness of each neurons while also have ability to represent them as a Matrix.

The overview is creating a Matrix of pointers, the pointers will be of each bias and wights of each neurons.

This will give us the ability to add GPU and multi-threading support while also keep what we love in SHAInet.

Network class - add graphic representation of network

add an option to see the network on a canvas with interconnected nodes and edges

Support HDF5 and JSON to save and load models?

Is there any plans to be able to load existing models and weights using HDF5 and JSON?

https://machinelearningmastery.com/save-load-keras-deep-learning-models/

I think it would be great to load existing keras models into shainet. I didn't see it on the roadmap and i'm not sure how complicated it would be to support. Just wondering if there are any plans and if you think this is possible.

Are there any benchmarks?

Hi all,

I've looked around a bit and can't find any benchmarks of shainet with deep learning frameworks torch, tensorflow, etc..

I know benchmarks are generally not that important, but out of curiosity, are any publicly available?

Add save/load for the CNN

Admittedly, it took a while until we got some time on our hands to work on this... :)
We should have it done by the end of the week. For now, save/load to JSON will be supported.

Autosave option while training

It would be good to have an option of autosave while training, for example every N epochs.
This could prove very useful in case something crashes or even if at some point a nasty NaN will find its way into the model (Maybe also good to preform a NaN check before saving the model),

Replace ES example?

Reading through the ES strategy here

Quote: Note on supervised learning. It is also important to note that supervised learning problems (e.g. image classification, speech recognition, or most other tasks in the industry), where one can compute the exact gradient of the loss function with backpropagation, are not directly impacted by these findings. For example, in our preliminary experiments we found that using ES to estimate the gradient on the MNIST digit recognition task can be as much as 1,000 times slower than using backpropagation. It is only in RL settings, where one has to estimate the gradient of the expected reward by sampling, where ES becomes competitive.

It seems that ES can be 1,000 time slower when doing supervised learning. The example in the readme is doing this here and I'm wondering if we should provide a better example or at least document that this is not a recommended use of this strategy?

Logo design for shainet

Hello, I want to make a logo contribution for you. I designed a logo. Please tell me what you think.

brain: learn
crystal

Network class - Add q-learning as a training option

Help building Cats vs Dogs CNN network

I have been playing with the CNN network trying to get any results but continue to hit roadblocks and I'm hoping to get some help.

I first converted the images to 48x48x1 greyscale to try and keep things as simple as possible.

I built a network as follows:

Dimensions
layers x width x height x channels
==================================
SHAInet::InputLayer
1 x 48 x 48 x 1
----------------------------------
SHAInet::ConvLayer
20 x 48 x 48 x 1
----------------------------------
SHAInet::MaxPoolLayer
20 x 24 x 24 x 1
----------------------------------
SHAInet::ConvLayer
20 x 24 x 24 x 20
----------------------------------
SHAInet::MaxPoolLayer
20 x 12 x 12 x 1
----------------------------------
SHAInet::FullyConnectedLayer
1 x 12 x 1 x 1
----------------------------------

I have tried many different configurations for training the model but all seem to error out:

model.train_batch(
  data: training.data_pairs,
  training_type: :sgdm,
  cost_function: :mse,
  epochs: 25,
  error_threshold: 0.0001,
  log_each: 100,
  mini_batch_size: 32)

No matter what I try, I get:

I, [2019-11-02 09:35:25 -07:00 #56315]  INFO -- : Epoch: 0, Total error: 1.0, MSE: 1.0

Here is the project: https://github.com/drujensen/cats_dogs

Any ideas?

Object Oriented - move to leverage inheritance

Looking through the code, I have a couple suggested changes to make it more OO:

NEURON_TYPES should be inherited. The base class would be Neuron and then MemoryNeuron would be inherited from it.

Learn or Training functions sgd, rprop, adam should be extracted out of the Network class. These should become their own class with base class Learn or something. Adam, SGD, ...

Cost functions should also be extracted out of the Network class. Base class would be Cost with each inherited class implementing the evaluate(input, expected)

Activation functions should be its own class. ...

WDYT?

Create layer class

Add a class for layer creation to allow maximum control of topology and various types of neurons in each layer

JSON.mapping and Logger are deprecated in Crystal 1.0.0

TODO

add repositories to shard.yml
require 'json_mapping' in json_data.cr

overfitting - BatchNormalization and Dropout

I have been playing around with the titanic data and even submitted to Kaggle but my results are not so good. I'm ranking quite low (9511)

I believe my models are overfitting. I'm using sgdm and I have lowered the learning_rate and momentum to try and avoid this but the error and MSE start to rise and don't come back down.

Is the eraser layer similar to a Dropout layer? If so, how do I use it?

Is there a way to create a BatchNormalization layer?

OCR recognise image on image

How creating captcha breaker?
For example :

How creating more output than one. I need 8 output.
Meybe using found data on data, similar yolo

NaN in Total error and MSE

While using cross-entropy in batch train, sometimes the errors become NaN.
This doesn't stop the network from training but does prevent from knowing how well the training go.
Possible problem is 0/0 division in the cross_entropy_cost_derivative function.

neuralegion / shainet Goto Github PK

shainet's Introduction

Installation

Usage

Standard training on XOR example

Batch training on the iris dataset using adam

Using convolutional network

Evolutionary optimizer example:

Development

Basic Features

Advanced Features

Possible Future Features

Contributing

Contributors

shainet's People

Contributors

Stargazers

Watchers

Forkers

shainet's Issues

Let's start with the interesting part.

Questionable design

In the roadmap ?

TODO

Recommend Projects

Recommend Topics

Recommend Org

Jobs