Comments (6)
There's no reason behind picking stddev=0.3
at the moment, and using Xavier initialization would be nice. Note that the learnable filter isn't just an (in_channels, out_channels)
matrix, as needed for the Xavier initializer, but a (filter_width, in_channels, out_channels)
tensor.
Maybe stacking together filter_width
Xavier initializers would work well.
from tensorflow-wavenet.
Right,well there's also tf.contrib.layers.xavier_initializer_conv2d. But no tf.contrib.layers.xavier_initializer_conv1d. But the only difference is amonst these is how they compute fan-in and fan-out; after that they compute the normal-std-dev or uniform-width from that. Should be straightforward to roll our own.
from tensorflow-wavenet.
@jyegerlehner: I didn't actually know about xavier_initializer_conv2d
. Looking at the code, I can see that it only accesses the last 2 dimensions of the tensor, so we can just use it for our rank 3 tensors.
This example just works:
import tensorflow as tf
initializer = tf.contrib.layers.xavier_initializer_conv2d()
variable = tf.get_variable('x', shape=[2, 32, 32], initializer=initializer)
sess = tf.Session()
sess.run(tf.initialize_all_variables())
print(sess.run(variable))
from tensorflow-wavenet.
OK I'll try to get to this in the not-too-distant future. If anyone else wants to pick it up please speak up so we don't end up duplicating effort.
from tensorflow-wavenet.
@jyegerlehner: It would make sense to define a _create_variable(name, shape)
method that defines a new variable and initializes it using the Xavier initializer, so that it becomes easier to make the initializer configurable.
from tensorflow-wavenet.
@ibab Sounds like a good idea.
from tensorflow-wavenet.
Related Issues (20)
- how dialated convolution actually work ?
- How to stop and resume training HOT 2
- Problem on runing it on colab HOT 2
- generate.py very slow with GPU HOT 1
- TypeError: cast() missing 1 required positional argument: 'dtype'
- tensorboard result: the generated audio of generate.py is 0 seconds
- Understanding convolution kernels in dilation layers HOT 4
- TypeError: Value passed to parameter 'indices' has DataType float32 not in list of allowed values: uint8, int32, int64 HOT 1
- I failed to download the dataset, how should I resolve the voice HOT 1
- My loss function fluctuates like crazy.
- Colab problem: continue previous training HOT 4
- problem on generate only noise HOT 5
- testing much worse than training?
- QUESTION How long does it take to generate one sample? HOT 1
- Module 'tensorflow' has no attribute 'placeholder' HOT 8
- Why is there no activation function applied to the 1x1 conv that produces the dense output?
- ModuleNotFoundError: No module named 'tensorflow.contrib' HOT 1
- about loading VCTK_Corpus dataset?
- Project dependencies may have API risk issues
- Training wavenet to rap?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tensorflow-wavenet.