andreas12345 / cdr-dereverb Goto Github PK
View Code? Open in Web Editor NEWCoherence-based Dereverberation for Speech Enhancement
License: BSD 2-Clause "Simplified" License
Coherence-based Dereverberation for Speech Enhancement
License: BSD 2-Clause "Simplified" License
*************************************************************************** Description *************************************************************************** This archive contains MATLAB code for CDR-based dereverberation as described in [1]. See demo_cdr_dereverb.m to get started. [1] Andreas Schwarz, Walter Kellermann, "Coherent-to-Diffuse Power Ratio Estimation for Dereverberation", IEEE/ACM Trans. on Audio, Speech and Lang. Proc., 2015 (under review); preprint available: arXiv:1502.03784 PDF: http://arxiv.org/pdf/1502.03784 *************************************************************************** Changes *************************************************************************** 1.0 - 2014-10-29: Initial version 1.1 - 2015 - The magnitude of the coherence is now limited in the estimators in order to prevent numerical problems. - Changed reference to journal paper. *************************************************************************** License *************************************************************************** Copyright (c) 2014, Chair of Multimedia Communications and Signal Processing, Friedrich-Alexander-Universitaet Erlangen-Nuernberg (FAU) Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met: 1. Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer. 2. Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution. 3. Neither the name of the copyright holder nor the names of its contributors may be used to endorse or promote products derived from this software without specific prior written permission. THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
Thank you for the great work, I would just like to ask about the function of [p=IterLSDesign(cfg.Lp,cfg.K,cfg.N);] it is not available in the code, and I wonder how you get the value of p?. Also, do I need to calculate this p value if I will use a different wave file?. Is it ok to use the p value that you're using in the code to run other audio wave files?
Dear Andreas,
Thanks for your works, it does help me a lot for dereverberation and CDR.
I have two questions about CDR and your paper.
The noise coherence estimation from your paper is fixed, which is an ideal case. But in reality, the noise coherence will change along with different environments I believe, are there any other methodologies for improving that?
The paper for estimating CDR is only for 2-microphones. If I want to do multi-channels such as 8mics, do you have better way in maths to compute multi-channels(>2mics) CDR
Regards
Yongyu
Dear Andreas
According to the paper Coherent-to-Diffuse Power Ratio Estimation for Dereverberation in the Fig. 10 in these CDR estimators the dereverberation does not work the same for low frequencies than for high frequencies. Apparently, for low frequencies the dereverberation is less. I wonder if we reduce the distance of the microphones let's say to 0.5 cm or 1 cm, our diffuse coherence model will be different, will we find the same effect for low frequencies and therefore our CDR estimator dereverberation performance will be less?
Hoping I have written properly this question.
Thanks
David
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.