GithubHelp home page GithubHelp logo

mylink's Introduction

1. Datasets

Speech ehancement datasets (sorted by usage frequency in paper)

English

Name Source Hours
Dataset by University of Edinburgh https://datashare.ed.ac.uk 39h(141165.816338)
VCTK(2009) https://datashare.ed.ac.uk 82h(297552.223038)
LibriSpeech http://www.openslr.org 983h(3539995.721648)
Common Voice https://commonvoice.mozilla.org 1837h(6615925.423006)
The VoxCeleb1 Dataset https://www.robots.ox.ac.uk x
The VoxCeleb2 Dataset https://www.robots.ox.ac.uk x

German

Name Source Hours
Common Voice https://commonvoice.mozilla.org/ 832h (2995609.871861)

Augmentation noise sources (sorted by usage frequency in paper)

Name Source Hours
DEMAND https://zenodo.org 8h(28800.384000)
100 Noise http://web.cse.ohio-state.edu 293s(293.299375)
RIRS_NOISES https://www.openslr.org 27h(97661.178407) 
QUT-NOISE https://research.qut.edu.au 27h (98262.946746)
MUSAN https://www.openslr.org 48h (175827.483386)
Deep Noise Suppression (DNS) Challenge - Interspeech 2020 https://github.com/breizhn/DNS-Challenge
Deep Noise Suppression (DNS) Challenge - Interspeech 2022 https://github.com/microsoft/DNS-Challenge

Audio data augmentation

Link Language Description
Data simulation Python Add reverberation, noise or mix speaker.
audio-SNR Python Mixing an audio file with a noise file at any Signal-to-Noise Ratio.

2. 논문

  1. Transformer

  2. Speech Enhancement

  3. Data Augmentation

  4. End to End

  5. Awesome Speech Enhancement

1.음성처리

1. 블러그

  1. Speech Processing for Machine Learning: Filter backs, Mel-Frequency Cepstral Coefficients(MFCCs) and What's In-Between
  2. RNN transducer
  3. What is translation equivariance, and why do we use convolutions to get it?
  4. Tensorflow-lite signiture 분석
  5. TFLite Quantization
  6. TensorFlow Custom Op으로 데이터 변환 최적화하기

2. Tensorflow

  1. tf.data : TensorFlow 입력 파이프 라인 빌드
  2. tf.data API로 성능 향상하기
  3. 분산훈련
  4. 체크포인트 훈련하기
  5. tf.function으로 성능 향상하기

3. Audio recognition using Tensorflow Lite

  1. tflite-support

  2. 오디오 분류기 통합 -> 오디오 분류기 통합2

  3. 오디오 데이터 준비 및 증강

  4. 모델변환

  5. 모델최적화

  6. TensorFlow Lite Model Maker를 사용하여 오디오 도메인에 대한 전이 학습

  7. Tensorflow and Tensorflow Lite code in the context of audio processing (MFCC, RNN)

  8. Audio recognition using Tensorflow Lite in Flutter applications

  9. Udacity - TensorFlow Lite e-Learning과정

  10. Coursera - Device-based Models with TensorFlow Lite

  11. A Definitive Guide for Audio Processing in Android with TensorFlow Lite Models

  12. Android에서 커스텀 TensorFlow Lite 모델 사용

  13. Tensorflow lite

  1. Quantization

Bitmap bitmap = Bitmap.createScaledBitmap(yourInputImage, 224, 224, true);
ByteBuffer input = ByteBuffer.allocateDirect(224 * 224 * 3 * 4).order(ByteOrder.nativeOrder());
for (int y = 0; y < 224; y++) {
    for (int x = 0; x < 224; x++) {
        int px = bitmap.getPixel(x, y);

        // Get channel values from the pixel value.
        int r = Color.red(px);
        int g = Color.green(px);
        int b = Color.blue(px);

        // Normalize channel values to [-1.0, 1.0]. This requirement depends
        // on the model. For example, some models might require values to be
        // normalized to the range [0.0, 1.0] instead.
        float rf = (r - 127) / 255.0f;
        float gf = (g - 127) / 255.0f;
        float bf = (b - 127) / 255.0f;

        input.putFloat(rf);
        input.putFloat(gf);
        input.putFloat(bf);
    }
}
int bufferSize = 1000 * java.lang.Float.SIZE / java.lang.Byte.SIZE;
ByteBuffer modelOutput = ByteBuffer.allocateDirect(bufferSize).order(ByteOrder.nativeOrder());
interpreter.run(input, modelOutput);
...

  1. Recognize Flowers with TensorFlow Lite on Android

  2. GitHub:

4.Voice Filter

  1. VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition

. Tensorflow

  1. tf.data : TensorFlow 입력 파이프 라인 빌드
  2. 케라스 손실 함수 정리

2.Speech Recognition

1.toolkit

  1. End-to-End Speech Processing Toolkit
  2. Openspeech
  3. DeepSpeec

Challenge

1.Deep Noise Suppression Challenge

mylink's People

Contributors

starmkk avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.