road2018 Goto Github PK

followers: 15.0 following: 35.0 repos: 1.1K gists: 0.0

Type: User

road2018's Projects

opus

Modern audio compression for the internet.

p.563

ITU P.563 code with minor modifications to make it run on Mac

This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).

p2fa_mandarin_py3

Modified Python3 P2FA for Mandarin

p2fa_py3

Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3

p2fa_state_aligner

P2FA-based HMM state-level forced aligner

packet-loss-notification

The project provides a helper class for detection and notification about lost unreliable packets.

paddlespeech

Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

pannstensorflow

Tensorflow implementation of Qiuqiang Kong's PANNs (https://github.com/qiuqiangkong/audioset_tagging_cnn/tree/keras_waveform)

pb_chime5

Speech enhancement system for the CHiME-5 dinner party scenario

percepnet

(Work In Progress) Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech

percepnet-keras

percepnet implemented using Keras, still need to be optimized and tuned.

performanceevaluationdnnforbss

pesq

Perceptual Evaluation of Speech Quality

phasen

A unofficial Pytorch implementation of Microsoft's PHASEN

phkit

phoneme toolkit. 好用的音素处理工具箱，包含中文音素、英文音素、文本转拼音、文本正则化等模块。

phone_distance_grader

Use phone distances to assess a speakers pronunciation for fluency

pidtln

Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi

pitch-tracking

Pitch tracking in real-time with the Kalman filter

plc-challenge

This repo contains required files for the INTERSPEECH 2022 Audio Deep Packet Loss Concealment (PLC) Challenge.

plcify

Run packet loss concealment algorithm on wavs

pocketfft

FFT implementation based on FFTPack, but with several improvements, cloned from

pocketsphinx

PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop