Light

marlousnijman / asr-speaker-estimation Goto Github PK

View Code? Open in Web Editor NEW

0.0 1.0 0.0 411 KB

Jupyter Notebook 94.67% Python 5.33%

asr-speaker-estimation's Introduction

ASR Speaker Estimation

Number of Speaker Estimation in Real-Life Conversations

Number of speaker estimation from single channel speech audio is an ASR task that is often used as starting point for other related task, such as speech diarization, but could also be used for audiosurveillance. In this work the robustness and generalizability of CountNet, a neural network for number of speaker estimation, trained on an artificially generated dataset was evaluated. The model was trained on a dataset generated from single speaker audio book recordings and tested on audio recordings of realistic conversations, which were manually labelled. Results show that model performance does generalize as well to these realistic datasets, and the number of simultaneous speakers are often overestimated, possibly as a result of reverberance, environmental characteristics, and speaking style.

Original Dataset	Real-life Dataset

asr-speaker-estimation's People

Contributors

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

Jobs