GithubHelp home page GithubHelp logo

aryaaftab / light-sernet Goto Github PK

View Code? Open in Web Editor NEW
63.0 63.0 22.0 328 KB

Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition

Python 95.21% Shell 1.57% Jupyter Notebook 3.22%
deep-learning fully-convolutional-networks lightweight speech-emotion-recognition tensorflow2 tflite

light-sernet's Introduction

Arya Aftab ๐Ÿ‘‹

Arya's LinkedIn Arya's LinkedIn

As a Data Scientist and Machine Learning Engineer, I worked independently and as a team member for several projects, including voice command, speech emotion recognition, speech enhancement, etc. My desire to learn more about Machine Learning, Deep Learning, their applications and deploy them into real-world models.

Hello, My name is Arya!

  • ๐Ÿ”ญ Iโ€™m currently working on Meta-learning and Self-supervised Learning
  • ๐Ÿ“‹ Previously worked on Speech Recognition, Speech Emotion Recognition, and Tiny Machine Learning
  • ๐ŸŽ“ Master of Electrical Engineering (Communication Systems), Sharif University of Technology
  • ๐ŸŽ“ Bachelor of Biomedical Engineering (Bioelectric), Amirkabir University of Technology

GitHub Stats

Top Langs

Arya's github stats

light-sernet's People

Contributors

aryaaftab avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

light-sernet's Issues

InvalidArgumentError: Cannot batch tensors with different shapes in component 0.

Hello! Good job! But I have an error. I want to test the model with my audio files. I have created a folder my_test_3.0s_Segmented in date where the audio is tagged by emotion. Everything goes well, but I always get an error at the moment: list(test_dataset.as_numpy_iterator())
InvalidArgumentError: Cannot batch tensors with different shapes in component 0. First element had shape [103,40,1] and element 1 had shape [92,40,1]. [Op:IteratorGetNext]
This prevents me from testing. I used my code on test data generated while training the model. The code works and I get the result. How can I fix it?

cannot run the IEMOCAP dataset on windows

Hello, could you show the data folder architecture so I understand the way you organised the dataset.
I kept getting errors to segment the data.
I extracted the IEMOCAP_full_release in the data folder the renamed it as IEMOCAP, however, I kept getting errors of files not found.

MFCC hop size problem.

"Good job on the paper. However, there seems to be a discrepancy regarding the frame overlaps and hop size between your text and the provided code. In your paper, it's stated that a Hamming window is used to split the audio signal into 64-ms frames with 16-ms overlaps, which are considered as quasi-stationary segments. From this, it would logically follow that the hop size is 48 ms.

However, in the hyperparameters.py file, it's stated "FRAME_STEP = 256". Given a sampling rate (fs) of 16 kHz, this implies a hop size of 16 ms, not 48 ms. Could you please clarify if there's a typographical error in the paper, or if there's a specific reason for this inconsistency?"

function cleaning_directory_filename()

I think the function cleaning_directory_filename() breaks the speaker independence in the paper, i.e., 10-fold cross-validation, causing speaker overlap in the training and test sets. Removing this function, I get an 8% drop in WA. Could you explain my confusion.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.