Multimodal emotion detector

Web app in Django for multimodal emotion recognition using neural networks.

This web is an online live emotion detector in Spanish language. It uses and combines three different modalities in order to predict the emotional state of the user.

Components

The modalities used are:

Facial expressions.
Prosodic characteristics of the voice (how are you speaking).
The spoken message (what are you saying).

Combination

In order to combine these models, a weighted sum of the emotion probabilities given by the facial and the vocal models is made. If the result is unclear and there is some kind of doubt between two or more emotions, the message model is used to clarify this doubt.

Language

The vocal and textual models are trained with Spanish data, so this classifier may not work properly if used in other languages.

Emotion model

The emotional model used for this classifier is the classic discrete Paul Ekman's classification of 7 basic emotions:

Anger
Disgust
Fear
Happiness
Sadness
Surprise
Neutral

Neural networks

The neural networks corresponding to the three used modalities are private. The ones available publicly in this repository are deteriorated versions of the original ones.

Additional functionalities

The app also allows to record live sessions of emotion detection and download the video afterwards.

In order to make it more visual and identify the emotions easily, each emotion is related in the app with one color and one emoji:

Anger -> Red
Disgust -> Green
Fear -> Purple
Happiness -> Yellow
Sadness -> Blue
Surprise -> Orange
Neutral -> White

Available app

A lite version of this app is available at: https://tfm-emotions.herokuapp.com/emotions/ As heroku free plan limits the memory available, this public version only includes the facial emotion, which is in fact the most representative one.

Screenshots

The screenshots show the texts of the app in Spanish as the app was initially fully designed in Spanish. Right now these texts are transcribed and the texts of the available app and the code in this github are fully in English.

Here is a screenshot of the full app.

Credits

Some external tools were used to develop this recognizer.

The face detector that it is used to locate and crop the face in the image is the one available at https://github.com/auduno/clmtrackr

The voice transcriptor used to get the spoken message from the voice is the one available at https://github.com/Uberi/speech_recognition

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
embeddings		embeddings
emotions		emotions
models		models
mysite		mysite
screenshots		screenshots
.gitattributes		.gitattributes
.gitignore		.gitignore
Procfile		Procfile
README.md		README.md
db.sqlite3		db.sqlite3
log.log		log.log
manage.py		manage.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multimodal emotion detector

Components

Combination

Language

Emotion model

Neural networks

Additional functionalities

Available app

Screenshots

Credits

About

Releases

Packages

Languages

jofuelo/multimodal_emotion_detector

Folders and files

Latest commit

History

Repository files navigation

Multimodal emotion detector

Components

Combination

Language

Emotion model

Neural networks

Additional functionalities

Available app

Screenshots

Credits

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages