Overview
The EmoTa Tamil Emotional Speech Dataset is a collection of recordings in Sri Lankan Tamil, representing distinct dialects from the northern, eastern, western, and central provinces. It is designed for research in speech and emotion recognition.
Key Features
- Speakers: 22 native Tamil speakers (11 male, 11 female)
- Emotions: Anger, Happiness, Sadness, Fear, Neutrality
- Sentences: 19 semantically neutral sentences
- Recording Quality: Captured in a soundproof environment
- Total Duration: Approx. 48 minutes
Dataset Structure
EmoTa/ ├── happy/ ├── sad/ ├── angry/ ├── fear/ └── neutral/ └──_ _ .wav