A list of papers (with available code), tutorials, and surveys on recent AI for emotion recognition (AI4ER)
- Emotion Recognition and Detection Methods: A Comprehensive Survey paper
- A systematic review on affective computing: emotion models, databases, and recent advances paper
- Multimodal Emotion Recognition using Deep Learning paper
- Emotion recognition using multi-modal data and machine learning techniques: A tutorial and review paper
- Deep learning-based multimodal emotion recognition from audio, visual, and text modalities: A systematic review of recent advancements and future prospects paper
- Emotion recognition from unimodal to multimodal analysis: A review
- Deep Multimodal Emotion Recognition on Human Speech: A Review paper
- Speech emotion recognition: Emotional models, databases, features, preprocessing methods, supporting modalities, and classifiers paper
- A Comprehensive Review of Speech Emotion Recognition Systems paper
- Deep Learning Techniques for Speech Emotion Recognition, from Databases to Models paper
- A Review on Speech Emotion Recognition Using Deep Learning and Attention Mechanism paper
- TRUST-SER: On The Trustworthiness Of Fine-Tuning Pre-Trained Speech Embeddings For Speech Emotion Recognition paper code
- Emohrnet: High-Resolution Neural Network Based Speech Emotion Recognition paper
- Parameter Efficient Finetuning for Speech Emotion Recognition and Domain Adaptation paper
- Investigating Salient Representations and Label Variance in Dimensional Speech Emotion Analysis paper
- Adaptive Speech Emotion Representation Learning Based On Dynamic Graph paper
- Leveraging Speech PTM, Text LLM, And Emotional TTS For Speech Emotion Recognition paper
- Enhancing Two-Stage Finetuning for Speech Emotion Recognition Using Adapters paper
- Frame-Level Emotional State Alignment Method for Speech Emotion Recognition paper code
- Gradient-Based Dimensionality Reduction for Speech Emotion Recognition Using Deep Networks paper code
- Speech Swin-Transformer: Exploring a Hierarchical Transformer with Shifted Windows for Speech Emotion Recognition paper
- Disentanglement Network: Disentangle the Emotional Features from Acoustic Features for Speech Emotion Recognition paper
- Improving Speaker-Independent Speech Emotion Recognition using Dynamic Joint Distribution Adaptation paper
- Comparing data-Driven and Handcrafted Features for Dimensional Emotion Recognition paper
- Emotion-Aware Contrastive Adaptation Network for Source-Free Cross-Corpus Speech Emotion Recognition paper
- Balancing Speaker-Rater Fairness for Gender-Neutral Speech Emotion Recognition paper
- Prompting Audios Using Acoustic Properties for Emotion Representation paper
- Speech Emotion Recognition with Distilled Prosodic and Linguistic Affect Representations paper
- Generalization of Self-Supervised Learning-Based Representations for Cross-Domain Speech Emotion Recognition paper
- Dynamic Speech Emotion Recognition Using A Conditional Neural Process paper
- Revealing Emotional Clusters in Speaker Embeddings: A Contrastive Learning Strategy for Speech Emotion Recognition paper
- Foundation Model Assisted Automatic Speech Emotion Recognition: Transcribing, Annotating, and Augmenting paper
- MS-SENet: Enhancing Speech Emotion Recognition Through Multi-Scale Feature Fusion with Squeeze-and-Excitation Blocks paper
- Cubic Knowledge Distillation for Speech Emotion Recognition paper code
- Improving Speech Emotion Recognition with Unsupervised Speaking Style Transfer paper
- Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition paper code
- Multi-Source Unsupervised Transfer Components Learning for Cross-Domain Speech Emotion Recognition paper
- Self-Supervised Domain Exploration with an Optimal Transport Regularization for Open Set Cross-Domain Speech Emotion Recognition paper
- Towards Improving Speech Emotion Recognition Using Synthetic Data Augmentation from Emotion Conversion paper
- Esihgnn: Event-State Interactions Infused Heterogeneous Graph Neural Network for Conversational Emotion Recognition paper
- MCM-CSD: Multi-Granularity Context Modeling with Contrastive Speaker Detection for Emotion Recognition in Real-Time Conversation paper code
- SERC-GCN: Speech Emotion Recognition In Conversation Using Graph Convolutional Networks paper
- Conversation Clique-Based Model for Emotion Recognition In Conversation paper
- Speaker-Centric Multimodal Fusion Networks for Emotion Recognition in Conversations paper
- Large Language Model-Based Emotional Speech Annotation Using Context and Acoustic Feature for Speech Emotion Recognition paper
- MF-AED-AEC: Speech Emotion Recognition by Leveraging Multimodal Fusion, Asr Error Detection, and Asr Error Correction paper
- GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Accurate Speech Emotion Recognition paper
- Fine-Grained Disentangled Representation Learning For Multimodal Emotion Recognition paper
- Improving Multi-Modal Emotion Recognition Using Entropy-Based Fusion and Pruning-Based Network Architecture Optimization paper
- Multi-Grained Multimodal Interaction Network for Sentiment Analysis paper
- Fusing Modality-Specific Representations and Decisions for Multimodal Emotion Recognition paper
- AttA-NET: Attention Aggregation Network for Audio-Visual Emotion Recognition paper code
- MMRBN: Rule-Based Network for Multimodal Emotion Recognition paper
- Inter-Modality and Intra-Sample Alignment for Multi-Modal Emotion Recognition paper
- RL-EMO: A Reinforcement Learning Framework for Multimodal Emotion Recognition paper code
- Multi-Modal Emotion Recognition Using Multiple Acoustic Features and Dual Cross-Modal Transformer paper
- AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models paper
- Learning Language-guided Adaptive Hyper-modality Representation for Multimodal Sentiment Analysis paper code
- LanSER: Language-Model Supported Speech Emotion Recognition paper
- Fine-tuned RoBERTa Model with a CNN-LSTM Network for Conversational Emotion paper
- Emotion Label Encoding Using Word Embeddings for Speech Emotion Recognition paper
- Discrimination of the Different Intents Carried by the Same Text Through Integrating Multimodal Information paper
- Meta-domain Adversarial Contrastive Learning for Alleviating Individual Bias in Self-sentiment Predictions paper
- SWRR: Feature Map Classifier Based on Sliding Window Attention and High-Response Feature Reuse for Multimodal Emotion Recognition paper
- Focus-attention-enhanced Crossmodal Transformer with Metric Learning for Multimodal Speech Emotion Recognition paper
- Learning Emotional Representations from Imbalanced Speech Data for Speech Emotion Recognition and Emotional Text-to-Speech paper
- MMER: Multimodal Multi-task Learning for Speech Emotion Recognition paper
- A Dual Attention-based Modality-Collaborative Fusion Network for Emotion Recognition paper [code](https://github.com/zxiaohen/ Speech-emotion-recognition-MCFN)
- Focus-attention-enhanced Crossmodal Transformer with Metric Learning for Multimodal Speech Emotion Recognition paper
- Speaker-aware Cross-modal Fusion Architecture for Conversational Emotion Recognition paper
- Emotion Prompting for Speech Emotion Recognition paper
- EmotionNAS: Two-stream Neural Architecture Search for Speech Emotion Recognition paper *
- Leveraging Semantic Information for Efficient Self-Supervised Emotion Recognition with Audio-Textual Distilled Models paper
- Leveraging Label Information for Multimodal Emotion Recognition paper
- Improving Joint Speech and Emotion Recognition Using Global Style Tokens paper
- Dual Memory Fusion for Multimodal Speech Emotion Recognition paper *
- Multi-Scale Temporal Transformer For Speech Emotion Recognition paper *
- Speech Emotion Recognition by Estimating Emotional Label Sequences with Phoneme Class Attribute paper
- Unsupervised Transfer Components Learning for Cross-Domain Speech Emotion Recognition paper
- Speech Emotion Recognition using Decomposed Speech via Multi-task Learning paper
- Cross-Lingual Cross-Age Adaptation for Low-Resource Elderly Speech Emotion Recognition paper
- MetricAug: A Distortion Metric-Lead Augmentation Strategy for Training Noise-Robust Speech Emotion paper code *
- Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations paper
- Two-stage Finetuning of Wav2vec 2.0 for Speech Emotion Recognition with ASR and Gender Pretraining paper
- Diverse Feature Mapping and Fusion via Multitask Learning for Multilingual Speech Emotion Recognition paper
- Hybrid Dataset for Speech Emotion Recognition in Russian Language paper
- Exploring Complementary Features in Multi-Modal Speech Emotion Recognition paper
- Cross-Modal Fusion Techniques for Utterance-Level Emotion Recognition from Text and Speech paper
- Using Auxiliary Tasks In Multimodal Fusion of Wav2vec 2.0 And Bert for Multimodal Emotion Recognition paper
- Robust multi-modal speech emotion recognition with ASR error adaptation paper
- Multilevel Transformer for Multimodal Emotion Recognition paper
- MGAT: Multi-Granularity Attention Based Transformers for Multi-Modal Emotion Recognition paper
- Knowledge-Aware Bayesian Co-Attention for Multimodal Emotion Recognition paper
- Exploiting Modality-Invariant Feature for Robust Multimodal Emotion Recognition with Missing Modalities paper code
- Multimodal Emotion Recognition Based on Deep Temporal Features Using Cross-Modal Transformer and Self-Attention paper code
- Exploring Attention Mechanisms for Multimodal Emotion Recognition in an Emergency Call Center Corpus paper
- Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition paper code
- Learning Cross-Modal Audiovisual Representations with Ladder Networks for Emotion Recognition paper
- DST: Deformable Speech Transformer for Emotion Recognition paper
- Multiple Acoustic Features Speech Emotion Recognition Using Cross-Attention Transformer paper
- Speech Emotion Recognition Via Two-Stream Pooling Attention With Discriminative Channel Weighting paper
- Speech Emotion Recognition via Heterogeneous Feature Learning paper
- Pre-Trained Model Representations and Their Robustness Against Noise for Speech Emotion Analysis paper
- Learning Robust Self-Attention Features for Speech Emotion Recognition with Label-Adaptive Mixup paper code
- Hierarchical Network with Decoupled Knowledge Distillation for Speech Emotion Recognition paper
- Adapting a Self-Supervised Speech Representation for Noisy Speech Emotion Recognition by Using Contrastive Teacher-Student Learning paper
- Fast Yet Effective Speech Emotion Recognition with Self-Distillation paper code
- General or Specific? Investigating Effective Privacy Protection in Federated Learning for Speech Emotion Recognition paper
- Speech-Based Emotion Recognition with Self-Supervised Models Using Attentive Channel-Wise Correlations and Label Smoothing paper code
- Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition paper code
- Phonetic Anchor-Based Transfer Learning to Facilitate Unsupervised Cross-Lingual Speech Emotion Recognition paper
- Knowledge Transfer for on-Device Speech Emotion Recognition With Neural Structured Learning paper code
- Speech Emotion Recognition Based on Low-Level Auto-Extracted Time-Frequency Features paper
- Role of Lexical Boundary Information in Chunk-Level Segmentation for Speech Emotion Recognition paper
- Zero-Shot Speech Emotion Recognition Using Generative Learning with Reconstructed Prototypes paper
- A Generalized Subspace Distribution Adaptation Framework for Cross-Corpus Speech Emotion Recognition paper
- exploring Wav2vec 2.0 Fine Tuning for Improved Speech Emotion Recognition paper code
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition paper code
- Deep Implicit Distribution Alignment Networks for cross-Corpus Speech Emotion Recognition paper
- Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-Trained Representations paper code
- Designing and Evaluating Speech Emotion Recognition Systems: A Reality Check Case Study with IEMOCAP paper
- EMIX: A Data Augmentation Method for Speech Emotion Recognition paper
- Multi-View Learning for Speech Emotion Recognition with Categorical Emotion, Categorical Sentiment, and Dimensional Scores paper
- An Empirical Study and Improvement for Speech Emotion Recognition paper
- Towards Learning Emotion Information from Short Segments of Speech paper
- Knowledge-Aware Graph Convolutional Network with Utterance-Specific Window Search for Emotion Recognition In Conversations paper
- Multi-Scale Receptive Field Graph Model for Emotion Recognition in Conversations paper
- SDTN: Speaker Dynamics Tracking Network for Emotion Recognition in Conversation paper
- Emotion Recognition in Conversation from Variable-Length Context paper
- Ensemble Knowledge Distillation of Self-Supervised Speech Models paper
- Domain Adaptation without Catastrophic Forgetting on a Small-Scale Partially-Labeled Corpus for Speech Emotion Recognition paper
- Shuffleaugment: A Data Augmentation Method Using Time Shuffling paper
- Achieving Fair Speech Emotion Recognition via Perceptual Fairness paper
- Unsupervised Domain Adaptation for Preference Learning Based Speech Emotion Recognition paper
- Using Emotion Embeddings to Transfer Knowledge between Emotions, Languages, and Annotation Formats paper code
- QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis paper
- A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition paper
- Mimicking the Thinking Process for Emotion Recognition in Conversation with Prompts and Paraphrasing paper
- SKIER: A Symbolic Knowledge Integrated Model for Conversational Emotion Recognition paper
- BERT-ERC: Fine-Tuning BERT Is Enough for Emotion Recognition in Conversation paper
- Feature Normalization and Cartography-Based Demonstrations for Prompt-Based Fine-Tuning on Emotion-Related Tasks paper
- Layer-wise Fusion with Modality Independence Modeling for Multi-modal Emotion Recognition paper code
- ConKI: Contrastive Knowledge Injection for Multimodal Sentiment Analysis paper
- ConFEDE: Contrastive Feature Decomposition for Multimodal Sentiment Analysis paper
- Topic and Style-aware Transformer for Multimodal Emotion Recognition paper
- Self-adaptive Context and Modal-interaction Modeling For Multimodal Emotion Recognition paper
- QAP: A Quantum-Inspired Adaptive-Priority-Learning Model for Multimodal Emotion Recognition paper
- Supervised Adversarial Contrastive Learning for Emotion Recognition in Conversations paper code
- MultiEMO: An Attention-Based Correlation-Aware Multimodal Fusion Framework for Emotion Recognition in Conversations paper
- DualGATs: Dual Graph Attention Networks for Emotion Recognition in Conversations paper
- A Cross-Modality Context Fusion and Semantic Refinement Network for Emotion Recognition in Conversation paper
- A Facial Expression-Aware Multimodal Multi-task Learning Framework for Emotion Recognition in Multi-party Conversations paper
- Label-Aware Hyperbolic Embeddings for Fine-grained Emotion Classification paper code
- Estimating the Uncertainty in Emotion Attributes using Deep Evidential Regression paper
- Tailor Versatile Multi-Modal Learning for Multi-Label Emotion Recognition paper
- Hybrid Curriculum Learning for Emotion Recognition in Conversation paper
- Contrast and Generation Make BART a Good Dialogue Emotion Recognizer paper
- Is Discourse Role Important for Emotion Recognition in Conversation? paper
- CTL-MTNet: A Novel CapsNet and Transfer Learning-Based Mixed Task Net for Single-Corpus and Cross-Corpus Speech Emotion Recognition paper code
- Speaker-Guided Encoder-Decoder Framework for Emotion Recognition in Conversation paper
- CauAIN: Causal Aware Interaction Network for Emotion Recognition in Conversations paper
- Online ECG Emotion Recognition for Unknown Subjects via Hypergraph-Based Transfer Learning paper
- Counterfactual Reasoning for Out-of-distribution Multimodal Sentiment Analysis paper
- Unified Multi-modal Pre-training for Few-shot Sentiment Analysis with Prompt-based Learning paper
- Towards Unbiased Visual Emotion Recognition via Causal Intervention paper
- Unsupervised Domain Adaptation Integrating Transformer and Mutual Information for Cross-Corpus Speech Emotion Recognition paper
- M3ER: Multiplicative Multimodal Emotion Recognition using Facial, Textual, and Speech Cues paper