π¬ From Screen to Heart β How Do Emotions Shape the Cinematic Landscape?
π EPFL β Master in Data Science, Year 1 (2024)
π₯ Team: Sofia Taouhid, Lina Sadgal, Yasmine Tligui, Guillaume Vitalis
π Award: Selected among the Top 10 Projects (out of 100+ teams)
π Project Website: Demo
π Code Repository: GitHub
This project investigates the emotional dimensions of storytelling using a large corpus of over 11,000 movie summaries enriched with 450,000 IMDb reviews.
We used HuggingFaceβs Emotion DistilRoBERTa model to extract primary emotions β joy, sadness, fear, anger, surprise, disgust, neutrality β then applied statistical correlations, PCA, and K-Means clustering to map emotional trends across genres and narratives.
Inspired by Inside Out, our project features a visual storytelling approach, integrating emotion-based visualizations like word clouds and heatmaps to offer an engaging exploration of genre-emotion relationships.
π Tools & Libraries:
- Python
- Pandas
- Scikit-learn
- HuggingFace Transformers
- Plotly
- Jekyll
- HTML/CSS
π§ Techniques:
- Natural Language Processing (NLP)
- Sentiment Analysis
- Clustering
- Interactive Visualization