🍿 Spark Movie Recommender

📍 EPFL – Master in Data Science, Year 2 (2026) 🔗 Code Repository: GitHub


This project focuses on implementing scale-out data processing pipelines over Apache Spark to power a video streaming application’s recommendation engine. Using a dataset based on MovieLens, the system pre-computes statistics and serves personalized movie recommendations to end-users.

The architecture is divided into four core processing milestones:


đź›  Tools & Libraries:

đź§  Techniques: