eyupatakanozkan

Advanced Analysis with Power BI

Film Statıstıcs

ABOUT PROJECT

Previously, I created a structured movie dataset by collecting my watch history from Letterboxd and enriching it with additional information from the IMDb API. You can review the detailed steps of how the dataset and table were created from here. You can view the gallery I published on my social site by clicking here.

After preparing the dataset, I aimed to conduct a more in-depth analysis using Power BI. With this approach, I was able to generate various insights regarding my personal movie-watching habits. 

You can access the power bi file by clicking on the github logo below.

Power BI Analyses

  • Overall Statistics

    • The total number of films I have watched

    • The number of unique directors, languages, and countries represented in my watch history

    • The cumulative total watch time in days

    • A comparison of my average ratings versus the average IMDb ratings

  • Country-Based Analysis

    • Movies were grouped by country of origin.

    • To ensure statistical reliability, only countries with at least five films watched were included in the charts.

    • Based on the left chart, I was able to rank the countries whose films I personally enjoyed the most.

    • Additionally, this allowed me to compare my movie preferences with those of IMDb users.

  • Language-Based Analysis

    • Films were categorized by their original language.

    • This provided a distribution of the languages I most frequently encountered in my movie-watching history.

  • Director-Based Analysis

    • To be included in the director charts, I needed to have watched at least three movies by the same director.

    • This filtering ensured that average values were meaningful.

    • The results revealed my favorite directors based on the ratings I assigned to their works.

    • A tree chart was also developed to display directors and the number of films I have watched from each.

  • Decade-Based Analysis

    • A timeline visualization was created to show the cumulative number of films I watched by decade.

    • The chart also illustrated how my average ratings evolved over time, allowing for generational comparisons in cinema appreciation.

 

GitHub Integration

Since the website includes a GitHub link, the following items were uploaded to a public repository:

You can access the full repository from the GitHub logo below.