Top 100 Profitable Directors In Movie Industry

INSIGHT FOUR - Top 100 Profitable Directors In Movie Industry
Description & Findings

The Bubble Scatterplot data visualization above give an overview of which director make most profitable movie in movie industry. As you can see that I have included few filtering option such as filter byGross Earning, Profit & ROI and all these factor can also be filtered by Total or Average when you click on the radio button switch. While Steven Spielberg is in the top leaderboard of either gross earning or profitable director but when you click on ROI, we can find out that Oren Peli is the director that has highest ROI, which is the director of Paranormal Activity.


Interactivity
  • Filter: You can select different filtering option by click on the regular button or switching radio button based on what you would like to explore in this data visualization.
  • Top 100 Profitable Actors In Movie Industry

    INSIGHT FIVE - Top 100 Profitable Actors In Movie Industry
    Description & Findings

    The Bubble Scatterplot data visualization above give is quite similar to the previous one, it gives an overview of which actor make most profitable movie in movie industry. As you can see that the filtering option is available here so you can explore more different option in details. It is interesting to find out that actor Gloria Stuart is the most profitable actor based on average gross earning and profit, while not surprisingly, the actor in Paranormal Activity has highest ROI such as Ashley Palmer.


    Interactivity
  • Filter: You can select different filtering option by click on the regular button or switching radio button based on what you would like to explore in this data visualization.
  • language

    Correlation Analysis on IMDb Dataset

    Correlation Matrix

    Scatter Plot

    INSIGHT SIX - Correlation Analysis on IMDb Dataset
    Description

    I've used Correlation Matrixdata visualization to find out the mutual relationship or connection between two properties in this dataset. There are two color coding here, green represent the positive correlation and red represent the negative correlation.


    Interactivity
  • Hover: You can hover different cell and it will pop out a scatterplot based on the two hovered properties.
  • Click: After you hover, you can click on that cell, then move to the scatterplot to reveal more information or explore the data you would like to know. Remember unclick the cell to resume back to hover mode. This logic is set explicitly so both hover and click effect won't contradict each other.

  • Interesting Fact
  • It is interesting to find out that the movie IMDb score and # of Voted Users has a positive coefficient of 0.47 which means that they are both related to each other and has some kind of relationship. In order to have good IMDb score, the number of voted users play a key factor.
  • We can also see that the profit and budget have -0.47 coefficient, this means that throwing or investing large amount of budget for a movie doesn’t guarantee that we will have a good profit return.
  • It is interesting to find out that The director's facebook likes has greatly affect the profit of a movies. As we can see that, the director with lowest facebook like have a negative loss in profit in the scatterplot on the right side.
  • I'm also curious in finding out how number of faces in poster affect IMDb score. As we can see that with correlation coefficient of -0.07, and by seeing the trend in the scatterplot, we know that when the poster has a lot of faces, the IMDb score start dropping to lower score.
  • I also observed that the duration and IMDb score has 0.36 correlation coefficient, which is a postive correlation. As we can see from the scatterplot that most of the movie has duration in between 50~150 minutes, with IMDb score ranging from highest 9.3 to lowest 1.6. While increasing the duration, the movie rating start falling downward, which in my opinion that movie with too long or too short duration will have negative effect on viewers, they either get too bored when duration is long or haven't watching enough for a movie if the duration is short. Which will then affect the IMDb score when they rate on the movie.