hypothesis is that the DC and Marvel movies have significantly distinguishable style and they discuss different topics. Based on this, DC and Marvel movies create different experience for the audience so the reviews can show the differences of the movies.
Essentially this is a binary classification problem. The dependent variables is the studio (DC or Marvel), and the independent variables are the audience reviews. That is, we can classify the DC and Marvel movies based on the reviews. At the same time, this can be a clustering problem as well. We can distinguish the DC and Marvel movies with topic models assuming they discuss different topics, such as  democracy, feminism, self-growth, and collaboration.

Data 

Methods