Recommender System is essential in E-commerce. I am very interested in the recommender system. Hence I created a game recommender system based on the dataset provided by steam.
To identify outliers in the dataset, I carried out exploratory analysis on it. After I removed anomalies, I plotted two bar charts regarding users’ games’ play time and purchase records to analyze the top ten favorite games. From the bar charts, we can see the popular games in the scale of play time are also got the most purchases.
After analyzing the top 10 popular games, I plotted the distribution of users’ games’ play time. Most users are playing a game less than 100 hours, while a few users have longer game-playing time. To better show the distribution, I made a log transformation of the playing time.
To quantify users preference for the games, I used quantiles of users’ game playing time, which divided all playing time into five grades as a rating. Then I applied collaborative filtering to make recommendations for users using pandas and numpy. According to the rating matrix, the program can recommend the highest score game for users.