Cat Sofa Customer Review Analysis
Customer Reviews and Ratings Analysis + Visualization
Link to project repository on GitHub

I am always looking for ways to spoil my cats. While browsing on Amazon I came across this super cool cat couch.
For this project I gathered the data on the customer reviews. Once I have cleaned the data I will create a simple dashboard to display the informtation.
- Web scraping (Python)
In python I used BeautifulSoup and Selenium to login to Amazon, parse through the review pages and collect the revivew information. I also added a sentiment column based on the review text. -
Data cleaning (Excel) (Link takes you to cleaned dataset)
I cleaned the dataset from Part 1 in Excel and it now has the following information:- Reviewer Name: The username of the person leaving the review
- Title: The title of the review
- Colour Name: The colour the cat couch was purchased in
- Rating: The rating given by the reviewer (out of 5 stars)
- Review Location: The country where the reviewer is located
- Review Date: Date the review was posted (yyyy-mm-dd)
- Review Text: The text description of the review
- Sentiment: The sentiment of the review text
-
Data Visualization (Tableau)
I have created a simple dashboard in Tableau.
-
Bonus Part: Simple EDA on Text Sentiment (R)
Simple EDA on text and sentiment. This is similar to my EDA from my UFO project.
Summary of Findings
Rating
- The average rating began to slowly decrease around August/September 2023, hitting its lowest point around November 2023. After that, it gradually increased, reaching its highest point after April 2024.
- Ratings predominantly cluster around 4.5 stars, but a significant number of ratings are also around 1 star.
- The green couch has the highest average rating at approximately 4.7 stars. The blue and yellow couches have the lowest average ratings at 3.6 stars.
- Mexico and Canada have high average ratings (5.0 stars and 4.1 stars, respectively), while the United States has a lower average rating (3.6 stars). However, it's important to note that Mexico only has one review, so this may not accurately reflect overall customer satisfaction.
Sentiment
- Most reviews are positive, followed by neutral, and then negative.
- The blue couch has the highest percentage of negative reviews compared to total reviews for that color.
- The yellow couch has the highest percentage of neutral reviews compared to total reviews for that color.
- The black couch technically has the highest percentage of positive reviews compared to total reviews for that color, but it also has the fewest reviews (only 4), which may skew the results. The green couch has the second-highest percentage of positive reviews.
- Positive reviews steadily increased from late August 2023 to January 2024, then slowly decreased.
- Neutral reviews peaked in December 2023, decreased, and then increased again in April 2024.
- Negative reviews remained relatively low but began to increase just before April 2024.
Number of Reviews
- Review counts remained low until November 2023 (when ratings hit their lowest point), then remained high until March 2024. After that, the count decreased until it started increasing again after April 2024.
- Canada has the most reviews, while Mexico has the fewest.
- The green couch has the most reviews, and the black couch has the fewest.
Overall
- The green couch is the most popular (most reviews) and has the highest customer satisfaction in terms of positive reviews and ratings.
- The blue couch has the lowest customer satisfaction in terms of negative reviews and ratings.
- The black couch is the least popular in terms of the number of reviews.
- Canada has the highest customer satisfaction in terms of ratings (76% positive reviews).
- The United States has the lowest customer satisfaction in terms of ratings (25% negative reviews).
Recommended Next Steps
- Compare sentiment trends over time to see if customer satisfaction shifts with changes in the product or its marketing.
- Investigate if there are any patterns in reviews based on the location, gender, or age of the reviewer.
- Conduct a time series analysis to identify trends and seasonal patterns in reviews.
- Predict future review trends based on historical data.
My Cats
For those who are interested, here are photos of my cats who inspired this project. Their names are Chevy and Bunny and they are sisters. They are very nosy, and they love belly rubs.