微信图片_20221017072008-3859286b

Analysis of Korean food restaurants in NT

As a student lives in New Territories who is fond of Korean food, I choose the Korean food restaurants in this area as study objects.

1.Data scraping
After filtering the restaurants by “Korean” and “New Territories”, I obtained 162 search results. I scrapped all the 162 data with the names, prices, types of dishes, number of collects, number of good comments and bad comments included. There are 85 data left afterwards.

2.Data cleaning
On openrice, all the collects numbers which are over 1000 are presented in the way of XX K. In order to make it easy for ranking in the following analyzing process, I replaced all the K in collects and “*1000” to show them in the form of numbers.
Also, there are some restaurants that have many branches. I merged the name and deleted the duplicates remaining the one with the highest collects number.
Besides, I delete those who are null in the number of good comments.

3.Data Analysis
There are three aspects that I’ve analyzed about.
1. Top 10 restaurants that users want to pay a visit to
2. Top 10 Koren fried chicken restaurants
3. Users’ attitude towards the overall performance of Korean restaurants in NT
All the questions are related to users’ attitudes and willingness, so I pick the numbers of collects and good comments as the main measurements.
Besides, there are some restaurants only have a few comments, or even only have 1 good comment in total and reach a 100% satisfaction rate, which may be not persuasive, so I exclude all the restaurants whose sum of comments<100.
Maybe it’s not that hard to come up with ways to measure, however, it does suffer to complete the calculation in python because I don’t master it well, to be honest. But after finishing this project and finally coming up with the codes, I do make improvements.
you can view the website here

Similar Posts