Shatin’s best restaurant
In this assignment, according to my food preference, I chose to research the Chinese restaurants in Shatin.(LINK is here)
I firstly used Parsehub to scrap the information, including the URL, the number of good and bad review, the number of comments, costs, cuisine, and their signature dishes. I’ve got data on over 100 restaurants.
I then used openrefine to refine and refine the data. Furthermore, I removed the hotels that had moved and were being renovated and converted the price ranges into specific numbers. In this way, I have data for 104 restaurants.
In the case of null results, I decide to convert them all to zeros.
After that, I put it into DB browser and used python for data analysis. After measuring the number of reviews, positive reviews and so on, I decided to use the information I had to create an index to measure how good the restaurant was, giving each parameter a different weight. According to the results, Sha Tin 18 is the best Chinese restaurant in Sha Tin. In the end, because I love fish, I went on to choose some restaurants that specialize in fish.
In the process of using python, I tried various functions of panda and felt that it is a very powerful tool. However, I think I still have a lot to learn in data visualization and further processing of data.