Simple python scraper/web crawler that runs through each entry for all the pages for each nyc borough to count the total number of boba spots in that certain borough.
Yelp.
The location aspect of the search returns "near [borough]" and there's two problems this introduces:
- repeated values (all boros are near each other).
- locations have to be verified (e.g. Manhattan search will include BK results).
Resepctive solutions:
- OOP solution: resturuant entry objects with '==' implemented. maintain global list to keep track of data we already saw (and counted).
- google maps geoencoding api to verify borough before counting it.
Here's the infromation I have gathered:
Bronx: 9
Brooklyn: 95
Manhattan: 163
Queens: 107
Staten Island: 16