Webscraping for Grocery App

Published:

Wanted to create a grocery app that contained all the groceries from all the major online grocery vendors. Allowing users to choose the groceries from each store individually.

For this I created a configuration of AWS servers that scraped data from each vendor, matched products with slightly different descriptions. And created a unified database.

The nuance was that products differed based on the zipcode.

So I created several EC2 instances each with a different zipcode and set them to scrape the data of each vendor from each zipcode.

Whenever an IP address got blocked, the server would automatically delete itself and a new one would be created to continue scraping data.