Webscraping for Grocery App

Published:

Wanted to create a grocery app that contained all the groceries from all the major online grocery vendors, allowing users to choose the groceries from each store individually.

For this, a configuration of AWS servers was created that scraped data from each vendor, matched products with slightly different descriptions, and created a unified database.

The nuance was that products differed based on the zipcode.

Several EC2 instances were created, each with a different zipcode, and set them to scrape the data of each vendor from each zipcode.

Whenever an IP address got blocked, the server would automatically delete itself and a new one would be created to continue scraping data.