I scraped businesses across the U.S., analyzed reviews to estimate customer volume and revenue, and tracked booking software by checking subdomains and site elements. Connecting the dots between businesses and their software providers, I estimated MRR, ARR, and potential exit multiples.
This gave a solid market overview, showing which booking software is most common, how tech-savvy businesses are, and rough revenue estimates based on customer activity.
I put everything—scraper code, SQL tables, and sample data—on GitHub if you want to check it out or improve it.
GitHub Repo: github.com/alexdelbosque/data-crawler-smbs
Has anyone else done something like this? How would you make it better? Let’s swap ideas!