I live in Pakistan and my government censors websites that contain "immoral" content. Sometimes random websites get caught in the censors, e.g pastebin.com
I wanted to build a portal that kept check of which websites are blocked, but I'm unsure about how to go about this:
1. I could download the zonefiles for all the popular tlds and test them, but this will exclude unpopular tlds
2. I could crawl the web but this will turn into a more expensive side project than I have the funds for.
3... any suggestions?