http://iwfchecker.lightning-bolt.net/ has a crowd sourced list of stuff that is being filtered by cleanfeed. Some of it from IWF list and some from court orders (e.g. Pirate Bay)
Cleanfeed has two filters. Thr first is IP/domain based and if the site is on the list then all the traffic will go through a proxy or deep packet inspection to check each URL accessed against the blocklist. Detecting/Sourcing the second level is much harder.