Web Scraping on Common Crawl dataset using Mapreduce

Crawl
Play