Apache Hadoop Ecosystem

Hadoop EcoSystem 1. Large data on the web. 2. Nutch built to crawl this web data. 3. Large volume of data had to saved – HDFS introduced. 4. How to use this data? Report. 5. MapReduce Framework built for coding & running analytics. 6. Unstructured data – Weblogs, click streams, Apache logs. Server l