MLlib

Apache Hadoop for Data Scientists

Data Science at Big Data scale is powerful but challenging to build. We at Cloudera are ever focused on bridging the gap between the tools on Hadoop and the tools on your laptop. Today, we announced a number of new…
Read more

Spark is the New Workhorse of Data Processing on Hadoop

If you are a big data practitioner, let me confirm something you have strongly suspected: Apache Spark will replace MapReduce as the general purpose data processing engine for Apache Hadoop. Spark’s success is due to the combination of dramatic speed…
Read more