Python

Apache Spark – Welcome to the CDH family

The neatest part about being part of our market is the rapid rate of innovation we experience. Ideas from a variety of sources – industry, academia and sometimes industry spawned from academia (in the case of our partner Databricks) –…
Read more

Waves of Adoption – Evolution of Hadoop Users

Over the last eight years of Hadoop’s existence we have seen what can be described as “waves of adoption” – there are distinguishable groups of users who adopted Hadoop at similar times and under similar circumstances. Each wave of users…
Read more

Thoughts on Joining Cloudera

Originally posted by Wes McKinney on October 6, 2014 After some unanticipated media leaks (here and here), I was very excited to finally share that my team and I are joining Cloudera. You can find out all the concrete details…
Read more

Apache Hadoop for Data Scientists

Data Science at Big Data scale is powerful but challenging to build. We at Cloudera are ever focused on bridging the gap between the tools on Hadoop and the tools on your laptop. Today, we announced a number of new…
Read more

Python and Apache Hadoop: A State of the Union

Over the last five years, the rapid growth of Python’s open source data tools have made it a tool of choice for a wide variety of data engineering and data science needs. Hugely successful projects that we now take for…
Read more