Kafka

New Capabilities for Apache Spark Users

  In September 2015, Cloudera launched the One Platform Initiative to make Apache Spark the default engine for Cloudera’s modern data platform. At the time, we had about 150 customers using Spark, many of them for simple ETL and data…
Read more

Cross-component Lineage for Apache Hadoop

Apache Hadoop® exists within a broader ecosystem of enterprise analytical packages. This includes ETL tools, ERP and CRM systems, enterprise data warehouses, data marts and others. Modern workloads flow from these various traditional analytical sources into Hadoop and then often back…
Read more

opening up a port on centos 7 firewall (using firewalld)

Blog post edited by Lester Martin There I was on an AWS hosted node trying to access port 2181 and 9092 on another AWS node where I just followed the instructions at http://kafka.apache.org/documentation/#quickstart to get a stand-alone instance of Kafka…
Read more

Cross-component Lineage for Apache Hadoop

Apache Hadoop® exists within a broader ecosystem of enterprise analytical packages. This includes ETL tools, ERP and CRM systems, enterprise data warehouses, data marts and others. Modern workloads flow from these various traditional analytical sources into Hadoop and then often back…
Read more