bigdatatutorial
-
Updated
Sep 13, 2018 - Shell
bigdatatutorial
Bigdata Pipeline
Life-cycle: Internal working of HDFS, SQOOP, HIVE, SPARK, HBASE, KAFKA with code.
Project Fortis is a data ingestion, analysis and visualization pipeline.
Hands-on workshop with Apache Iceberg
Hands-on workshop with Iceberg, Redpanda, Debezium and Kafka-Connect
🚢 Docker image for Twitter Sentiment analysis with Spark MLlib
Developing and deploying Spark Streaming on Kubernetes!
Master's Final Degree Project on Artificial Intelligence and Big Data
local kubernetes-based ml setup
🏆 Spark4You Design patterns
A rudimentary command line utility for contrasting Apache Spark event logs.
DCL-700: Big Data Essentials
A bash script to install and configure Hadoop DFS, YARN, MapReduce, Apache Hive and Spark on CentOS.
Docker App for services including kafka, spark and cassandra
Creating gcloud dataproc cluster with this github action
Add a description, image, and links to the spark-streaming topic page so that developers can more easily learn about it.
To associate your repository with the spark-streaming topic, visit your repo's landing page and select "manage topics."