-
Updated
Jul 24, 2019 - Python
data-lake
Here are 238 public repositories matching this topic...
-
Updated
May 2, 2020 - Jupyter Notebook
Python package for data warehouse, data lake and storage related tasks
-
Updated
Dec 22, 2021 - Python
Board Game Dataset Model is aggregated from two major board game data sources including BoardGameGeek and Board Game Atlas
-
Updated
Dec 26, 2022 - Jupyter Notebook
Streamdata.io Stack Exchange Questions Streaming to Amazon S3 Data Lake Using Lambda
-
Updated
Jul 14, 2018 - JavaScript
User activity insights | PySpark, AWS S3 & EMR
-
Updated
Jan 6, 2020 - Jupyter Notebook
Capstone project using US I94 Immigrations dataset for Udacity Data Engineering Nanodegree.
-
Updated
Nov 11, 2022 - Jupyter Notebook
Data Lake with Apache Spark
-
Updated
May 16, 2020 - Python
Cloud Data Engineering Technologies FER labs
-
Updated
Dec 18, 2023 - Go
ETL process applied on covid-19 dataset of European countries using Azure services such as databricks, keyvault, sql database, data factory etc. Finally power bi dashbaord was also made.
-
Updated
Sep 13, 2023 - Python
Data Engineer (Udacity): Project 4 Data Lakes with Spark on Amazon Web Service (AWS)
-
Updated
Oct 19, 2020 - Python
Apache Hive and Apache Druid performance testing for MIND Foods HUB Data Lake
-
Updated
Apr 8, 2022
A project using AWS to combine and transform streaming data for analytics.
-
Updated
Aug 23, 2023 - Python
Improve this page
Add a description, image, and links to the data-lake topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-lake topic, visit your repo's landing page and select "manage topics."