Skip to content
#

data-orchestration

Here are 27 public repositories matching this topic...

Skytrax-Data-Warehouse

A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.

  • Updated Apr 18, 2020
  • Python

This repo contains a dataset, exercises, and sample code for an end-to-end SAP BTP data-to-value bootcamp covering SAP HANA Cloud, SAP Data Warehouse Cloud, SAP Data Intelligence Cloud, and SAP Analytics Cloud.

  • Updated Mar 14, 2023
  • Jupyter Notebook

Develop a real-time data ingestion pipeline using Kafka and Spark. Collect minute-level stock data from Yahoo Finance, ingest it into Kafka, and process it with Spark Streaming, storing the results in Cassandra. Orchestrated the workflow using Airflow deployed on Docker.

  • Updated Nov 29, 2024
  • Python

Improve this page

Add a description, image, and links to the data-orchestration topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-orchestration topic, visit your repo's landing page and select "manage topics."

Learn more