ETL
Projects with this topic
-
Configuration and data workflows for an instance of Apache Airflow for the DDR ecosystem
Updated -
Simulation of a real hospital scenario with a ML model in production
Updated -
Crawl and extract Home Depot's schema.org/Products.
Updated -
Official weather data ETL for wind energy project evaluation in El Calafate, Argentina.
Updated -
Live data source that can be used for data engineering, data warehouse and etl development.
UpdatedUpdated -
Live data source that can be used for data engineering, data warehouse and etl development.
UpdatedUpdated -
Live data source that can be used for data engineering, data warehouse and etl development.
UpdatedUpdated -
Moved to: https://github.com/atviriduomenys/spinta
Archived 3Updated -
A Python extract, transform, load (ETL) pipeline for the CityPulse Smart City dataset.
Updated -
Amaxa is a new data loader and ETL (extract-transform-load) tool for Salesforce, designed to support the extraction and loading of complex networks of records in a single operation.
Primary repo now on GitHub: https://github.com/davidmreed/amaxa
Updated -
Stack Exchange releases "data dumps" of all its publicly available content roughly every three months via archive.org.
This project is an example and a framework for building ETL for this data with Apache Spark and Java.
Updated -