A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀
-
Updated
Apr 20, 2022
A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀
🦀 A smart, persistent key-value store in Rust for managing data with a lifecycle. Features atomic TTLs, frequency counting, and is built on sled for performance and durability.
A coursera course on Machine Learning in Production
Alchemist is an intelligent data transformation engine that turns raw, messy data into clean, curated, and governed datasets. It automates and simplifies the data engineering lifecycle with a focus on quality, lineage, and operational excellence across the modern data ecosystem.
The objective of this project was to develop a revenue forecasting solution for a digital wallet company. The primary goal was to analyze historical transaction data, create a predictive model to forecast future revenue, and present these insights through an interactive dashboard.
Práctica 1. Tipología y Ciclo de Vida de los Datos. Caso práctico de Web Scraping orientado a aprender a identificar los datos relevantes por un proyecto analítico y usar las herramientas de extracción de datos.
“A full-stack data lifecycle project for stock market data using Python, MySQL, Feature Engineering, and EDA, focused on FAANG companies.”
RDM course with tactical solutions for researchers and data professionals alike
Add a description, image, and links to the data-lifecycle topic page so that developers can more easily learn about it.
To associate your repository with the data-lifecycle topic, visit your repo's landing page and select "manage topics."