Streaming Data Pipelines with Apache Kafka and Delta Live TablesHow to extract features and prep your ML data with autoscaling, declarative, and low-latency data pipelinesAug 19, 2022Aug 19, 2022
Published inGeek CultureData Engineering at Data and AI Summit 2022What role do data pipelines and workflows play in modern data engineering? What is the easiest way to deploy them? How about streaming data…Jul 23, 2022Jul 23, 2022
The Data Lakehouse BookHow can I get an overview of the Lakehouse platform, including Delta Lake, Governance, Data Science, Data Engineering, Streaming Data, and…Jul 20, 20221Jul 20, 20221
Published inGoogle Cloud - CommunityWorkflows for the Data lakehouseDatabricks Workflows on GCPMay 19, 2022May 19, 2022
Published inGoogle Cloud - CommunityData Quality in the LakehouseData Engineering is all about Data Quality ManagementApr 29, 2022Apr 29, 2022
What is Data Governance?“Data governance is the oversight to ensure data brings value and supports the business strategy. Data governance is more than just a tool…Mar 22, 2022Mar 22, 2022
Published inGeek CultureHow I passed the dbt Fundamentals certification with Databrickstl;dr dbt is an open source project for ELT. It enables analytics engineers to transform data by writing SQL in a re-usable way. This…Mar 21, 20222Mar 21, 20222
Published inGeek CultureOn Sharing Scientific Data with Open Source SoftwareWhy sharing data matters: An introduction to open source Delta Sharing and how the world changed in just five days.Jan 5, 2022Jan 5, 2022
Published inGoogle Cloud - CommunityFrom Zero to Hero with Databricks on Google CloudDatabricks turned into the favorite platform for many data engineers, data scientists, and ML experts. It combines data, analytics, and AI…Nov 22, 2021Nov 22, 2021