InGeek CulturebyFrank MunzPredicate Pushdown for Apache Spark with Google BigQueryDoes predicate pushdown for Databricks on Google Cloud with BQ work? It does! And here is how to verify it.Apr 14, 20212Apr 14, 20212
InGoogle Cloud - CommunitybyFrank MunzBI on the LakehouseHow to Connect Google Looker to Databricks Delta LakeApr 17, 2021Apr 17, 2021
InGeek CulturebyFrank MunzData and AI Summit 2021 — The 7 Biggest AnnouncementsHighlights from the Data and AI Summit (former Apache Spark Summit) for the Busy IT ProfessionalMay 27, 2021May 27, 2021
InGeek CulturebyFrank MunzUsing Delta Sharing with Google ColabDelta Sharing is the industry’s first open protocol for secure data sharing, making it simple to securely share massive amounts of data…Aug 12, 2021Aug 12, 2021
InGeek CulturebyFrank MunzWhy Three out of Four Data Sharing Technologies Don’t Cut It (anymore)An increasing number of digital-native startups champion data as a strategic asset and create financial value from sharing data. Use cases…Oct 12, 2021Oct 12, 2021
Frank MunzMy Lakehouse and Delta Lake Session at ODSCODSC session about sharing huge amounts of data from a Lakehouse with Delta Sharing.Nov 4, 2021Nov 4, 2021
InGoogle Cloud - CommunitybyFrank MunzFrom Zero to Hero with Databricks on Google CloudDatabricks turned into the favorite platform for many data engineers, data scientists, and ML experts. It combines data, analytics, and AI…Nov 22, 2021Nov 22, 2021
InGeek CulturebyFrank MunzOn Sharing Scientific Data with Open Source SoftwareWhy sharing data matters: An introduction to open source Delta Sharing and how the world changed in just five days.Jan 5, 2022Jan 5, 2022
InGeek CulturebyFrank MunzHow I passed the dbt Fundamentals certification with Databrickstl;dr dbt is an open source project for ELT. It enables analytics engineers to transform data by writing SQL in a re-usable way. This…Mar 21, 20222Mar 21, 20222
Frank MunzWhat is Data Governance?“Data governance is the oversight to ensure data brings value and supports the business strategy. Data governance is more than just a tool…Mar 22, 2022Mar 22, 2022
InGoogle Cloud - CommunitybyFrank MunzData Quality in the LakehouseData Engineering is all about Data Quality ManagementApr 29, 2022Apr 29, 2022
InGoogle Cloud - CommunitybyFrank MunzWorkflows for the Data lakehouseDatabricks Workflows on GCPMay 19, 2022May 19, 2022
Frank MunzThe Data Lakehouse BookHow can I get an overview of the Lakehouse platform, including Delta Lake, Governance, Data Science, Data Engineering, Streaming Data, and…Jul 20, 20221Jul 20, 20221
InGeek CulturebyFrank MunzData Engineering at Data and AI Summit 2022What role do data pipelines and workflows play in modern data engineering? What is the easiest way to deploy them? How about streaming data…Jul 23, 2022Jul 23, 2022
Frank MunzStreaming Data Pipelines with Apache Kafka and Delta Live TablesHow to extract features and prep your ML data with autoscaling, declarative, and low-latency data pipelinesAug 19, 2022Aug 19, 2022