Podcast: DataNation – Podcast for Data Engineers, Analysts and Scientists
-
44 – Multi-Table Versioning and why Abstractions Matter
There is a reason the Git-for-Data Paradigm of Nessie catalogs is so essential, not only for the versioning features it provides but also the level of abstraction it provides them. In this episode, I discuss this more.
-
43 – Building a Data Lakehouse on your Laptop
In just a few commands, you can have everything you need to practice ingestion and querying with popular data software. Just install Docker and then run the commands in the image. You can also follow the directions in this blog:https://lnkd.in/eDiC8fc6 Also try out this video series:https://lnkd.in/gp843ErM
-
42 – Window Functions and Apache Iceberg Metadata Tables
Alex Merced describes what are window function, and how they can be applied to Apache Iceberg Metadata tables
-
41 – Databricks’ “Open” Problem and the Need for an Agnostic Intermediate Data Lakehouse Table Format
Alex Merced discusses some of the fallout from Databricks’ UNIFormat announcement, and the innovation the industry needs to unlock the data lakehouse. Follow me on twitter @amdatalakehouse
-
40 – Big Announcements for Apache Iceberg, Delta Lake and Apache Hudi from Snowflake and Databricks
Alex Merced discusses some of the big announcements from this weeks conferences. Make sure to checkout Gnarly Data Waves on your favorite podcast app.
-
39 – What are Dremio’s Data Reflections and why are they so cool!
Alex Merced explains what are Dremio reflection and how they bring you speed, reduce storage costs, and do so while keeping things easy for your end users. Follow Alex on twitter @amdatalakehouse
-
37 – Dremio, Data Lakehouses and Generative AI
Alex Merced discusses Dremio’s new generative AI Features and the future of Data Lakehouses. Follow Alex on twitter @amdatalakehouse
-
36 – ELT & ETL: The Good, The Bad and the Ugly
Alex Merced reflects on a recent article from Lauren Balik on the topic of ELT. Here is the Article:https://medium.com/@laurengreerbalik/how-fivetran-dbt-actually-fail-3a20083b2506 Launren’s Twitter: @laurenbalik My Twitter handle: @amdatalakehouse
-
35 – Data Lakehouse Statistics (Understanding Parquet and Iceberg)
Alex Merced helps explain how stats are collected and used when working with Parquet files and Apache Iceberg tables. Follow Alex on twitter @amdatalakehouse