Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.
…
continue reading
Join Kostas and Nitay as they speak with amazingly smart people who are building the next generation of technology, from hardware to cloud compute. Tech on the Rocks is for people who are curious about the foundations of the tech industry. Recorded primarily from our offices and homes, but one day we hope to record in a bar somewhere. Cheers!
…
continue reading
1
How Denormalized is Building ‘DuckDB for Streaming’ with Apache DataFusion
1:02:01
1:02:01
Αναπαραγωγή αργότερα
Αναπαραγωγή αργότερα
Λίστες
Like
Liked
1:02:01
Summary In this episode, Kostas and Nitay are joined by Amey Chaugule and Matt Green, co-founders of Denormalized. They delve into how Denormalized is building an embedded stream processing engine—think “DuckDB for streaming”—to simplify real-time data workloads. Drawing from their extensive backgrounds at companies like Uber, Lyft, Stripe, and Coi…
…
continue reading
1
206: Reviving Old-School Customer Experiences Through Modern Data Strategies, Featuring Edward Chenard, Seasoned Data Leader and Analytics Officer
48:26
Highlights from this week’s conversation include: Edward's Background and Journey in Data (0:44) P&L Ownership Discussion (1:15) Challenges in Profit Ownership (3:38) Data Team Dynamics (5:52) Role Clarity Between CFO and CDO (7:31) Nuances of Data Leadership (11:24) Focus on Relevance in Data Work (14:05) Best Buy's Personalization Project (18:39)…
…
continue reading
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
Highlights from this week’s conversation include: Nicolay’s Background and Journey in AI (0:39) Milestones in LLMs (4:30) Barriers to Effective Use of LLMs (6:39) Data-Centric AI Approach (10:17) Importance of Data Over Model Tuning (12:20) Capabilities of LLMs (15:08) Challenges in Structuring Data (18:28) JSON Generation Techniques (20:28) Utiliz…
…
continue reading
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
1
Unifying structured and unstructured data for AI: Rethinking ML infrastructure with Nikhil Simha and Varant Zanoyan
1:01:45
1:01:45
Αναπαραγωγή αργότερα
Αναπαραγωγή αργότερα
Λίστες
Like
Liked
1:01:45
Summary In this episode, we dive deep into the future of data infrastructure for AI and ML with Nikhil Simha and Varant Zanoyan, two seasoned engineers from Airbnb and Facebook. Nikhil and Varant share their journey from building real-time data systems and ML infrastructure at tech giants to launching their own venture. The conversation explores th…
…
continue reading
1
204: Will a Duck DB-Like Excel Emerge by 2075? And Is Data Every Company’s Most Valuable Asset? Featuring Benn Stancil of Mode
53:36
Highlights from this week’s conversation include: Benn's Background and Journey in Data (0:48) Reflection on Strategy and Vision (2:10) The Importance of Doing It Your Way (4:10) Early Experiences and Blogging (6:27) Self-Imposed Pressure in Startups (8:24) The Challenge of Decision-Making (12:11) Key Decisions in a Startup's Trajectory (15:48) Und…
…
continue reading
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
Overview In this episode, we chat with Chris Riccomini about the evolution of stream processing and the challenges in building applications on streaming systems. We also chat about leaky abstractions, good and bad API designs, what Chris loves and hates about Rust and finally about his exciting new project that involves object storage and LSMs. Con…
…
continue reading
Highlights from this week’s conversation include: Spencer's Background at Braze (1:54) The Early Days of Braze (2:41) Finding Product-Market Fit (4:44) First Major Customer (6:33) Unique Aspects of Braze's Growth Team (8:07) Startup Culture Experience (10:40) Data and Marketing Perspectives (12:50) Common Marketing Data Challenges (15:50) Changing …
…
continue reading
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
1
202: Predicting the Impact of Competitive Entrants With Synthetic Controls with Evan Wimpey of Elder Research
50:01
Highlights from this week’s conversation include: Evan's Background and Journey in Data (0:40) Discussion on Synthetic Controls (1:04) Evan's Educational Journey and Marine Corps Experience (2:54) Joining Elder Research (4:38) Synthetic Controls Explained (6:54) Measuring Impact with Synthetic Controls (9:05) Building the Control Group (12:54) Qual…
…
continue reading
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
1
201: AI Real-Talk: Uncovering the Good, Bad and Ugly Through Prototyping with Eric, John, and Matt
1:03:19
1:03:19
Αναπαραγωγή αργότερα
Αναπαραγωγή αργότερα
Λίστες
Like
Liked
1:03:19
Highlights from this week’s conversation include: Current State of LLMs (1:12) Historical Analogy to the iPhone (3:32) Limitations of Early iPhones (5:02) Comparing LLMs to Historical Technologies (6:08) Skepticism About LLM Capabilities (9:11) Broad Nature of AI Innovations (10:12) User Input Challenges (14:32) Transcription and Unstructured Data …
…
continue reading
1
The PRQL: AI Roundtable: Putting AI in Historical Context and Real-Life Learnings Through Prototyping, with Eric, John, and Matt
2:00
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
1
200: Data Team Struggles: Telling Stakeholders the Truth vs. What They Want to Hear (How to Tell The Truth, Tactfully)
29:01
Highlights from this week’s conversation include: Lightning Round Discussion (1:21) Data Team's Truthfulness (2:21) Culture as a Blocker (9:10) Misconceptions about Data Jobs (10:32) Cultural and Technological Influences (11:51) Challenges in Data Science Projects (15:19) Embracing the Process (17:23) Barriers to Entry (19:36) Hiring Data Leaders (…
…
continue reading
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
1
199: How To Use Data Analytics and AI To Increase Profitability With Smarter Procurement, Featuring Cameron Jagoe of ProcureVue
49:29
Highlights from this week’s conversation include: Cameron's Background and Journey in Data (1:49) Running a Bakery (3:03) Applying Analytics to Bakery Operations (7:07) Reevaluating Business Operations (18:08) Optimizing for Profitability (19:09) Working at Newell Rubbermaid (20:11) Value Engineering Projects (22:11) Starting a Center of Excellence…
…
continue reading
1
The PRQL: Better Analytics, Smarter Purchasing, and Improved Profitability with Cameron Jagoe of ProcureVue
2:39
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
Highlights from this week’s conversation include: Jesse’s background and work in data (0:35) E-commerce Application for Search (1:23) Ph.D. in Physics Experience Then Working in Data (2:27) Early Machine Learning Journey (4:35) Machine Learning at Stitch Fix (7:28) Machine Learning at Amazon (10:39) Myths and Realities of AI (13:49) Bolt-On AI vs. …
…
continue reading
1
The PRQL: Exploring the Evolution of AI and ML in E-commerce Search Optimization with Jesse Clark of Marqo.ai
1:50
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
1
197: Deep Dive: How to Build AI Features and Why it is So Dang Hard with Barry McCardle of Hex
1:03:35
1:03:35
Αναπαραγωγή αργότερα
Αναπαραγωγή αργότερα
Λίστες
Like
Liked
1:03:35
Highlights from this week’s conversation include: Overview of Hex and its Purpose (0:51) Discussion on AI and Data Collaboration (1:42) Product Updates in Hex (2:14) Challenges of Building AI Features (13:29) Magic Features and AI Context (15:22) Chatbots and UI (17:31) Benchmarking AI Models (19:06) AI as a Judge Pattern (23:32) Challenges in AI D…
…
continue reading
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
1
196: Why Big Query Was a Big Deal, Observability AI, and How AI is Like a Guy at the Bar, Featuring David Wynn of Edge Delta
49:21
Highlights from this week’s conversation include: David’s Background and Career (0:49) Econometrics Work at UPS (3:14) Challenges with Time Series Data and Tools (7:15) Working at Google Cloud (11:28) BigQuery's Significance (13:51) Comparison of Data Warehouse Products (17:23) Learning different cloud platforms (20:17) Coherence in GCP (23:04) Obs…
…
continue reading
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
1
195: Supply Chain Data Stacks and Snowflake Optimization Pro Tips with Jeff Skoldberg of Green Mountain Data Solutions
48:51
Highlights from this week’s conversation include: Jeff's Background and Transition to Independent Consulting (0:03) Working at Keurig and Business Model Changes (2:16) Tech Stack Evolution and SAP HANA Implementation (7:33) Adoption of Tableau and Data Pipelines (11:21) Supply Chain Analytics and Timeless Data Modeling (15:49) Impact of Cloud Compu…
…
continue reading
1
The PRQL: Breaking down Keurig’s Supply Chain Data Stack with Jeff Skoldberg of Green Mountain Data Solutions
2:21
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
Highlights from this week’s conversation include: Clint’s Background and Journey in Data (0:51) Starting a Data Career (2:01) Transition to Startup SaaS World (4:27) Clint’s Connection to a Federal Reserve Database (5:31) Challenges in Predictive Modeling (10:27) Data Input Challenges (15:50) Marketers' Workflow and Data Integration (18:29) Soft RO…
…
continue reading
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
Highlights from this week’s conversation include: Introducing a special edition of the show with the cynical data guy (0:19) Metadata and LLMs (2:32) Data-driven culture (8:44) No-code orchestration tools (17:09) No Code vs. Low Code (19:58) Enterprise Challenges with No Code Solutions (20:08) No Code Tools for Small Companies (21:40) Inappropriate…
…
continue reading
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
1
192: Business Logic As Code: A New LLM-Powered Operating System for Business Automation with Binny Gill of Kognitos
47:57
Highlights from this week’s conversation include: The history of computer science and AI inflection point (1:23) Binny's early programming experiences and the constraints of technology (2:14) Getting interested in computer programming (5:02) The experiment that impacted the starting of Kognitos (8:23) Challenges in traditional computer science (16:…
…
continue reading
1
The PRQL: From Programming Tic Tac Toe to Building an Operating System for Natural Language Programs With Binny Gill of Kognitos
2:59
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
1
191: From Amazon to Consulting: Time Series Forecasting and How to Communicate Data Analytics Insights with David McCandless of McCandless Consulting
49:26
Highlights from this week’s conversation include: David's Background and Journey in Data (0:30) Transition to Time Series Forecasting (2:03) Working on Time Series Forecasting at Amazon (2:55) Challenges and Experience in Time Series Forecasting (4:32) Transitioning to a New Role at Amazon (5:52) Tools and Methods for Time Series Forecasting (8:17)…
…
continue reading
1
The PRQL: Practical Applications for Time Series Forecasting with David McCandless of McCandless Consulting
2:45
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
1
190: Aligning Data Teams and Data Tools With Business Needs Featuring Ben Rogojan, the Seattle Data Guy
52:19
Highlights from this week’s conversation include: Ben’s background and journey in data (0:18) Relating data to business outcomes (2:33) Facebook's approach to data-driven business outcomes (4:43) Subjectivity and data-driven business outcomes (8:43) Infrastructure and data collection at Facebook (12:04) The importance of first-party data and the de…
…
continue reading
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
1
189: Customer Data Modeling, The Data Warehouse, Reverse ETL, and Data Activation with Ryan McCrary of RudderStack
1:03:52
1:03:52
Αναπαραγωγή αργότερα
Αναπαραγωγή αργότερα
Λίστες
Like
Liked
1:03:52
Highlights from this week’s conversation include: Ryan's Background and Roles in Data (0:05) Data Activation and Dashboard Staleness (1:27) Profiles and Data Activation (2:54) Customer-Facing Experience and Product Management (3:40) Profiles Product Overview (5:10) Use Cases for Profiles (6:44) Challenges with Data Projects (9:19) Entity Management…
…
continue reading
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
1
188: How To Invest in Data Infrastructure and Data Projects That Create Business Value with Matthew Kelliher-Gibson of Rudderstack
56:10
Highlights from this week’s conversation include: Matt KG’s Background in Data (0:35) Challenges in purchasing data tools (1:28) Early experiences in data analysis (9:51) Matt’s Transition to a subprime auto loan company (13:19 Transition to RudderStack and software purchase decisions (17:36) Tech Problems: People and Process (22:02) Challenges in …
…
continue reading
1
The PRQL: Navigating the Procurement Process for Data Infrastructure Tooling With Matthew Kelliher-Gibson of Rudderstack
2:43
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
Highlights from this week’s conversation include: Kostas Passes the Baton as Co-Host of the Podcast (0:24) Reflecting on the Podcast (2:56) New Co-Host John Wessel and His Background in Data (4:34) Kostas Journey in Data (10:55) Rudderstack's Explosive Growth (21:28) The Podcast's Inception and Marketing Activities (24:19) Evolution of the podcast …
…
continue reading
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
Highlights from this week’s conversation include: The Evolution of Data Systems (0:47) The Role of Open Source Software (2:39) Challenges of Time Series Data (6:38) Architecting InfluxDB (9:34) High Cardinality Concepts (11:36) Trade-Offs in Time Series Databases (15:35) High Cardinality Data (18:24) Evolution to InfluxDB 3.0 (21:06) Modern Data St…
…
continue reading
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading
Highlights from this week’s conversation include: Pete’s background and the origin story of Data Council (1:04) Reflecting on 10 years of Data Council (2:07) Impact of the pandemic on conferences (5:25) Rebuilding after the pandemic (7:42) Evolution of Data Council (10:33) Balancing content and sponsorship (16:17) Selecting speakers and content at …
…
continue reading
1
Data Council Week: AI Isn’t Just Hype - How To Successfully Apply LLMs Today with Tristan Zajonc of Continual
35:33
Highlights from this week’s conversation include: Tristan's Background and Journey into Data (1:14) Evolution of Machine Learning and AI (3:13) Impact of Generative AI (6:33) MLOps and Challenges in Early Data Science (8:48) Success and Applications of AI Today (11:34) Continual AI Copilot Platform (18:04) Challenges in building remarkable AI assis…
…
continue reading
1
Data Council Week: How To Do Self-Service Data Analytics and Business Intelligence Right with Ryan Dolley of GoodData
42:08
Highlights from this week’s conversation include: Ryan’s background in data (0:58) Transition from Performing Arts to Data (2:23) Understanding End Users in Data Projects (6:08) Learning from Failures in Data Projects (8:07) The self-service era (19:50) Struggles of self-service (21:23) The disillusion with dashboards (26:23) GoodData's approach (3…
…
continue reading
1
185: The Evolution of Data Processing, Data Formats, and Data Sharing with Ryan Blue of Tabular
1:29:43
1:29:43
Αναπαραγωγή αργότερα
Αναπαραγωγή αργότερα
Λίστες
Like
Liked
1:29:43
Highlights from this week’s conversation include: The Evolution of Data Processing (2:36) Ryan’s Background and Journey in Data (4:52) Challenges in Transitioning to S3 (8:47) Impact of Latency on Query Performance (11:43) Challenges with Table Representation (15:26) Designing a New Metadata Format (21:36) Integration with Existing Tools and Open S…
…
continue reading
1
The PRQL: The Two Parallel Tracks of Development In Data Processing with Ryan Blue of Tabular
4:48
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
…
continue reading