Understanding ETL
Data Pipelines for Modern Data Architectures


Book Details
Author | Matt Palmer |
Publisher | O'Reilly Media |
Published | 2024 |
Edition | 1st |
Paperback | 107 pages |
Language | English |
ISBN-13 | 9781098159238, 9781098159252 |
ISBN-10 | 1098159233, 109815925X |
License | Compliments of Databricks |
Book Description
Extract, transform, load (ETL) is at the center of every application of data, from business intelligence to AI. Constant shifts in the data landscape - including the implementations of lakehouse architectures and the importance of high-scale real-time data - mean that today's data practitioners must approach ETL a bit differently.
This updated technical guide offers data engineers, engineering managers, and architects an overview of the modern ETL process, along with the challenges you're likely to face and the strategic patterns that will help you overcome them. You'll come away equipped to make informed decisions when implementing ETL and confident about choosing the technology stack that will help you succeed.
- Discover what ETL looks like in the new world of data lakehouses
- Learn how to deal with real-time data
- Explore low-code ETL tools
- Understand how to best achieve scale, performance, and observability
This book is published as open-access, which means it is freely available to read, download, and share without restrictions.
If you enjoyed the book and would like to support the author, you can purchase a printed copy (hardcover or paperback) from official retailers.