LIBRISTO
LIBROAMANTO
povinné
Staňte sa súčasťou komunity milovníkov kníh z celého sveta a získajte hromadu výhod. Založiť účet zdarma
0
Doprava zadarmo s Packetou nad 59.99 €
Kuriér DPD 2.99 Kuriér GLS 3.99 Zberné miesto GLS 2.49 SPS 3.99 SPS Parcel Shop 2.99 Packeta kurýr 3.99 Slovenská pošta 3.99 Zberné miesto DPD 2.99 Packeta 2.99

Doprava zdarma pre objednávky nad 59,99 € s Packetou a SPS Boxmi.

Building Scalable Data Systems with Apache Spark 4.x

Architect, Optimize, and Operate Distributed Pipelines with SQL, PySpark, and Modern Lakehouse Technologies

Jazyk AngličtinaAngličtina
Kniha Brožovaná
Kniha Building Scalable Data Systems with Apache Spark 4.x Kevin R. Auguste
Libristo kód: 52265738
Nakladateľstvo Independently published, máj 2026
Are your data pipelines slowing down, breaking under scale, or becoming too complex to maintain?Mode... Celý popis
? points 57 b Nové Nové
23.39
Skladom u dodávateľa Odosielame za 9-15 dní

Až 30 dní na vrátenie tovaru

Are your data pipelines slowing down, breaking under scale, or becoming too complex to maintain?

Modern data systems demand more than scripts that "just work." They require reliability, performance, and the ability to evolve without constant rewrites. Yet many engineers and analysts struggle with inefficient Spark jobs, unpredictable execution, and rising infrastructure costs.

This book addresses that gap.

Building Scalable Data Systems with Apache Spark 4.x is a practical guide to designing, optimizing, and operating distributed data pipelines using Apache Spark, PySpark, SQL, and lakehouse technologies. It focuses on how Spark actually behaves at scale, so you can build systems that are not only functional, but fast, stable, and production-ready.

You won't just learn how to write Spark code, you'll learn how to think like a data systems engineer.

Inside, you will learn how to:

  • Design end-to-end pipelines from ingestion to output using PySpark and SQL
  • Understand execution internals like DAGs, jobs, stages, and Catalyst optimization
  • Optimize performance through partitioning, Adaptive Query Execution (AQE), and efficient joins
  • Build reliable streaming systems with Structured Streaming and exactly-once semantics
  • Work with modern storage systems like Delta Lake and Apache Iceberg
  • Deploy and operate Spark workloads using Kubernetes, monitoring, and resource tuning

Each chapter builds practical intuition, connecting code to execution so you can diagnose bottlenecks, reduce cost, and scale confidently.

If you work as a data engineer, data analyst, backend developer, or data scientist, this book equips you with the skills to move beyond trial-and-error and build systems that perform consistently in real-world environments.

Your data is growing. Your systems should keep up.

Get your copy today and start building data pipelines that scale, perform, and last.

Herečka & Polyglotka
EWA KASP pre
Prehrať video
Ewa Kasp
Libristo má najväčší výber cudzojazyčnej literatúry. Preto si knihy kupujem tu.

Informácie o knihe

Celý názov Building Scalable Data Systems with Apache Spark 4.x
Jazyk Angličtina
Väzba Kniha - Brožovaná
Dátum vydania 2026
Počet strán 242
EAN 9798195327088
Libristo kód 52265738
Nakladateľstvo Independently published
Váha 428
Rozmery 178 x 254 x 13
Darujte túto knihu ešte dnes
Je to jednoduché
1 Pridajte knihu do košíka a vyberte možnosť doručiť ako darček 2 Obratom Vám zašleme poukaz 3 Knihu zašleme na adresu obdarovaného

Prihlásenie

Prihláste sa k svojmu účtu. Ešte nemáte Libristo účet? Vytvorte si ho teraz!

 
povinné
povinné

Nemáte účet? Získajte výhody Libristo účtu!

Vďaka Libristo účtu budete mať všetko pod kontrolou.

Vytvoriť Libristo účet
Knižný radca Libroamiko
Ahoj, som Libroamiko, môžem pomôcť?