LIBRISTO
LIBROAMANTO
povinné
Staňte sa súčasťou komunity milovníkov kníh z celého sveta a získajte hromadu výhod. Založiť účet zdarma
0
Doprava zadarmo s Packetou nad 59.99 €
Kuriér DPD 2.99 Zberné miesto GLS 2.49 SPS 3.99 SPS Parcel Shop 2.99 Packeta kurýr 3.99 Pošta 3.99 Zberné miesto DPD 2.99 Kuriér GLS 3.99 Packeta 2.99

Doprava zdarma pre objednávky nad 59,99 € s Packetou a SPS Boxmi.

THE LLM ENGINEER

Building Transformer Language Models with Python and PyTorch

Jazyk AngličtinaAngličtina
Kniha Brožovaná
Kniha THE LLM ENGINEER Lachlan James
Libristo kód: 51946507
Nakladateľstvo Independently published, apríl 2026
Most developers working with large language models today are flying blind. They understand the inter... Celý popis
? points 106 b Nové Nové
43.77
Skladom u dodávateľa Odosielame za 9-15 dní

30 dní na vrátenie tovaru

Most developers working with large language models today are flying blind. They understand the interface but they don't really understand the machine. And there lies the gap - between knowing how to use a model and knowing how to build, debug, and adapt one - is where real engineering capability lives.

The LLM Engineer is a hands-on implementation guide that closes that gap completely. Starting from a single sentence - predict the next token - and ending with a quantized model serving requests through an OpenAI-compatible API, this book walks you through every layer of the modern LLM stack. No black boxes. No magic. Every concept is introduced once, explained precisely, and immediately followed by complete, runnable code.

The architecture you build matches the design of Llama 3, Mistral, and Gemma at the blueprint level - rotary position embeddings, grouped-query attention, SwiGLU activations, RMSNorm - not a toy approximation, but the real thing at teachable scale.

Inside this book, you'll learn how to:

  • Implement byte-pair encoding from scratch and understand the tokenizer quirks that cause real production bugs
  • Build scaled dot-product attention, multi-head attention, and grouped-query attention (GQA) - the memory-efficient variant used by every major open-weight model since 2022
  • Construct a complete transformer block using pre-norm RMSNorm, SwiGLU feed-forward layers, RoPE positional encodings, and residual connections
  • Design and run a full training pipeline: packed sequences, AdamW with parameter-group weight decay, cosine warmup scheduling, gradient clipping, mixed-precision bfloat16, and distributed data parallelism
  • Fine-tune models efficiently using LoRA and QLoRA - implemented entirely from scratch, not just called from a library
  • Train for human preference alignment using Direct Preference Optimization (DPO), the technique that replaced PPO-based RLHF in most production pipelines
  • Quantize models to INT8 and 4-bit precision using GPTQ, AWQ, and GGUF for CPU deployment with llama.cpp
  • Serve models at scale using vLLM with PagedAttention and continuous batching, and expose an OpenAI-compatible API

Along the way, you'll build:

  • A byte-pair encoding tokenizer that handles Unicode, byte-level encoding, and the edge cases that break naive implementations
  • A complete GPT-style transformer language model - architecturally identical to Llama 3 - trained from scratch on real text data
  • A full training loop with Weights & Biases experiment tracking, checkpointing, and distributed GPU support via PyTorch DDP
  • An inference engine with greedy decoding, temperature sampling, top-k, top-p, repetition penalties, speculative decoding, and structured output generation
  • LoRA and QLoRA adapters injected and merged into a pre-trained model, reducing trainable parameters by over 99%
  • A DPO-aligned instruct model trained on preference pairs, starting from an SFT checkpoint
  • A production-ready serving stack: quantized model exported to GGUF, served locally via Ollama, and deployed at scale with vLLM

Every chapter includes working, runnable code, common bug sections drawn from real implementation failures, and exercises that push your understanding beyond what the text alone can teach.

Herečka & Polyglotka
EWA KASP pre
Prehrať video
Ewa Kasp
Libristo má najväčší výber cudzojazyčnej literatúry. Preto si knihy kupujem tu.

Informácie o knihe

Celý názov THE LLM ENGINEER
Jazyk Angličtina
Väzba Kniha - Brožovaná
Dátum vydania 2026
Počet strán 246
EAN 9798255965939
Libristo kód 51946507
Nakladateľstvo Independently published
Váha 431
Rozmery 191 x 235 x 13
Darujte túto knihu ešte dnes
Je to jednoduché
1 Pridajte knihu do košíka a vyberte možnosť doručiť ako darček 2 Obratom Vám zašleme poukaz 3 Knihu zašleme na adresu obdarovaného

Prihlásenie

Prihláste sa k svojmu účtu. Ešte nemáte Libristo účet? Vytvorte si ho teraz!

 
povinné
povinné

Nemáte účet? Získajte výhody Libristo účtu!

Vďaka Libristo účtu budete mať všetko pod kontrolou.

Vytvoriť Libristo účet
Knižný radca Libroamiko
Ahoj, som Libroamiko, môžem pomôcť?