Build a Large Language Model (From Scratch)

0h 0m 0s
English
Paid

"Creating a Large Language Model from Scratch" is a practical guide that will teach you step-by-step how to create, train, and fine-tune large language models (LLMs).

In the book, you will go through the entire process—from model design to pre-training on a general data corpus and fine-tuning for specific tasks. Each stage is accompanied by clear explanations, diagrams, and examples.

Read more about the course

What You Will Learn:

  • Plan and program all parts of LLM
  • Prepare datasets for model training
  • Fine-tune LLM for text classification and custom data
  • Use feedback to improve instruction execution
  • Load pre-trained weights

The book will guide you through the internal mechanisms of generative AI, allowing you to not only understand how LLMs work but also learn how to adapt them. All developed examples can be run on a regular laptop.

About the Technology

Following the principle "I cannot understand what I cannot build," you will learn the process of creating a GPT-style LLM from scratch. Without using pre-built libraries, you will design a basic model, configure it for text classification, and eventually create a chatbot that can follow your instructions.

About the Book

"Building a Large Language Model from Scratch" is an engaging practical course on the fundamentals of generative AI. At each stage, you will delve into theory, reinforcing your knowledge with real development, giving you a deep understanding of the workings and limitations of LLMs.

Who the Book is For:

The book is suitable for readers with an intermediate level of Python and basic knowledge of machine learning. All developed models will work on any modern laptop, with the possibility of using a GPU.

Read Book Build a Large Language Model (From Scratch)

#Title
1Build a Large Language Model (From Scratch)

Similar courses to Build a Large Language Model (From Scratch)

Case Study in A/B Testing

Case Study in A/B TestingLunarTech

Category: Data processing and analysis
Duration 1 hour 56 minutes 17 seconds
PyTorch for Deep Learning

PyTorch for Deep Learningzerotomastery.io

Category: Data processing and analysis
Duration 52 hours 27 seconds
The Data Science Course: Complete Data Science Bootcamp 2023

The Data Science Course: Complete Data Science Bootcamp 2023udemy

Category: Data processing and analysis
Duration 31 hours 14 minutes 14 seconds
Apache Airflow Workflow Orchestration

Apache Airflow Workflow OrchestrationAndreas Kretz

Category: Other (Tools), Data processing and analysis
Duration 1 hour 18 minutes 41 seconds
Machine Learning with Python : COMPLETE COURSE FOR BEGINNERS

Machine Learning with Python : COMPLETE COURSE FOR BEGINNERSudemy

Category: Python, Data processing and analysis
Duration 13 hours 12 minutes 31 seconds
dbt for Data Engineers

dbt for Data EngineersAndreas Kretz

Category: Data processing and analysis
Duration 1 hour 52 minutes 55 seconds
Time Series Analysis, Forecasting, and Machine Learning

Time Series Analysis, Forecasting, and Machine Learningudemy

Category: Python, Data processing and analysis
Duration 22 hours 47 minutes 45 seconds
The Data Engineering Bootcamp: Zero to Mastery

The Data Engineering Bootcamp: Zero to Masteryzerotomastery.io

Category: Data processing and analysis
Duration 13 hours 23 minutes 15 seconds
Learning Apache Spark

Learning Apache SparkAndreas Kretz

Category: Data processing and analysis
Duration 1 hour 44 minutes 4 seconds
PyTorch for Deep Learning and Computer Vision

PyTorch for Deep Learning and Computer Visionudemy

Category: Data processing and analysis
Duration 10 hours 20 minutes 51 seconds