Skip to main content
CF

Rearchitecting LLMs

0h 0m 0s
English
Paid

Rearchitecting LLMs is a self-paced course by Pere Martra. Large general-purpose language models are not optimized by default for specific industries and business tasks.

Course facts

Lessons
0
Duration
self-paced
Level
All levels
Language
English
Updated
Instructor
Pere Martra
Price
Premium

Large general-purpose language models are not optimized by default for specific industries and business tasks. By using methods such as specialized fine-tuning, pruning unnecessary neural components, and knowledge distillation, models can be rearchitected to work faster, be cheaper to operate, and provide more accurate results.

The book "Rearchitecting LLM: Structural Methods for Creating Efficient Models" translates ideas from the latest AI research into practical approaches for optimizing models for specific tasks. Working with this practical book, you will perform "surgical" tuning of popular open-source models—such as Llama-3, Gemma, and Qwen—to create cost-effective local small language models (SLM).

As you study the material, you will learn to combine behavioral analysis of models with structural changes in architecture: identify and remove components that do not contribute to the model's goals, and apply "fair pruning" methods to reduce model bias at the level of individual neurons.

What's inside the book:

  • universal methods for model architecture tuning
  • end-to-end model rearchitecture pipelines
  • improving explainability and reducing bias through model "cleaning"
  • replacing external LLMs with local SLMs

Who teaches Rearchitecting LLMs? Pere Martra

Pere Martra thumbnail

Pere Martra is an ML Research Engineer specializing in post-training, compression, and alignment of large language models (LLM). His work aims to bridge the gap between academic research and engineering practice. The main goal of his research is to create specialized small language models (SLM) capable of achieving high performance with significantly fewer computational resources.

He is currently working on a book, "Rearchitecting LLMs," for Manning Publications (MEAP - Q1 2026, publication - Q2 2026). The book focuses on advanced optimization methods for language models that go beyond traditional fine-tuning. Previously, he published the book "Large Language Models Projects" (Apress, 2024), dedicated to the practical application of LLM.

His key technical areas include model architecture optimization, the development of efficient pipelines for creating specialized models, and research on structural pruning. In his work (preprint, December 2025), he explores systematic pruning in GLU architectures, demonstrating how structural optimization can enhance the models' abilities, such as following instructions.

He is also the creator of OptiPfair, an open-source library for detecting and reducing bias at the level of individual neuron components in models.

With over twenty years of experience in technical leadership, he is currently focused on the development of efficient and ethical AI systems. Pere Martra is actively engaged in the professional community: he is the author of the "Large Language Model Notebooks" course (over 1,800 stars on GitHub) and participates in the Hugging Face and Google Gemini ecosystems.

He is open to collaboration on research and engineering projects related to model efficiency, compression, and the responsible development of artificial intelligence.

Books

Read Book Rearchitecting LLMs

#TitleTypeOpen
1Rearchitecting LLMs v2 MEAP PDF
2Rearchitecting LLMs v3 MEAP PDF
3Rearchitecting LLMs v4 MEAP PDF

What courses are similar to Rearchitecting LLMs?

  • Building Solo thumbnailNew

    Building Solo

    By: Aaron Francis
    Explore the creation of the SOLO application: manage development in one place. A unified interface for npm, composer, and servers, with support for multiple pro
    2h 3m
  • Auditing Your Code thumbnailNew

    Auditing Your Code

    By: Aaron Francis
    Learn code audit skills to analyze and improve the codebase. The course covers tools for systematic analysis, project cleanup, and working with reports.
    30m
  • Building Loadout thumbnailNew

    Building Loadout

    By: Aaron Francis
    Take the course and create a desktop application called Loadout to manage AI tools using modern technologies such as Rust and React.
    11m
  • AI Voice Agents with AWS thumbnailUpdated 1mo ago

    AI Voice Agents with AWS

    By: Zero To Mastery
    Study the creation of voice AI agents using AWS and Python. Develop an assistant with real functionalities and a deep understanding of the architecture.
    3h 1m5/5
  • faster. | Learn AI-Assisted Development thumbnailUpdated 1mo ago

    faster. | Learn AI-Assisted Development

    By: Aaron Francis
    Practical course on AI development for engineers. Learn reproducible processes and improve your code with artificial intelligence.
    5/5
  • Building faster.dev thumbnailNew

    Building faster.dev

    By: Aaron Francis
    Master the development of the AI-native platform faster.dev using modern technologies, including Laravel and React, to create efficient SaaS products.
    22m
  • n8n for AI Workflows and AI Agents thumbnailUpdated 1mo ago

    n8n for AI Workflows and AI Agents

    By: Academind Pro (Maximilian Schwarzmüller)
    Learn how to create robust automations with n8n and AI. This includes AI agents, email processing, content generation, and image generation.
  • GOLANG A.I. Masterclass - Build 6 A.I. Projects with GO (Advanced Course) thumbnailNew

    GOLANG A.I. Masterclass - Build 6 A.I. Projects with GO (Advanced Course)

    By: Akhil Sharma
    Take a 26-hour course on AI project development in Golang. Create 6 advanced projects, enhancing your skills in building scalable solutions.
    25h 50m0/5

Frequently asked questions

What is Rearchitecting LLMs about?
Large general-purpose language models are not optimized by default for specific industries and business tasks. By using methods such as specialized fine-tuning, pruning unnecessary neural components, and knowledge distillation, models can…
Who teaches this course?
It is taught by Pere Martra. You can find more courses by this instructor on the corresponding source page.
How long is the course?
It is delivered as a self-paced online course on CourseFlix.
Is it free to watch?
It is part of CourseFlix's premium catalog. A subscription unlocks the full video player; the course description, table of contents, and preview information are available to everyone.
Where can I watch it online?
The course is available to watch online on CourseFlix at https://courseflix.net/course/rearchitecting-llms. The page hosts every lesson with the integrated video player; no download is required.