Building LLMs from the Ground Up: A 3-Hour Coding Workshop

Dive into a 3-hour coding workshop on Large Language Models (LLMs) to learn how to implement, train, and use them. The video presentation includes chapters such as an introduction to LLMs, understanding input data, coding an LLM architecture, pretraining, loading pretrained weights, instruction fine-tuning, benchmark evaluation, and evaluating conversational performance. The workshop provides clickable chapter marks for ease of navigation. This departure from usual text-based content was well-received last time, prompting a repeat. Check out the workshop video, accompanying materials, and additional resources on GitHub for a comprehensive learning experience. Happy viewing!

https://magazine.sebastianraschka.com/p/building-llms-from-the-ground-up