Notebook

Theory

LLM

Build LLM from scratch in Python using createllm package - https://pythonscholar.com/build-a-large-language-model-from-scratch/
Private LLM using databricks-dolly-15k (approximately 15,000 instruction/response fine-tuning records) - https://www.leewayhertz.com/build-private-llm/
The Emergence Of Large Language Model (LLM) API Build Frameworks - https://cobusgreyling.medium.com/the-emergence-of-large-language-model-llm-api-build-frameworks-78d83d68eeda
Corpus size and LLM - https://genai.stackexchange.com/q/613/2269
Restrict LLM responses to specific dataset - https://genai.stackexchange.com/q/167/2269
Fine tuning the LLaVA Vision LLM on AWS - https://medium.com/@mr.sean.ryan/fine-tuning-the-llava-vision-llm-on-aws-2ba46b7dcec9
Time-LLM: Reprogram an LLM for Time Series Forecasting - https://towardsdatascience.com/time-llm-reprogram-an-llm-for-time-series-forecasting-e2558087b8ac
Running Your Very Own Local LLM - https://yc.prosetech.com/running-your-very-own-local-llm-6d4db99c0611
What are 1-bit LLMs? - https://medium.com/data-science-in-your-pocket/what-are-1-bit-llms-3f2ae4b40fdf
Implementing the Transformer Encoder from Scratch in TensorFlow and Keras - https://machinelearningmastery.com/implementing-the-transformer-encoder-from-scratch-in-tensorflow-and-keras/
Unleashing the Power of Language Models: A Deep Dive into Language Foundation Model Tuning Strategies - https://ai.plainenglish.io/unleashing-the-power-of-language-models-a-deep-dive-into-language-foundation-model-tuning-4f1e96be7ddf
Understanding Large Language Models - Words vs Tokens - https://kelvin.legal/understanding-large-language-models-words-versus-tokens/
Hands-On LangChain for LLM Applications Development: Output Parsing - https://pub.towardsai.net/hands-on-langchain-for-llm-applications-development-output-parsing-876354434462
What are foundation models? - https://research.ibm.com/blog/what-are-foundation-models
Your Guide to the LLM Ecosystem - https://pub.aimind.so/your-guide-to-the-llm-ecosystem-f67826c84be8
Large Language Model (LLM) Stack — Version 5 - https://cobusgreyling.medium.com/large-language-model-llm-stack-version-5-5a9306870e7f
Building a Million-Parameter LLM from Scratch Using Python - https://levelup.gitconnected.com/building-a-million-parameter-llm-from-scratch-using-python-f612398f06c2
Deploy your own Open-Source Language Model: A Comprehensive Guide - https://blog.zhaw.ch/artificial-intelligence/2023/04/20/deploy-your-own-open-source-language-model/
Best practices for building LLMs - https://stackoverflow.blog/2024/02/07/best-practices-for-building-llms/
Understanding Encoder And Decoder LLMs - https://magazine.sebastianraschka.com/p/understanding-encoder-and-decoder
How does the (decoder-only) transformer architecture work? - https://ai.stackexchange.com/q/40179/75530
LLM Agents - https://www.promptingguide.ai/research/llm-agents
https://python.langchain.com/docs/use_cases/sql/
Building an LLM Stack, Part 1: Implementing Encoders and Decoders - https://deepgram.com/learn/building-an-llm-stack-1-implementing-encoders-and-decoders

LLM metrics

LLM Customizations

Hugging Face

Total noob’s intro to Hugging Face Transformers
Hugging Face Transformers - Hugging Face Transformers is an open-source Python library that provides access to thousands of pre-trained Transformers models for natural language processing (NLP), computer vision, audio tasks, and more. It simplifies the process of implementing Transformer models by abstracting away the complexity of training or deploying models in lower level ML frameworks like PyTorch, TensorFlow and JAX.
Hugging Face Hub - The Hugging Face Hub is a collaboration platform that hosts a huge collection of open-source models and datasets for machine learning, think of it being like Github for ML. The hub facilitates sharing and collaborating by making it easy for you to discover, learn, and interact with useful ML assets from the open-source community. The hub integrates with, and is used in conjunction with the Transformers library, as models deployed using the Transformers library are downloaded from the hub.
Hugging Face Spaces - Spaces from Hugging Face is a service available on the Hugging Face Hub that provides an easy to use GUI for building and deploying web hosted ML demos and apps. The service allows you to quickly build ML demos, upload your own apps to be hosted, or even select a number of pre-configured ML applications to deploy instantly.

Miscellaneous

According to Alireza Goudarzi, senior researcher of machine learning (ML) for GitHub Copilot: “LLMs are not trained to reason. They’re not trying to understand science, literature, code, or anything else. They’re simply trained to predict the next token in the text.” Source

chainhead/llm.md

Notebook

Theory

LLM

LLM metrics

LLM Customizations

LLM Evaluation

LLM fine-tuning

RAG

GenAI stack

Prompt Engineering

Hugging Face

Miscellaneous

StackOverflow

wd021 commented Jul 10, 2025

Uh oh!