Skip to content

Instantly share code, notes, and snippets.

View awdemos's full-sized avatar
🏠
Working from home

Drew the AI Guy awdemos

🏠
Working from home
View GitHub Profile
@awdemos
awdemos / grpo_demo.py
Created August 6, 2025 03:31 — forked from NickyDark1/grpo_demo.py
a grpo modifaction for deepspeed in multigpu from https://gist.github.com/willccbb/4676755236bb08cab5f4e54a0475d6fb
# train_grpo.py
from typing import *
import re
import torch
from datasets import load_dataset, Dataset, load_from_disk
from transformers import AutoTokenizer, AutoModelForCausalLM, TrainingArguments
from peft import LoraConfig
from trl import GRPOConfig, GRPOTrainer, TrlParser
from dataclasses import dataclass, field
@hughdbrown
hughdbrown / setup-aie5.sh
Created February 27, 2025 21:34
Setup AIE5 with upstream/main and origin/main
#!/bin/sh
set -eux -o pipefail
export GIT_USER=hughdbrown
export GIT_REPO_NAME_USER=AIE5 # What the user is going to call the repo
export GIT_REPO_NAME_AI_MAKER=AIE5 # What the repo is already called by AI_MAKER
mkdir -p ~/workspace/${GIT_USER}/maven.com/${GIT_REPO_NAME_USER}
cd ~/workspace/${GIT_USER}/maven.com/${GIT_REPO_NAME_USER}
@NickyDark1
NickyDark1 / grpo_demo.py
Created February 7, 2025 14:24 — forked from cgpeter96/grpo_demo.py
a grpo modifaction for deepspeed in multigpu from https://gist.github.com/willccbb/4676755236bb08cab5f4e54a0475d6fb
# train_grpo.py
from typing import *
import re
import torch
from datasets import load_dataset, Dataset, load_from_disk
from transformers import AutoTokenizer, AutoModelForCausalLM, TrainingArguments
from peft import LoraConfig
from trl import GRPOConfig, GRPOTrainer, TrlParser
from dataclasses import dataclass, field
@awdemos
awdemos / The Evolution of Large Language Models: From Transformers to DeepSeek-R1.md
Last active January 31, 2025 00:57
The Evolution of Large Language Models: From Transformers to DeepSeek-R1

The Evolution of Large Language Models: From Transformers to DeepSeek-R1

The field of artificial intelligence has seen remarkable progress in recent years, particularly in the domain of large language models (LLMs). This article explores the journey from the foundational Transformer architecture to the cutting-edge DeepSeek-R1 model, highlighting key developments and breakthroughs along the way.

Transformer Architecture: The Foundation of Modern LLMs

The Transformer architecture, introduced in 2017, revolutionized natural language processing. Its attention mechanism allowed for more efficient processing of sequential data, paving the way for larger and more capable language models1.

@awdemos
awdemos / DeepSeekR1AssistantAICareerPathDev.md
Last active February 3, 2025 16:10
DeepSeek-R1 Mastery: Build Your Ultimate AI Assistant

Revolutionize Your Workflow: Build Your Own AI Assistant with DeepSeek-R1 and Browser-Use

Unleash the Power of Local AI for Seamless Web Automation and Deep Analysis

🚀 Ready to supercharge your productivity? Dive into the world of AI with DeepSeek-R1 and Browser-Use, a game-changing combination that turns your computer into a personal AI powerhouse. This comprehensive guide will transform you from a curious tech enthusiast to an AI maestro, capable of creating an assistant that browses the web, gathers data, and delivers insightful analysis—all while keeping your data 100% private and under your control.

🎯 Master These AI Superpowers

By the end of this guide, you'll be able to:

@willccbb
willccbb / grpo_demo.py
Last active October 24, 2025 07:12
GRPO Llama-1B
# train_grpo.py
#
# See https://github.com/willccbb/verifiers for ongoing developments
#
"""
citation:
@misc{brown2025grpodemo,
title={Granular Format Rewards for Eliciting Mathematical Reasoning Capabilities in Small Language Models},
author={Brown, William},
@Artefact2
Artefact2 / README.md
Last active October 21, 2025 02:42
GGUF quantizations overview

Which GGUF is right for me? (Opinionated)

Good question! I am collecting human data on how quantization affects outputs. See here for more information: ggml-org/llama.cpp#5962

In the meantime, use the largest that fully fits in your GPU. If you can comfortably fit Q4_K_S, try using a model with more parameters.

llama.cpp feature matrix

See the wiki upstream: https://github.com/ggerganov/llama.cpp/wiki/Feature-matrix