Drew the AI Guy awdemos

🏠

Working from home

Founder of aicareerpath.dev, I am an AI consultant, teacher, and advocate who accelerates AI and robotics transformation for enterprise organizations and teams.

18 followers · 30 following

AICareerpath
Abilene, TX
https://aicareerpath.dev/

View GitHub Profile

Recently created

Least recently created

Recently updated

Least recently updated

awdemos / grpo_demo.py

Created March 6, 2025 13:48 — forked from willccbb/grpo_demo.py

GRPO Llama-1B

	# train_grpo.py
	#
	# See https://github.com/willccbb/verifiers for ongoing developments
	#
	import re
	import torch
	from datasets import load_dataset, Dataset
	from transformers import AutoTokenizer, AutoModelForCausalLM
	from peft import LoraConfig
	from trl import GRPOConfig, GRPOTrainer

awdemos / prompt.txt

Created March 2, 2025 19:17 — forked from dakshvar22/prompt.txt

Deepseek R1 for self-improvement of CALM's prompts

	I am building a prompt for an LLM (gpt-4o) while building a conversational assistant. The LLM is expected to predict one of the available commands based on the instructions given. Here is the current prompt -

	```
	Your task is to analyze the current conversation context and generate a list of actions to start new business processes that we call flows, to extract slots, or respond to small talk and knowledge requests.

	These are the flows that can be started, with their description and slots:

	transfer_money: send money to friends and family
	slot: transfer_money_recipient (the name of a person)
	slot: transfer_money_amount_of_money (the amount of money without any currency designation)