Skip to content

Instantly share code, notes, and snippets.

@sozercan
Created May 27, 2024 04:00
Show Gist options
  • Save sozercan/72516e314b0ce73fe4f47cee228e859f to your computer and use it in GitHub Desktop.
Save sozercan/72516e314b0ce73fe4f47cee228e859f to your computer and use it in GitHub Desktop.
#syntax=ghcr.io/sozercan/aikit:latest
apiVersion: v1alpha1
baseModel: unsloth/llama-3-8b-bnb-4bit
datasets:
- source: "yahma/alpaca-cleaned"
type: alpaca
config:
unsloth:
packing: false
maxSeqLength: 2048
loadIn4bit: true
batchSize: 2
gradientAccumulationSteps: 4
warmupSteps: 10
maxSteps: 60
learningRate: 0.0002
loggingSteps: 1
optimizer: adamw_8bit
weightDecay: 0.01
lrSchedulerType: linear
seed: 42
output:
quantize: q4_k_m
name: model
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment