How the AI Split Works.md

The Split Algorithm: From Chaos to Clean Commits

This is where the real engineering challenge lives. Currently, Omni uses a fixed batch size of 3 files when autoSplit is enabled:[^2]

const batchSize = autoSplit ? 3 : Math.max(1, selected.length);
const totalCommits = Math.max(1, Math.ceil(selected.length / batchSize));

This works, but it has no semantic understanding of which files belong together logically. The proposed AI-enhanced approach replaces this with a three-phase hybrid architecture:[^2]

Phase 1: Static Analysis (instant, no AI)

Before any model is called, Omni extracts metadata from the full file — not just the diff. This is the key insight: diff tells you what changed; the full file tells you why it matters.[^2]

interface FileDossier {
  path: string;
  diff: string;
  imports: string[];    // What it depends on
  exports: string[];    // What it exposes  
  fileType: 'component' | 'hook' | 'util' | 'test' | 'config' | 'type';
  detectedDomain: string;  // From path: "user", "auth", "checkout"
}

From this, Omni builds a relationship graph deterministically — import chains, test-file pairing (UserService.test.ts → UserService.ts), directory clustering. This handles 80% of grouping decisions with zero AI cost.[^2]

Phase 2: AI Refinement (fast, small prompts)

The LLM only answers questions that require judgment: Is this a feature or a fix? Do these two unrelated-looking files actually belong together? For each candidate cluster: "Should this be 1 or 2 commits?" Cross-cluster: "Do any files belong elsewhere?"[^2]

Phase 3: Message Generation (current approach)

Generate title and body per commit using the enriched context from all six sources.[^2]

Adaptive Model Strategy

The architecture scales up gracefully across model tiers:[^2]

Model Tier	Examples	Context	Strategy
Tiny (4K tokens)	Apple AFM	4K	Many micro-calls, compressed structured prompts
Medium	Ollama (Llama, Mistral)	8–32K	Batched parallel calls
Large	GPT 5.2, Opus 4.5, Gemini 3	128K–1M+	Single-pass — send everything at once

For AFM's tight 4K window, every token counts. Instead of natural language prompts, Omni uses structured formats:[^2]

[src/hooks/useUserData.ts]
+cache.get +cache.set -fetchUser direct
?type ?domain ?sum
→ feat user cache-user-fetch

This lets 3–4 files fit per call. For 47 files, the full pipeline completes in ~7 seconds. For 100 files, ~12 seconds. With GPT 5.2 or Opus 4.5, the same 47 files resolve in a single call in ~3 seconds.[^2]

The design philosophy: build for AFM first, because small models force clean decomposition. Larger models then just collapse multiple passes into one — the micro-pipeline isn't a compromise, it's the foundation.[^2]

AI Rebase: Decluttering the Commit Frenzy

On the flip side of splitting, there's squashing. The AI Rebase feature lets you select a range of messy commits, and Omni's AI groups them into streamlined logical commits — "vibe code generates commit bonanza, Omni declutters the frenzy to a neat stack of commits by removing the commit pollution" (issue #177) .

eonist/How the AI Split Works.md

Select an option

No results found

Select an option

No results found

AI Split

eonist commented Feb 21, 2026

Uh oh!

eonist commented Feb 21, 2026

Uh oh!