Use hybrid system prompt with claude_code base and workspace context #539

jeremyeder · 2026-01-28T05:07:11Z

Summary

Changes the Claude Code runner to use a hybrid system prompt approach that combines the built-in claude_code prompt from the Claude Agent SDK with workspace-specific context.

Changes Made

Code Changes:

Modified adapter.py (lines 508-511) to use array-based system_prompt_config
- First element: "claude_code" (built-in SDK prompt with standard capabilities)
- Second element: workspace context (repos, workflows, artifacts, branches, etc.)

Documentation:

Added comprehensive README.md for the claude-code-runner component
Documented system prompt configuration approach and rationale
Included complete environment variables reference
Added development, testing, and API documentation
Documented recent changes section for historical context

Benefits

✅ Better SDK Alignment: Uses the official Claude Code system prompt as intended
✅ Maintains Context: Preserves workspace-specific information (repos, workflows, branches)
✅ No Breaking Changes: Existing functionality continues to work
✅ Improved Clarity: Clear separation between standard and custom instructions

Before

system_prompt_config = {"type": "text", "text": workspace_prompt}

After

system_prompt_config = [
    "claude_code",
    {"type": "text", "text": workspace_prompt}
]

What Claude Now Receives

Base Claude Code Prompt: Standard instructions, tool definitions, and behavioral guidelines from the SDK
Workspace Context: Session-specific information including:
- Repository locations and structure
- Active workflow details
- Artifacts and upload directories
- Git branch information and auto-push instructions
- MCP integration setup guidance
- Custom workflow instructions from ambient.json

Test Plan

Verify adapter.py changes don't break existing tests
Test that Claude receives both system prompts correctly
Confirm workspace context is still available to Claude
Validate no breaking changes in session behavior
Check that documentation builds correctly

Files Changed

components/runners/claude-code-runner/adapter.py (4 lines modified)
components/runners/claude-code-runner/README.md (245 lines added - new file)

🤖 Generated with Claude Code

Changes the Claude Code runner to use a hybrid system prompt approach that combines the built-in "claude_code" prompt with workspace-specific context. **Changes:** - Modified adapter.py to use array-based system_prompt_config - First element: "claude_code" (built-in SDK prompt) - Second element: workspace context (repos, workflows, artifacts, etc.) **Benefits:** - Leverages standard Claude Code instructions and capabilities - Maintains workspace-specific context for session awareness - Better alignment with Claude Agent SDK best practices - No breaking changes to existing functionality **Documentation:** - Added comprehensive README.md for claude-code-runner component - Documented system prompt configuration approach - Included environment variables reference - Added development and testing guidelines Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

jeremyeder · 2026-01-28T07:15:23Z

I don't think this is a solved problem.
We don't have the experience to build our own system prompt right now.
Not including the claude_code system_prompt was likely because the SDK changed its capabilities or defaults along the way.

Some more context from Jan 2026:

## The Problem Apple Silicon users running `make local-up` experienced: - Slow builds (15-20 minutes instead of 4-6 minutes) - Frequent crashes with: `qemu: uncaught target signal 11 (Segmentation fault)` - Next.js build failures ## Root Cause The `_build-and-load` Makefile target (used by `make local-up`) was **missing the `$(PLATFORM_FLAG)` parameter** in its build commands. **Before (BROKEN):** ```makefile _build-and-load: @$(CONTAINER_ENGINE) build -t $(BACKEND_IMAGE) components/backend @$(CONTAINER_ENGINE) build -t $(FRONTEND_IMAGE) components/frontend @$(CONTAINER_ENGINE) build -t $(OPERATOR_IMAGE) components/operator @$(CONTAINER_ENGINE) build -t $(RUNNER_IMAGE) -f components/runners/... ``` This caused: 1. Images built without explicit platform → defaulted to `amd64` 2. On Apple Silicon (arm64), amd64 images run via QEMU emulation 3. QEMU emulation is 4-6x slower and crashes during heavy builds like Next.js **Why other targets worked:** The public build targets (`build-frontend`, `build-backend`, etc.) correctly included `$(PLATFORM_FLAG)`, so running `make build-all PLATFORM=linux/arm64` worked fine. Only `make local-up` was broken. ## The Fix **After (FIXED):** ```makefile _build-and-load: @$(CONTAINER_ENGINE) build $(PLATFORM_FLAG) -t $(BACKEND_IMAGE) components/backend @$(CONTAINER_ENGINE) build $(PLATFORM_FLAG) -t $(FRONTEND_IMAGE) components/frontend @$(CONTAINER_ENGINE) build $(PLATFORM_FLAG) -t $(OPERATOR_IMAGE) components/operator @$(CONTAINER_ENGINE) build $(PLATFORM_FLAG) -t $(RUNNER_IMAGE) -f components/runners/... ``` **That's it!** Just added `$(PLATFORM_FLAG)` to 4 build commands. ## Additional Improvements While fixing the bug, we also added auto-detection so users don't have to manually set `PLATFORM`: 1. **Auto-detect architecture** (Makefile lines 18-38): - Apple Silicon → `PLATFORM=linux/arm64` (default) - Intel/AMD → `PLATFORM=linux/amd64` (default) 2. **Diagnostic tool**: ```bash make check-architecture # Shows detected vs active platform ``` 3. **Documentation**: Added troubleshooting guide in `docs/developer/local-development/kind.md` ## Impact **Before:** - Apple Silicon users: 15-20 min builds with frequent crashes - Had to manually set `PLATFORM=linux/arm64` (and even then `local-up` ignored it!) **After:** - Apple Silicon users: 4-6 min builds, no crashes - Auto-detects native architecture - Manual override still supported: `make local-up PLATFORM=linux/amd64` ## Files Changed - **Makefile** (lines 18-38, 669-675): Auto-detect + fix bug - **docs/developer/local-development/kind.md**: Troubleshooting guide - **e2e/scripts/load-images.sh**: Architecture validation ## Testing ```bash # Verify native builds make check-architecture # Should show: ✓ Using native architecture # Clean start make local-clean make local-up # Should complete in 4-6 minutes without crashes ``` Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

github-actions · 2026-01-28T08:07:07Z

Claude Code Review

Summary

This PR implements a hybrid system prompt approach for the Claude Code runner, combining the SDK's built-in claude_code prompt with workspace-specific context. The changes are minimal, focused, and well-documented. Additionally, the PR includes Makefile improvements for architecture auto-detection and a new comprehensive README for the runner component.

Overall Assessment: ✅ Strong PR - Well-executed change with excellent documentation and useful infrastructure improvements.

Issues by Severity

🚫 Blocker Issues

None - No blocking issues found.

🔴 Critical Issues

None - No critical issues found.

🟡 Major Issues

None - No major issues found.

🔵 Minor Issues

1. Missing Test Coverage for System Prompt Change

File: components/runners/claude-code-runner/adapter.py:508-511

Issue: The system prompt configuration change has no corresponding unit test to verify the hybrid array structure is correctly passed to the SDK.

Current state:

Existing tests cover auto-push, observability, privacy masking, etc.
No test validates system_prompt_config structure

Recommendation:
Add a unit test like:

# tests/test_system_prompt.py
def test_hybrid_system_prompt_structure(mocker):
    """Verify system prompt uses hybrid array with claude_code + workspace context."""
    # Mock dependencies
    adapter = ClaudeCodeRunnerAdapter(...)
    
    # Trigger initialization that builds system_prompt_config
    # ...
    
    # Assert structure
    assert isinstance(system_prompt_config, list)
    assert len(system_prompt_config) == 2
    assert system_prompt_config[0] == "claude_code"
    assert system_prompt_config[1]["type"] == "text"
    assert "Workspace Structure" in system_prompt_config[1]["text"]

Priority: Medium - Tests prevent regressions

2. Version Hardcoding in Minikube Manifest

File: components/manifests/minikube/frontend-deployment.yaml:44

Issue: The VTEAM_VERSION environment variable is hardcoded to a specific git hash (v0.0.19-8-g6d9251e), which will become stale.

Current:

- name: VTEAM_VERSION
  value: "v0.0.19-8-g6d9251e"  # Hardcoded

Recommendation:

Use a templating approach (Kustomize, Helm) for version injection
Or document that this file is for testing only and may have stale versions
Or add a CI check to warn when this diverges from git tags

Impact: Low - This is a minikube dev manifest, not production

Priority: Low - Nice-to-have cleanup

Positive Highlights

✅ Excellent Documentation

The new README.md (245 lines) is comprehensive and well-structured:

Clear architecture overview
Complete environment variables reference
System prompt rationale with before/after examples
API documentation
Security considerations
Development and testing guide

This sets a great standard for component documentation.

✅ Proper Architecture Auto-Detection

The Makefile improvements (lines 18-38) are thoughtful and user-friendly:

Auto-detects host architecture (arm64, amd64)
Provides clear warnings for cross-compilation
Allows manual override with PLATFORM=...
Includes make check-architecture for diagnostics

This will save developers from cryptic QEMU segfaults.

✅ Minimal, Focused Code Change

The core change is just 4 lines (adapter.py:508-511):

system_prompt_config = [
    "claude_code",
    {"type": "text", "text": workspace_prompt}
]

This is the ideal PR size for a system prompt change - easy to review, low risk, clear intent.

✅ Follows Python Configuration Standards

From CLAUDE.md:

✅ Uses proper formatting (consistent with existing code)
✅ No any types introduced
✅ Maintains existing patterns

✅ Security-Conscious Infrastructure Changes

The e2e/scripts/load-images.sh improvements:

Validates images exist before loading
Warns on architecture mismatches (doesn't block)
Provides actionable error messages

This prevents confusing failures during CI/local testing.

Recommendations

1. Add Unit Test for System Prompt Structure (Priority: Medium)

See Minor Issue #1 above. This ensures the hybrid prompt structure is validated.

2. Verify SDK Compatibility (Priority: High - But Likely Done)

Question for reviewer/author: Has this been tested with the actual Claude Agent SDK to confirm:

The array format ["claude_code", {...}] is supported?
The workspace context is correctly appended (not overriding)?
Claude receives both prompts as expected?

If yes: Document in PR description or test plan.
If no: Run a manual integration test before merging.

3. Consider Adding E2E Test Case (Priority: Low)

While not required, an E2E test could verify:

Claude responds to workspace-specific context (e.g., recognizes repo names)
Standard Claude Code capabilities still work (e.g., can use tools)

This could be a follow-up PR.

4. Update CHANGELOG or Release Notes (Priority: Medium)

This is a behavior change (albeit non-breaking). Consider adding to:

docs/decisions.md (lightweight decision log)
Component CHANGELOG (if one exists)
Next release notes

Rationale from CLAUDE.md:

"Decision Log: Lightweight chronological record of major decisions"

Architecture & Pattern Compliance

✅ Follows CLAUDE.md Guidelines

Python standards: Uses proper formatting, no any types
Documentation standards: New README is comprehensive and colocated
Configuration standards: Environment variables properly documented
Error handling: No changes to error patterns (N/A for this PR)

✅ No Security Issues

Token Handling: No token-related code changed ✅
Input Validation: No user input handling changed ✅
Container Security: No container config changed ✅
Secret Management: No secrets exposed in logs ✅

✅ No Performance Concerns

System prompt is built once per session (not per turn)
Array structure has negligible memory overhead
No new loops or expensive operations

Testing Checklist (From PR Description)

Reviewing the test plan:

Verify adapter.py changes don't break existing tests
⚠️ Action Required: Run pytest in components/runners/claude-code-runner/ to confirm
Test that Claude receives both system prompts correctly
⚠️ Action Required: Manual integration test or add unit test
Confirm workspace context is still available to Claude
⚠️ Action Required: Test with a real session, verify Claude knows repo structure
Validate no breaking changes in session behavior
⚠️ Action Required: Run E2E tests or manual smoke test
Check that documentation builds correctly
✅ README is Markdown (no build needed), but verify mkdocs if included in docs/

Files Changed Assessment

1. `Makefile` (+49, -11) ✅ Excellent

Changes:

Architecture auto-detection
check-architecture target
Platform display in help

Assessment: Well-implemented, user-friendly, follows established patterns.

2. `components/runners/claude-code-runner/adapter.py` (+4, -1) ✅ Excellent

Changes:

System prompt from dict → array

Assessment: Minimal, focused, clear intent. Exactly what a good PR looks like.

3. `components/runners/claude-code-runner/README.md` (+245, -0) ✅ Excellent

Assessment: Comprehensive, well-structured, sets a great standard. Only minor issue is it references line numbers (508-511, 1500-1575) which could become stale.

Suggestion: Use relative references like "in the _initialize_agent method" instead of line numbers.

4. `docs/developer/local-development/kind.md` (+32, -0) ✅ Good

Changes:

Documents architecture support
Troubleshooting for segfaults

Assessment: Helpful addition, prevents common developer issues.

5. `e2e/scripts/load-images.sh` (+20, -1) ✅ Good

Changes:

Image existence check
Architecture mismatch warnings

Assessment: Defensive programming, prevents cryptic failures.

6. `components/manifests/minikube/frontend-deployment.yaml` (+1, -1) 🟡 Minor Issue

Change: Version string update

Assessment: See Minor Issue #2 above - consider templating or documenting as test-only.

Final Verdict

✅ Approve with Minor Suggestions

Strengths:

Minimal, focused code change (4 lines)
Excellent documentation (README is exemplary)
Thoughtful infrastructure improvements (architecture detection)
Follows all CLAUDE.md standards
No security, performance, or architectural concerns

Minor Improvements:

Add unit test for system prompt structure
Verify SDK compatibility (likely already done)
Document in decision log

This PR demonstrates best practices:

Small, incremental changes
Comprehensive documentation
Developer experience improvements
No over-engineering

🎉 Great work! This is how to do a focused feature PR right.

Meta: Review Coverage

Loaded context files:

✅ CLAUDE.md
✅ .claude/context/backend-development.md (N/A - no backend changes)
✅ .claude/context/frontend-development.md (N/A - no frontend changes)
✅ .claude/context/security-standards.md
✅ .claude/patterns/k8s-client-usage.md (N/A - no K8s client changes)
✅ .claude/patterns/error-handling.md (N/A - no error handling changes)
✅ .claude/patterns/react-query-usage.md (N/A - no React changes)

Review focus areas:

✅ Code Quality - Excellent (minimal, focused change)
✅ Security - No issues
✅ Performance - No concerns
✅ Testing - Could add unit test
✅ Architecture - Follows patterns
✅ Error Handling - N/A (no changes)

Files reviewed: 6/6 (100% coverage)

🔍 View AI decision process (logs available for 90 days)

📋 View memory system files loaded (click to expand)

What Amber Loaded for Code Review

Amber automatically loaded these repository standards from the memory system:

CLAUDE.md - Master project instructions, development standards
backend-development.md - Go backend, K8s integration patterns
frontend-development.md - NextJS, Shadcn UI, React Query patterns
security-standards.md - Auth, RBAC, token handling
k8s-client-usage.md - User token vs service account patterns
error-handling.md - Consistent error patterns
react-query-usage.md - Data fetching patterns

Impact: This review used your repository's specific code quality standards, security patterns, and best practices from the memory system (PRs #359, #360) - not just generic code review guidelines.

This comment has been minimized.

Sign in to view

jeremyeder requested a review from Gkrumbach07 January 28, 2026 07:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use hybrid system prompt with claude_code base and workspace context #539

Use hybrid system prompt with claude_code base and workspace context #539

Uh oh!

jeremyeder commented Jan 28, 2026

Uh oh!

This comment has been minimized.

jeremyeder commented Jan 28, 2026

Uh oh!

github-actions bot commented Jan 28, 2026 •

edited

Loading

What Amber Loaded for Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Use hybrid system prompt with claude_code base and workspace context #539

Are you sure you want to change the base?

Use hybrid system prompt with claude_code base and workspace context #539

Uh oh!

Conversation

jeremyeder commented Jan 28, 2026

Summary

Changes Made

Benefits

Before

After

What Claude Now Receives

Test Plan

Files Changed

Uh oh!

This comment has been minimized.

jeremyeder commented Jan 28, 2026

Uh oh!

github-actions bot commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Claude Code Review

Summary

Issues by Severity

🚫 Blocker Issues

🔴 Critical Issues

🟡 Major Issues

🔵 Minor Issues

1. Missing Test Coverage for System Prompt Change

2. Version Hardcoding in Minikube Manifest

Positive Highlights

✅ Excellent Documentation

✅ Proper Architecture Auto-Detection

✅ Minimal, Focused Code Change

✅ Follows Python Configuration Standards

✅ Security-Conscious Infrastructure Changes

Recommendations

1. Add Unit Test for System Prompt Structure (Priority: Medium)

2. Verify SDK Compatibility (Priority: High - But Likely Done)

3. Consider Adding E2E Test Case (Priority: Low)

4. Update CHANGELOG or Release Notes (Priority: Medium)

Architecture & Pattern Compliance

✅ Follows CLAUDE.md Guidelines

✅ No Security Issues

✅ No Performance Concerns

Testing Checklist (From PR Description)

Files Changed Assessment

1. Makefile (+49, -11) ✅ Excellent

2. components/runners/claude-code-runner/adapter.py (+4, -1) ✅ Excellent

3. components/runners/claude-code-runner/README.md (+245, -0) ✅ Excellent

4. docs/developer/local-development/kind.md (+32, -0) ✅ Good

5. e2e/scripts/load-images.sh (+20, -1) ✅ Good

6. components/manifests/minikube/frontend-deployment.yaml (+1, -1) 🟡 Minor Issue

Final Verdict

✅ Approve with Minor Suggestions

Meta: Review Coverage

What Amber Loaded for Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

github-actions bot commented Jan 28, 2026 •

edited

Loading

1. `Makefile` (+49, -11) ✅ Excellent

2. `components/runners/claude-code-runner/adapter.py` (+4, -1) ✅ Excellent

3. `components/runners/claude-code-runner/README.md` (+245, -0) ✅ Excellent

4. `docs/developer/local-development/kind.md` (+32, -0) ✅ Good

5. `e2e/scripts/load-images.sh` (+20, -1) ✅ Good

6. `components/manifests/minikube/frontend-deployment.yaml` (+1, -1) 🟡 Minor Issue