CLI Agent

A CLI-driven client using a simple configuration system. This application provides a lightweight, efficient interface for interacting with the OpenAI API through command-line commands.

Building

go build -o cli-agent

Usage

Running without a configuration file

./cli-agent

Output:

No configuration file specified. Use --config or -c to specify a config file.
Available commands:
  chat <prompt>    - Send a chat completion request
  models --list    - List all available models
  models --get <id> - Get details for a specific model
  parse <text>     - Extract bash command from text using <do>...</do> tags
  execute <command> - Execute a bash command
  agent <prompt>   - Run autonomous agent with chat, parse, and execute loop

Running with a configuration file

./cli-agent --config example.yaml

Or using the short flag:

./cli-agent -c example.yaml

Commands

Parse

Extract bash commands from text using <do>...</do> tags. This is useful for parsing LLM responses that contain executable commands.

./cli-agent parse "Here's the command: <do>ls -la</do>"

Output:

ls -la

You can also pipe input from stdin:

echo "Run this: <do>echo 'Hello World'</do>" | ./cli-agent parse

The parser supports multi-line commands and complex bash syntax:

./cli-agent parse "<do>
for file in *.txt; do
  echo \"Processing: \$file\"
done
</do>"

Parse Error Handling

The parser provides clear error messages for common issues:

No bash action found: Input doesn't contain <do>...</do> tags
Multiple bash actions: Input contains more than one <do>...</do> block
Empty bash action: The <do>...</do> tags contain no command

Execute

Execute bash commands with configurable engine support. This command requires a configuration file to be specified.

./cli-agent -c example.yaml execute "ls -la"

You can also pipe commands from stdin:

echo "echo 'Hello World'" | ./cli-agent -c example.yaml execute

The execute command uses the execution configuration from your config file:

Local execution (default): Commands run directly on your system
Docker execution: Commands run inside a Docker container
Podman execution: Commands run inside a Podman container
Custom wrappers: Commands run with custom prefixes

Execute Output

The execute command displays:

stdout: Standard output from the command
stderr: Standard error output (printed to stderr)
Exit code: The command's exit code
Duration: Execution time

Example output:

$ ./cli-agent -c example.yaml execute "echo 'Hello World'"
Hello World
Exit code: 0
Duration: 5.2ms

Execute with Parse

You can combine parse and execute using pipes:

./cli-agent parse "Run this: <do>echo 'Hello World'</do>" | ./cli-agent -c example.yaml execute

This allows you to extract commands from LLM responses and execute them in a single workflow.

Agent

Run an autonomous agent that combines chat, parse, and execute in a loop. The agent manages conversation history and can execute bash commands extracted from LLM responses.

./cli-agent -c example.yaml agent "Create a simple Python script that prints 'Hello World'"

The agent will:

Send your prompt to the OpenAI API
Parse the response for bash commands wrapped in <do>...</do> tags
Execute any found commands
Feed the execution results back to the LLM
Repeat until no more commands are found or max turns is reached

Agent Configuration

The agent behavior is controlled by the agent section in your configuration file:

agent:
  system: "You are a helpful coding assistant. Use <do>...</do> tags to wrap bash commands you want to execute."
  max_turns: 10

agent.system: System message sent at the beginning of each agent session to set context (default: "")
agent.max_turns: Maximum number of agent turns before stopping (default: 10)

Agent Workflow

The agent follows this workflow:

Initialization: Send the system message (if configured) to establish context
Turn Loop:
- Send the current prompt/conversation to the LLM
- Parse the response for bash commands using <do>...</do> tags
- If a command is found:
  - Execute the command using the configured execution engine
  - Capture stdout, stderr, exit code, and duration
  - Feed the execution results back to the LLM as context
  - Increment turn counter
- If no command is found or max turns reached:
  - Terminate the loop
Output: Display the final LLM response

Agent Use Cases

The agent mode is particularly useful for:

Automated coding tasks: Let the agent write, test, and iterate on code
File operations: Create, read, modify files through natural language
System administration: Execute commands and analyze results
Debugging: Run tests, check logs, and fix issues iteratively
Exploratory analysis: Run commands and analyze results in a loop

Agent Safety

The agent includes several safety features:

Turn limit: Prevents infinite loops with configurable max turns
Command validation: All commands are parsed and validated before execution
Execution timeout: Commands are terminated if they exceed the configured timeout
Error handling: Execution errors are captured and fed back to the LLM for recovery

Chat Completions

Send a chat completion request to the OpenAI API:

./cli-agent -c example.yaml chat "What is the capital of France?"

With custom parameters:

./cli-agent -c example.yaml chat --model gpt-4o --temperature 0.5 --max-tokens 1000 "Explain quantum computing"

With a system message:

./cli-agent -c example.yaml chat --system "You are a helpful assistant." "Hello!"

Chat Options

-m, --model: Model to use for chat completion (default: from config)
-t, --temperature: Sampling temperature (0-2, default: from config)
-n, --max-tokens: Maximum tokens to generate (default: from config)
-p, --top-p: Nucleus sampling threshold (0-1, default: from config)
-s, --system: System message to set context

Models

List all available models:

./cli-agent -c example.yaml models --list

Get details for a specific model:

./cli-agent -c example.yaml models --get gpt-4o

Models Options

-l, --list: List all available models
-g, --get: Get details for a specific model

Configuration

Supported Configuration Formats

The CLI agent supports the following configuration file formats:

YAML (.yaml, .yml)
JSON (.json)
TOML (.toml)

Configuration File Structure

An example configuration file is provided at example.yaml:

name: "CLI Agent OpenAI Client"
version: "1.0.0"

settings:
  debug: false
  verbose: true

openai:
  # Required: OpenAI API base URL
  base_url: "https://api.openai.com/v1"

  # Required: Your OpenAI API key
  api_key: "sk-..."

  # HTTP client configuration
  http_client:
    timeout: 120              # Request timeout in seconds
    max_retries: 3            # Maximum number of retries
    retry_delay: 1000         # Delay between retries in milliseconds

  # Default parameters for API requests
  defaults:
    model: "gpt-4o"
    temperature: 0.7
    max_tokens: 2048
    top_p: 1.0

Configuration Fields

General Settings

name: Application name
version: Application version
settings.debug: Enable debug logging
settings.verbose: Enable verbose logging

OpenAI Configuration

openai.base_url: OpenAI API base URL (required)
openai.api_key: Your OpenAI API key (required)
openai.http_client.timeout: Request timeout in seconds (default: 120)
openai.http_client.max_retries: Maximum number of retries (default: 3)
openai.http_client.retry_delay: Delay between retries in milliseconds (default: 1000)
openai.defaults.model: Default model to use (default: gpt-4o)
openai.defaults.temperature: Default sampling temperature (default: 0.7)
openai.defaults.max_tokens: Default maximum tokens (default: 2048)
openai.defaults.top_p: Default nucleus sampling threshold (default: 1.0)

Agent Configuration

The agent configuration controls the autonomous agent behavior:

agent.system: System message for agent context (default: "")
- This message is sent at the beginning of each agent session to set context
- Use this to define the agent's role, behavior, and constraints
- Example: "You are a helpful coding assistant. Use ... tags to wrap bash commands you want to execute."
agent.max_turns: Maximum number of agent turns (default: 10)
- Prevents infinite loops in case the agent doesn't terminate
- Each turn consists of: chat → parse → execute → feedback
- Set higher for complex tasks, lower for quick operations

Execution Configuration

The execution configuration allows you to customize how bash commands are executed. This is particularly useful for running commands in different environments (Docker, Podman, etc.) or with custom wrappers.

execution.engine: Command prefix that gets prepended to bash commands (default: empty for local execution)
execution.timeout: Command execution timeout in seconds (default: 30)

Execution Engine Examples:

Local execution (default):
```
execution:
  engine: ""
  timeout: 30
```
Command ls -la executes as: bash -c "ls -la"
Docker execution:
```
execution:
  engine: "docker run --rm -v $(pwd):/workspace -w /workspace ubuntu:latest bash -c"
  timeout: 30
```
Command ls -la executes as: docker run --rm -v $(pwd):/workspace -w /workspace ubuntu:latest bash -c "ls -la"
Podman execution:
```
execution:
  engine: "podman run --rm --userns keep-id alpine:latest sh -c"
  timeout: 30
```
Command ls -la executes as: podman run --rm --userns keep-id alpine:latest sh -c "ls -la"
Docker Compose execution:
```
execution:
  engine: "docker compose exec -T app bash -c"
  timeout: 30
```
Command ls -la executes as: docker compose exec -T app bash -c "ls -la"
Custom wrapper:
```
execution:
  engine: "my-wrapper --timeout 30 --verbose bash -c"
  timeout: 30
```
Command ls -la executes as: my-wrapper --timeout 30 --verbose bash -c "ls -la"

The execution engine provides maximum flexibility for running commands in different environments without requiring code changes.

Project Structure

CLI_Agent/
├── cmd/                    # Command-line interface entry points
│   ├── root.go            # Main CLI command definition
│   ├── chat.go            # Chat completions command
│   ├── models.go          # Models API command
│   ├── parse.go           # Bash command parser command
│   ├── execute.go         # Command execution command
│   └── agent.go           # Autonomous agent command
├── internal/              # Private application code
│   ├── config/           # Configuration parsing and validation
│   │   ├── config.go     # Config loading logic
│   │   └── config_test.go # Config unit tests
│   ├── executor/         # Command execution engine
│   │   ├── executor.go   # Executor interface and implementation
│   │   └── executor_test.go # Executor unit tests
│   ├── openai/           # OpenAI client wrapper
│   │   ├── client.go     # Client initialization
│   │   ├── chat.go       # Chat completions API
│   │   ├── models.go     # Models API
│   │   ├── errors.go     # Error handling
│   │   ├── client_test.go # Client unit tests
│   │   └── chat_test.go  # Chat completion unit tests
│   ├── logger/           # Verbose logging utilities
│   │   └── logger.go     # Structured logging implementation
│   ├── parser/           # Bash command parser
│   │   ├── parser.go     # Command extraction from LLM responses
│   │   └── parser_test.go # Parser unit tests
│   └── mocks/            # Generated mock clients for testing
│       └── client_mock.go # GoMock generated mock
├── tests/                # Integration tests and test helpers
│   ├── helpers.go        # Test helper functions
│   └── integration/      # Integration tests
│       ├── chat_test.go  # Chat integration tests
│       ├── models_test.go # Models integration tests
├── testdata/             # Test fixtures and constants
│   ├── config/           # Test configuration files
│   │   ├── valid.yaml    # Valid YAML config
│   │   ├── valid.json    # Valid JSON config
│   │   ├── valid.toml    # Valid TOML config
│   │   └── invalid.yaml  # Invalid config for testing
│   └── test_constants/   # Test constants
│       └── constants.go  # Shared test constants
├── example.yaml          # Example configuration file
├── go.mod                # Go module definition
└── main.go               # Application entry point

Development

Dependencies

viper - Configuration management
pflag - POSIX-compliant command-line flag parsing
openai-go/v3 - OpenAI API client library
GoMock - Mock generation framework for testing

Adding Configuration Fields

To add new configuration fields:

Update the Config struct in internal/config/config.go
Add validation logic in the ValidateAndSetDefaults() method
Update the example configuration file

Verbose Logging

The application provides comprehensive logging at multiple levels:

INFO: General information about application state
VERBOSE: Detailed information about operations (enabled by verbose: true)
DEBUG: Debug-level information for troubleshooting (enabled by debug: true)
ERROR: Error messages and stack traces

All logs include timestamps and log levels for easy filtering and analysis.

Error Handling

The application includes comprehensive error handling:

Configuration validation errors with clear messages
API errors wrapped with custom error types
Network error handling with retry logic
Detailed error logging in verbose mode

Testing

The project includes comprehensive unit and integration tests to ensure code quality and functionality.

Running Tests

Run All Tests

go test ./...

Run Unit Tests Only

go test ./internal/... ./cmd/...

Run Integration Tests Only

Integration tests require valid API credentials and are tagged with the integration build tag:

go test -tags=integration ./tests/...
PROJECT_ROOT="$(pwd)" go test ./tests/integration -tags=integration

Run Tests with Coverage

go test -cover ./...

Run Tests with Coverage Report

go test -coverprofile=coverage.out ./...
go tool cover -html=coverage.out

Run Specific Test

go test -run TestLoadValidYAML ./internal/config

Run Tests with Verbose Output

go test -v ./...

Test Structure

CLI_Agent/
├── internal/
│   ├── config/
│   │   └── config_test.go       # Config loading and validation tests
│   ├── executor/
│   │   └── executor_test.go     # Executor unit tests
│   ├── openai/
│   │   ├── client_test.go       # Client initialization tests
│   │   └── chat_test.go         # Chat completion tests
│   ├── parser/
│   │   └── parser_test.go       # Parser unit tests
│   └── mocks/
│       └── client_mock.go       # GoMock generated mock client
├── testdata/
│   ├── config/
│   │   ├── valid.yaml           # Valid YAML config
│   │   ├── valid.json           # Valid JSON config
│   │   ├── valid.toml           # Valid TOML config
│   │   └── invalid.yaml         # Invalid config for testing
│   └── test_constants/
│       └── constants.go         # Shared test constants
├── tests/
│   ├── helpers.go               # Test helper functions
│   └── integration/
│       ├── chat_test.go         # Chat integration tests
│       └── models_test.go       # Models integration tests

Unit Tests

Unit tests cover individual components in isolation:

Config Package: Tests configuration loading, validation, and default value setting
Executor Package: Tests command execution with different engines, timeout handling, and error scenarios
OpenAI Package: Tests client initialization, request building, and error handling
Parser Package: Tests bash command extraction from LLM responses with comprehensive edge case coverage
Agent Package: Tests autonomous agent loop, conversation history management, and turn counting

Mock-Based Unit Tests

Unit tests use GoMock to test functionality without requiring API calls:

Mock Client: internal/mocks/client_mock.go provides a GoMock-generated test double for the OpenAI client
Test Constants: testdata/test_constants/constants.go provides shared test constants

Example of using mock client in tests:

// Create mock controller and client
ctrl := gomock.NewController(t)
mockClient := mocks.NewMockCLIClient(ctrl)

// Set up expected behavior
mockClient.EXPECT().
    NewCompletion(gomock.Any(), gomock.Any()).
    Return(MockChatCompletionResponse("Test response content"), nil)

// Create request
req := &ChatCompletionRequest{
    Model:       "gpt-4",
    Messages:    []openaiapi.ChatCompletionMessageParamUnion{openaiapi.UserMessage("test")},
    Temperature: f64(0.7),
    MaxTokens:   intP(2048),
    TopP:        f64(1.0),
}

// Execute with mock response
resp, err := CreateChatCompletion(mockClient, context.Background(), req)

// Verify response
if err != nil {
    t.Fatalf("Expected no error, got: %v", err)
}
if resp.Choices[0].Message.Content != "Test response content" {
    t.Errorf("Expected content 'Test response content', got '%s'", resp.Choices[0].Message.Content)
}

Benefits of mock-based testing:

No API Dependencies: Tests run without requiring valid API credentials
Faster Tests: Mock responses eliminate network latency
Consistent Test Data: Using test constants ensures consistent mock data
Better Test Coverage: Can test error scenarios that are hard to reproduce with real API

Integration Tests

Integration tests test the complete CLI workflow with actual API calls. These tests require:

A valid configuration file with API credentials
The CLI binary to be built

Setting Up Integration Tests

Create or update your configuration file (e.g., config.yaml) with valid API credentials:

openai:
  base_url: "https://your-api-endpoint/v1"
  api_key: "your-api-key"
  # ... other settings

Set the PROJECT_ROOT environment variable to point to your project root:
```
export PROJECT_ROOT=$(pwd)
```
Run integration tests:
```
go test -tags=integration ./tests/...
```

Integration Test Coverage

Models List: Tests ./cli-agent -c config.yaml models --list
- Verifies successful execution
- Checks for "Owned By" pattern in output
Models Get: Tests ./cli-agent -c config.yaml models --get <model-id>
- Verifies successful execution
- Checks for "ID:" in output
- Validates model details are displayed
Chat: Tests ./cli-agent -c config.yaml chat "What is the capital of France?"
- Verifies successful execution
- Validates response is received
- Tests with invalid configuration
Executor: Tests command execution with parser integration
- Verifies command extraction from LLM responses
- Tests different execution engines (local, custom)
- Validates timeout behavior and error handling
- Tests stdout/stderr capture
- Validates end-to-end workflow
Agent: Tests autonomous agent workflow
- Verifies conversation history management across turns
- Tests turn counting and max turns enforcement
- Validates system message configuration
- Tests termination conditions (no command, max turns)
- Validates error handling and recovery
- Tests integration with chat, parse, and execute commands

Test Fixtures

Test fixtures are provided in the testdata/ directory:

Configuration Files: Valid and invalid configurations for testing different scenarios
Test Constants: Shared test constants for consistent test data

The testdata/test_constants/constants.go file provides the following test constants:

TestBaseURL: Test API base URL
TestAPIKey: Test API key
TestConfigName: Test configuration name
TestVersion: Test version
TestModel: Test model identifier
TestTemperature: Test temperature value
TestMaxTokens: Test max tokens value
TestTopP: Test top_p value
TestTimeout: Test timeout value
TestMaxRetries: Test max retries value
TestRetryDelay: Test retry delay value
TestResponseContent: Test response content

These constants provide consistent test data across all tests, ensuring reproducibility and reducing test maintenance.

Test Helpers

The tests/helpers.go file provides utility functions for testing:

ConfigPathIfExisting: Returns config path if file exists
RunCLICommand: Execute CLI commands and capture output
GetRootAndCLIAgent: Get project root and CLI agent binary path
GetFirstModel: Extract first model from models list output

Skipping Integration Tests

Integration tests can be skipped by using the short test flag:

go test -short ./internal/...

Security Considerations

API Key Protection
- Never log API keys
- Consider using environment variables for sensitive data
- Do not commit configuration files with API keys to version control
HTTPS Only
- Ensure base_url uses HTTPS
- The application will warn about insecure HTTP connections
Input Validation
- All user inputs are validated
- Prompts are sanitized before sending to the API
- Request sizes are limited to prevent abuse

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
.kilocode/rules		.kilocode/rules
.vscode		.vscode
cmd		cmd
internal		internal
testdata		testdata
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
example.yaml		example.yaml
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Folders and files

Latest commit

History

Repository files navigation

CLI Agent

Building

Usage

Running without a configuration file

Running with a configuration file

Commands

Parse

Parse Error Handling

Execute

Execute Output

Execute with Parse

Agent

Agent Configuration

Agent Workflow

Agent Use Cases

Agent Safety

Chat Completions

Chat Options

Models

Models Options

Configuration

Supported Configuration Formats

Configuration File Structure

Configuration Fields

General Settings

OpenAI Configuration

Agent Configuration

Execution Configuration

Project Structure

Development

Dependencies

Adding Configuration Fields

Verbose Logging

Error Handling

Testing

Running Tests

Run All Tests

Run Unit Tests Only

Run Integration Tests Only

Run Tests with Coverage

Run Tests with Coverage Report

Run Specific Test

Run Tests with Verbose Output

Test Structure

Unit Tests

Mock-Based Unit Tests

Integration Tests

Setting Up Integration Tests

Integration Test Coverage

Test Fixtures

Test Helpers

Skipping Integration Tests

Security Considerations

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages