Model Selection

The Right Model for Every Task

Siya supports a comprehensive range of cutting-edge AI models, from powerful cloud-based options to privacy-focused local models. Understanding each model’s strengths helps you choose the perfect one for your specific needs, balancing performance, cost, and capabilities.

Model Overview

Cloud Models

Cutting-Edge AI at Your Fingertips

Access the most powerful AI models through secure API connections

Claude Models
OpenAI Models
Google Models

Anthropic's Claude Family

Industry-leading models known for reasoning, safety, and helpful responses

Claude Opus 4

The Flagship ModelClaude Opus 4 represents the pinnacle of AI capability, excelling at:

Complex reasoning and analysis
Architecture design and planning
Code generation and debugging
Creative problem solving
Multi-step task execution

Specifications

Context: 200K tokens
Strengths: Deep reasoning
Speed: 20-30 tokens/sec
Best for: Complex tasks
Cost: Premium tier

Choose Opus 4 when accuracy and reasoning depth matter more than speed

Claude Sonnet 4

The Balanced PerformerSonnet 4 offers an optimal balance of capability and efficiency:

Fast response times
Excellent code understanding
Strong analytical skills
Cost-effective for most tasks
Great for iterative development

Specifications

Context: 200K tokens
Strengths: Balance
Speed: 40-50 tokens/sec
Best for: Daily coding
Cost: Standard tier

Sonnet 4 is recommended for 80% of development tasks

Specialized Models

Purpose-Built for Specific Domains

Models optimized for particular use cases and languages

DeepSeek Coder

The Code SpecialistTrained specifically for programming tasks:

Superior code completion
Bug detection and fixing
Code explanation
Refactoring suggestions
Multi-language support

Supported Languages: Python, JavaScript, TypeScript, Java, C++, Go, Rust, and more

Performance Metrics

Use DeepSeek for intensive coding sessions and complex debugging

Qwen 2.5

Multilingual ExcellenceOutstanding performance across languages:

Native multilingual training
Strong in Asian languages
Excellent reasoning
Cultural awareness
Technical accuracy

Language Support: English, Chinese, Japanese, Korean, Spanish, French, German, and 50+ more

Use Cases

International projects
Localization tasks
Cross-cultural apps
Global documentation
Multi-region deployment

Local Models

Privacy-First AI on Your Machine

Run powerful models locally without internet connection

Setup Guide
Available Models
Performance

Download Model

Model Sources

Download llamafile models from:

Install Model

# Create models directory
mkdir -p ~/.siya/local-models

# Move downloaded model
mv ~/Downloads/model.llamafile ~/.siya/local-models/

# Make executable
chmod +x ~/.siya/local-models/model.llamafile

Configure Siya

Model appears automatically in Siya’s model selector

First Run

First run may take time to load model into memory

Local models require 8-64GB RAM depending on model size

Local Model Performance

Performance depends on your hardware configuration

Hardware Requirements

Minimum Specs

CPU: 8-core modern processor
RAM: 16GB (8GB models)
Storage: 50GB free
OS: macOS 12+, Windows 10+

Recommended Specs

CPU: Apple M2/M3 or equivalent
RAM: 32-64GB
GPU: Optional but helpful
Storage: 100GB+ SSD

Optimization Tips

Enable GPU Acceleration

export LLAMAFILE_GPU=1
export CUDA_VISIBLE_DEVICES=0

Adjust Context Size

export LLAMAFILE_CTX=4096  # Reduce for speed

Use Quantization

Choose Q4 or Q5 quantized models for better performance

Model Selection Strategy

Choose the Right Model Every Time

Strategic guidance for optimal model selection

By Task Type
By Project Size
By Constraints

Software Architecture

Recommended: Claude Opus 4

Why Opus 4?

Deep reasoning for design decisions
Understands complex systems
Excellent at trade-off analysis
Can model entire architectures

Alternative: GPT-5 for broader perspective

Code Implementation

Recommended: DeepSeek Coder

Why DeepSeek?

Specialized for code generation
Superior completion accuracy
Understands language idioms
Fast and efficient

Alternative: Claude Sonnet 4 for general coding

Debugging & Analysis

Recommended: Claude Sonnet 4

Why Sonnet 4?

Balanced analysis capabilities
Good at finding subtle bugs
Clear explanations
Fast enough for iteration

Alternative: Local Qwen for privacy-sensitive code

Documentation

Recommended: GPT-4 Turbo

Why GPT-4 Turbo?

Natural writing style
Good at examples
Consistent formatting
Cost-effective

Alternative: Gemini Pro for large docs

Model Comparison

Side-by-Side Model Comparison

Detailed comparison to help you make informed decisions

Switching Models

Dynamic Model Switching

Switch models on the fly based on your needs

Manual Switching
Automatic Switching

Quick Switch

Press ⌘ + M to open model selector

Menu Bar

Click model name in status bar

Command

Type “switch to GPT-5” in chat

Settings

Change default in preferences

Smart Model Selection

Enable automatic model switching based on task

{
  "modelSelection": {
    "autoSwitch": true,
    "rules": [
      {
        "pattern": "debug|fix|error",
        "model": "claude-sonnet-4"
      },
      {
        "pattern": "architect|design|plan",
        "model": "claude-opus-4"
      },
      {
        "pattern": "quick|simple|fast",
        "model": "gemini-flash"
      },
      {
        "complexity": "> 8",
        "model": "claude-opus-4"
      }
    ]
  }
}

Siya learns your preferences and improves selection over time

Cost Optimization

Manage Your AI Budget Effectively

Strategies to optimize costs while maintaining quality

Cost-Saving Strategies

Use Appropriate Models

Don’t use Opus 4 for simple tasks - Sonnet 4 or Flash work great

Enable Auto-Switching

Let Siya choose cost-effective models automatically

Leverage Local Models

Use local models for repetitive or sensitive tasks

Optimize Context

Clear unnecessary context to reduce token usage

Batch Operations

Group similar tasks to reduce overhead

Usage Monitoring

"Show my token usage"
"What's my API spend today?"
"Which model am I using most?"
"Show cost breakdown"

Best Practices

Model Selection Excellence

Expert tips for getting the most from each model

General Guidelines

Do's

Start with faster models
Upgrade when needed
Use local for privacy
Monitor costs
Learn model strengths
Experiment freely

Don'ts

Always use most expensive
Ignore context limits
Share API keys
Neglect local options
Stick to one model
Waste tokens

Model-Specific Tips

Future Models

Coming Soon to Siya

Upcoming models and capabilities on our roadmap

In Development

Coming Soon

Claude Haiku 3
GPT-4 Vision
Llama 4
Custom fine-tuned models
Mixtral MOE

Future Vision

On the Horizon

Multi-model ensemble
Custom training
Specialized verticals
Real-time models
Edge deployment

Summary

Choose Wisely, Code Brilliantly

With Siya’s comprehensive model selection, you have access to the perfect AI for every task. From the reasoning depth of Claude Opus 4 to the lightning speed of Gemini Flash, from specialized coding models to privacy-preserving local options - the right model is always at your fingertips. Understanding each model’s strengths empowers you to work smarter, faster, and more efficiently.

Get Started

Start using Siya with your chosen model

API Setup

Configure your API keys

The right model. The right time. The right results.

Getting Started

Core Concepts

Configuration & Integration

Advanced

The Right Model for Every Task

​Model Overview

​Cloud Models

Cutting-Edge AI at Your Fingertips

Anthropic's Claude Family

Specifications

Specifications

OpenAI GPT Family

Specifications

Specifications

Google Gemini Family

Specifications

Specifications

​Specialized Models

Purpose-Built for Specific Domains

Performance Metrics

Use Cases

​Local Models

Privacy-First AI on Your Machine

Model Sources

Local Model Performance

Minimum Specs

Recommended Specs

​Model Selection Strategy

Choose the Right Model Every Time

Recommended: Claude Opus 4

Recommended: DeepSeek Coder

Recommended: Claude Sonnet 4

Recommended: GPT-4 Turbo

Local Models Only

Cloud Options

Local Options

​Model Comparison

Side-by-Side Model Comparison

​Switching Models

Dynamic Model Switching

Smart Model Selection

​Cost Optimization

Manage Your AI Budget Effectively

​Best Practices

Model Selection Excellence

Do's

Don'ts

​Future Models

Coming Soon to Siya

In Development

Future Vision

​Summary

Choose Wisely, Code Brilliantly

Get Started

API Setup

Model Overview

Cloud Models

Specialized Models

Local Models

Model Selection Strategy

Model Comparison

Switching Models

Cost Optimization

Best Practices

Future Models

Summary