Skip to main content

The Right Model for Every Task

Siya supports a comprehensive range of cutting-edge AI models, from powerful cloud-based options to privacy-focused local models. Understanding each model’s strengths helps you choose the perfect one for your specific needs, balancing performance, cost, and capabilities.

Model Overview

Siya's multi-model ecosystem provides flexibility and choice

Cloud Models

Cutting-Edge AI at Your Fingertips

Access the most powerful AI models through secure API connections
  • Claude Models
  • OpenAI Models
  • Google Models

Anthropic's Claude Family

Industry-leading models known for reasoning, safety, and helpful responses

Claude Opus 4

The Flagship ModelClaude Opus 4 represents the pinnacle of AI capability, excelling at:
  • Complex reasoning and analysis
  • Architecture design and planning
  • Code generation and debugging
  • Creative problem solving
  • Multi-step task execution

Specifications

  • Context: 200K tokens
  • Strengths: Deep reasoning
  • Speed: 20-30 tokens/sec
  • Best for: Complex tasks
  • Cost: Premium tier

Claude Opus 4 performance across different tasks

Choose Opus 4 when accuracy and reasoning depth matter more than speed
The Balanced PerformerSonnet 4 offers an optimal balance of capability and efficiency:
  • Fast response times
  • Excellent code understanding
  • Strong analytical skills
  • Cost-effective for most tasks
  • Great for iterative development

Specifications

  • Context: 200K tokens
  • Strengths: Balance
  • Speed: 40-50 tokens/sec
  • Best for: Daily coding
  • Cost: Standard tier
Sonnet 4 is recommended for 80% of development tasks

Specialized Models

Purpose-Built for Specific Domains

Models optimized for particular use cases and languages

DeepSeek Coder

The Code SpecialistTrained specifically for programming tasks:
  • Superior code completion
  • Bug detection and fixing
  • Code explanation
  • Refactoring suggestions
  • Multi-language support
Supported Languages: Python, JavaScript, TypeScript, Java, C++, Go, Rust, and more

Performance Metrics

Use DeepSeek for intensive coding sessions and complex debugging
Multilingual ExcellenceOutstanding performance across languages:
  • Native multilingual training
  • Strong in Asian languages
  • Excellent reasoning
  • Cultural awareness
  • Technical accuracy
Language Support: English, Chinese, Japanese, Korean, Spanish, French, German, and 50+ more

Use Cases

  • International projects
  • Localization tasks
  • Cross-cultural apps
  • Global documentation
  • Multi-region deployment

Local Models

Privacy-First AI on Your Machine

Run powerful models locally without internet connection
  • Setup Guide
  • Available Models
  • Performance
1

Download Model

Model Sources

Download llamafile models from:
2

Install Model

# Create models directory
mkdir -p ~/.siya/local-models

# Move downloaded model
mv ~/Downloads/model.llamafile ~/.siya/local-models/

# Make executable
chmod +x ~/.siya/local-models/model.llamafile
3

Configure Siya

Model appears automatically in Siya’s model selector
4

First Run

First run may take time to load model into memory
Local models require 8-64GB RAM depending on model size

Model Selection Strategy

Choose the Right Model Every Time

Strategic guidance for optimal model selection

Model selection decision tree

  • By Task Type
  • By Project Size
  • By Constraints

Software Architecture

Recommended: Claude Opus 4

Why Opus 4?
  • Deep reasoning for design decisions
  • Understands complex systems
  • Excellent at trade-off analysis
  • Can model entire architectures
Alternative: GPT-5 for broader perspective

Recommended: DeepSeek Coder

Why DeepSeek?
  • Specialized for code generation
  • Superior completion accuracy
  • Understands language idioms
  • Fast and efficient
Alternative: Claude Sonnet 4 for general coding

Recommended: Claude Sonnet 4

Why Sonnet 4?
  • Balanced analysis capabilities
  • Good at finding subtle bugs
  • Clear explanations
  • Fast enough for iteration
Alternative: Local Qwen for privacy-sensitive code

Recommended: GPT-4 Turbo

Why GPT-4 Turbo?
  • Natural writing style
  • Good at examples
  • Consistent formatting
  • Cost-effective
Alternative: Gemini Pro for large docs

Model Comparison

Side-by-Side Model Comparison

Detailed comparison to help you make informed decisions

Switching Models

Dynamic Model Switching

Switch models on the fly based on your needs
  • Manual Switching
  • Automatic Switching
1

Quick Switch

Press ⌘ + M to open model selector
2

Menu Bar

Click model name in status bar
3

Command

Type “switch to GPT-5” in chat
4

Settings

Change default in preferences

Cost Optimization

Manage Your AI Budget Effectively

Strategies to optimize costs while maintaining quality

Cost-Saving Strategies

1

Use Appropriate Models

Don’t use Opus 4 for simple tasks - Sonnet 4 or Flash work great
2

Enable Auto-Switching

Let Siya choose cost-effective models automatically
3

Leverage Local Models

Use local models for repetitive or sensitive tasks
4

Optimize Context

Clear unnecessary context to reduce token usage
5

Batch Operations

Group similar tasks to reduce overhead
"Show my token usage"
"What's my API spend today?"
"Which model am I using most?"
"Show cost breakdown"

Best Practices

Model Selection Excellence

Expert tips for getting the most from each model

General Guidelines

Do's

  • Start with faster models
  • Upgrade when needed
  • Use local for privacy
  • Monitor costs
  • Learn model strengths
  • Experiment freely

Don'ts

  • Always use most expensive
  • Ignore context limits
  • Share API keys
  • Neglect local options
  • Stick to one model
  • Waste tokens

Future Models

Coming Soon to Siya

Upcoming models and capabilities on our roadmap

In Development

Coming Soon
  • Claude Haiku 3
  • GPT-4 Vision
  • Llama 4
  • Custom fine-tuned models
  • Mixtral MOE

Future Vision

On the Horizon
  • Multi-model ensemble
  • Custom training
  • Specialized verticals
  • Real-time models
  • Edge deployment

Summary

Choose Wisely, Code Brilliantly

With Siya’s comprehensive model selection, you have access to the perfect AI for every task. From the reasoning depth of Claude Opus 4 to the lightning speed of Gemini Flash, from specialized coding models to privacy-preserving local options - the right model is always at your fingertips. Understanding each model’s strengths empowers you to work smarter, faster, and more efficiently.

The right model. The right time. The right results.