This guide documents a highly productive, AI-driven development stack using a combination of cloud-based LLMs, terminal tools, IDEs, and mobile access. It is designed for developers who want persistent workflows, AI-powered coding assistance, and flexible access from multiple devices.
┌──────────┐
│ iPhone │
└──────────┘
│
Tailscale VPN (on-demand)
│
▼
┌─────────────┐
│ MacBook Air │
│ (tmux / │
│ zellij + │
│ caffeinate) │
└─────────────┘
│
┌─────────────────────┼──────────────────┐
│ │ │
▼ ▼ ▼
┌────────────┐ ┌────────────┐ ┌───────────────┐
│ VS Code │ │ Terminus │ │ Cursor AI │
│ + Cline │ │ (SSH) │ │ (Primary IDE) │
│ + Continue │ └────────────┘ └───────────────┘
└────────────┘ │
│ ▼
▼ ┌──────────────────────────────┐
┌────────────────┐ │ AI Coding Agents & Tools │
│ Test / Scripts │ │------------------------------│
│ (AI-driven) │ │ • Cline │
└────────────────┘ │ • Continue.dev │
│ • Claude Code │
│ • Ollama (local models) │
│ • Aider │
│ • Windsurf (Cascade) │
│ • Gemini CLI │
│ • GitHub Copilot │
└──────────────────────────────┘
│
▼
┌────────────────────────────────────┐
│ LLM Routing & Orchestration │
│------------------------------------│
│ • OpenRouter / LiteLLM │
│ • Multi-model routing │
│ • Cost / speed optimisation │
│ • IDE integrations │
└────────────────────────────────────┘
│
▼
┌────────────────────────────────────────────────────┐
│ Cloud / Web LLM Interfaces │
│----------------------------------------------------│
│ ChatGPT Web / Codex → Research / general │
│ Perplexity Web → Cited research │
│ Claude Code (terminal) → Heavy coding │
│ Claude Web + Desktop → General usage │
│ Grok / SuperGrok → Images + real-time │
└────────────────────────────────────────────────────┘
Component Descriptions
iPhone
- Purpose: Mobile access to coding sessions, remote terminals, and LLM interfaces.
- Tools: Termius (SSH), web browsers for cloud LLMs.
- Connection: Via Tailscale VPN for secure, private access to your MacBook.
Tailscale VPN
- Purpose: Secure, on-demand VPN that allows devices to communicate without exposing ports publicly.
- Use case: Connect iPhone or other devices to your MacBook seamlessly.
MacBook
- Purpose: Primary development machine.
- Enhancements:
- tmux / zellij: Terminal multiplexers for persistent, multi-pane sessions.
- caffeinate / Amphetamine: Prevent sleep to maintain long-running processes or server tasks.
IDEs and Terminals
- VS Code + Cline + Continue.dev
- Modern IDE with AI-powered code suggestions and integrations.
- Terminus
- SSH client for remote server access and workflow continuity.
- Cursor AI
- AI-focused primary IDE for code generation and testing.
AI Coding Agents & Tools
- Purpose: Automate coding, testing, and documentation.
- Key Tools:
- Cline: Terminal-based AI coding assistant.
- Continue.dev: AI automation for workflows.
- Claude Code: Heavy-duty coding in CLI.
- Aider / Windsurf / Gemini CLI / GitHub Copilot: Multi-purpose AI agents integrated with IDEs.
LLM Routing & Orchestration
- Purpose: Optimize which AI model handles which task.
- Features:
- Multi-model routing based on task type (research, coding, testing, image generation).
- Speed and cost optimization for different models.
- Seamless IDE integration for on-the-fly AI assistance.
Cloud / Web LLM Interfaces
- Purpose: External interfaces for research, code generation, and real-time AI support.
- Examples:
- ChatGPT Web / Codex: General-purpose research and code generation.
- Perplexity Web: Citation-focused research.
- Claude Web + Desktop / Claude Code: Heavy coding workflows.
- Grok / SuperGrok: Image processing, real-time AI applications.
Workflow Example
-
Start your MacBook environment:
tmux new -s dev caffeinate -dims & -
Connect iPhone via Tailscale VPN
- Open Termius to SSH into the MacBook.
- Access web LLM interfaces via browser.
-
Code & Test:
- Open VS Code or Cursor AI.
- Use AI agents for code suggestions (
Claude Code,Aider). - Run tests and scripts locally or via AI automation.
-
Route AI Tasks via LLM Orchestration:
- Route heavy coding to Claude Code CLI.
- Use Perplexity Web for research and citations.
- Use ChatGPT for general guidance.
-
Deploy / Integrate:
- Final code merges via VS Code.
- Push code to GitHub or deploy with scripts automated by Continue.dev.
Tips & Best Practices
- Keep tmux/zellij sessions persistent to avoid losing long-running AI tasks.
- Use Tailscale for mobile-first remote access; no port forwarding required.
- Use LLM orchestration to reduce latency and cost by assigning tasks to the most efficient model.
- Combine terminal-based and web-based AI tools for optimal workflow flexibility.
- Regularly update your AI agents and IDE plugins for maximum compatibility.
This stack allows seamless integration of mobile access, persistent development environments, AI coding assistants, and multi-model orchestration, ideal for solo developers or small teams who heavily rely on AI-driven productivity.