Tag

AI Models

A collection of 22 posts

Gemini

Gemini 3.5 Live Translate: A Developer's Guide

Google's Gemini 3.5 Live Translate is a new audio model for continuous speech-to-speech translation in 70+ languages. Here's how it works, where it ships, and how to build with it.

· 7 min read
AI Models

Kimi K2.6 vs GPT-5.5 vs Claude Opus 4.8 (2026)

A practical 2026 comparison of Kimi K2.6, GPT-5.5, and Claude Opus 4.8 on coding benchmarks, reasoning, pricing, and self-host economics — plus which to pick by use case.

· 7 min read
Speech to Text

faster-whisper vs whisper.cpp vs OpenAI Whisper (2026)

A practical 2026 comparison of faster-whisper, whisper.cpp, and OpenAI's reference Whisper — speed, VRAM, accuracy, and which local speech-to-text runtime to pick for your hardware.

· 7 min read
Whisper

Run Whisper Large Locally: Setup Guide (2026)

Install and run OpenAI Whisper's largest model locally for private, offline transcription — VRAM requirements, pip and Apple Silicon setup, faster-whisper, and quantization.

· 7 min read
AI

Anthropic Mythos: Complete Guide (2026)

Anthropic Mythos is the frontier preview model unveiled April 7, 2026: stronger than Opus 4.7 on math and security, withheld from public release, shipped only via Project Glasswing to ~50 defensive-security partners.

· 11 min read
AI

Claude Mythos vs Opus 4.7 vs GPT-5.5 (2026)

Claude Mythos, Opus 4.7, and GPT-5.5 shipped within three weeks of each other in April 2026. We break down which frontier model wins on coding, reasoning, vision, cost, and which one your team should actually pick.

· 10 min read
AI

Qwen 3.7 Max: Alibaba's May 2026 Flagship Guide

Alibaba's Qwen 3.7 Max launched May 20, 2026 with a 1M-token context, native extended-thinking mode, and benchmark wins on SWE-Pro and Terminal-Bench. Here's how it compares to Claude Opus 4.7, GPT-5.5, Gemini 3.5 Flash and DeepSeek V4, what it costs on DashScope, and when to pick it.

· 10 min read
AI

Gemini 3.5 Flash + Gemini Spark: Google I/O 2026

Google dropped Gemini 3.5 Flash and Gemini Spark at I/O 2026. A frontier-grade Flash model that outruns 3.1 Pro, and a persistent personal agent built on top of it. Here's what shipped, what's rumored, and where it fits next to Claude Opus 4.7 and GPT-5.5.

· 9 min read
AI

AI Model Releases — May 2026 Roundup

A practitioner's roundup of every AI model release that mattered in May 2026 — Anthropic Mythos, Gemini 3.5 Flash, Qwen 3.7 Max, Mistral Medium 3.5, ERNIE 5.1, and Subquadratic's 12M-token SubQ. Benchmarks, pricing, availability, and what to actually use.

· 14 min read
AI

Baidu ERNIE 5.1: Chinese LLM Cracks Global Top 5

Baidu's ERNIE 5.1, released May 8 2026, became the first Chinese LLM in the global Search Arena top 5. Here's what it does, how it compares to DeepSeek V4 and Qwen, and when teams outside China should actually use it.

· 8 min read
AI Models

AI Models Released in May 2026 — Complete Roundup

Every major LLM and AI tooling release in May 2026 — Qwen 3.7-Max, DeepSeek V4-Pro permanent pricing, Gemini 3.5 Flash, Composer 2.5, Grok Build, Cherry Studio 1.9.6, Ollama 0.24, and what's still rumored.

· 9 min read