Reference

·

Supporting source

Typhoon-2 Inference Cost Discount vs GPT-4o

~30-50% lower per-token cost (Thai-language workloads)

As ofQ4 2025 release; 2026 deployment·Sources3·Supporting

SCB 10X Typhoon-2 (released December 2025) reaches parity with GPT-4o on Thai-language tasks across M3Exam Thai, ThaiSum, and the Typhoon eval suite, at roughly 30 to 50 percent lower per-token inference cost when served from the AWS Bangkok region, Google Cloud Bangkok region, or NSTDA Lanta-class infrastructure. The cost differential plus latency advantage (no Singapore round-trip) flips the deployment decision for Thai-language regulated workloads from foundation-vendor wrappers to Thai LLM fine-tunes for net-new buys during 2026-27.

Figure in context

SCB 10X Typhoon-2 (released December 2025) reaches parity with GPT-4o on Thai-language tasks across M3Exam Thai, ThaiSum, and the Typhoon eval suite, at roughly 30 to 50 percent lower per-token inference cost when served from the AWS Bangkok region, Google Cloud Bangkok region, or NSTDA Lanta-class infrastructure. The cost differential plus latency advantage (no Singapore round-trip) flips the deployment decision for Thai-language regulated workloads from foundation-vendor wrappers to Thai LLM fine-tunes for net-new buys during 2026-27.

SCB 10X Typhoon-2 (released December 2025) reaches parity with GPT-4o on Thai-language tasks across M3Exam Thai, ThaiSum, and the Typhoon eval suite, at roughly 30 to 50 percent lower per-token inference cost when served from the AWS Bangkok region, Google Cloud Bangkok region, or NSTDA Lanta-class infrastructure. The cost differential plus latency advantage (no Singapore round-trip) flips the deployment decision for Thai-language regulated workloads from foundation-vendor wrappers to Thai LLM fine-tunes for net-new buys during 2026-27.

Time scope

Q4 2025 release; 2026 deployment

Source basis

Supporting source

Interpretation notes

What this tells you

SCB 10X Typhoon-2 (released December 2025) reaches parity with GPT-4o on Thai-language tasks across M3Exam Thai, ThaiSum, and the Typhoon eval suite, at roughly 30 to 50 percent lower per-token inference cost when served from the AWS Bangkok region, Google Cloud Bangkok region, or NSTDA Lanta-class infrastructure. The cost differential plus latency advantage (no Singapore round-trip) flips the deployment decision for Thai-language regulated workloads from foundation-vendor wrappers to Thai LLM fine-tunes for net-new buys during 2026-27.

What not to do with it

Use the linked report for interpretation and keep basis differences explicit.

Related figures

Adjacent numbers that add context without drowning the value.

Report context

Atlas actors in this figure's reports

Profiles covered in the report that cite this number.

Typhoon-2 Inference Cost Discount vs GPT-4o · Insight