DeepSeek Launches Sparse Attention Model, Halves API Costs

Tuesday, 30 June

Tuesday, 30 June, 2026

DeepSeek Launches Sparse Attention Model, Halves API Costs

By Isha

DeepSeek has unveiled V3.2-exp, a sparse-attention model using a “lightning indexer” and fine-grained token selection to trim inference expenses. In long-context applications, this architecture can cut per-call API costs by up to 50%. The model is open-weight and publicly available on Hugging Face, enabling further third-party validation and adoption.

Read full story at TechCrunch

Tags:api adoption DeepSeek

Download TechShots

IT Trends Move Fast. Stay Faster.

Android iOS

Share your insights

Create Content

Categories

DeepSeek Launches Sparse Attention Model, Halves API Costs

Also Read

Defeating the Digital Snoops: How to Stop Public Wi-Fi From Exposing Your Data

"Hundred-Year Flood": Elon Musk and Tim Cook Sound Alarm Over Record AI Chip Costs

NASA in the Neighborhood: The $25,000 Luxury Lunar Rover for Earth

Pocket Revolution: The Device That Swallowed the 21st Century Turns 19

Italy Probes Microsoft Over Sneaky, Expensive AI Upgrades

The Future of Tech Work: 92% of Leaders Say AI Management is Non-Negotiable

Premium Power, Mid-Range Price: Motorola Drops Moto Pad 70 Pro

Beat the Ads: Vi Bundles 3 Months of Free Spotify Premium

Feeding the Machine: iPhone 18 Slated for RAM Boost to Power Advanced AI

Download TechShots

Share your insights

Subscribe To Our Newsletter.