DeepSeek NSA: China’s AI Company Introduces Ultra-Fast Sparse Attention Mechanism To Speed Up the Inferences and Reduce Pre-Training Costs

DeepSeek NSA, introduced by China’s AI company, offers an ultra-fast sparse attention mechanism that accelerates inferences and reduces pre-training costs.

DeepSeek Logo (Photo Credits: X/@LiangWenfeng_)

China's DeepSeek has launched NSA, a hardware-aligned and natively trainable sparse attention mechanism to offer users ultra-fast long-context training and inferences. DeepSeek NSA offers a dynamic hierarchical sparse strategy, fine-gained token selection, and coarse-gained token compression. The China-based DeepSeek AI company said its NSA would speed up the inferences and reduce pre-training costs without compromising performance. DeepSeek NSA is also said to outperform Full Attention models on various benchmarks. Grok 3 Launched by Elon Musk’s xAI Outperforming DeepSeek R1, OpenAI o1 and Gemini-2 Flash Thinking; Check Modes, Versions and More.

DeepSeek Launched NSA Mechanism for Faster Inferences, Lower Training Costs

🚀 Introducing NSA: A Hardware-Aligned and Natively Trainable Sparse Attention mechanism for ultra-fast long-context training & inference!
Core components of NSA:
• Dynamic hierarchical sparse strategy
• Coarse-grained token compression
• Fine-grained token selection
💡 With… pic.twitter.com/zjXuBzzDCp
— DeepSeek (@deepseek_ai) February 18, 2025

(SocialLY brings you all the latest breaking news, viral trends and information from social media world, including Twitter (X), Instagram and Youtube. The above post is embeded directly from the user's social media account and LatestLY Staff may not have modified or edited the content body. The views and facts appearing in the social media post do not reflect the opinions of LatestLY, also LatestLY does not assume any responsibility or liability for the same.)

Trent Boult Rattles LAKR With Fiery Four-Wicket Haul, Clean Bowls Top Three Batters in LA Knight Riders vs MI New York MLC 2025 Match (Watch Video)

Ahmedabad Plane Crash: Air India Denies Stewarts Law Firm’s Allegations of Forcing AI 171 Crash Victims’ Families To Disclose Financial Dependency To Get Compensation

Babydoll Archi Viral Video: Watch Archita Phukan Nail ‘Dame Un Grrr’ Trend As ‘Archita Pukham Video Viral Original’ Searches Spike!

Arijit Singh Surpasses Taylor Swift As Most Followed Artiste on Spotify! ‘Saiyaara’ Director Mohit Suri Reacts, Calls Bollywood Singer 'Brilliant Cultural Ambassador of India’

Microsoft Shuts Down Its Operations in Pakistan After 25 Years, Founding Country Head Calls ‘The End of an Era’

Lalit Modi, Vijay Mallya Belt Out ‘I Did It My Way’ at Glitzy London Party, Chris Gayle Joins Star-Studded Night (Watch Video)

Sangareddy Factory Blast: 1 More Dies; Toll Rises to 39 in Explosion at Sigachi Industries in Telangana

‘The Traitors India Season 1’ Winners: Uorfi Javed and Nikita Luther Take Home Grand Prize of INR 70.5 Lakh by Beating Traitor Harsh Gujral

Ravindra Jadeja Becomes Player To Score 2000+ and Take 100+ Wickets in ICC WTC History, Achieves Feat During IND vs ENG 2nd Test 2025

Tecno Pova 7 5G, Tecno Pova 7 Pro 5G Launched in India, Sale Begins on July 10; Check Specifications and Other Details

DeepSeek NSA: China’s AI Company Introduces Ultra-Fast Sparse Attention Mechanism To Speed Up the Inferences and Reduce Pre-Training Costs

DeepSeek NSA, introduced by China’s AI company, offers an ultra-fast sparse attention mechanism that accelerates inferences and reduces pre-training costs.

DeepSeek Launched NSA Mechanism for Faster Inferences, Lower Training Costs

Elon Musk’s xAI To Roll Out Updated ‘Ask Grok’ Chrome Extension With ‘Translate’ and ‘Fact-Check’ Features; Grok 4 Launch Coming Soon

Elon Musk’s X Replaces Google Translate With Grok Translation on Web Version, Offers Instant Translation of Foreign Languages on Platform

Qwen-TTS: Alibaba’s Qwen Releases Bi-Lingual Text-to-Speech Model With 7 Voices in Qwen API; Check More Details

OpenAI Adopts Palantir-Like Strategy, Offers Customers Consulting-Like AI Services Requiring Them To Spend Minimum USD 10 Million: Report

Trent Boult Rattles LAKR With Fiery Four-Wicket Haul, Clean Bowls Top Three Batters in LA Knight Riders vs MI New York MLC 2025 Match (Watch Video)

Ahmedabad Plane Crash: Air India Denies Stewarts Law Firm’s Allegations of Forcing AI 171 Crash Victims’ Families To Disclose Financial Dependency To Get Compensation

Babydoll Archi Viral Video: Watch Archita Phukan Nail ‘Dame Un Grrr’ Trend As ‘Archita Pukham Video Viral Original’ Searches Spike!

Arijit Singh Surpasses Taylor Swift As Most Followed Artiste on Spotify! ‘Saiyaara’ Director Mohit Suri Reacts, Calls Bollywood Singer 'Brilliant Cultural Ambassador of India’

Microsoft Shuts Down Its Operations in Pakistan After 25 Years, Founding Country Head Calls ‘The End of an Era’

Lalit Modi, Vijay Mallya Belt Out ‘I Did It My Way’ at Glitzy London Party, Chris Gayle Joins Star-Studded Night (Watch Video)

PM Narendra Modi Departs for 5-Nation Tour; To Attend BRICS Summit and Hold Key Bilateral Talks (See Pics and Video)

Kesavan Ramachandran Appointed As New Executive Director of RBI, Know All About Him

Corbin Bosch Becomes First South Africa Player Since Jacques Kallis in 2002 To Score Century and Take Five-Wicket Haul, Achieves Feat During ZIM vs SA 1st Test 2025

Yolo County Fire: Massive Fireworks Explosion Turns Sky Into Warzone; Flames, Smoke and Blasts Rock Esparto As Investigation Begins (Watch Videos)

Bihar Cops Caught Planting Liquor to Frame Innocent Man in Muzaffarpur; Probe Ordered After Shocking Video Goes Viral

What Is Sex-Sorted Semen Technology? Kerala Achieves Milestone With Birth of Twin Female Calves Under Accelerated Breed Improvement Programme, Boosting Dairy Sector

Short Videos

Editor's Choice

Did Nirmala Sitharaman Announce Investment Platform Promising up to INR 15 Lakh Monthly Income for INR 21,000 Investment? PIB Fact Check Debunks Digitally-Altered Fake Video

Who Is Soham Parekh, Indian Techie Accused of Duping Multiple Companies? What’s His Reaction to Allegations of Moonlighting?

New Zealand Nurse Undergoes Robotic Double Hip Replacement in Mumbai, Calls India’s Hospitals More Advanced Than NZ’s ‘Third World System’ As She Saves USD 60,000

Is Shefali Jariwala’s Death Linked to COVID-19 Vaccine? Here’s a Fact Check of Fake Claims About ‘Kaanta Laga’ Fame Actor’s Cause of Death

Trending Topics