Artificial Intelligence Race: After ChatGPT, Microsoft Introduces Kosmos-1, a New AI Model That Responds to Visual Cues

As the war over artificial intelligence (AI) chatbots heat up, Microsoft has unveiled Kosmos-1, a new AI model that can also respond to visual cues or images, apart from text prompts or messages.

Technology IANS| Mar 03, 2023 08:00 PM IST

A+

A-

A-
A+

New Delhi, March 3 : As the war over artificial intelligence (AI) chatbots heat up, Microsoft has unveiled Kosmos-1, a new AI model that can also respond to visual cues or images, apart from text prompts or messages. The multimodal large language model (MLLM) can help in an array of new tasks, including image captioning, visual question answering and more. Artificial Intelligence: India Building Next-Gen AI To Become a Global Powerhouse and Empower Billions of Citizens: Union Minister of State for Electronics and IT Rajeev Chandrasekhar.

Kosmos-1 can pave the way for the next-stage beyond ChatGPT's text prompts. "A big convergence of language, multimodal perception, action, and world modeling is a key step toward artificial general intelligence. In this work, we introduce Kosmos-1, a Multimodal Large Language Model (MLLM) that can perceive general modalities, learn in context and follow instructions," said Microsoft's AI researchers in a paper. WhatsApp Update: Meta-Owned Messaging Platform To Launch ‘Split View’ Feature for Tablets on Android Beta.

The paper suggests that multimodal perception, or knowledge acquisition and "grounding" in the real world, is needed to move beyond ChatGPT-like capabilities to artificial general intelligence (AGI), reports ZDNet.

"More importantly, unlocking multimodal input greatly widens the applications of language models to more high-value areas, such as multimodal machine learning, document intelligence, and robotics," the paper read.

The goal is to align perception with LLMs, so that the models are able to see and talk. Experimental results showed that Kosmos-1 achieves impressive performance on language understanding, generation, and even when directly fed with document images.

It also showed good results in perception-language tasks, including multimodal dialogue, image captioning, visual question answering, and vision tasks, such as image recognition with descriptions (specifying classification via text instructions).

"We also show that MLLMs can benefit from cross-modal transfer, i.e., transfer knowledge from language to multimodal, and from multimodal to language. In addition, we introduce a dataset of Raven IQ test, which diagnoses the nonverbal reasoning capability of MLLMs," said the team.

(The above story first appeared on LatestLY on Mar 03, 2023 08:00 PM IST. For more news and updates on politics, world, sports, entertainment and lifestyle, log on to our website latestly.com).

Alia Bhatt Backs Mother Soni Razdan As Veteran Actress Voices Her Concern Over Palestine Crisis, Talks About Her German Grandfather Who Fought the Nazis (See Post)

Nagaland Dear Lottery Sambad Result Today 8 PM Live: Dear Finch Monday Lottery Result of July 28 2025 Declared Online, Watch Lucky Draw Winners List

Tata Technologies Launches Online Contest To Celebrate JRD Tata’s 121st Birth Anniversary on July 29, Asks Grok AI Chatbot To Pick a Winner in Lucky Draw

Female Grandmasters in India: Here's A List of Women Chess Players Who Attained the Highest Individual Title Awarded By FIDE

Kolkata Fatafat Result Today, July 28, 2025: Kolkata FF Live Winning Numbers Released, Know When and Where To Check Result Chart of Satta Matka-Type Lottery Game

NISAR Mission Update: GSLV-F16 to Liftoff ISRO-NASA Earth Observation Satellite on July 30; Check Live Streaming and Other Details

Did PM Modi Government Launched Pan-India Helpline Number 104 ‘Blood on Call’ Service To Meet Blood Requirement? PIB Reveals Truth About Misleading Claim

Moto G86 Power 5G With MIL-810H Military-Grade Protection To Launch in India on July 30; Check Expected Price and Other Specifications

Karisma Kapoor Not Involved in Sunjay Kapur’s Property Matters; Children Kiaan and Samaira’s Named Rightful Heirs

Did Bihar Boy Avinash Kumar Build Single-Seater Aircraft With Scrap for INR 7,000? Viral Video Turns Out To Be From Bangladesh in Fact Check

Artificial Intelligence Race: After ChatGPT, Microsoft Introduces Kosmos-1, a New AI Model That Responds to Visual Cues

As the war over artificial intelligence (AI) chatbots heat up, Microsoft has unveiled Kosmos-1, a new AI model that can also respond to visual cues or images, apart from text prompts or messages.

Google News Initiative AI Skills Academy Launched in India in Collaboration With IIMC To Equip Country’s Newsroom With Knowledge and Tools for AI-Powered Future

Elon Musk Announces Samsung Will Produce Tesla’s Next-Generation AI6 Chip at Its Semiconductor Plant in Texas

TCS Share Price Drop After Layoffs: Stocks of Tata Consultancy Services Fall by Nearly 2% As IT Major Announces 12,261 Job Cuts, Check Latest Price on NSE and BSE

Gemini AI Blunder: Google’s Gemini CLI Deletes Files of Software Developer Anurag Gupta, Apologises Later; Here’s What Happened

Alia Bhatt Backs Mother Soni Razdan As Veteran Actress Voices Her Concern Over Palestine Crisis, Talks About Her German Grandfather Who Fought the Nazis (See Post)

Nagaland Dear Lottery Sambad Result Today 8 PM Live: Dear Finch Monday Lottery Result of July 28 2025 Declared Online, Watch Lucky Draw Winners List

Tata Technologies Launches Online Contest To Celebrate JRD Tata’s 121st Birth Anniversary on July 29, Asks Grok AI Chatbot To Pick a Winner in Lucky Draw

Female Grandmasters in India: Here's A List of Women Chess Players Who Attained the Highest Individual Title Awarded By FIDE

Kolkata Fatafat Result Today, July 28, 2025: Kolkata FF Live Winning Numbers Released, Know When and Where To Check Result Chart of Satta Matka-Type Lottery Game

NISAR Mission Update: GSLV-F16 to Liftoff ISRO-NASA Earth Observation Satellite on July 30; Check Live Streaming and Other Details

40-Year-Old Ravi Bopara Hits Century, Smashes 55-Ball 110-Run Knock As England Champions Defeat India Champions in WCL 2025

Weather Forecast Today, July 28: Check Weather Updates, Rain Predictions for Mumbai, Delhi, Chennai, Bengaluru, Hyderabad, Shimla and Kolkata

Mumbai: Alcohol Addict Beaten to Death by Younger Brother in Vikhroli, Accused Arrested After Post-Mortem Report Exposes Murder

Barabanki Temple Stampede: Monkeys Bring Down Power Lines, Triggering Panic During ‘Jalabhishek’ at Awsaneshwar Mahadev Mandir, 2 Devotees Killed in Chaos (Watch Videos)

US Shocker: 3-Year-Old Boy in State Custody Dies After ‘Accidentally’ Being Left Inside Scorching Hot Car in Alabama (Watch Video)

‘Handshake With John Cena’ Kolkata Knight Riders Hits It Out of the Park With Hilarious Meme On Ben Stokes After IND vs ENG 4th Test 2025

Short Videos

Editor's Choice

Who Was Hashim Musa, Pahalgam Terror Attack Mastermind Killed in Dachigam Encounter by Indian Army?

Donkey Meat Racket Busted in Islamabad: IFA Seizes Over 1,000 Kg of Donkey Meat, 60 Live Donkeys During Raid at Illegal Operation in Tarnol; 1 Foreign National Arrested

Drone Sighting Rumours Create Panic Among Villagers in Uttar Pradesh: Are Criminals Using Drones for Robbery in UP Villages? Police Debunk Misinformation

Dog Attack in Delhi: 6-Year-Old Girl Dies of Rabies Weeks After Being Attacked by Rabid Stray Dog in Pooth Kalan, Family Demands Action Against MCD Officials

Trending Topics