LATEST
WEBSTORY
TRENDING

‘World Got To Know Mahatma Gandhi From Movie’: PM Modi's Comment On Mahatma Gandhi Sparks Row

Shocking: Man Dies After Being Sucked Into Plane Engine In Front Of Passengers At Amsterdam Airport

Noida: AC Blast Triggers Massive Fire In Noida Society, Video Goes Viral

5 most expensive foods found in India

10 interesting facts about Einstein

10 power foods to pump up your red blood cells

BJP appoints new state presidents in Telangana, Jharkhand, Punjab, Andhra ahead of 2024 Lok Sabha polls

Meet 'mystery girl' whose pictures with Indian cricketer Yuzvendar Chahal is going viral

SCO must not hesitate to criticise countries supporting terrorism: PM Modi

PHOTOS
VIDEOS
ENTERTAINMENT

Streaming This Week: Panchayat season 3, Swatantrya Veer Savarkar, Illegal season 3, latest OTT releases to binge-watch

Avneet Kaur shines in navy blue gown with shimmery trail at Cannes 2024, fans say 'she is unstoppable now'

Assamese actress Aimee Baruah wins hearts as she represents her culture in saree with 200-year-old motif at Cannes

‘World Got To Know Mahatma Gandhi From Movie’: PM Modi's Comment On Mahatma Gandhi Sparks Row

Shocking: Man Dies After Being Sucked Into Plane Engine In Front Of Passengers At Amsterdam Airport

Noida: AC Blast Triggers Massive Fire In Noida Society, Video Goes Viral

Panchayat's Durgesh Kumar says viral 'Dekh raha hai Binod' line is not his anymore, reacts to memes | Exclusive

This classic was made by director in frustration, was rejected by Amitabh, Naseer, inspired many filmmakers, earned...

This small-budget blockbuster was rejected by Amitabh Bachchan, attained cult status, made director star; film earned...

Home Technology

Technology

What is CoDi, Microsoft's AI to generate text, images, audio, videos all at once?

Microsoft's CoDi breaks down limits by becoming the first model able to analyse and generate several types of material simultaneously, leading to a satisfying result.

DNA Web Team

Updated : Jul 04, 2023, 08:13 PM IST

What is CoDi, Microsoft's AI to generate text, images, audio, videos all at once?

Microsoft has started an innovative move to build an all-at-once AI model called CoDi (Composable Diffusion) in an effort to increase the AI's potential. CoDi is set to revolutionise how we engage with computers and perceive our environment since it is capable of concurrently analysing and creating a variety of media types, including text, pictures, video, and audio.

Microsoft's CoDi breaks down limits by becoming the first model able to analyse and generate several types of material simultaneously, leading to a satisfying result. The invention of CoDi is based on a novel approach that creates a shared diverse space, allowing synchronised synthesis of related modalities like simultaneously synced video and audio.

This special capability decreases earlier worries about the consistency of independently created unimodal streams when combined. Latent diffusion models (LDMs) relevant to each format were first developed individually, resulting in excellent single-modality creation performance. The same conceptual framework was then projected onto these inputs, enabling the LDM of each mode to analyse any combination of simultaneous inputs.

READ | Amazon introduces new product customisation feature in India

The capacity of CoDi to cope with many-to-many generation techniques, constantly creating a variety of output methods, is a ground-breaking invention. CoDi accomplishes this difficult task without the need to spend time on all potential mode combinations by merging a cross-attention generator with an environment translator.

Capabilities of CoDi

CoDi showed off its skills by producing a synchronised visual and audio output by effectively fusing text, audio, and image instructions. This development shows CoDi's capability to combine data from many sources and provide cogent and aligned results.

The ground-breaking capabilities of CoDi open up a wide range of practical uses, particularly in accessible technology and education. It can provide dynamic, captivating content that supports various learning methods and offers affordable opportunities for those with limitations. CoDi is anticipated to greatly improve human-computer interaction, bringing in a new era of creative AI.

Find your daily dose of news & explainers in your WhatsApp. Stay updated, Stay informed- Follow DNA on WhatsApp.