Featured Content:

Meta Outlines its Latest Image Recognition Advances, Which Could Facilitate


Meta’s working towards the next stage of generative AI, which could eventually enable the creation of immersive VR environments via simple directions and prompts.

Its latest development on this front is its updated DINO image recognition model, which is now able to better identify individual objects within image and video frames, based on self-supervised learning, as opposed to requiring human annotation for each element.

As you can see in this example, DINOv2 is able to understand the context of visual inputs, and separate out individual elements, which will better enable Meta to build new models that have advanced understanding of not only what an item might look like, but also where it should be placed within a setting.

Meta published the first version of its DINO system back in 2021, which was a significant advance in what’s possible via image recognition. The new version builds upon this, and could have a range of potential use cases.

As explained by Meta:

“In recent years, image-text pre-training, has been the standard approach for many computer vision tasks. But because the method relies on handwritten captions to learn the semantic content of an image, it ignores important information that typically isn’t explicitly mentioned in those text descriptions. For instance, a caption of a picture of a chair in a vast purple room might read ‘single oak chair’. Yet, the caption misses important information about the background, such as where the chair is spatially located in the purple room.”

DINOv2 is able to build in more of this context, without requiring manual intervention, which could have specific value for VR development.

It could also facilitate more immediately more accessible elements, like improved digital backgrounds in video chats, or tagging products within video content. It could also enable all new types of AR and visual tools that could lead to more immersive Facebook functions.

Going forward, the team plans to integrate this model, which can function as a building block, in a larger, more complex AI system that could interact with large language models. A visual backbone providing rich information on images will allow complex AI systems to reason on images in a deeper way than describing them with a single text sentence. Models trained with text supervision are ultimately limited by the image captions. With DINOv2, there is no such built-in limitation.

That, as noted, could also enable the development of AI-generated VR worlds, so that you’d eventually be able to speak entire, interactive virtual environments into existence.

That’s a long way off, and Meta’s hesitant to make too many references to the metaverse at this stage. But that’s where this technology could truly come into its own, via AI systems that can understand more about what’s in a scene, and where, contextually, things should be placed.

It’s another step in that direction – and while many have cooled on the prospects for Meta’s metaverse vision, it still could become the next big thing, once Meta’s ready to share more of its next-level vision.

It’ll likely be more cautious about such, given the negative coverage it’s seen thus far. But it is coming, so don’t be surprised when Meta eventually wins the generative AI race with a totally new, totally different experience.

You can read more about DINOv2 here.





Via

Latest Post

How to Know If Someone Blocked You on Telegram

2 Telegram is packed with useful features,...

Is Apple Pay safe? | ConsideringApple

Apple Pay has become an important part of...

Web.com and GoDaddy join IONOS and Wix on the Generative AI integration

As AI-powered features become more popular across all...

How to Delete Photos and Albums on Facebook

You can use Facebook to share your happy...

Subscribe Us

Don't miss

How to Know If Someone Blocked You on Telegram

2 Telegram is packed with useful features,...

Is Apple Pay safe? | ConsideringApple

Apple Pay has become an important part of...

Web.com and GoDaddy join IONOS and Wix on the Generative AI integration

As AI-powered features become more popular across all...

How to Delete Photos and Albums on Facebook

You can use Facebook to share your happy...

How to View Your Purchase History in Steam

https://www.youtube.com/watch?v=2TPilVjSJLwArticle updated on 5/31/2023 to reflect navigational changes...

How to Use Photoshop’s AI Generative Fill Tool Right Now

The realm of artificial intelligence continues to expand...

How to Set Up Voicemail on Your iPhone

Voicemail helps people leave messages for you when...

Twitter is Approving Far More Government Censorship Requests Under Elon Musk

One of the key questions posed early on...

How to Access ChatGPT From Anywhere Using ChatGPT Everywhere

It's undeniable that AI is here to stay....

How to Know If Someone Blocked You on Telegram

2 Telegram is packed with useful features, and it’s easily one of the best secret texting apps out there. However, when things...

Is Apple Pay safe? | ConsideringApple

Apple Pay has become an important part of our daily lives. Imagine you go out to buy something and forgot to carry cash...

Web.com and GoDaddy join IONOS and Wix on the Generative AI integration

As AI-powered features become more popular across all kinds of industries, a string of website builder services are also jumping on the bandwagon.Web.com...

LEAVE A REPLY

Please enter your comment!
Please enter your name here