Apple Launches Groundbreaking Depth Pro Open Source AI Model for Monocular Depth Estimation

Monday, 7 October 2024, 03:35

Apple has released the Depth Pro open-source AI model, revolutionizing monocular depth estimation. This innovative technology enables the generation of accurate depth maps for 3D applications using advanced AI techniques, eliminating the need for multiple cameras. Depth Pro highlights Apple’s commitment to enhancing AI capabilities in various fields, including augmented reality.
Gadgets360
Apple Launches Groundbreaking Depth Pro Open Source AI Model for Monocular Depth Estimation

Apple Releases Depth Pro AI Model

Depth estimation is a critical process in 3D modelling, augmented reality (AR), autonomous systems, and robotics. The human eye excels in gauging depth with a single perspective, yet traditional cameras struggle, capturing only two-dimensional images. Apple's Depth Pro addresses this challenge, leveraging advanced AI to produce accurate depth maps from single images.

How Depth Pro AI Model Generates Depth Maps

Researchers employed a Vision Transformer-based (ViT) architecture to create the Depth Pro AI model. The model operates at a resolution of 384 x 384 for outputs while maintaining 1536 x 1536 for inputs, providing a comprehensive detail analysis. In a recently published paper, Apple demonstrated Depth Pro's proficiency in accurately mapping depth for visually intricate objects, achieving results in under a second. The open-source model's weights are available on GitHub, enabling users to implement it on a single GPU.


This article was prepared using information from open sources in accordance with the principles of Ethical Policy. The editorial team is not responsible for absolute accuracy, as it relies on data from the sources referenced.


Related posts


Newsletter

Subscribe to our newsletter for the most reliable and up-to-date tech news. Stay informed and elevate your tech expertise effortlessly.

Subscribe