Apple Launches Groundbreaking Depth Pro Open Source AI Model for Monocular Depth Estimation
Apple Releases Depth Pro AI Model
Depth estimation is a critical process in 3D modelling, augmented reality (AR), autonomous systems, and robotics. The human eye excels in gauging depth with a single perspective, yet traditional cameras struggle, capturing only two-dimensional images. Apple's Depth Pro addresses this challenge, leveraging advanced AI to produce accurate depth maps from single images.
How Depth Pro AI Model Generates Depth Maps
Researchers employed a Vision Transformer-based (ViT) architecture to create the Depth Pro AI model. The model operates at a resolution of 384 x 384 for outputs while maintaining 1536 x 1536 for inputs, providing a comprehensive detail analysis. In a recently published paper, Apple demonstrated Depth Pro's proficiency in accurately mapping depth for visually intricate objects, achieving results in under a second. The open-source model's weights are available on GitHub, enabling users to implement it on a single GPU.
This article was prepared using information from open sources in accordance with the principles of Ethical Policy. The editorial team is not responsible for absolute accuracy, as it relies on data from the sources referenced.