Apple's FastVLM AI Model Offers Instant Video Captioning

September 02, 2025
Apple has introduced FastVLM, a new AI model for video captioning that provides near-instant image processing, available for Apple Silicon devices.

Apple Inc. has launched FastVLM, a new AI model designed for video captioning, offering users near-instant high-resolution image processing. This model is part of Apple's ongoing efforts to enhance AI capabilities on its devices, specifically optimized for Apple Silicon.

FastVLM leverages a Visual Language Model (VLM) and the MLX open framework, making it 85 times faster and three times smaller than similar models in the market. Users can try out this technology via the Hugging Face repository or through a lighter web-based version, FastVLM 0.5B, which runs directly in the browser.

The AI model requires the camera to focus on an object for processing, providing live captions with prompts such as "Describe what you see in one sentence" or "Identify any text or written content visible." While the technology is designed for Apple Silicon, it may take some time to load on certain devices, even those with advanced specifications.

This development aligns with Apple's broader strategy to integrate AI into its products, potentially paving the way for future applications in wearables and assistive technology. The FastVLM model is available on platforms like Hugging Face, where users can explore its capabilities further.

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like Daily AI Brief.

Also, consider following us on social media:

Subscribe to Daily AI Brief

Daily report covering major AI developments and industry news, with both top stories and complete market updates

Whitepaper

Stanford HAI’s 2025 AI Index Reveals Record Growth in AI Capabilities, Investment, and Regulation

The 2025 AI Index by Stanford HAI provides a comprehensive overview of the global state of artificial intelligence, highlighting significant advancements in AI capabilities, investment, and regulation. The report details improvements in AI performance, increased adoption in various sectors, and the growing global optimism towards AI, despite ongoing challenges in reasoning and trust. It serves as a critical resource for policymakers, researchers, and industry leaders to understand AI's rapid evolution and its implications.

Read more