What question did this study set out to answer?

The aim is to enhance real-time visual prediction for mobile AR applications using VFMs while addressing computational constraints.

January 24, 2026

手の中のVFM：モバイル拡張現実のリアルタイムシーン理解の最適化

Key Points

The aim is to enhance real-time visual prediction for mobile AR applications using VFMs while addressing computational constraints.
Developed ARIA, the first system for on-device VFM inference acceleration.
Utilized a parallel and selective inference scheme leveraging mobile processor heterogeneity.
Full-frame predictions are offloaded to GPUs, while dynamic region updates are handled by NPUs.
Achieved significant improvements in prediction accuracy in real-world mobile AR scenarios.
Increased the deadline success rate for real-time applications.

Abstract

モバイル拡張現実（AR）アプリケーションは、没入型でコンテキスト認識可能なユーザー体験を可能にするために、ピクセルレベルの深度やセマンティクスを含む高品質でリアルタイムの視覚予測を必要とします。近年、Vision Foundation Models（VFM）は多様で未見のデータに対する高い汎化能力を示し、スケーラブルなモバイルAR体験をサポートしています。しかし、モバイルデバイス上でのVFMの展開は計算資源の制約により困難であり、特に予測精度とリアルタイム性能の両立が課題です。本記事では、VFMのデバイス内推論加速を可能にする初のシステムであるARIA 3を紹介します。ARIAはモバイルプロセッサの異種性を活かした並列かつ選択的な推論スキームを採用しており、高並列性のあるGPUのようなプロセッサに対して全フレーム予測を周期的にオフロードし、動的領域に対してはNPUのような専門アクセラレータで低レイテンシの更新を実施します。モバイルデバイスで実装・評価した結果、ARIAは実際のモバイルARシナリオにおいて精度と期限成功率で大幅な改善を達成しました。

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Social Feed

Authors

Jeho Lee

C.R. Jung

Gunjoong Kim

Journals

GetMobile Mobile Computing and Communications

Actions

Institutions

Uppsala University

Yonsei University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Leeら（Mon,）はこの問題を研究しました。

www.synapsesocial.com/papers/697460acbb9d90c67120a8d2 — DOI: https://doi.org/10.1145/3793236.3793246

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Mobile Foundation Model as Firmware· 2024 · 31 citations
WiP: Efficient LLM Prefilling with Mobile NPU· 2024 · 3 citations
Pantheon: Preemptible Multi-DNN Inference on Mobile Edge GPUs· 2024 · 19 citations
Fast On-device LLM Inference with NPUs· 2025 · 16 citations
Traveling Salesman Problem· 2013 · 297 citations

手の中のVFM：モバイル拡張現実のリアルタイムシーン理解の最適化

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Social Feed

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider