On-device AI

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2023 Jul 19 16:5
Editor
Edited
Edited
2025 Jun 2 14:58

Mobile AI, Edge AI

https://www.linkedin.com/feed/?trk=guest_homepage-basic_google-one-tap-submit
Edge AI Tools
 
 
 

Mobile LLM below 1b (or SmolLM2, LLaMa-1b, Qwen 1b,
Hymba
)

OmniVision-968M: World's Smallest Vision Language Model
Pocket-size multimodal model with 9x token reduction for on-device deployment
OmniVision-968M: World's Smallest Vision Language Model
MobileLLM - a facebook Collection
Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905
MobileLLM - a facebook Collection
HuggingFaceTB/SmolLM2-1.7B-Instruct · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
HuggingFaceTB/SmolLM2-1.7B-Instruct · Hugging Face
vision model (256M 500M 2.2B)
ds4sd/SmolDocling-256M-preview · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
ds4sd/SmolDocling-256M-preview · Hugging Face
SmolVLM - small yet mighty Vision Language Model
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
SmolVLM - small yet mighty Vision Language Model
SmolVLM Grows Smaller – Introducing the 256M & 500M Models!
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
SmolVLM Grows Smaller – Introducing the 256M & 500M Models!
dataset SmolTalk
HuggingFaceTB/smoltalk · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
HuggingFaceTB/smoltalk · Datasets at Hugging Face

Solution is
LoRA
like adaptation technique?

Introducing Apple’s On-Device and Server Foundation Models
At the 2024 Worldwide Developers Conference, we introduced Apple Intelligence, a personal intelligence system integrated deeply into…
Introducing Apple’s On-Device and Server Foundation Models
Meta and Qualcomm team up to run big A.I. models on phones
Large language models (LLMs) are the technology that underpin applications like OpenAI's ChatGPT that can return out text that resembles human output.
Meta and Qualcomm team up to run big A.I. models on phones

Cognitive core could be extremely small

No Priors Ep. 80 | With Andrej Karpathy from OpenAI and Tesla
Andrej Karpathy joins Sarah and Elad in this week of No Priors. Andrej, who was a founding team member of OpenAI and the former Tesla Autopilot leader, needs no introduction. In this episode, Andrej discusses the evolution of self-driving cars, comparing Tesla's and Waymo’s approaches, and the technical challenges ahead. They also cover Tesla’s Optimus humanoid robot, the bottlenecks of AI development today, and how AI capabilities could be further integrated with human cognition. Andrej shares more about his new mission Eureka Labs and his insights into AI-driven education and what young people should study to prepare for the reality ahead. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @Karpathy Show Notes: 0:00 Introduction 0:33 Evolution of self-driving cars 2:23 The Tesla vs. Waymo approach to self-driving 6:32 Training Optimus with automotive models 10:26 Reasoning behind the humanoid form factor 13:22 Existing challenges in robotics 16:12 Bottlenecks of AI progress 20:27 Parallels between human cognition and AI models 22:12 Merging human cognition with AI capabilities 27:10 Building high performance small models 30:33 Andrej’s current work in AI-enabled education 36:17 How AI-driven education reshapes knowledge networks and status 41:26 Eureka Labs 42:25 What young people study to prepare for the future
No Priors Ep. 80 | With Andrej Karpathy from OpenAI and Tesla
 
 

Recommendations