NexaAI Releases OmniNeural-4B for On-Device AI
The new 4-billion-parameter model is designed for 'any-to-any' multimodal tasks and optimized to run efficiently on mobile hardware.

NexaAI has introduced OmniNeural-4B, a new 4-billion-parameter model designed to bring versatile AI capabilities directly to consumer devices. Its compact size and specific optimizations target the growing demand for powerful, locally-run artificial intelligence.
Unlike models that handle specific input-output pairs like text-to-image, OmniNeural-4B is an 'any-to-any' multimodal system. This architecture allows it to process and generate a wide combination of data types, which could enable more complex and integrated on-device applications.
Designed for the Edge
The core focus of this release is performance on edge devices. NexaAI highlights that OmniNeural-4B is specifically optimized for the Neural Processing Units (NPUs) found in modern Snapdragon chips and Android hardware. This approach bypasses the need for cloud servers, potentially offering faster, more private, and offline-capable AI experiences directly on a user's phone.
The model is available on Hugging Face under a permissive Creative Commons BY 4.0 license, allowing developers and researchers to freely experiment with building sophisticated multimodal features into mobile applications.
Sources
- Visit
NexaAI/OmniNeural-4B
Hugging Face
0 comments
No comments yet. Be the first to weigh in.
More in Any-to-Any

MiniMax Releases M3, a Multimodal MoE Model
The new open-weight model from MiniMax AI combines vision, coding, and reasoning using a Mixture-of-Experts architecture.
Google Releases Gemma 4 12B Multimodal Model
The new 12-billion-parameter open model from DeepMind introduces a unified 'any-to-any' architecture for advanced multimodal tasks.
Google Releases Gemma 4, a 12B 'Any-to-Any' Model
The new 12-billion-parameter model from Google DeepMind is designed to handle a flexible mix of data types, moving beyond traditional text and image inputs.