EfficientViT: Lightweight Multi-Scale Attention for On-Device Semantic Segmentation
Paper
•
2205.14756
•
Published
•
1
EfficientViT is a new family of high-resolution vision models with novel multi-scale linear attention. As such, EfficientViT delivers remarkable performance gains over previous state-of-the-art models with significant speedup on diverse hardware platforms, including mobile CPU, edge GPU, and cloud GPU.
Original paper: EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense Prediction
EfficientViT is a new family of vision models for efficient high-resolution dense prediction. The core building block of EfficientViT is a new lightweight multi-scale linear attention module that achieves global receptive field and multi-scale learning with only hardware-efficient operations.
Model Configuration:
| Model | Device | Model Link |
|---|---|---|
| EfficientViT-L2 | N1-655 | Model_Link |
| EfficientViT-L2 | CV72 | Model_Link |
| EfficientViT-L2 | CV75 | Model_Link |