← Home

#VLA

19 notes

VLAS: Vision-Language-Action Model With Speech Instructions For Customized Robot Manipulation
VLA
MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation
VLA
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
VLA
Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding
VLA
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics
VLA
GR00T N1: An Open Foundation Model for Generalist Humanoid Robots
humanoid VLA
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
VLA
CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
VLA
Real-World Robot Applications of Foundation Models: A Review
VLA VLM LLM survey embodied
Improving Vision-Language-Action Model with Online Reinforcement Learning
VLA
Knowledge Insulating Vision-Language-Action Models: Train Fast, Run Fast, Generalize Better
VLA
OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation
survey VLA
$\pi_{0.6}$: A VLA That Learns From Experience
flow-matching rl VLA
Diffusion-VLA: Scaling Robot Foundation Models via Unified Diffusion and Autoregression
diffusion VLA manipulation
TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policiy
manipulation VLA
$\pi_0$: A Vision-Language-Action Flow Model for General Robot Control
manipulation VLA
Scaling proprioceptive-Visual Learning with heterogeneous Pre-trained Transformers
robotic VLA
Vision-Language-Action Models
VLA
From LLMs to Actions: Latent Codes as Bridges in Hierarchical Robot Control
VLA robotic LLM