Papers covered:
AI Agents that Matter
Kyutai Moshi
Meta 3D Gen
Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
PathAlign: A vision-language model for whole slide images in histopathology
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
Consistency Flow Matching: Defining Straight Flows with Velocity Consistency
Share this post