
Humyn Labs Boosts Development of Physical AI with $20 Million Investment
Detailed Analysis
Humyn Labs Commits $20 Million to Expand AI Data Collection Operations
Artificial intelligence firm Humyn Labs has committed $20 million to expand its data collection operations across India, Southeast Asia, Latin America, and the West Asia. The capital will finance the infrastructure required to train physical artificial intelligence systems (robots) and voice models. Co-founders Manish Agarwal and Ishank Gupta channeled the investment to organize and validate human intelligence.
Humyn Labs is leveraging its revenue to invest $20 million into building high-quality datasets for physical AI, focusing on egocentric and conversational voice data. Egocentric in artificial intelligence refers to data from a first-person view captured by a human or agent while interacting with its surroundings. The firm focuses on source-first data collection, recording first-person human activity, visuals, and movements within commercial, agricultural, and residential environments.
The datasets capture how humans navigate and physically interact with surroundings to train physical AI systems. Humyn Labs is expanding its voice data infrastructure to encompass 33 languages, dialects, accents, and code-switching patterns. This expansion addresses the utilization of voice for real-world commands and human-robot interactions.
| Expansion | Current | Projected | | --- | --- | --- | | Languages, dialects, accents, and code-switching patterns | 0 | 33 | | Global AI training dataset market size (2025) | - | $3.59 billion | | Global AI training dataset market size (2026) | - | $4.44 billion | | Global AI training dataset market size (2034) | - | $23.18 billion | | India market (2026) | - | $190 million |
Humyn Labs will establish robotics labs to construct simulation environments and world models. This unit integrates real-world data with training frameworks to deploy physical AI systems. In AI systems, a world model is an internal representation of how the real world works. For robotics labs, these world models are typically used to let robots learn in simulation before actual deployment.
The company is utilizing a decentralized network across the global south to source data. The current pipeline is driven by roughly 15–18 customers. While Manish Agarwal did not disclose any client names, he said that it targets top-tier labs where successful proof-of-concepts can quickly scale into $10–$15 million contracts.
Humyn AI's growth is currently uneven. The company has delivered approximately $2 million in revenue over the last few months, operating at an annualized run rate of around $4–5 million. Manish Agarwal said that Humyn has a sales pipeline of around $45–50 million. Based on current execution, the company expects to reach a $100 million Annual Recurring Revenue (ARR) by the end of December 2026.
Investor Takeaway
Investors should consider Humyn Labs as a potential player in the AI development space.
More in General

Anthropic Deploys Novel AI Model with Limited Cybersecurity Capabilities Compared to Mythos
AvenuesAI Develops On-Premise Small Language Models in Response to Increasing Data Privacy Concerns
