Key Highlights
- Nebius (NBIS) has entered into an agreement to purchase model optimization and inference specialist Eigen AI for roughly $643 million through a combination of cash and Class A shares.
- The acquisition will bring Eigen AI’s optimization capabilities into Nebius Token Factory, the company’s enterprise-focused managed inference solution.
- MIT HAN Lab researchers who founded Eigen AI will launch Nebius’s inaugural Bay Area engineering center.
- Co-developed model implementations from both organizations have already achieved top rankings on Artificial Analysis performance benchmarks.
- NBIS shares climbed 8.51% following the announcement, reaching $150.00, after declining 6.07% in the previous week.
On May 1, 2026, Nebius (NBIS) revealed it has reached an agreement to purchase Eigen AI in a transaction valued at approximately $643 million. The consideration will consist of both cash and Nebius Class A shares, calculated using the company’s 30-day volume-weighted average share price at the time of signing. The stock responded positively, gaining 8.51% to reach $150.00.
The deal is anticipated to finalize in the coming weeks, subject to antitrust approval and customary closing requirements.
Eigen AI specializes in inference optimization and model efficiency for artificial intelligence applications. The company’s solutions enable AI development teams to deploy open-source models with enhanced speed and reduced costs in production environments, eliminating the need for internal optimization infrastructure.
Nebius intends to integrate Eigen AI’s technology seamlessly into Token Factory, its managed inference offering. Token Factory delivers autoscaling API endpoints and fine-tuning capabilities for leading open-source models such as Llama, DeepSeek, Qwen, Gemma, and additional frameworks.
The organizations have previously collaborated on technical initiatives. Prior to the acquisition announcement, they jointly engineered optimized model versions that achieved leading positions on Artificial Analysis, a prominent AI performance benchmarking service.
Eigen AI’s Technical Capabilities and Team
Eigen AI emerged from MIT’s HAN Lab, founded by leading researchers in AI optimization. Co-founders Ryan Hanrui Wang and Wei-Chen Wang have developed two highly influential methodologies in production AI systems.
Ryan’s Sparse Attention research (SpAtten) stands as the most-referenced HPCA publication since 2020. Wei-Chen’s Activation-aware Weight Quantization (AWQ) earned the MLSys 2024 Best Paper Award and has become the industry-standard method for 4-bit model deployment.
Co-founder Di Jin earned his PhD from MIT CSAIL and played a key role in developing Meta’s Llama 3 and Llama 4 post-training processes. He also contributed to the CGPO reinforcement learning from human feedback methodology.
After the transaction closes, the founding team will move to the San Francisco Bay Area to establish Nebius’s first United States-based engineering and research facility.
Market Dynamics in AI Inference
Inference has emerged as the most rapidly expanding segment within the AI compute ecosystem. Industry projections indicate it will account for approximately two-thirds of overall AI computational requirements in 2026.
Efficient inference deployment presents significant technical hurdles. The process encompasses model representation, GPU kernel optimization, and dynamic workload management — capabilities that most organizations lack internally.
Open-source models compound these challenges, as they’re generally released without optimization. Emerging architectures including Mixture-of-Experts and Compressed Sparse Attention create additional complexity around memory management and computational efficiency that demand specialized expertise.
Eigen AI’s comprehensive optimization methodology addresses post-training, fine-tuning, and production inference across all major open-source frameworks. The company’s kernel-level and model-level techniques are engineered to maximize performance from existing hardware infrastructure without requiring additional development resources.





