AsianFin -- China’s Houmo Intelligence unveiled a new high-efficiency AI chip tailored for large models on edge devices at the 2025 World Artificial Intelligence Conference , as the startup sharpens its pivot from intelligent driving to next-gen edge computing amid intensifying demand for localized AI inference.
The Houmo Manjie M50, launched in Shanghai during the conference, is China’s first AI chip to integrate memory and computation in a single architecture specifically designed for large-scale inference at the edge. With 160 TOPS of INT8 compute and 100 TFLOPS of bFP16 performance, plus up to 48GB of memory and a bandwidth of 153.6 GB/s, the M50 runs models from 1.5 billion to 70 billion parameters—all within 10 watts of power. The chip is aimed at devices such as PCs, smart speakers, and robots, offering plug-and-play large model capabilities.
“The M50 is just the beginning,” said CEO Wu Qiang at a media briefing. “Our vision is to make AI computing power as accessible as electricity—embedded in every device, across every industry.”
The company also rolled out complementary products including the Liqing-series M.2 cards, Limou-series accelerator cards, and compute boxes, targeting everything from consumer devices to intelligent industrial terminals. It’s the latest sign that China’s edge AI race is heating up as generative models move off the cloud and into everyday products.
Founded in 2020, Houmo initially focused on AI chips for intelligent driving. But by late 2023, Wu concluded the sector was overcrowded and stagnating. “The industry was obsessed with cost competition, and no one believed in L3 autonomy anymore,” Wu said. “We had a chip with strong performance, but the market didn’t want it.”
Instead, the company saw promise in compute-in-memory —a chip design that breaks the traditional von Neumann architecture by embedding computation directly into memory arrays. This reduces data movement and energy consumption, addressing bottlenecks in bandwidth and latency—particularly relevant for large AI models.
Houmo re-engineered its first-generation product in under a year, launching the M30 chip for edge large model inference in early 2024. A key vote of confidence came from China Mobile, which used the chip to run a 60-billion-parameter model. In July, the company secured strategic funding from China Mobile’s digital economy funds in Beijing and Shanghai.
“People questioned why I pivoted,” Wu said. “But survival outweighed pride. Autonomous driving was a dead end. Edge AI is a new frontier—and there’s still space to lead.”
Houmo’s new accelerator lineup includes the Limou LM5050 and LM5070 cards, equipped with two and four M50 chips respectively, offering up to 640 TOPS of performance for ultra-large model inference. The company’s transition from SRAM-based CIM to DRAM-based PIM further boosts its hardware’s efficiency and scalability.
Wu says the firm is already developing its next-generation DRAM-PIM AI chip, expected as early as 2026. The chip aims to triple current energy efficiency and support widespread local deployment of models with tens of billions of parameters on devices like tablets and PCs.
“DRAM-PIM is the next step,” said Wu. “It tightens the bond between memory and compute, unlocking real-time intelligence at the edge.”
The WAIC 2025 event, themed “Intelligent Era, Shared Future,” featured more than 1,200 global guests—including 12 Turing and Nobel laureates—and showcased over 3,000 innovations. The exhibition space surpassed 70,000 square meters for the first time, drawing more than 800 companies and revealing more than 100 world or China-first product debuts.
As generative AI evolves, China is increasingly positioning edge AI as a key national focus. Public data suggests China’s computing-in-memory chip market could exceed 110 billion yuan by 2030.
Houmo’s investors include Sequoia Capital China, Qiming Venture Partners, Matrix Partners China, Lenovo Capital, Walden International, and China Mobile.
“Edge intelligence is where the future is headed,” Wu said. “We’re not just building chips—we’re building the infrastructure for the intelligent era.”
消息,OKX 行情数据显示,ETH 跌破 2300 USDT,现报 2299.68 USDT,24H 跌幅 4.55%。...
2 嘉信理财与城堡证券考虑进军预测市场4月19日消息,传统金融巨头嘉信理财和城堡证券都在考虑进军预测市场。 嘉信理财首席执行官...
3 Uniswap 治理页面封锁部分腾讯云 IP 段,使消息,4 月 19 日,据社区用户消息,近日无法访问 Uniswap 治理页面(,向 Uniswap 官方询问后获...
4 男子两日投资575美元变百万消息,据链上分析平台Lookonchain发推称:一名投资者在两天内将 575 美元变成超过 100 万美元,...
5 某地址持有80.2亿枚ASTEROID,浮盈达260万美消息,据 Lookonchain 监测,一地址持有约 80.2 亿枚 ASTEROID,持仓超过 580 天,目前未实现盈利约...
6 Aave遭黑客攻击引发54亿美元资产撤离消息,据链上分析师余烬发推称:超过 54 亿美元资产因安全担忧从 Aave 协议紧急撤离,孙宇晨...
7 DeFi 协议 Ether.fi 暂停 weETH 与 eETH 跨链及存消息,DeFi 协议 Ether.fi 表示,因 Kelp rsETH 事件根本原因尚待查明,已预防性暂停 weETH 与 eETH 的...
8 4.95亿USDT巨额转账消息,据Whale Alert发推称:4.95 亿枚 USDT从一个未知钱包转移至另一个未知钱包。...
9 Google Cloud A4X Max裸金属实例支持5万GPU集群消息,4 月 19 日,Google Cloud宣布其A4X Max裸金属实例可支持高达50,000个GPU的集群,网络带宽是前...
10 专家:美伊均未达“政治临界点”,战争消息,美伊双方均未达到政治临界点,战争可能持续一段时间。伊朗再次关闭霍尔木兹海峡,...
成都来彰科技 蜀ICP备2025134723号-1
资讯来源互联网,如有版权问题请联系管理员删除。