当前位置:主页 > 快讯 >

Chinas Houmo AI Debuts Edge AI Chip for Large Models at WAIC

时间:2025-07-28 11:36:27

  
 

  
AsianFin -- China’s Houmo Intelligence unveiled a new high-efficiency AI chip tailored for large models on edge devices at the 2025 World Artificial Intelligence Conference , as the startup sharpens its pivot from intelligent driving to next-gen edge computing amid intensifying demand for localized AI inference.

  The Houmo Manjie M50, launched in Shanghai during the conference, is China’s first AI chip to integrate memory and computation in a single architecture specifically designed for large-scale inference at the edge. With 160 TOPS of INT8 compute and 100 TFLOPS of bFP16 performance, plus up to 48GB of memory and a bandwidth of 153.6 GB/s, the M50 runs models from 1.5 billion to 70 billion parameters—all within 10 watts of power. The chip is aimed at devices such as PCs, smart speakers, and robots, offering plug-and-play large model capabilities.

  “The M50 is just the beginning,” said CEO Wu Qiang at a media briefing. “Our vision is to make AI computing power as accessible as electricity—embedded in every device, across every industry.”

  The company also rolled out complementary products including the Liqing-series M.2 cards, Limou-series accelerator cards, and compute boxes, targeting everything from consumer devices to intelligent industrial terminals. It’s the latest sign that China’s edge AI race is heating up as generative models move off the cloud and into everyday products.

  Founded in 2020, Houmo initially focused on AI chips for intelligent driving. But by late 2023, Wu concluded the sector was overcrowded and stagnating. “The industry was obsessed with cost competition, and no one believed in L3 autonomy anymore,” Wu said. “We had a chip with strong performance, but the market didn’t want it.”

  Instead, the company saw promise in compute-in-memory —a chip design that breaks the traditional von Neumann architecture by embedding computation directly into memory arrays. This reduces data movement and energy consumption, addressing bottlenecks in bandwidth and latency—particularly relevant for large AI models.

  Houmo re-engineered its first-generation product in under a year, launching the M30 chip for edge large model inference in early 2024. A key vote of confidence came from China Mobile, which used the chip to run a 60-billion-parameter model. In July, the company secured strategic funding from China Mobile’s digital economy funds in Beijing and Shanghai.

  “People questioned why I pivoted,” Wu said. “But survival outweighed pride. Autonomous driving was a dead end. Edge AI is a new frontier—and there’s still space to lead.”

  Houmo’s new accelerator lineup includes the Limou LM5050 and LM5070 cards, equipped with two and four M50 chips respectively, offering up to 640 TOPS of performance for ultra-large model inference. The company’s transition from SRAM-based CIM to DRAM-based PIM further boosts its hardware’s efficiency and scalability.

  Wu says the firm is already developing its next-generation DRAM-PIM AI chip, expected as early as 2026. The chip aims to triple current energy efficiency and support widespread local deployment of models with tens of billions of parameters on devices like tablets and PCs.

  “DRAM-PIM is the next step,” said Wu. “It tightens the bond between memory and compute, unlocking real-time intelligence at the edge.”

  The WAIC 2025 event, themed “Intelligent Era, Shared Future,” featured more than 1,200 global guests—including 12 Turing and Nobel laureates—and showcased over 3,000 innovations. The exhibition space surpassed 70,000 square meters for the first time, drawing more than 800 companies and revealing more than 100 world or China-first product debuts.

  As generative AI evolves, China is increasingly positioning edge AI as a key national focus. Public data suggests China’s computing-in-memory chip market could exceed 110 billion yuan by 2030.

  Houmo’s investors include Sequoia Capital China, Qiming Venture Partners, Matrix Partners China, Lenovo Capital, Walden International, and China Mobile.

  “Edge intelligence is where the future is headed,” Wu said. “We’re not just building chips—we’re building the infrastructure for the intelligent era.”

热点推荐
1 有研粉材:应用于昇腾芯片的新型散热铜

消息,有研粉材在互动平台表示,应用于昇腾芯片的新型散热铜粉出货量目前较为稳定。针对...

2 美国现货比特币ETF单日净流出6.35亿美元

消息,美国现货比特币ETF在5月13日的交易中录得总计6.35亿美元的净流出,其中黑石的IBIT基金单...

3 印度总理莫迪将短暂访问阿联酋,寻求更

消息,印度总理莫迪将于周五对阿联酋进行短暂访问,期间他很可能探讨长期能源供应协议,...

4 纽约法院暂停AAVE 7100万美元以太坊恢复请

消息,纽约联邦法官已暂停对AAVE请求释放与Kelp DAO相关的7100万美元冻结以太坊的裁决,要求双...

5 两艘驶往印度的LPG运输船成功穿越霍尔木

消息,两艘满载烹饪燃料的液化石油气运输船已成功穿越霍尔木兹海峡,成为最新一批无视美...

6 OpenAI报告称仅有有限凭证被窃取

消息,OpenAI报告称,只有有限的凭证从代码库中被窃取,并未产生更广泛的影响。...

7 0x082e...ca88:HYPE最大多头回吐740万美元浮

消息,HYPE最大多头在近一周内回吐浮盈约740万美元,当前价格已跌破39美元,回落至其持仓成...

8 与a16z相关钱包地址再买入50,168枚HYPE

消息,据Lookonchain监测,与a16z相关联的钱包地址0xb5e4于8小时前再次买入50,168枚HYPE,价值约19...

9 黑石ETH ETF净流出9361枚ETH,交易量达4.2亿

消息,黑石在5月13日的ETH ETF净流出9361枚ETH,价值约2110万美元,交易量为4.2亿美元。同时,黑...

10 ZEC最大空头:SP500多单增持72.14枚,持仓规

消息,SP500多单增持72.14枚,约合1,003,689.63美元,持仓规模达到46,651,875.00美元,均价从7,012.41美...

成都来彰科技 蜀ICP备2025134723号-1

资讯来源互联网,如有版权问题请联系管理员删除。