当前位置:主页 > 要闻 >

ByteDances Volcano Engine Supercharges AI Offerings With Major Model Upgrad...

时间:2025-08-01 15:13:07

  
 

  AsianFin -- ByteDance’s Volcano Engine is accelerating its AI ambitions with a sweeping upgrade of its Doubao large model suite, underscoring the company’s intensifying push into enterprise AI and digital agent solutions amid China’s increasingly competitive cloud landscape.

  On July 30, Volcano Engine launched several new offerings, including the Doubao Image Editing Model 3.0, Doubao Simultaneous Interpretation Model 2.0, and a fully upgraded Doubao Large Model 1.6 series. The upgrades come alongside a broader effort to bolster its AI-native infrastructure and cement its lead in China’s rapidly growing cloud-based large model services market.

  Doubao’s meteoric rise is backed by strong data: Daily token usage surged to 16.4 trillion as of May, representing a 137-fold increase since its debut in May 2024. According to an IDC report, Doubao now leads China’s public cloud large model service market by a wide margin, commanding a 46.4% market share—more than Baidu AI Cloud and Alibaba Cloud combined.

  Volcano Engine, ByteDance’s enterprise tech arm, is aggressively monetizing that growth. In 2024, it generated over RMB 12 billion in revenue and is targeting more than RMB 25 billion in 2025—positioning it to potentially surpass Baidu Cloud’s full-year top line.

  “AI is no longer just a tool—it’s becoming the agent,” said Tan Dai, President of Volcano Engine. “Software is now executing tasks, not just enabling them.”

  At the center of the latest upgrade is Doubao·Image Editing Model 3.0 , which allows complex visual manipulations—like background removal, lighting adjustments, and pose alterations—through natural language prompts. The model is designed for commercial use in advertising, content creation, and e-commerce, and is available to enterprise users via Volcano Ark and to consumers via ByteDance apps like Jimeng and Doubao.

  The new Doubao·Simultaneous Interpretation Model 2.0 slashes latency from 8–10 seconds to 2–3 seconds, thanks to a full-duplex system. It also supports zero-shot voice cloning, allowing for foreign language speech generation in the user’s own voice without prior training data—opening up use cases in international business, media, and education.

  Meanwhile, the flagship Doubao-Seed-1.6-flash model delivers stronger performance in code, math, and reasoning tasks with latency as low as 10ms per token. Token pricing has also been aggressively cut: RMB 0.15 per million input tokens, and RMB 1.5 per million output tokens, slashing costs by up to 70% in enterprise trials.

  Also notable is the multimodal Seed 1.6-Embedding model, which enables joint retrieval across text, image, and video. It currently tops the MMEB_v2 image leaderboard, outperforming rival models including Alibaba’s Qwen2 7B by 5.6 points.

  Volcano Engine is doubling down on open-source as part of its strategy to build a broader ecosystem around AI agents. The core capabilities of its Coze platform—including visual development tool Coze Studio and management suite Coze Loop—were recently open-sourced. Within three days, Coze Studio had amassed over 10,000 GitHub stars.

  To support intelligent agent deployment, the company rolled out a new Responses API with native context management and multimodal support, cutting development time for AI assistants from two days to just one hour. Code requirements have been reduced by 87%, according to internal benchmarks.

  Volcano Engine has also launched HiAgent, a “digital employee” workspace platform that acts as a centralized task hub. It enables personalized interfaces tailored to job roles—sales, HR, operations—integrating enterprise systems and streamlining workflows. The platform is already in deployment at clients including Guangjiao Digital Technology and Xiamen University.

  Zhang Xin, Volcano Engine’s VP, highlighted how HiAgent addresses three key productivity bottlenecks: repetitive rule-based tasks, system switching disruptions, and decision-making blind spots. “The goal is not to replace people, but to help them do more of what matters,” he said.

  Tan Dai sees the current AI wave as the third major computing platform shift, following the PC and mobile eras. He likens Volcano Engine’s journey to a marathon—and the company is only “500 meters in.”

  Looking ahead, ByteDance’s enterprise arm is targeting RMB 100 billion in annual revenue by 2030, provided macroeconomic conditions remain favorable. That growth hinges on converting its massive scale, technical edge, and early-mover advantage into long-term, defensible commercial value.

  “Every link in the chain has to be strong,” Tan said. “In cloud computing, customer needs vary drastically. But in AI, we must do everything better—from the large model, to native infrastructure, to agent deployment.”

  Volcano Engine’s rapid model iteration and open ecosystem approach appear designed to do just that. Whether it can maintain this breakneck pace as competition heats up from rivals like Baidu, Alibaba, and Tencent remains to be seen.

  But for now, ByteDance is making a strong claim to be China’s AI infrastructure leader—not just building large models, but translating them into agents that work.

热点推荐
1 特朗普政府推进170亿美元关税退款门户建

消息,据彭博社发推称:特朗普政府表示,正在推进建设一个基于网络的平台,以处理因美国...

2 以太坊金库 BitMine、Ark Invest 和 Kraken 参与

区块链和人工智能公司 Eightco 的股价在获得来自 BitMine Immersion Technologies、Ark Invest和 Kraken 母公...

3 美联储或考虑罕见会议间降息

消息,据美联储传声筒Nick Timiraos发推称:美国总统表示希望美联储在非例行会议上降息,上一...

4 美企比特币周购2万枚或重塑市场格局

消息,据BitcoinTreasuries发推称:美国上市公司正通过普通股和优先股增发筹集资金,预计很快将...

5 以太坊财资公司还在运营吗?它们最近都

尽管市场低迷,以太坊金库公司 Bitmine 和 Sharplink 仍在推进其 ETH 战略。Bitmine 上周购入 60,976...

6 特斯拉获准将xAI投资转为SpaceX股份

消息,据彭博社发推称:特斯拉获政府批准,将投资马斯克旗下xAI的股份转换为SpaceX少量股权...

7 DTC的2026年代币化计划或将链上结算引入美

DTCC 的核心子公司存托信托公司 已获得美国证券交易委员会 的监管批准,将为其托管的特定资...

8 「BTC OG内幕巨鲸」代理人:油价突破,风

消息,3 月 12 日,「BTC OG 内幕巨鲸」代理人 Garrett Jin 在 X 平台发文表示,「油价已经突破。美...

9 美能源部长称为错误帖子承担全部责任

3月12日消息,美国能源部长赖特今日表示,将为错误的X帖子承担全部责任,会亲自审核以后的...

10 伊朗新最高领袖首次讲话总结:必要时开

消息,伊朗新任最高领袖穆杰塔巴哈梅内伊 12 日通过国家电视台发表首次公开讲话。讲话内容...

成都来彰科技 蜀ICP备2025134723号-1

资讯来源互联网,如有版权问题请联系管理员删除。