Bendi新闻
>
Chinese Startup Unveils AI Video Software to Rival OpenAI’s Sora

Chinese Startup Unveils AI Video Software to Rival OpenAI’s Sora

7月前

Shengshu-AI claims its Vidu software can develop high-quality videos lasting up to 16 seconds, far surpassing previous Chinese text-to-video models.

A Chinese startup has unveiled an artificial intelligence-powered system capable of generating high-definition videos lasting up to 16 seconds, marking a major breakthrough for China’s AI industry as it races to catch up with the United States’ leading firms.

Shengshu-AI, a Beijing-based startup that was founded only last year, presented the new system — which it has named Vidu — at the Zhongguancun Forum in Beijing on Saturday, describing it as China’s “first long-duration, high-consistency, and highly dynamic video generation model.”

Many in China have been quick to dub Vidu China’s answer to Sora, the text-to-video model created by OpenAI that sent shockwaves around the world when it was unveiled in February.

For now, it appears that Vidu is still some way from matching Sora’s capabilities. According to Shengshu-AI, Vidu can generate high-definition videos lasting up to 16 seconds, whereas Sora can generate 60-second clips.

A GIF from a video clip powered by Vidu.

But this would still put Vidu at the very cutting edge of the rapidly evolving AI-generated content field. Most of the leading text-t0-video models, including Pika and Gen-2, only produce clips lasting up to 4 seconds.

Unlike those models, Vidu is not yet publicly available, and Shengshu-AI has yet to confirm when it will be formally launched. But the company performed a live demonstration of the system at the forum and said it was open to working with partners to further fine-tune its technology.

Shengshu-AI is one of many startups to have emerged during the frenzy of AI-related investment in China since the release of OpenAI’s ChatGPT in late 2022.

The firm was founded in March 2023 with Zhu Jun, a leading AI researcher at Beijing’s prestigious Tsinghua University, joining as chief scientist. It has since raised over 100 million yuan ($14 million) from investors, including the Chinese tech giants Ant Group and Baidu.

At the Zhongguancun Forum, Zhu said that Vidu was capable of generating scenes that are consistent with the laws of physics and contain rich details, such as realistic shadow effects and facial expressions.

In another nod to Shengshu-AI’s ambitions to rival OpenAI, the live demonstration of Vidu that followed featured a video almost identical to the one used to launch Sora — a clip of a car driving along a mountain road.

A GIF from a video clip powered by Vidu.

A GIF from a video clip powered by Sora.

The primary technology underpinning Vidu is the Universal Vision Transformer, which combines two AI models: Transformer and Diffusion. It is similar to Sora’s Diversity in Transformation architecture, but Shengshu-AI claims that its research team developed its system before OpenAI, releasing a related paper in September 2022.

“After Sora’s release in February, we found that our technical roadmaps are highly aligned, and we became even more determined to press forward with our own research,” Zhu said at the forum.

The release of Sora earlier this year astonished many in China, as the technical challenges involved in generating AI video far surpass those involved in creating text and still images. The hashtag “Sora” received over 100 million views on the Chinese microblogging platform Weibo within a week of the product’s launch.

Within China’s AI industry, there were fears that the launch of Sora showed that the gap between Silicon Valley and China was widening. But Shengshu-AI has been bullish about its ability to catch up with the U.S.’s market leaders.

As recently as February, Vidu was reportedly only capable of generating 4-second clips, but that has increased fourfold in just a few months. In March, Shengshu-AI’s CEO, Tang Jiayu, told domestic media: “It’s certain that the model can reach Sora’s level this year, though it’s difficult to say whether it will take three months or six months.”

A GIF from video clips powered by Vidu.

With its demonstration of Vidu, Shengshu-AI has proved itself a leader in China’s AI sector, Chen Chen, a partner at consultancy Analysys, told domestic media. Yet Sora remains far ahead in terms of the duration, diversity, and richness of its videos, Chen added.

China’s tech industry continues to invest heavily in AI content generation. Major AI models including ChatGPT, Stable Diffusion, and Midjourney are unavailable in China, leaving a large hole in the market for domestic firms to fill.

In recent months, major tech firms including ByteDance, Kuaishou, Tencent, and SenseTime, as well as a host of smaller players, have reported progress in developing text-to-video AI tools. However, several have stressed that their products remain in their infancy.

According to market researchers iResearch, the value of China’s AI-generated content market is predicted to grow at 87% annually for the remainder of the decade, twice the speed of the global market.

(Header image: Shengshu Technology and Tsinghua University launch Vidu, a text-to-video model, at the 2024 Zhongguancun Forum in Beijing, April 27, 2024. CNS)


Download the new Sixth Tone app at the App Store or Google Play
APK file for Android:
https://image4.sixthtone.com/pkg/sixthtone.apk
(Copy URL and open in browser)

微信扫码关注该文公众号作者

来源:Sixth Tone

相关新闻

For Chinese Journalists, An Uphill Battle to Scrutinize AITesla to Supply Cars to Chinese Local Gov’t for First TimeMy Child Spent a Fortune on a Chinese Video Game. What Now?智谱AI版Sora来了!人人免费不限次,有手机就能玩,API也开放了Going for Gold: The Chinese Athletes to Watch at Paris 2024英伟达布局AI视频,Sora风头快被抢完了谷歌2小时疯狂复仇,终极杀器硬刚GPT-4o!Gemini颠覆搜索,视频AI震破Sora炸裂AI技术Sora背后:奥特曼清单法OpenAI甩出AI模型Sora,做视频的我又要失业了……领先Sora的三家AI视频公司,能让你的图片变大片!让四郎开口唱“朕EMO啦”,硬刚Sora的国产AI视频工具爆红!Chinese Parents Turn to ‘Magic Potions’ to Help Kids Run FasterStable Diffusion核心团队被曝集体离职;微软利用AI Agent复现Sora丨AIGC日报Sora大火!AI成各顶级名校新晋爆火专业!下一代留学生该如何选择专业?Chinese Hit the Slopes to Escape Brutal Summer Heat谢赛宁对话Sora 负责人:AI 视觉的基础是对压缩图像的学习Chinese Parents Falling Prey to Dubious Myopia ‘Miracle Cures’Haitang, the Lost Chinese Whale, Finally Swims to Freedom常见中国签证类型,应该如何选择?Common Chinese Visa Types: How to Choose?How Does Chinese Media Write About AI?消息称软银拟投资近10亿美元强化生成式AI运算能力;Sora进一步引爆全球算力需求,A股公司加速布局丨AIGC日报走向全民的疫苗,以及后 Sora 时代的「AI 换脸术」|每周播报The Chinese Factory Tearing Love to ShredsSora横空出世,AI创业者和投资人们一夜无眠
logo
联系我们隐私协议©2024 bendi.news
Bendi新闻
Bendi.news刊载任何文章,不代表同意其说法或描述,仅为提供更多信息,也不构成任何建议。文章信息的合法性及真实性由其作者负责,与Bendi.news及其运营公司无关。欢迎投稿,如发现稿件侵权,或作者不愿在本网发表文章,请版权拥有者通知本网处理。