Note: The article was first published on TechNode China written by Evan Huang and translated by Zinan Zhang.
In the dynamic AI era, venture capital is increasingly attuned to the transformative potential of this technology. As generative AI advances in creating text, images, and videos, a plethora of opportunities and challenges are emerging. This article explores the pivotal role of the Scaling Law, the emergence of super apps, and the promising future of AI-driven innovations. Highlighting insights from industry leaders, it underscores the potential for AI to revolutionize various sectors and entrepreneurial ventures, providing valuable directions for future venture capital investments.
The training and inference stages of large models demand substantial computational resources. The Scaling Law suggests that significant advancements in intelligence are achieved through consistent investment in vast amounts of data and powerful computing, provided the algorithmic architecture remains stable.
OpenAI, a strong proponent of the Scaling Law, has showcased the potential of generative AI across various fields by leveraging transformer architecture, extensive training data, and considerable computational resources.
Recently, Kevin Scott, Microsoft CTO, mentioned in an interview with Pat Grady and Bill Coughran of Sequoia Capital that they have yet to observe diminishing returns from scaling. He announced that the next generation of OpenAI models would soon be available, offering cheaper, more powerful solutions capable of tackling more complex problems. “This is the story with each generation of models as we scale up,” he remarked.
On May 18, Yang Zhilin, founder of Moonshot AI, discussed the computational aspects of the Scaling Law. He noted that initial improvements in model performance are driven by enhanced computational power and efficiency. However, further advancements require increased computational investment and ensuring that this investment effectively translates into intelligence. “This involves two issues: sustaining computational investment and maximizing the intelligence output of each computation unit,” he explained.
In an interview with TechNode, Wu Yunsheng, vice-president of Tencent Cloud, shared his perspective. “Currently, there are different viewpoints, including realistic and idealistic views. Some believe the Scaling Law has reached a plateau, where continued investment yields diminishing returns. Others argue it is still in a phase of rapid development.” He emphasized that the Scaling Law remains significant, citing rapid progress in multimodal research over the past year. “In this field, various capabilities improve significantly with added data or computing power. We will continue to explore and observe its development and changes across different scenarios and technologies,” he added.
As of March 28, 2024, there are 117 large models registered with the Cyberspace Administration of China, including Baidu’s ERNIE Bot, Alibaba’s Tongyi Qianwen, and the open-source ChatGLM. The rapid development of AI large models is becoming a key driver of innovation and breakthroughs in super applications.
As these large model technologies mature and improve, they are gradually permeating various industries, sparking a range of entrepreneurial opportunities. From healthcare to fintech, from smart manufacturing to cultural creativity, the application potential of AI is limitless.
Zhou Zhifeng, Managing Partner of Qiming Venture Partners, pointed out at the World Artificial Intelligence Conference in Shanghai that compared to the timeline of application deployment during the internet wave, he predicts that the explosion of applications in the current AI wave will occur significantly earlier. Currently, generative AI is gaining substantial user favor in three “C fields” — Copilot, Creativity, and Companionship — showing a development trajectory similar to internet applications and transitioning from efficiency-enhancing applications to those aimed at providing enjoyment. He noted that the internet reduced the marginal cost of information distribution to almost zero, while the core of generative AI is to reduce the marginal cost of digital content creation to nearly zero, indicating that AI technology is bound to release enormous value.
When discussing the future of AI-driven super apps, Zhang Fan, COO of Zhipu AI, expressed optimism, arguing that although creating super apps is not easy, the AI era will see many unimaginable applications emerge. This process requires advancements in computing power, networks, hardware levels, and user habits, following the principle of gradual development from small-scale applications. Zhang emphasizes that by embracing and utilizing existing AI technologies to gradually transform current applications and products, the future will undoubtedly usher in super apps in the AI era.
Regarding the challenges of implementing generative AI applications, Zhou Zhifeng believes that reducing the cost of model usage necessary for the widespread adoption of generative AI, improving the effectiveness of large models, and enhancing user retention rates of generative AI applications are crucial. Since the growth period from zero to one for generative AI application companies is longer than in other fields, they need to overcome both TPF (Technology-Product Fit) and PMF (Product-Market Fit) challenges simultaneously. Therefore, the founding team needs greater patience, determination, and understanding of the technology, the product, and the world.
There were 45 intelligent robots, including 25 humanoid robots, showcased at WAIC this year. A video of a humanoid robot walking on the Great Wall was repeatedly played at the event. The humanoid robot L2 in the video has successfully conquered the steep slopes of the famous structure, achieving steady walking on it.
At the recent Huawei Developer Conference 2024, Zhang Ping’an, Executive Director and CEO of Huawei Cloud, unveiled the Pangu Model 5.0. During the introduction of the Pangu model for embodied AI, he showcased the broad potential of the KUAVO humanoid robot, equipped with the Pangu model, in both industrial and household scenarios, attracting widespread attention.
Chen Jianyu, an assistant professor at Tsinghua University and founder of the humanoid robot company Robot Era, believes that humanoid robots will be the ultimate form of general-purpose robots. This is not only because the pure humanoid form with two legs and two arms is more compatible with existing environments, but also because it’s easier to transfer training data from the human world. Technically, an end-to-end integration of the brain and cerebellum will be a crucial research direction in the future. Using human language as the interface between the brain and cerebellum is limited, and it is better to borrow from the end-to-end joint training process of autonomous driving, where physical layer data is directly fed back to the text and image models, significantly enhancing overall model performance.
Last week, Tencent, in collaboration with Shanghai Jiao Tong University, released the Top Ten Trends of Large Models 2024: Entering the Era of ‘Machine External Brain’ report, which pointed out that the combination of robot technology and large models provides a “body” for the machine’s external brain. In the future, humanoid robots will not only be able to perform physical tasks but also interact with humans more naturally and intuitively, endowing physical products with intelligent “brains”.
The report states that the development of humanoid robots relies on two major technical pillars: motion control and task training. The application of large models has greatly improved the robots’ learning efficiency and ability to execute complex tasks. The integration of these technologies not only drives technological innovation in humanoid robots but also opens possibilities for their widespread deployment in practical applications. This also heralds a future of human-machine symbiosis, where humanoid robots will play increasingly important roles in various industries, from household services to high-risk industrial operations, showcasing their efficiency and safety. Through continuous technological innovation and application expansion, humanoid robots will play a key role in improving the quality of life and work efficiency, further integrating into human daily life as indispensable assistants and the ultimate carriers of artificial intelligence.
In conclusion, the era of AI is not just a technological revolution but a transformative force that is redefining the landscape of innovation and investment. As we look to the future, the challenges of implementing generative AI applications remain significant. The need to reduce costs, improve effectiveness, and enhance user retention rates is crucial for the widespread adoption of these technologies. However, the potential rewards are immense, offering a glimpse into a world where AI is not just a tool but an integral part of our daily lives, from household services to high-risk industrial operations.
In summary, the dynamic AI era presents a wealth of opportunities for venture capital and entrepreneurial ventures. As we continue to explore and invest in AI-driven innovations, the future holds huge promise for transforming industries, enhancing human-machine interactions, and ultimately, improving the quality of life for all.
]]>Autotech startup Nullmax said on Tuesday that its latest generation of autonomous driving hardware and software package, allowing cars to navigate complex urban environments autonomously with features such as lane changing, will cost users as little as “several thousand RMB.”
Why it matters: Shanghai and Fremont-based Nullmax is among the few players in the self-driving vehicle space claiming that cars will be able to function by themselves in urban scenarios without maps and lidar. Instead, the company said artificial intelligence models can be used to enable cars to navigate from points A to B.
Details: Xu told a press conference that his company is advocating a “pure vision” and “end-to-end” approach, as Tesla has been doing and many are following its lead, which involves deep neural networks, using cameras only to perform autonomous driving functions (our translation).
Context: Chinese EV startups led by NIO, Xpeng Motors, and Li Auto have been ramping up efforts to transition from “rule-based” designs to an “end-to-end” autonomous driving method. Meanwhile, traditional car manufacturers are tapping into the power of AI by working with tech giants such as Huawei and NVIDIA, as well as startup unicorns like Horizon Robotics and Momenta.
READ MORE: Former Tesla engineer shares thoughts on end-to-end autonomous driving at WAIC 2024
Editor’s note: ‘Landing AI’ is a series of special reports focusing on the field of Artificial Intelligence curated by TechNode. By investigating the development of AI landing in China and the behind-the-scenes stories of the industry, we’re going to dive deeper into everything that’s possible under the new wave of AI.
]]>On July 12, the Bilibili World 2024 event opened at the National Exhibition and Convention Center, attracting ACG (Anime, Comic, Game) fans from across China who traveled to Shanghai to become part of the Chinese streaming platform’s annual celebration.
The three-day event this year featured over 700 exhibitors and 800 ACG-content creators. The Lenovo booth showcased two new AI products: the YOGA Air 14c AI PC and the Legion Y9000P AI PC.
Why it matters: Bilibili World 2024 draws in ACG fans to celebrate shared interests, which meant the AI-powered Lenovo PCs garnered significant attention.
Details: The Lenovo YOGA Air 14c AI PC is designed for work, while the Lenovo Legion Y9000P AI PC is principally marketed to gamers.
Context: Held since 2017, Bilibili World is a community-focused annual large-scale in-person event organized by Bilibili, a leading Chinese video-sharing website. On June 29, the first round of the ACG event’s ticket sales saw 27,000 VIP tickets sell out within 30 seconds, with 100,000 general admission tickets gone within one minute, according to the company’s own app.
Kuaishou, one of the main rivals to TikTok’s China sibling Douyin, showcased several fresh features for its text-to-video model Kling AI at the World Artificial Intelligence Conference (WAIC) in Shanghai last week, including the ability to generate videos up to 10 seconds.
At WAIC, visitors queued to experience the Sora-like tool that is currently available by invitation only. Users sent simple prompts to generate videos, such as “a panda eating salmon” and “the Mona Lisa putting her glasses on with her hands”, with the resulting clips demonstrating Kling AI’s ability to render the inputs almost perfectly.
AI-generated videos have subsequently flooded the Chinese internet, with Kling AI being used to create clips featuring characters from historical films undertaking modern day tasks and spawning multiple memes.
A video featuring Rong momo, a character from “My Fair Princess” who has become a well-known internet meme in China, feeding Princess Ziwei a chicken drumstick has gone viral on social platforms these days. The AI-generated video is based on the drama’s most famous scene in which Rong momo tortures Ziwei by repeatedly stabbing her with a needle.
Why it matters: Kuaishou will be hoping that its suite of self-developed large model series, including language model KwaiYii, image-focused Kolors, and video-centered Kling, will give it an edge as it continues to challenge to ByteDance’s Douyin and TikTok.
Details: More than 500,000 users have applied to help beta test Kling, senior vice president of Kuaishou Gai Kun revealed last weekend at a WAIC forum, with the number of videos generated reaching 7 million as of now. The Sora rival’s hype is such that English language posts teaching users outside of China how to apply for a Kling AI trial can be found on X, formerly known as Twitter.
Context: Kuaishou, China’s second-largest short video company, launched its AI strategy in 2023, according to CEO Cheng Yixiao, who said that generative AI has a “very rich combination of business scenarios and huge value potential” for the content platform.
Editor’s note: ‘Landing AI’ is a series of special reports focusing on the field of Artificial Intelligence curated by TechNode. By investigating the development of AI landing in China and the behind-the-scenes stories of the industry, we’re going to dive deeper into everything that’s possible under the new wave of AI.
]]>