China's Baidu Releases Its AI Code to Foster Innovation

China's Baidu Releases Its AI Code
Baidu, often referred to as "China's Google," has joined the ranks of U.S. tech giants by releasing some of its proprietary AI code. This move signifies a growing trend in the tech industry where leading companies are sharing their advancements in artificial intelligence to foster public research and encourage innovation.
Baidu's Deep Learning Initiatives
Baidu is a major player in the field of deep learning, a powerful machine-learning technology that enables computers to perform complex tasks such as understanding speech and recognizing objects in images with remarkable accuracy. The company's commitment to this technology is evident in its recent release of key code that optimizes the performance of its AI software.
Deep Speech 2: A Breakthrough in Speech Recognition
One of Baidu's most notable achievements in deep learning is the development of Deep Speech 2, an advanced speech-recognition system. This system has demonstrated capabilities that rival human performance in recognizing short sentences, a significant feat in the realm of AI. For Baidu, this technology is particularly valuable as it enhances user accessibility to its services, especially on mobile devices. Given the complexities of typing Chinese characters on smartphones, voice-based interaction offers a more efficient and user-friendly alternative for searching the web and sending messages.
Understanding Deep Learning
Deep learning operates by feeding vast amounts of data into large, simulated neural networks. These networks learn to identify patterns and make predictions, allowing computers to perform tasks like transcribing voice or recognizing objects in images with near-perfect accuracy. The process involves presenting the network with input, such as audio recordings or images, and allowing it to learn from these examples over time.
Warp-CTC: Accelerating AI Development
Baidu's released code, known as Warp-CTC, is an optimized implementation of a deep-learning algorithm. Its primary advantage lies in its ability to run efficiently on modern computer chips, significantly speeding up the development and deployment of AI applications. Startups like Nervana, which provides deep-learning frameworks to other companies, are already integrating Warp-CTC into their technology offerings.
The Open Source AI Movement
Baidu's decision to make its deep-learning code publicly available aligns with a broader industry movement. Companies like Facebook and Google have also been releasing their AI technologies, driven by a desire to benefit from public research and to foster a collaborative ecosystem. By sharing their code, these tech giants aim to encourage researchers and startups to develop compatible machine-learning systems, potentially leading to future acquisitions and partnerships.
The Role of Data in AI
While code is a crucial component of AI development, the availability of large datasets is equally important. Companies like Facebook, Google, and Baidu possess vast amounts of data that are essential for training AI models. There is a growing trend towards releasing some of this data to the public, as exemplified by Yahoo's data dump aimed at improving computer understanding of user preferences. This open approach to data sharing further fuels the advancement of AI technologies.
Conclusion
Baidu's release of its AI code, particularly Warp-CTC, marks a significant contribution to the open-source AI community. It underscores the company's commitment to advancing deep learning and its recognition of the benefits of collaborative development. As more tech giants embrace open-source practices, the pace of innovation in artificial intelligence is expected to accelerate, leading to more sophisticated and accessible AI applications in the future.
Original article available at: https://www.technologyreview.com/2016/01/14/163873/chinas-baidu-releases-its-ai-code/