The Future of Data in AI Development

On March 7, Yao Qian, the director of the Science and Technology Supervision Bureau of the CSRC, wrote in China Finance that it was recommended to focus on the…

The Future of Data in AI Development

On March 7, Yao Qian, the director of the Science and Technology Supervision Bureau of the CSRC, wrote in China Finance that it was recommended to focus on the development of synthetic data industry based on AIGC technology. With higher efficiency, lower cost and higher quality as the “incremental expansion” of the data element market, it helps to create data advantages for the future development of AI. In terms of strengthening the high-quality supply of data elements, we should make overall plans for self-reliance and opening-up. Consider establishing filtered domestic mirror sites for specific data sources such as Wikipedia and Reddit for use by domestic data processors.

Yao Qian, Director of the Science and Technology Regulatory Bureau of the CSRC: Focus on the development of synthetic data industry based on AIGC technology

Interpret the above information:


Yao Qian, the director of the Science and Technology Supervision Bureau of the CSRC, recently emphasized the importance of synthetic data industry based on AIGC technology. According to Qian, this technology can provide higher efficiency, lower costs, and higher quality of data for AI development. He also highlighted that the development of the synthetic data industry can serve as an incremental expansion of the data element market, further benefiting the future development of AI.

In strengthening the high-quality supply of data elements for AI, Qian advises considering overall plans for self-reliance and opening-up. Specifically, in terms of expanding data sources, the establishment of filtered domestic mirror sites for specific data sources such as Wikipedia and Reddit for use by domestic data processors is recommended.

The message succinctly highlights the benefits of AIGC technology for AI development. The use of synthetic data in AI modeling has already proven to be an effective way to reduce the time and cost of data collection without compromising quality. By expanding the synthetic data industry based on AIGC technology, data processors can greatly enhance the development of AI.

Moreover, the message emphasizes the importance of strengthening data self-reliance while promoting international data sharing. Establishing filtered domestic mirror sites for commonly used data sources is a practical step to maintain data security and privacy without sacrificing access to important data according to Qian.

In conclusion, the message presents a forward-looking vision of data use in AI development. The use of AIGC technology and synthetic data has the potential to revolutionize AI development by providing better-quality data at a lower cost, thereby expanding the data market further. At the same time, it also highlights the importance of domestic self-reliance in data collection and encourages specific action to achieve this goal.

This article and pictures are from the Internet and do not represent SipPop's position. If you infringe, please contact us to delete:https://www.sippop.com/9608.htm

It is strongly recommended that you study, review, analyze and verify the content independently, use the relevant data and content carefully, and bear all risks arising therefrom.