CONSPECTUS:The data-driven paradigm,represented by the famous machine learning paradigm,is revolutionizing the way materials are discovered.The inductive nature of the data-driven approach gives it great speed of pred...CONSPECTUS:The data-driven paradigm,represented by the famous machine learning paradigm,is revolutionizing the way materials are discovered.The inductive nature of the data-driven approach gives it great speed of prediction but also brings with it a heavy reliance on material data.However,unlike its success with text and images,which are supported by big data,materials data tend to be small data.Building a large database of materials is a good solution but not a permanent one.The cost of materials data is much higher than that of text or images,and the size of the materials database at this stage is far from sufficient.We will continue to face a shortage of materials data for a long time to come,making small data approaches necessary for machine learning based materials discovery.展开更多
基金supported by the National Key Research and Development Program of China(2021YFA1500703,2022YFA1503103,2022YFB3807200)Natural Science Foundation of China(22033002,T2321002,22373013)+2 种基金Natural Science Foundation of Jiangsu Province,Major Project(BK20232012,BK20222007)Jiangsu Provincial Scientific Research Center of Applied Mathematics(BK20233002)the Fundamental Research Funds for the Central Universities.
文摘CONSPECTUS:The data-driven paradigm,represented by the famous machine learning paradigm,is revolutionizing the way materials are discovered.The inductive nature of the data-driven approach gives it great speed of prediction but also brings with it a heavy reliance on material data.However,unlike its success with text and images,which are supported by big data,materials data tend to be small data.Building a large database of materials is a good solution but not a permanent one.The cost of materials data is much higher than that of text or images,and the size of the materials database at this stage is far from sufficient.We will continue to face a shortage of materials data for a long time to come,making small data approaches necessary for machine learning based materials discovery.