# Machine learning for complete intersection Calabi-Yau manifolds: a methodological study

We revisit the question of predicting both Hodge numbers h^1,1 and h^2,1 of complete intersection Calabi-Yau (CICY) 3-folds using machine learning (ML), considering both the old and new datasets built respectively by Candelas-Dale-Lutken-Schimmrigk / Green-Hübsch-Lutken and by Anderson-Gao-Gray-Lee. In real world applications, implementing a ML system rarely reduces to feed the brute data to the algorithm. Instead, the typical workflow starts with an exploratory data analysis (EDA) which aims at understanding better the input data and finding an optimal representation. It is followed by the design of a validation procedure and a baseline model. Finally, several ML models are compared and combined, often involving neural networks with a topology more complicated than the sequential models typically used in physics. By following this procedure, we improve the accuracy of ML computations for Hodge numbers with respect to the existing literature. First, we obtain 97 inspired by the Inception model for the old dataset, using only 30 of the data for training. For the new one, a simple linear regression leads to almost 100 h^2,1 is less successful as we manage to reach only 50 datasets, but this is still better than the 16 network (SVM with Gaussian kernel and feature engineering and sequential convolutional network reach at best 36 that neural networks can be valuable to study the properties of geometries appearing in string theory.

READ FULL TEXT
Comments

There are no comments yet.