Estimating oil recovery factor using machine learning: Applications of XGBoost classification

10/28/2022
by   Alireza Roustazadeh, et al.
0

In petroleum engineering, it is essential to determine the ultimate recovery factor, RF, particularly before exploitation and exploration. However, accurately estimating requires data that is not necessarily available or measured at early stages of reservoir development. We, therefore, applied machine learning (ML), using readily available features, to estimate oil RF for ten classes defined in this study. To construct the ML models, we applied the XGBoost classification algorithm. Classification was chosen because recovery factor is bounded from 0 to 1, much like probability. Three databases were merged, leaving us with four different combinations to first train and test the ML models and then further evaluate them using an independent database including unseen data. The cross-validation method with ten folds was applied on the training datasets to assess the effectiveness of the models. To evaluate the accuracy and reliability of the models, the accuracy, neighborhood accuracy, and macro averaged f1 score were determined. Overall, results showed that the XGBoost classification algorithm could estimate the RF class with reasonable accuracies as high as 0.49 in the training datasets, 0.34 in the testing datasets and 0.2 in the independent databases used. We found that the reliability of the XGBoost model depended on the data in the training dataset meaning that the ML models were database dependent. The feature importance analysis and the SHAP approach showed that the most important features were reserves and reservoir area and thickness.

READ FULL TEXT

page 29

page 30

research
10/22/2022

Estimating oil and gas recovery factors via machine learning: Database-dependent accuracy and reliability

With recent advances in artificial intelligence, machine learning (ML) a...
research
04/01/2022

Oil reservoir recovery factor assessment using Bayesian networks based on advanced approaches to analogues clustering

The work focuses on the modelling and imputation of oil and gas reservoi...
research
12/25/2019

A Study of the Learnability of Relational Properties (Model Counting Meets Machine Learning)

Relational properties, e.g., the connectivity structure of nodes in a di...
research
03/28/2022

Using Machine Learning to generate an open-access cropland map from satellite images time series in the Indian Himalayan Region

Crop maps are crucial for agricultural monitoring and food management an...
research
04/04/2022

Highly efficient reliability analysis of anisotropic heterogeneous slopes: Machine Learning aided Monte Carlo method

Machine Learning (ML) algorithms are increasingly used as surrogate mode...
research
08/27/2022

Information FOMO: The unhealthy fear of missing out on information. A method for removing misleading data for healthier models

Not all data are equal. Misleading or unnecessary data can critically hi...
research
06/09/2022

HDTorch: Accelerating Hyperdimensional Computing with GP-GPUs for Design Space Exploration

HyperDimensional Computing (HDC) as a machine learning paradigm is highl...

Please sign up or login with your details

Forgot password? Click here to reset