Machine Learning Aids Imputation of Missing Petrophysical Data in Iraqi Reservoir

This study compares seven imputation techniques for predicting missing core-measured horizontal and vertical permeability and porosity data in two wells drilled in the North Rumaila oil field in southern Iraq.

August 1, 2024

Journal of Petroleum Technology

Fig. 1—The proportion of missing petrophysical properties in Well R1 (clastic reservoir) and Well R2 (carbonate reservoir) is shown in the two histograms on the left. The pattern of the missing data in relation to depth (vertical scale) is displayed on the right. Missing data are displayed in red and measured data in blue. The fraction of data in each combination is shown on the right-side vertical scale. <i>Φ</i> = porosity, <i>k<sub>H</sub></i> = horizontal permeability, <i>k<sub>V</sub></i> = vertical permeability.

The study described in the complete paper comprehensively compares seven imputation techniques for predicting missing core-measured horizontal and vertical permeability and porosity data in two wells drilled in the North Rumaila oil field of southern Iraq. The results reveal that a data-imputation method consisting of multivariate imputation by chained equations (MICE) combined with classification and regression trees (CART) outperforms the other methods with the clastic and carbonate data sets studied. The novel workflow is suitable for application in both clastic and carbonate reservoir formations.

Introduction

To account for reservoir heterogeneity in 3D reservoir modeling, advanced techniques are required to improve the accuracy and reliability of estimating missing core-analysis data points. The imputation of missing data refers to a range of statistical and machine-learning (ML) methods that supplement core-sample reanalysis or additional core-sample collection to fill data gaps.

This study evaluates seven data-imputation methods involving ML algorithms:

Iterative robust model-based imputation (IRMI)
MICE combined with CART
Random imputation of missing data (RIMD)
Sequential imputation (SEQimpute)
Random forest imputation (RF)
Principal component analysis imputation (PCA)
Multiple imputation of incomplete multivariate data (AMELIA package)

These core-data imputation methods are applied to two wells penetrating the clastic Zubair formation of the North Rumaila oil field.

Continue Reading with SPE Membership

SPE Members: Please sign in at the top of the page for access to this member-exclusive content. If you are not a member and you find JPT content valuable, we encourage you to become a part of the SPE member community to gain full access.

Join SPE