Reservoir simulation faces challenges in computational efficiency and uncertainty management for large-scale assets. This study presents an integrated framework combining the connection element method (CEM) and data space inversion with variable controls (DSIVC) for efficient history matching and optimized forecasting of reservoir performance. CEM reduces the computational cost of numerical simulation while retaining accuracy. DSIVC enables direct production forecasting after history matching without repeated model inversion. The CEM–DSIVC approach is applied to two reservoir cases. CEM efficiently constructs reservoir models honoring complex geology. DSIVC mathematically integrates production data to reduce uncertainty and parameter space. Without repeated forward simulation, optimized forecasts are obtained under different control strategies. Compared to conventional methods, CEM–DSIVC achieves reliable uncertainty quantification and optimized forecasting with significantly improved efficiency. This provides an effective solution to overcome limitations in simulating and managing uncertainty for large-scale reservoirs. The proposed approach leverages the complementary strengths of CEM and DSIVC, synergistically improving reservoir modeling, management, and decision-making. This integrated data-driven framework demonstrates strong potential as an advanced tool for efficient field development planning and optimization.

Reservoir closed-loop optimization is a critical component in field development, with the reservoir system as the primary object.1 It combines numerical reservoir simulation to accurately reproduce the full lifecycle of field exploitation and aids development planning. Conventional numerical optimization generally uses appropriate optimization methods and optimization algorithms to find the optimal production system, but this process requires repeated numerical simulation calculations. Conventional numerical optimization entails low computational efficiency and prolonged duration, preventing efficient calculation.2,3 Therefore, it is imperative to explore reservoir closed-loop optimization control methods from two perspectives history matching and production optimization through novel pathways.4,5

Reservoir numerical simulation is an important tool to reproduce the full lifecycle of reservoir exploitation, forecast reservoir production, and adjust field management strategies.6 Commonly used numerical methods include finite difference,7 finite volume,8 and finite element methods.9,10 To ensure accuracy, these methods often require extensive grid refinement of the computational domain, resulting in poor convergence and difficulties in history matching.11,12 Grid coarsening can improve efficiency but leads to loss of flow paths and reduced accuracy.13 Boundary element14,15 and meshless methods16,17 have seen some reservoir applications. However, boundary elements are often limited to single-phase flow equations and relatively homogeneous reservoirs. Meshless methods, such as extended finite volume method (EFVM),18 can reduce the difficulty of discretizing reservoir domain and form richer inter-node connections to reduce the grid orientation effect.19 However, these methods still require a large number of nodes to achieve sufficiently high accuracy. Zhao et al.20 introduced the connection element method (CEM) combining generalized finite difference theory and data-driven flownet method. This derives flow parameters satisfying local mass conservation in the meshless framework. CEM calculate pressure on the coarse-node system, but transform the calculation of advection-dominated transport equations to one-dimensional (1D) connection element between two neighboring nodes, which can be regarded as a multiscale meshless method and achieve a good balance of computational accuracy and efficiency. This forms richer connectivity and retains more flow structures.

Although the CEM can achieve accurate and efficient dynamic prediction, uncertainty in geological information still poses predictive challenges.20 History matching against observed data is needed to obtain more accurate geological parameters.21 Among optimization algorithms, gradient-based ones converge faster but require adjoint methods to compute gradients.22 However, due to the limited number of reservoir parameters, analytical gradients are difficult to obtain via adjoint methods and the process is extremely complex and incompatible with reservoir simulators.23 Compared to gradient-based methods, gradient-free algorithms construct approximate gradients for computation and are more portable and easier to couple with commercial simulators.24 However, they require extensive simulation runs and are prohibitively expensive for large models. Currently, more popular are ensemble methods like ensemble Kalman filter (EnKF)25,26 and ensemble smoother with multiple data assimilation (ESMDA).27,28 However, they do not avoid direct inversion of geological parameters, but are more efficient than gradient-free methods. However, the high computational cost remains due to the large number of prior models runs required. In recent years, many researchers have applied machine learning to build proxy models approximating time-consuming simulations to improve efficiency.29 However, such models lack physical meanings and may deviate considerably from reality when production changes. Moreover, the intricate training process also impedes field application.30 

Constructing reservoir models consistent with historical production data through fluid flow numerical simulation is often computationally demanding. Existing history matching approaches still have limitations when applied to large-scale complex reservoirs. Sun et al.31,32 proposed a new class of the data assimilation method called data space inversion (DSI). It constructs a data space spanned by simulation results from a large set of initial reservoir models with differing petrophysical properties. A proxy model is established by parameterizing the high-dimensional data space. The proxy model is calibrated by history matching using mathematical optimization without repetitive numerical simulation runs. Unlike machine learning techniques, the proxy honors the physical relationships and weights between the truth case and prior models. DSI computes posterior production forecasts through stochastic maximum-likelihood principles without inverting reservoir models, thus avoiding prohibitive computational costs. For giant hydrocarbon fields, single-flow simulation runs are extremely time-consuming, so iterative history matching via extensive simulation is infeasible. This makes DSI and other direct forecasting techniques attractive. DSI provides reliable uncertainty quantification and forecasting efficiency as demonstrated in channelized and Gaussian permeability fields under oil–water or gas–oil displacement.31 The previously discussed DSI and forecast-centric techniques were premised on the idealized assumption of a “perfect model,” ignoring any mismatches between simulations and actual production data during calibration. Jiang et al.33 used coupled coarse-scale simulations and PCA (Principal Component Analysis)-parameterized error representations to generate accurate posterior forecasts. However, the original DSI formulation lacks the capability for production optimization after history matching.34,35

In the above DSI framework,31 specifying altered well-operating strategies following history matching necessitates recalibrating the entire prior ensemble of reservoir models via extensive flow simulations, incurring tremendous computational costs.36 To address this limitation, Jiang et al.37 developed data space inversion with variable controls (DSIVC) for prediction with user-defined well controls after history matching. Like prior DSI, DSIVC does not generate posterior models. Unlike existing DSI, DSIVC does not require re-simulation when different matched control sets are specified. This is a key advantage of DSIVC over current data space techniques. However, simulating the prior models is hugely time-consuming for DSIVC, especially for large-scale models where running thousands of simulations is impractical. Therefore, the targeted coupling of the CEM and DSIVC proposed in this work is necessary and significant. The CEM is expected to drastically reduce simulation time and enables a fast large-scale data space inversion and reservoir management. This combined approach leverages the advantages of both methods to achieve efficient and reliable uncertainty quantification and optimized field development planning.

In summary, CEM provides fast numerical simulation, while DSIVC integrates production data for reducing uncertainty without repeated inversion. By combining the strengths of both methods, CEM–DSIVC establishes an advanced workflow for efficient history matching and optimized forecasting of large-scale reservoirs. This integrated approach is expected to achieve reliable uncertainty quantification and optimized production planning with low computational requirements.

Traditional grid-based reservoir simulation methods determine inter-grid connections based on grid topology and calculate transmissibility using average grid properties. However, the grid topology defines but also constrains the connections. CEM uses a new approach to discretize the computational domain (i.e., meshless connection system).

The meshless connection system is a network structure as shown in Fig. 1. From a flow perspective, CEM discretizes the domain into interconnected geometrical entities (i.e., connection elements) represented by line segments. The endpoints of connection elements are defined as nodes, while connection elements are interconnected through the nodes. In Fig. 1, a set of connection elements sharing common nodes (red markers) is extracted from the connection system.

FIG. 1.

Construction of 1D connection elements from the flow perspective.20 

FIG. 1.

Construction of 1D connection elements from the flow perspective.20 

Close modal

In CEM, a connection element has two parameters: transmissibility and connection volume. Taking two-phase oil–water flow as an example, the detailed calculation of transmissibility and connection volume of a connection element is explained.

Assuming rock and fluid are incompressible and neglecting capillary force, the governing equation of two-phase flow can be given as

  • Mass conservation equations
    ( ρ i k k r i μ i p i ) + q i = ( ϕ ρ i S i ) t .
    (1)
where subscript i denotes oil and water phases. k r i is oil or water phase relative permeability. k is the reservoir permeability, mD. μ is fluid viscosity, mPa s. ρ i is fluid density, kg/m3. is Hamiltonian operator. ϕ is the reservoir porosity. p is the reservoir pressure, MPa. t is the time, d. q is the source or sink term, d−1. S i is the oil saturation or water saturation.

Combining oil phase and water phase, over the control domain Ω i of node i yields
V i ( k ( k r o μ o + k r w μ w ) p ) d Ω + V i q d Ω = V i C t ϕ p t d Ω ,
(2)
Δ p ( M 0 ) = 2 p x 2 + 2 p y 2 = m = 3 4 j = 1 n e m j i ( p j p i ) ,
(3)
where the subscript i of e m j i denotes the difference coefficient is calculated by taking node i as the central node.
Assuming the partial derivative of pressure with respect to time is the same everywhere in a node's domain of influence, combining Eqs. (2) and (3) leads to the difference scheme
k i j λ i j t V i m = 3 4 j = 1 n e m j i ( p j t + Δ t p i t + Δ t ) + Q i t + Δ t = C t , i t ϕ i V i ( p i t + Δ t p i t ) Δ t ,
(4)
where k i j is the average permeability between node i and node j (mD), which is also defined as the permeability of connection element ( i , j ), that is, k i j = 2 k i k j k i + k j. V i is the control volume of node i, m3. The index n is the number of nodes connected to node i, and it is also the number of connection elements with node i. p i t + Δ t is the average pressure of the control volume of node i at time t + Δ t, MPa. Q i is the source and sink item in the control volume of node i, which is obtained by integrating into the node control volume, d−1. V i m = 3 4 j = 1 n e m j i ( p j t + Δ t −  p i t + Δ t ) is only the approximation of the diffusion term V i Δ p t + Δ t d Ω. Specifically, λ i j t is the total mobility (mD/mPa s) at time step t in the case of two-phase flow, which is calculated by the upstream weight scheme. For connection element (i, j),
λ i j t = { k r w ( S w , j t ) μ w + k r o ( S w , j t ) μ o , p j t > p i t , k r w ( S w , i t ) μ w + k r o ( S w , i t ) μ o , p i t > p j t .
(5)

1. The calculation of connection transmissibility

Considering the local mass conversation, Eq. (6) holds
{ V i m = 3 4 e m j i = V j m = 3 4 e m i j , i V i = V R .
(6)
Then, the node control volume V i is calculated by using Eq. (6). The transmissibility T i j of the connection element ( i , j ) is defined as follows:
T i j = λ i j k i j V i m = 3 4 e m j i = λ i j k i j V j m = 3 4 e m i j = T j i ,
(7)
where e m , j i is the coefficient estimated by the Laplace operator, 1/m2; the superscript i or j represents the central node of the influence domain, the subscript m corresponds to the partial derivative of the pressure function, and the subscript i or j represents the ordinal numbers of other nodes in the influence domain.

2. The calculation of connection volume

It is widely known that the transmissibility in traditional grid-based reservoir numerical simulation (FVM) is calculated by
T i j = λ i j k i j A i j L i j = λ i j k i j A i j d i j L i j 2 = λ i j k i j V i j L i j 2 ,
(8)
where A i j is the cross-sectional area of two adjacent grids, m2. L i j is the distance between the centers of the two grids, m. V i j is the volume of the connection between grid i and grid j, m3. So, there is a proportional relationship
T i j L i j 2 λ i j k i j V i j .
(9)
Similarly, the connection transmissibility and the connection volume in the CEM should also satisfy the proportional relationship
T i j L i j 2 λ i j k i j V i j .
(10)
For the convenience of symbolic representation, the G i j is denoted as
G i j = T i j λ i j k i j = V i m = 3 4 e m , j + 1 i = V j m = 3 4 e m , i + 1 j .
(11)
The sum of the volumes of all connection elements should be equal to the total volume of the reservoir
i = 1 N 1 i < j N V i j = V R .
(12)
Thus, by combining Eqs. (10)–(12), we can obtain the expression of the connection volume as follows:
V i j = G i j L i j 2 V R i = 1 N 1 i < j N G i j L i j 2 .
(13)

Equation (13) has been proven good initial approximation for the connection volumes of connection elements, which may be updated in history-matching procedure.

Figure 2 shows an example of how the CEM model characterizes a rectangular oil reservoir. Figure 2(a) is a schematic diagram of a five-spot four-production rectangular oil reservoir. Figure 2(b) is a schematic diagram of node distribution, and Fig. 2(c) is a connection diagram of the CEM model. The CEM separates the calculation of the pressure equation and saturation equation into two steps. First, it calculates the pressure diffusion equation based on the nodes in the meshless system and the seepage characteristics parameters between the nodes, so as to obtain the average pressure of the control volumes of each node. Then, according to the average pressure of the control volumes of each node, the upstream flux of each connection element is estimated, so that the Buckley–Leverett saturation convection equation can be efficiently and accurately calculated in parallel on the connection element.20 More details on the solution process and parameter settings are provided in the  Appendix.

FIG. 2.

Schematic diagram of the characterization of a rectangular oil reservoir by the CEM.20 (a) Rectangular reservoir, (b) node distributions, and (c) CEM model.

FIG. 2.

Schematic diagram of the characterization of a rectangular oil reservoir by the CEM.20 (a) Rectangular reservoir, (b) node distributions, and (c) CEM model.

Close modal

1. The data-space inversion with varying controls formulation

The DSIVC program is designed to enable production forecasts for user-specified well control settings in the post-history-match period. In DSIVC, well control is incorporated as a variable within the data space, and we consider the prescribed well settings as additional (specified) data to be matched during the data assimilation process.

DSIVC designates u to be the vector of post-history-match well control variables, which could include rates or bottom-hole pressures at different stages for all of the wells in the forecasting period
u = [ u 1 1 , u 2 1 , , u N stage 1 , , u 1 N w , u 2 N w , , u N stage N w ] T ,
(14)
where Nw is the number of wells and Nstage is the number of control stages during the prediction or forecasting period. We set Nu = Nw × Nstage, which means u R N u × 1.
Let m represent the vector of prior model parameters (in this paper, they are connection conductivities and connection volumes), and g represent the CEM simulation procedure. The forward simulation results are generated as follows:
d i = g ( m i , u hm , u i ) , i = 1 , 2 , , N r ,
(15)
where Nr represents the number of prior realizations that are simulated, di includes the injection and production rates at different time steps for all of the wells, uhm is the well control setting for the history-matching period, and ui is the post-history-match well controls for mi.
In DSIVC, all of the data variables are combined into a column vector dfull to predict future injection and production quantities under user-specified controls after the history-matching period. This gives
d full = [ u i T , d i T ] T = [ u i T , d hm T , d pred T ] T ,
(16)
where d full R N f × 1, N f = N u + N h m + N pred.
In DSIVC, in addition to specifying the observed data we wish to match, we also specify the post-history-match well controls. Here, we combine these two quantities in specified data vector dspec, defined as
d spec = [ u spec T , d obs T ] T ,
(17)
where u spec R N u × 1 is the user-specified set of well controls, and d obs R N h m × 1 is the data variables observed (measured) during the history-matching period.
d obs is equal to d hm, and the relationship between d full and d obs can be represented as
d obs = H d full + ε ,
(18)
where ε R N hm × 1 is the vector of measurement errors of the observed data, which is assumed to be Gaussian random variables with mean 0 and covariance matrix C D, C D R N hm × N hm. In addition, H R N hm × N f is applied to extract the historical production data from d full.
In DSIVC, under Bayes' law, the conditional probability density function (PDF) of d full given d spec = [ u spec T , d obs T ] T is
p ( d full | d spec ) = p ( d full | d spec ) p ( d full ) p ( d spec ) = p ( ε ̂ = d spec H ̂ d full ) α exp ( 1 2 ( H ̂ d full d spec ) T C ̂ D 1 ( H ̂ d full d spec ) ) ,
(19)
where p ( d full ) is the prior PDF of d full, and p ( d full | d spec ) is the conditional PDF of the specified data vector d spec given d full. The covariance matrix C ̂ D is a diagonal matrix with diagonal elements representing the variance of the associated variable in ε ̂. In this framework, if the well controls u spec is specified, the variance associated with u spec would be zero, and this would render C ̂ D not invertible. Su et al. change the diagonal elements C ̂ D corresponding to u spec and set these elements to 1.

2. Reparameterization methods

In the original DSI, Sun et al.31 applied the PCA method to parameterize the data space for avoiding the calculation of C d 1. In this case, let Φ PCA R N f × N l denote the transformed basis matrix, and for any production data d full PCA, it can be expressed as
d full PCA = Φ PCA ξ + d prior ,
(20)
where ξ R N s × 1 can be regarded as the proxy of the production data, whose order is much lower than that of geological parameters.
In practice, some wells have water breakthroughs or are often shut down intermittently using various measures, and the water content is close to 0 or 0. When the principal component analysis method is applied, it may lead to nonphysical values (below 0) in the inversion calculation of the water content. To maximize the ability to apply principal component analysis methods to preserve the physical characteristics of the production data variables, Sun et al. proposed to apply a histogram transformation method to transform the dPCA of a Gaussian distribution (mean dprior, variance Φ Φ T). For the jth row in the dPCA matrix, i.e., the production data of the reservoir model at the specified time point,
d PCA , H = f T 1 f I ( d PCA j ) , j = 1 , , N d ,
(21)
where d PCA , H indicates histogram-transformed data; fI denotes the initial cumulative distribution function of the jth row; and fT denotes the cumulative distribution function of the target of the jth row. Therefore, the converted production data using the histogram can be expressed as
d full = h ( d PCA ) = h ( Φ ξ + d prior ) ,
(22)
where h stands for the histogram transformation process function and ξ are the reduced-space variables. Because the prior distribution of ξ is standard normal, the probability p ( ξ ) exp ( 1 2 ξ T ξ ). The posterior distribution of ξ given dspec can thus be written as
p ( ξ | d spec ) p ( d spec | ξ ) p ( ξ ) exp ( 1 2 ( H ̂ f ( ξ ) d spec ) T C ̂ D 1 ( H ̂ f ( ξ ) d spec ) 1 2 ξ T ξ ) .
(23)

In this paper, simultaneous perturbation stochastic approximation (SPSA) algorithm is adopted for solving this problem. Because the gradient of SPSA deviates slightly from the real gradient, the posterior forecast derived by SPSA can also approximately quantify uncertainty. In addition, the application of SPSA can avoid over fitting phenomenon. This paper will introduce the reason about algorithm selection in Sec. II C.

For complex and uncertain petroleum reservoir dynamics, real-time optimized control based on current oil–water well production is crucial. This study performs history matching with the stochastic CEM–DSIVC approach to obtain a representative model, whose inversion results effectively characterize uncertainties. Combining multi-model optimization, well rates are optimized to pursue optimal performance while reducing sensitivity-induced variability, ensuring reliability.

Before determining J(u), it is essential to carry out DSIVC. We rely on the same collection of prior simulations, which span Ncycle phases of variable controls during the forecast duration. In this closed-loop procedure, this does not undertake any further flow simulations. For every cycle, denoted as k, the well controls are set accordingly.

At every cycle k, our aim in optimization is to either maximize or minimize a particular value J throughout the production duration. This optimization challenge (maximization) can be articulated as
u k * = arg max u spec U J ( u ) ,
(24)
where U N u × 1 defines the allowable range for u, and N u = N w × ( N cycle k + 1 ) for a particular cycle k.
Here, we take J to be the expected net present value (NPV), i.e.,
J ( u ) = 1 N post i = 1 N post NPV i = 1 N post i = 1 N post [ n = 1 N Δ t n ( j = 1 N prod ( P o Q j o P w Q j w ) k = 1 N inj P inj Q k inj ) ( 1 + r ) t n 365 ] i ,
(25)
where N represents the number of simulation time steps, while tn denotes the cumulative time up to the end of time step n in days. The time step size is given by Δ t n in days. The number of production and injection wells is represented by Nprod and Ninj, respectively. Po is the oil price in USD/STB, and the costs for water production and injection are given by Pw and Pinj in USD/STB, respectively. The oil and water production rates of the jth production well at the nth time step are Q j o and Q j w, both in STB/D. Q k inj indicates the water injection rate of the kth injection well at time step n. Finally, r is the annual discount rate.

From the posterior production and injection forecasts, we calculate NPV for each posterior sample with Eq. (25) and then compute the expected NPV, J(u). To achieve this, the authors adopt the SPSA algorithm, whose solution procedure is detailed in the  Appendix.

In summary of the descriptions in Secs. II A–II C, the proposed CEM–DSIVC workflow for efficient history matching and production optimization is summarized as the following steps:

  1. Generate Nr initial reservoir connection models based on geological data and initial well controls uo. Compute initial production dynamics using CEM to generate dfull.

  2. Operate the field for historical period and first k − 1 stages in the forecasting period (only historical period for k = 1). Provide observed data dobs.

  3. Construct DSIVC model as described in Sec. II. Set initial guess of control variables u0 for the following Nstagek+1 stages and SPSA parameters.

  4. At iteration l, we re-simulate initial models with CEM using each control set uspec and d spec = [ u spec T , d obs T ] T. Apply Eq. (17) to compute posterior forecast and objective function J ( u l ).

  5. if J ( u l ) > J ( u l 1 ), set u l = u l 1, and go to step 7. Else, reduce step size and return to step 4.

  6. End optimization when meeting convergence criteria:
    | J ( u l ) J ( u l 1 ) | / J ( u l ) 1.5 × 10 3 .
    (26)

In this paper, RMSE is utilized to evaluate the error between the predicted value and the real data, which is calculated by the following formula:
RMSE = 1 N i = 1 N y i y ̂ i 2 2 ,
(27)
where N represents the number of evaluation samples, and y i and y ̂ i represent the reference and predicted data, respectively.

Through solving the history-matching mathematical model, CEM–DSIVC obtains the transmissibility and connection volume for each connection element. This reduces the DSIVC matching parameter dimensionality. Seen from the workflow, theoretically, CEM in the workflow can reduce both computational cost of forward simulations and the number of degree of freedoms for history matching and optimization. DSIVC is an efficient data assimilation method that can integrate production data to reduce reservoir uncertainty. By combining DSIVC with the fast CEM reservoir simulation, CEM–DSIVC is an advanced novel closed-loop optimization workflow that can perform reservoir management under uncertainty with high computational efficiency. The CEM–DSIVC method demonstrates high accuracy in history matching and uncertainty quantification, while requiring much lower computational cost compared to conventional methods. Therefore, CEM–DSIVC provides an effective and practical solution for closed-loop reservoir optimization and uncertainty management.

A two-dimensional heterogeneous reservoir example was established to verify the method. The permeability distribution of the model is shown in Fig. 3. Based on sequential Gaussian simulation, 200 initial CEM models were generated to reflect the uncertainty of the reservoir. Figure 4 shows the transmissibility and connected pore volume of one of the CEM models. The reservoir contains four water injection wells and nine production wells. All parameters of the reservoir models are the same except for transmissibility and connected pore volume. The total production time of the reservoir is 2500 days. All injection and production wells are controlled by constant bottom-hole pressure. The first 1500 days of observed data are used for history matching, and the last 1000 days of data are used for production forecasting.

FIG. 3.

Log-permeability of reference ECLIPSE reservoir model.

FIG. 3.

Log-permeability of reference ECLIPSE reservoir model.

Close modal
FIG. 4.

Connection transmissibility and connection volume of the connection element reference model. (a) Connection transmissibility and (b) connection volume.

FIG. 4.

Connection transmissibility and connection volume of the connection element reference model. (a) Connection transmissibility and (b) connection volume.

Close modal

As mentioned above, DSIVC is a type of the data assimilation method. To further verify the effectiveness of the CEM–DSIVC method proposed in this paper, the Eclipse simulator and the CEM method are compared in the simulation section, and the mainstream multi-model data assimilation methods ES-MDA and DSIVC are compared in the history matching and optimization methods. Figures 5 and 6 show the history matching and forecasting results of oil production rate and water cut by CEM–DSIVC, CEM–ESMDA, and ECLIPSE–DSIVC methods using the initial reservoir models and the truth case model. It refers to a combination of the CEM method and ESMDA, similar to the CEM–DSI method. ECLIPSE–DSIVC refers to the combination of eclipse and DSIVC methods, replacing the CEM method with ECLIPSE. The black dashed line separates the history stage and the forecast stage. The red dashed line represents the truth case model data, and the red dots represent observed data. The observation error is taken as 10% of the true model simulated values. The blue dashed line represents the average value calculated by the posterior model data. The inverted production forecast values can effectively match the production forecast of the truth case model.

FIG. 5.

History-matching results of oil production rate for wells 1, 5, and 9. (a) CEM–DSIVC, (b) CEM–ESMDA, and (c) ECLIPSE–DSIVC.

FIG. 5.

History-matching results of oil production rate for wells 1, 5, and 9. (a) CEM–DSIVC, (b) CEM–ESMDA, and (c) ECLIPSE–DSIVC.

Close modal
FIG. 6.

History-matching results of water production rate for wells 1, 5, and 9. (a) CEM–DSIVC, (b) CEM–ESMDA, and (c) ECLIPSE–DSIVC.

FIG. 6.

History-matching results of water production rate for wells 1, 5, and 9. (a) CEM–DSIVC, (b) CEM–ESMDA, and (c) ECLIPSE–DSIVC.

Close modal

In this paper, the oil price and water disposal cost are set as r o = 70 USD/STB, r w = 3 USD/STB, and r w i = 2 USD/STB. The annual interest rate b is set to 0.1. For this case study, the operating conditions remain unchanged for the first 1500 days. The last 1000 days are divided into ten steps. The bottom hole pressure (BHP) ranges of the production wells are set to 200–400 Bar, and the bottom hole pressure (BHP) ranges of the injection wells are set to 400–600 Bar for the ten steps. Closed-loop production optimization is performed using CEM–DSIVC, CEM–ESMDA, and ECLIPSE–DSIVC according to the above settings. Figure 7 shows the optimized BHP scheduling of the production and injection wells by CEM–DSIVC. The box plots of cumulative oil production and net present value before and after optimization are shown in Fig. 8, displaying the results of the prior and three optimized methods. It can be clearly seen from the figure that closed-loop optimization significantly increases the cumulative oil production and net present value for all three methods. Compared to the previous models, all three methods significantly reduce uncertainty. The computational accuracy and time consumption comparisons of the three methods are shown in Table I. In this case study, one CEM forward simulation takes about 0.8 s, and one ECLIPSE forward simulation takes about 2.8 s. The computational time of CEM–ESMDA is about 850 s (mainly because 1000 CEM calculations are needed, including one prior model run and four update iterations). For the CEM–DSIVC method, 200 forward simulations of the prior ensemble take about 250 s, and the DSIVC takes about 57 s. The main computational time of ECLIPSE–DSIVC is the prior ensemble runs by ECLIPSE, which takes about 610 s. The results show that the CEM–DSIVC method can accurately predict reservoir production dynamics with relatively low computational cost.

FIG. 7.

Optimized BHP schedule. (a) Production well WBHP and (b) injection well WBHP.

FIG. 7.

Optimized BHP schedule. (a) Production well WBHP and (b) injection well WBHP.

Close modal
FIG. 8.

Comparison of NPV and FOPT before and after optimization. (a) NPV and (b) FOPT.

FIG. 8.

Comparison of NPV and FOPT before and after optimization. (a) NPV and (b) FOPT.

Close modal
TABLE I.

Comparison of the accuracy and computational time of case 1.

CEM–DSIVC CEM–ESMDA ECLIPSE–DSIVC
Average RMSE  0.5321  0.6274  0.5123 
Computational costs  307 s  850 s  610 s 
CEM–DSIVC CEM–ESMDA ECLIPSE–DSIVC
Average RMSE  0.5321  0.6274  0.5123 
Computational costs  307 s  850 s  610 s 

To test the methods, a practical waterflooding development case was selected. The reservoir model is constructed as shown in Fig. 9. There are high uncertainties in this reservoir system. First, the CEM model was obtained based on the geological data of the reservoir. Then, 200 initial data-driven models were generated by Gaussian sequential simulation, one of which CEM models is shown in Figs. 10(a) and 10(b). The total number of production wells in this reservoir is 17, and the total number of injection wells is 11, with a total production time of 4324 days. Here, the first 2010 days were set for history matching, and the following 2314 days were used for production forecast validation. The prior production data of field oil production total (FOPT) and field water cut (FWCT) were calculated as shown in Figs. 11(a) and 11(b). The black dashed line separates the history stage and the forecast stage. The red dashed line represents the truth case model data, and the red dots represent observed data. The observation error was taken as 10% of the true model simulated values. The blue dashed line represents the average value calculated by the posterior model data. It can be seen that there are some deviations between the prior production data and the actual observed production data, but the prior data can still cover the actual production data. The posterior production dynamics by the three methods for history matching can all better match the actual observed production data, and the production forecast can effectively capture the future variation laws of the reservoir production (Figs. 11 and 12).

FIG. 9.

Reservoir grid model construction.

FIG. 9.

Reservoir grid model construction.

Close modal
FIG. 10.

CEM model characteristic parameters. (a) Connection transmissibility and (b) connection volume.

FIG. 10.

CEM model characteristic parameters. (a) Connection transmissibility and (b) connection volume.

Close modal
FIG. 11.

History-matching results of FOPT. (a) CEM–DSIVC, (b) CEM–ESMDA, and (c) ECLIPSE–DSIVC.

FIG. 11.

History-matching results of FOPT. (a) CEM–DSIVC, (b) CEM–ESMDA, and (c) ECLIPSE–DSIVC.

Close modal
FIG. 12.

History-matching results of FWCT. (a) CEM–DSIVC, (b) CEM–ESMDA, and (c) ECLIPSE–DSIVC.

FIG. 12.

History-matching results of FWCT. (a) CEM–DSIVC, (b) CEM–ESMDA, and (c) ECLIPSE–DSIVC.

Close modal

The final production scheme of oil and water wells was used as the pre-optimization plan. Optimization methods were then applied to optimize the well operating regimes over the next 600 days. Figure 13 shows box plots of the field cumulative oil production and NPV before and after optimization. The information reflected in the figure indicates that reservoir uncertainty was reduced through optimization. In addition, the optimized field cumulative oil production increased markedly compared to pre-optimization. At the end of closed-loop optimization, the average cumulative oil production rose from 2.51 × 106 m3 before optimization to 2.84 × 106 m3 after optimization. The average NPV increased from 184 million USD before optimization to 202 million USD after optimization. The computational accuracy and time consumption comparisons of case 2 are shown in Table II. In this case, one CEM forward simulation takes about 9.8 s, but one ECLIPSE forward simulation takes about 752 s. The computational time of CEM–ESMDA is about 4960 s (mainly because 500 CEM calculations are needed, including one prior model run and four update iterations). For the CEM–DSIVC method, 100 forward simulations of the prior ensemble take about 980 s, and the DSIVC takes about 61 s. The main computational time of ECLIPSE–DSIVC is the prior ensemble runs by ECLIPSE, which takes about 75 200 s. This computational cost would be prohibitively expensive for larger-scale models or a greater number of prior model evaluations. This highlights the importance of the work presented in this paper. For large oilfields, the methods described here can substantially improve computational efficiency while still ensuring accuracy.

FIG. 13.

Comparison of results before and after optimization. (a) NPV and (b) FOPT.

FIG. 13.

Comparison of results before and after optimization. (a) NPV and (b) FOPT.

Close modal
TABLE II.

Comparison of the accuracy and computational time of case 2.

CEM–DSIVC CEM–ESMDA ECLIPSE–DSIVC
Average RMSE  0.4191  0.4453  0.4398 
Computational costs  1041 s  4960 s  75289 s 
CEM–DSIVC CEM–ESMDA ECLIPSE–DSIVC
Average RMSE  0.4191  0.4453  0.4398 
Computational costs  1041 s  4960 s  75289 s 

Throughout the whole paper, several key conclusions can be obtained below:

  • This paper integrates the connection element method (CEM) and data space inversion with varying controls (DSIVC) for efficient history matching and production optimization in water reservoirs.

  • CEM significantly reduces the computational cost of forward reservoir simulation. DSIVC avoids direct inversion of geological parameters, improving calculation efficiency. The combined CEM–DSIVC approach drastically decreases the computational burden, enabling closed-loop optimization for large-scale reservoirs.

  • The proposed approach is demonstrated on two reservoir cases. For the first heterogeneous waterflooding model, compared to conventional grid-based simulation, CEM–DSIVC achieves comparable history matching accuracy with much less computational time. For the second case based on a giant offshore oilfield, CEM–DSIVC efficiently optimizes the field development plan while ensuring accuracy. The results validate that CEM–DSIVC can perform rapid uncertainty quantification and optimized forecasting for large-scale assets.

  • Future work can focus on extending CEM–DSIVC to other reservoir settings like heavy oil, compositional simulation, etc., to further improve efficiency and accuracy. Integrating economic models into the framework could also enable techno-economic optimization.

This work was supported by the National Natural Science Foundation of China (Nos. 51922007, 52104017, and 52274030).

The authors have no conflicts to disclose.

Wei Liu: Conceptualization (equal); Data curation (equal); Investigation (equal); Methodology (equal); Writing – original draft (equal). Yunfeng Xu: Data curation (equal); Visualization (equal); Writing – review & editing (equal). Xiang Rao: Funding acquisition (equal); Writing – review & editing (equal). Deng Liu: Visualization (equal); Writing – review & editing (equal). Hui Zhao: Conceptualization (equal); Funding acquisition (equal); Methodology (equal); Project administration (equal).

The data that support the findings of this study are available from the corresponding author upon reasonable request.

1. Fully implicit scheme
CEM with the fully implicit scheme (denoted as CEM-FI) is shown in Eqs. (A1) and (A2), in which the nodal pressure and water saturation are calculated simultaneously. A difference between CEM-FI and the traditional grid-based FVM simulator is expressed as the weighted sum of the pore volumes of connection elements ( i , j ) connected to the node i, instead of the control pore volume V p , i t + Δ t. Compared to FVM, CEM-FI solves pressures and saturation using properties of connection elements instead of those of grids
j = 1 n i λ w t + Δ t λ o t + Δ t + λ w t + Δ t T i j t + Δ t ( p j t + Δ t p i t + Δ t ) + Q i , w t + Δ t = j = 1 n i ζ i j V p , i j t + Δ t ( S w i t + Δ t S w i t ) Δ t ,
(A1)
j = 1 n i λ o t + Δ t λ o t + Δ t + λ w t + Δ t T i j t + Δ t ( p j t + Δ t p i t + Δ t ) + Q i , o t + Δ t = j = 1 n i ζ i j V p , i j t + Δ t ( S o i t + Δ t S o i t ) Δ t .
(A2)

CEM-FI still suffers from the problem that the number of required nodes is too large and the computational cost for actual large-scale reservoirs is high. This makes it difficult to perform the history matching and effectively predict the production dynamics. Therefore, in Subsec. 2 of the  Appendix, a sequential coupled scheme is given to avoid this problem.

2. Sequential coupled scheme
By using Eq. (A3) that is the discrete scheme of two-phase flow equation, the scheme of implicit pressure and explicit saturation (IMPES) is rewritten as
j = 1 n i T i j t ( p j t + Δ t p i t + Δ t ) + Q i t + Δ t = C t , i t + Δ t j = 1 n i ζ i j V p , i j t + Δ t ( p i t + Δ t p i t ) Δ t ,
(A3)
where the mobility λ i j t is the mobility corresponding to the saturation of the upstream node of the connection element at time t. Then, the above linear equations in Eq. (A3) are solved to obtain node pressure at the time t + Δ t.
After the pressure in each node control area is calculated, the transport equation on the 1D connection element (i, j) is as follows:
( k k r w μ w p ) = ( ϕ S w ) t .
(A4)
The total oil–water flux on the connection element in a small-time step can be expressed as follows:
q i j = k V i j L i j ( k r w μ w + k r o μ o ) p .
(A5)
The transport equation on the connection element in Eq. (A4) can be simplified to the pure convective transport equation in Eq. (A6). Assuming rock and fluid are incompressible and neglecting capillary force, the classical Buckley–Leverett equation
S w t + L i j q i j V p , i j f w ( S w ) x = 0 ,
(A6)
where S w is the water saturation, q i j is the upstream flux of the connection element (i, j), V p , i j = V i j ϕ i j is the pore volume, L i j is the length of the connection element (i, j), and f w ( S w ) is water cut
f w ( S w ) = k r w ( S w ) / μ w k r w ( S w ) / μ w + k r o ( S w ) / μ o .
(A7)
The upstream flux of the connection element is approximated by using the pressure difference and the transmissibility of the connection element at the previous time step. Assuming that node i is upstream of the node j, the total oil–water flux in the time step on the connection element ( i , j ) is calculated as
q i j t + Δ t = T i j t ( p i t + Δ t p j t + Δ t ) .
(A8)

The saturation distribution along the 1D connection element can be calculated from Eq. (A8) when the upstream flux and the connection volume are known.

3. SPSA algorithm
The stochastic gradient of the target function O ( ξ ) for kth iteration is given by
g ̂ k ( ξ k ) = O ( ξ k + c k Δ k ) O ( ξ k c k Δ k ) 2 c k Δ k 1 ,
(A9)
where Δ k R N r × 1 is the random column vector that satisfies the + ¯ 1 Bernoulli distribution; c k is a positive coefficient to control the size of perturbation with the fixed value of 0.1. The vector ξ can be updated using steepest descent
ξ k + 1 = ξ k + α k + 1 g ̂ ¯ k ( ξ k ) ,
(A10)
where αk+1 is the iteration step size with the fixed value of 0.5; the average gradient g ̂ ¯ k ( ξ k ) is defined by
g ̂ ¯ k ( ξ k ) = 1 n i = 1 n g ̂ k ( ξ k ) ,
(A11)
where n is the number of perturbations at each iteration with the fixed value of 5. The iteration step is set to be 50 in this paper. In addition, if the target function value does not decrease for more than 15 iteration steps, the iteration procedure will stop.
1.
J.-D.
Jansen
,
S. D.
Douma
,
D. R.
Brouwer
,
P. M. J.
Van den Hof
,
O. H.
Bosgra
, and
A. W.
Heemink
, “
Closed-loop reservoir management
,” paper presented at the
SPE Reservoir Simulation Symposium
,
2009
.
2.
J.
Hou
,
K.
Zhou
,
X.-S.
Zhang
,
X.-D.
Kang
, and
H.
Xie
, “
A review of closed-loop reservoir management
,”
Pet. Sci.
12
,
114
128
(
2015
).
3.
Y.
Chen
,
D. S.
Oliver
, and
D.
Zhang
, “
Efficient ensemble-based closed-loop production optimization
,”
SPE J.
14
,
634
645
(
2009
).
4.
V.
Bukshtynov
,
O.
Volkov
,
L. J.
Durlofsky
, and
K.
Aziz
, “
Comprehensive framework for gradient-based optimization in closed-loop reservoir management
,”
Comput. Geosci.
19
,
877
897
(
2015
).
5.
N.
Wang
,
H.
Chang
,
X.-Z.
Kong
, and
D.
Zhang
, “
Deep learning based closed-loop well control optimization of geothermal reservoir with uncertain permeability
,”
Renewable Energy
211
,
379
394
(
2023
).
6.
D. S.
Oliver
and
Y.
Chen
, “
Recent progress on reservoir history matching: A review
,”
Comput. Geosci.
15
,
185
221
(
2011
).
7.
S. H.
Lee
,
C. L.
Jensen
, and
M. F.
Lough
, “
Efficient finite-difference model for flow in a reservoir with multiple length-scale fractures
,”
SPE J.
5
,
268
275
(
2000
).
8.
F.
Marcondes
and
K.
Sepehrnoori
, “
An element-based finite-volume method approach for heterogeneous and anisotropic compositional reservoir simulation
,”
J. Pet. Sci. Eng.
73
,
99
106
(
2010
).
9.
L.-K.
Fung
,
A. D.
Hiebert
, and
L. X.
Nghiem
, “
Reservoir simulation with a control-volume finite-element method
,”
SPE Reservoir Eng.
7
,
349
357
(
1992
).
10.
J.
Al Kubaisy
,
P.
Salinas
, and
M. D.
Jackson
, “
A hybrid pressure approximation in the control volume finite element method for multiphase flow and transport in heterogeneous porous media
,”
J. Comput. Phys.
475
,
111839
(
2023
).
11.
D.
Ren
,
X.
Wang
,
Z.
Kou
,
S.
Wang
,
H.
Wang
,
X.
Wang
,
Y.
Tang
,
Z.
Jiao
,
D.
Zhou
, and
R.
Zhang
, “
Feasibility evaluation of CO2 EOR and storage in tight oil reservoirs: A demonstration project in the Ordos Basin
,”
Fuel
331
,
125652
(
2023
).
12.
R.
Li
,
A. C.
Reynolds
, and
D. S.
Oliver
, “
History matching of three-phase flow production data
,”
SPE J.
8
,
328
340
(
2003
).
13.
A. S.
Chen
,
B.
Evans
,
S.
Djordjević
, and
D. A.
Savić
, “
Multi-layered coarse grid modelling in 2D urban flood simulations
,”
J. Hydrol.
470–471
,
1
11
(
2012
).
14.
R.
Pecher
and
J. F.
Stanislav
, “
Boundary element techniques in petroleum reservoir simulation
,”
J. Pet. Sci. Eng.
17
,
353
366
(
1997
).
15.
H.
Chu
,
Z.
Chen
,
X.
Liao
, and
W. J.
Lee
, “
Transient behavior modeling of a multi-well horizontal pad in a reservoir with irregular boundary using boundary element method
,”
J. Pet. Sci. Eng.
209
,
109852
(
2022
).
16.
Y.
Xu
,
G.
Sheng
,
H.
Zhao
,
Y.
Hui
,
Y.
Zhou
,
J.
Ma
,
X.
Rao
,
X.
Zhong
, and
J.
Gong
, “
A new approach for gas-water flow simulation in multi-fractured horizontal wells of shale gas reservoirs
,”
J. Pet. Sci. Eng.
199
,
108292
(
2021
).
17.
X.
Rao
,
W.
Zhan
,
H.
Zhao
,
Y.
Xu
,
D.
Liu
,
W.
Dai
,
R.
Gong
, and
F.
Wang
, “
Application of the least-square meshless method to gas-water flow simulation of complex-shape shale gas reservoirs
,”
Eng. Anal. Boundary Elem.
129
,
39
54
(
2021
).
18.
X.
Rao
,
H.
Zhao
, and
Y.
Liu
, “
A novel meshless method based on the virtual construction of node control domains for porous flow problems
,”
Eng. Comput.
(published online
2023
).
19.
X.
Rao
,
Y.
Liu
, and
H.
Zhao
, “
An upwind generalized finite difference method for meshless solution of two-phase porous flow equations
,”
Eng. Anal. Boundary Elem.
137
,
105
118
(
2022
).
20.
H.
Zhao
,
W.
Zhan
,
Z.
Yuhui
,
T.
Zhang
,
H.
Li
, and
X.
Rao
, “
A connection element method: Both a new computational method and a physical data-driven framework—Take subsurface two-phase flow as an example
,”
Eng. Anal. Boundary Elem.
151
,
473
489
(
2023
).
21.
K.
Zhang
,
H.-Q.
Yu
,
X.-P.
Ma
,
J.-D.
Zhang
,
J.
Wang
,
C.-J.
Yao
,
Y.-F.
Yang
,
H.
Sun
, and
J.
Yao
, “
Multi-source information fused generative adversarial network model and data assimilation based history matching for reservoir with complex geologies
,”
Pet. Sci.
19
,
707
719
(
2022
).
22.
O.
Brenner
,
P.
Piroozmand
, and
P.
Jenny
, “
Efficient assimilation of sparse data into RANS-based turbulent flow simulations using a discrete adjoint method
,”
J. Comput. Phys.
471
,
111667
(
2022
).
23.
E.
Eltahan
,
F. O.
Alpak
, and
K.
Sepehrnoori
, “
A quasi-Newton trust-region method for optimization under uncertainty using stochastic simplex approximate gradients
,”
Comput. Geosci.
27
,
627
648
(
2023
).
24.
J. W. O.
Pinto
,
J. A. R.
Tueros
,
B.
Horowitz
,
S. M. B. A.
da Silva
,
R. B.
Willmersdorf
, and
D. F. B.
de Oliveira
, “
Gradient-free strategies to robust well control optimization
,”
Comput. Geosci.
24
,
1959
1978
(
2020
).
25.
S. I.
Aanonsen
,
G.
Noevdal
,
D. S.
Oliver
,
A. C.
Reynolds
, and
B.
Vallès
, “
The ensemble Kalman filter in reservoir engineering—A review
,”
SPE J.
14
,
393
412
(
2009
).
26.
K.
Thulin
,
G.
Naevdal
,
H. J.
Skaug
, and
S. I.
Aanonsen
, “
Quantifying Monte Carlo uncertainty in the ensemble Kalman filter
,”
SPE J.
16
,
172
182
(
2011
).
27.
A. A.
Emerick
and
A. C.
Reynolds
, “
Ensemble smoother with multiple data assimilation
,”
Comput. Geosci.
55
,
3
15
(
2013
).
28.
G.
Evensen
and
K. S.
Eikrem
, “
Conditioning reservoir models on rate data using ensemble smoothers
,”
Comput. Geosci.
22
,
1251
1270
(
2018
).
29.
S.
Wang
,
N.
Sobecki
,
D.
Ding
,
L.
Zhu
, and
Y.-S.
Wu
, “
Accelerating and stabilizing the vapor-liquid equilibrium (VLE) calculation in compositional simulation of unconventional reservoirs using deep learning based flash calculation
,”
Fuel
253
,
209
219
(
2019
).
30.
Z.
Zhong
,
A. Y.
Sun
,
B.
Ren
, and
Y.
Wang
, “
A deep-learning-based approach for reservoir production forecast under uncertainty
,”
SPE J.
26
,
1314
1340
(
2021
).
31.
W.
Sun
,
M.-H.
Hui
, and
L. J.
Durlofsky
, “
Production forecasting and uncertainty quantification for naturally fractured reservoirs using a new data-space inversion procedure
,”
Comput. Geosci.
21
,
1443
1458
(
2017
).
32.
W.
Sun
and
L. J.
Durlofsky
, “
Data-space approaches for uncertainty quantification of CO2 plume location in geological carbon storage
,”
Adv. Water Resour.
123
,
234
255
(
2019
).
33.
S.
Jiang
and
L. J.
Durlofsky
, “
Treatment of model error in subsurface flow history matching using a data-space method
,”
J. Hydrol.
603
,
127063
(
2021
).
34.
S.
Jiang
and
L. J.
Durlofsky
, “
Data-space inversion using a recurrent autoencoder for time-series parameterization
,”
Comput. Geosci.
25
,
411
432
(
2021
).
35.
W.
Fu
,
K.
Zhang
,
X.
Ma
,
P.
Liu
,
L.
Zhang
,
X.
Yan
,
Y.
Yang
,
H.
Sun
, and
J.
Yao
, “
Deep conditional generative adversarial network combined with data-space inversion for estimation of high-dimensional uncertain geological parameters
,”
Water Resour. Res.
59
,
e2022WR032553
, https://doi.org/10.1029/2022WR032553 (
2023
).
36.
M. M.
Lima
,
A. A.
Emerick
, and
C. E. P.
Ortiz
, “
Data-space inversion with ensemble smoother
,”
Comput. Geosci.
24
,
1179
1200
(
2020
).
37.
S.
Jiang
,
W.
Sun
, and
L. J.
Durlofsky
, “
A data-space inversion procedure for well control optimization and closed-loop reservoir management
,”
Comput. Geosci.
24
,
361
379
(
2020
).