A review of the adjoint-state method for computing the gradient of a functional with geophysical applications
Article first published online: 18 SEP 2006
Geophysical Journal International
Volume 167, Issue 2, pages 495–503, November 2006
How to Cite
Plessix, R.-E. (2006), A review of the adjoint-state method for computing the gradient of a functional with geophysical applications. Geophysical Journal International, 167: 495–503. doi: 10.1111/j.1365-246X.2006.02978.x
- Issue published online: 18 SEP 2006
- Article first published online: 18 SEP 2006
- Accepted 2006 February 28. Received 2006 February 28; in original form 2005 July 4
- adjoint state;
Estimating the model parameters from measured data generally consists of minimizing an error functional. A classic technique to solve a minimization problem is to successively determine the minimum of a series of linearized problems. This formulation requires the Fréchet derivatives (the Jacobian matrix), which can be expensive to compute. If the minimization is viewed as a non-linear optimization problem, only the gradient of the error functional is needed. This gradient can be computed without the Fréchet derivatives. In the 1970s, the adjoint-state method was developed to efficiently compute the gradient. It is now a well-known method in the numerical community for computing the gradient of a functional with respect to the model parameters when this functional depends on those model parameters through state variables, which are solutions of the forward problem. However, this method is less well understood in the geophysical community. The goal of this paper is to review the adjoint-state method. The idea is to define some adjoint-state variables that are solutions of a linear system. The adjoint-state variables are independent of the model parameter perturbations and in a way gather the perturbations with respect to the state variables. The adjoint-state method is efficient because only one extra linear system needs to be solved.
Several applications are presented. When applied to the computation of the derivatives of the ray trajectories, the link with the propagator of the perturbed ray equation is established.