Transportation theory (mathematics)






In mathematics and economics, transportation theory or transport theory is a name given to the study of optimal transportation and allocation of resources. The problem was formalized by the French mathematician Gaspard Monge in 1781.[1]


In the 1920s A.N. Tolstoi was one of the first to study the transportation problem mathematically. In 1930, in the collection Transportation Planning Volume I for the National Commissariat of Transportation of the Soviet Union, he published a paper "Methods of Finding the Minimal Kilometrage in Cargo-transportation in space".[2][3]


Major advances were made in the field during World War II by the Soviet mathematician and economist Leonid Kantorovich.[4] Consequently, the problem as it is stated is sometimes known as the Monge–Kantorovich transportation problem.[5] The linear programming formulation of the transportation problem is also known as the Hitchcock–Koopmans transportation problem.[6]




Contents






  • 1 Motivation


    • 1.1 Mines and factories


    • 1.2 Moving books: the importance of the cost function




  • 2 Abstract formulation of the problem


    • 2.1 Monge and Kantorovich formulations


    • 2.2 Duality formula




  • 3 Solution of the problem


    • 3.1 Optimal transportation on the real line


    • 3.2 Separable Hilbert spaces




  • 4 Applications


  • 5 See also


  • 6 References


  • 7 Further reading





Motivation



Mines and factories


Suppose that we have a collection of n mines mining iron ore, and a collection of n factories which use the iron ore that the mines produce. Suppose for the sake of argument that these mines and factories form two disjoint subsets M and F of the Euclidean plane R2. Suppose also that we have a cost function c : R2 × R2 → [0, ∞), so that c(xy) is the cost of transporting one shipment of iron from x to y. For simplicity, we ignore the time taken to do the transporting. We also assume that each mine can supply only one factory (no splitting of shipments) and that each factory requires precisely one shipment to be in operation (factories cannot work at half- or double-capacity). Having made the above assumptions, a transport plan is a bijection T : MF.
In other words, each mine mM supplies precisely one factory T(m) ∈ F and each factory is supplied by precisely one mine.
We wish to find the optimal transport plan, the plan T whose total cost


c(T):=∑m∈Mc(m,T(m)){displaystyle c(T):=sum _{min M}c(m,T(m))}c(T):=sum _{{min M}}c(m,T(m))

is the least of all possible transport plans from M to F. This motivating special case of the transportation problem is an instance of the assignment problem.
More specifically, it is equivalent to finding a minimum weight matching in a bipartite graph.



Moving books: the importance of the cost function


The following simple example illustrates the importance of the cost function in determining the optimal transport plan. Suppose that we have n books of equal width on a shelf (the real line), arranged in a single contiguous block. We wish to rearrange them into another contiguous block, but shifted one book-width to the right. Two obvious candidates for the optimal transport plan present themselves:



  1. move all n books one book-width to the right ("many small moves");

  2. move the left-most book n book-widths to the right and leave all other books fixed ("one big move").


If the cost function is proportional to Euclidean distance (c(xy) = α|x − y|) then these two candidates are both optimal. If, on the other hand, we choose the strictly convex cost function proportional to the square of Euclidean distance (c(xy) = α|x − y|2), then the "many small moves" option becomes the unique minimizer.



Abstract formulation of the problem



Monge and Kantorovich formulations


The transportation problem as it is stated in modern or more technical literature looks somewhat different because of the development of Riemannian geometry and measure theory. The mines-factories example, simple as it is, is a useful reference point when thinking of the abstract case. In this setting, we allow the possibility that we may not wish to keep all mines and factories open for business, and allow mines to supply more than one factory, and factories to accept iron from more than one mine.


Let X and Y be two separable metric spaces such that any probability measure on X (or Y) is a Radon measure (i.e. they are Radon spaces). Let c : X × Y → [0, ∞] be a Borel-measurable function. Given probability measures μ on X and ν on Y, Monge's formulation of the optimal transportation problem is to find a transport map T : XY that realizes the infimum


inf{∫Xc(x,T(x))dμ(x)|T∗)=ν},{displaystyle inf left{left.int _{X}c(x,T(x)),mathrm {d} mu (x);right|;T_{*}(mu )=nu right},}{displaystyle inf left{left.int _{X}c(x,T(x)),mathrm {d} mu (x);right|;T_{*}(mu )=nu right},}

where T(μ) denotes the push forward of μ by T. A map T that attains this infimum (i.e. makes it a minimum instead of an infimum) is called an "optimal transport map".


Monge's formulation of the optimal transportation problem can be ill-posed, because sometimes there is no T satisfying T(μ) = ν: this happens, for example, when μ is a Dirac measure but ν is not.


We can improve on this by adopting Kantorovich's formulation of the optimal transportation problem, which is to find a probability measure γ on X × Y that attains the infimum


inf{∫Yc(x,y)dγ(x,y)|γΓ)},{displaystyle inf left{left.int _{Xtimes Y}c(x,y),mathrm {d} gamma (x,y)right|gamma in Gamma (mu ,nu )right},}inf left{left.int _{{Xtimes Y}}c(x,y),{mathrm  {d}}gamma (x,y)right|gamma in Gamma (mu ,nu )right},

where Γ(μ, ν) denotes the collection of all probability measures on X × Y with marginals μ on X and ν on Y. It can be shown[7] that a minimizer for this problem always exists when the cost function c is lower semi-continuous and Γ(μ, ν) is a tight collection of measures (which is guaranteed for Radon spaces X and Y). (Compare this formulation with the definition of the Wasserstein metric W1 on the space of probability measures.) A gradient descent formulation for the solution of the Monge-Kantorovich problem was given by Sigurd Angenent, Steven Haker, and Allen Tannenbaum.[8]



Duality formula


The minimum of the Kantorovich problem is equal to


sup(∫(x)dμ(x)+∫(y)dν(y)),{displaystyle sup left(int _{X}varphi (x),mathrm {d} mu (x)+int _{Y}psi (y),mathrm {d} nu (y)right),}sup left(int _{{X}}varphi (x),{mathrm  {d}}mu (x)+int _{{Y}}psi (y),{mathrm  {d}}nu (y)right),

where the supremum runs over all pairs of bounded and continuous functions ϕ:X→R{displaystyle phi :Xrightarrow mathbf {R} }{displaystyle phi :Xrightarrow mathbf {R} } and ψ:Y→R{displaystyle psi :Yrightarrow mathbf {R} }{displaystyle psi :Yrightarrow mathbf {R} } such that


φ(x)+ψ(y)≤c(x,y).{displaystyle varphi (x)+psi (y)leq c(x,y).}varphi (x)+psi (y)leq c(x,y).


Solution of the problem



Optimal transportation on the real line


@media all and (max-width:720px){.mw-parser-output .tmulti>.thumbinner{width:100%!important;max-width:none!important}.mw-parser-output .tmulti .tsingle{float:none!important;max-width:none!important;width:100%!important;text-align:center}}


Optimal transportation matrix

Optimal transportation matrix



Continuous optimal transport

Continuous optimal transport



For 1≤p<∞{displaystyle 1leq p<infty }1 leq p < infty, let Pp(R){displaystyle {mathcal {P}}_{p}(mathbf {R} )}{mathcal  {P}}_{p}({mathbf  {R}}) denote the collection of probability measures on R{displaystyle mathbf {R} }mathbf {R} that have finite p{displaystyle p}p-th moment. Let μPp(R){displaystyle mu ,nu in {mathcal {P}}_{p}(mathbf {R} )}mu ,nu in {mathcal  {P}}_{p}({mathbf  {R}}) and let c(x,y)=h(x−y){displaystyle c(x,y)=h(x-y)}{displaystyle c(x,y)=h(x-y)}, where h:R→[0,∞){displaystyle h:mathbf {R} rightarrow [0,infty )}{displaystyle h:mathbf {R} rightarrow [0,infty )} is a convex function.



  1. If μ{displaystyle mu }mu has no atom, i.e., if the cumulative distribution function =R→[0,1]{displaystyle F_{mu }=mathbf {R} rightarrow [0,1]}{displaystyle F_{mu }=mathbf {R} rightarrow [0,1]} of μ{displaystyle mu }mu is a continuous function, then 1∘:R→R{displaystyle F_{nu }^{-1}circ F_{mu }:mathbf {R} to mathbf {R} }F_{{nu }}^{{-1}}circ F_{{mu }}:{mathbf  {R}}to {mathbf  {R}} is an optimal transport map. It is the unique optimal transport map if h{displaystyle h}h is strictly convex.

  2. We have


minγΓ)∫R2c(x,y)dγ(x,y)=∫01c(Fμ1(s),Fν1(s))ds.{displaystyle min _{gamma in Gamma (mu ,nu )}int _{mathbf {R} ^{2}}c(x,y),mathrm {d} gamma (x,y)=int _{0}^{1}cleft(F_{mu }^{-1}(s),F_{nu }^{-1}(s)right),mathrm {d} s.}min _{{gamma in Gamma (mu ,nu )}}int _{{{mathbf  {R}}^{2}}}c(x,y),{mathrm  {d}}gamma (x,y)=int _{0}^{1}cleft(F_{{mu }}^{{-1}}(s),F_{{nu }}^{{-1}}(s)right),{mathrm  {d}}s.

The proof of this solution appears in.[9]



Separable Hilbert spaces


Let X{displaystyle X}X be a separable Hilbert space. Let Pp(X){displaystyle {mathcal {P}}_{p}(X)}{mathcal  {P}}_{p}(X) denote the collection of probability measures on X{displaystyle X}X such that have finite p{displaystyle p}p-th moment; let Ppr(X){displaystyle {mathcal {P}}_{p}^{r}(X)}{mathcal  {P}}_{p}^{r}(X) denote those elements μPp(X){displaystyle mu in {mathcal {P}}_{p}(X)}mu in {mathcal  {P}}_{p}(X) that are Gaussian regular: if g{displaystyle g}g is any strictly positive Gaussian measure on X{displaystyle X}X and g(N)=0{displaystyle g(N)=0}{displaystyle g(N)=0}, then μ(N)=0{displaystyle mu (N)=0}{displaystyle mu (N)=0} also.


Let μPpr(X){displaystyle mu in {mathcal {P}}_{p}^{r}(X)}mu in {mathcal  {P}}_{p}^{r}(X), νPp(X){displaystyle nu in {mathcal {P}}_{p}(X)}nu in {mathcal  {P}}_{p}(X), c(x,y)=|x−y|p/p{displaystyle c(x,y)=|x-y|^{p}/p}c(x,y)=|x-y|^{p}/p for p∈(1,∞),p−1+q−1=1{displaystyle pin (1,infty ),p^{-1}+q^{-1}=1}{displaystyle pin (1,infty ),p^{-1}+q^{-1}=1}. Then the Kantorovich problem has a unique solution κ{displaystyle kappa }kappa , and this solution is induced by an optimal transport map: i.e., there exists a Borel map r∈Lp(X,μ;X){displaystyle rin L^{p}(X,mu ;X)}{displaystyle rin L^{p}(X,mu ;X)} such that


κ=(idX×r)∗)∈Γ).{displaystyle kappa =(mathrm {id} _{X}times r)_{*}(mu )in Gamma (mu ,nu ).}kappa =({mathrm  {id}}_{{X}}times r)_{{*}}(mu )in Gamma (mu ,nu ).

Moreover, if ν{displaystyle nu }nu has bounded support, then


r(x)=x−|∇ϕ(x)|q−2∇ϕ(x){displaystyle r(x)=x-|nabla phi (x)|^{q-2}nabla phi (x)}r(x)=x-|nabla phi (x)|^{{q-2}}nabla phi (x)

for μ{displaystyle mu }mu -almost all x∈X{displaystyle xin X}xin X for some locally Lipschitz, c-concave and maximal Kantorovich potential ϕ{displaystyle phi }phi . (Here ϕ{displaystyle nabla phi }nabla phi denotes the Gâteaux derivative of ϕ{displaystyle phi }phi .)



Applications


The Monge-Kantorovich optimal transport has found applications in wide range in different fields. Among them are:




  • Image registration and warping [10]

  • Reflector design [11]

  • Retrieving information from shadowgraphy and proton radiography [12]


  • Seismic tomography and Reflection seismology [13]



See also







  • Wasserstein metric

  • Transport function

  • Hungarian algorithm

  • Transportation planning



References





  1. ^ G. Monge. Mémoire sur la théorie des déblais et des remblais. Histoire de l’Académie Royale des Sciences de Paris, avec les Mémoires de Mathématique et de Physique pour la même année, pages 666–704, 1781.


  2. ^ Schrijver, Alexander, Combinatorial Optimization, Berlin ; New York : Springer, 2003. .mw-parser-output cite.citation{font-style:inherit}.mw-parser-output q{quotes:"""""""'""'"}.mw-parser-output code.cs1-code{color:inherit;background:inherit;border:inherit;padding:inherit}.mw-parser-output .cs1-lock-free a{background:url("//upload.wikimedia.org/wikipedia/commons/thumb/6/65/Lock-green.svg/9px-Lock-green.svg.png")no-repeat;background-position:right .1em center}.mw-parser-output .cs1-lock-limited a,.mw-parser-output .cs1-lock-registration a{background:url("//upload.wikimedia.org/wikipedia/commons/thumb/d/d6/Lock-gray-alt-2.svg/9px-Lock-gray-alt-2.svg.png")no-repeat;background-position:right .1em center}.mw-parser-output .cs1-lock-subscription a{background:url("//upload.wikimedia.org/wikipedia/commons/thumb/a/aa/Lock-red-alt-2.svg/9px-Lock-red-alt-2.svg.png")no-repeat;background-position:right .1em center}.mw-parser-output .cs1-subscription,.mw-parser-output .cs1-registration{color:#555}.mw-parser-output .cs1-subscription span,.mw-parser-output .cs1-registration span{border-bottom:1px dotted;cursor:help}.mw-parser-output .cs1-hidden-error{display:none;font-size:100%}.mw-parser-output .cs1-visible-error{font-size:100%}.mw-parser-output .cs1-subscription,.mw-parser-output .cs1-registration,.mw-parser-output .cs1-format{font-size:95%}.mw-parser-output .cs1-kern-left,.mw-parser-output .cs1-kern-wl-left{padding-left:0.2em}.mw-parser-output .cs1-kern-right,.mw-parser-output .cs1-kern-wl-right{padding-right:0.2em}
    ISBN 3540443894. Cf. p.362



  3. ^ Ivor Grattan-Guinness, Ivor, Companion encyclopedia of the history and philosophy of the mathematical sciences, Volume 1, JHU Press, 2003. Cf. p.831


  4. ^ L. Kantorovich. On the translocation of masses. C.R. (Doklady) Acad. Sci. URSS (N.S.), 37:199–201, 1942.


  5. ^ Cédric Villani (2003). Topics in Optimal Transportation. American Mathematical Soc. p. 66. ISBN 978-0-8218-3312-4.


  6. ^ Singiresu S. Rao (2009). Engineering Optimization: Theory and Practice (4th ed.). John Wiley & Sons. p. 221. ISBN 978-0-470-18352-6.


  7. ^ L. Ambrosio, N. Gigli & G. Savaré. Gradient Flows in Metric Spaces and in the Space of Probability Measures. Lectures in Mathematics ETH Zürich, Birkhäuser Verlag, Basel. (2005)


  8. ^ Angenent, S.; Haker, S.; Tannenbaum, A. (2003). "Minimizing flows for the Monge–Kantorovich problem". SIAM J. Math. Analysis. 35 (1): 61–97. doi:10.1137/S0036141002410927.


  9. ^ Rachev, Svetlozar T., and Ludger Rüschendorf. Mass Transportation Problems: Volume I: Theory. Vol. 1. Springer, 1998.


  10. ^ Haker, Steven; Zhu, Lei; Tannenbaum, Allen; Angenent, Sigurd (2004-12-01). "Optimal Mass Transport for Registration and Warping". International Journal of Computer Vision. 60 (3): 225–240. doi:10.1023/B:VISI.0000036836.66311.97. ISSN 0920-5691.


  11. ^ Glimm, T.; Oliker, V. (2003-09-01). "Optical Design of Single Reflector Systems and the Monge–Kantorovich Mass Transfer Problem". Journal of Mathematical Sciences. 117 (3): 4096–4108. doi:10.1023/A:1024856201493. ISSN 1072-3374.


  12. ^ Kasim, Muhammad Firmansyah; Ceurvorst, Luke; Ratan, Naren; Sadler, James; Chen, Nicholas; Sävert, Alexander; Trines, Raoul; Bingham, Robert; Burrows, Philip N. (2017-02-16). "Quantitative shadowgraphy and proton radiography for large intensity modulations". Physical Review E. 95 (2): 023306. arXiv:1607.04179. Bibcode:2017PhRvE..95b3306K. doi:10.1103/PhysRevE.95.023306.


  13. ^ Metivier, Ludovic (24 February 2016). "Measuring the misfit between seismograms using an optimal transport distance: application to full waveform inversion". Geophysical Journal International. 205 (1): 345–377. doi:10.1093/gji/ggw014.




Further reading



  • Brualdi, Richard A. (2006). Combinatorial matrix classes. Encyclopedia of Mathematics and Its Applications. 108. Cambridge: Cambridge University Press. ISBN 0-521-86565-4. Zbl 1106.05001.



Popular posts from this blog

Steve Gadd

Подольск

Лира (музыкальный инструмент)