# American Institute of Mathematical Sciences

June  2020, 2(2): 101-121. doi: 10.3934/fods.2020007

## Topological reconstruction of sub-cellular motion with Ensemble Kalman velocimetry

 1 Mathematics Dept., University of Tennessee, Knoxville, TN 37996, USA 2 Center for Biological Physics, Arizona State University, Tempe, AZ 85281, USA

* Corresponding author: vmaroula@utk.edu

Microscopy imaging of plant cells allows the elaborate analysis of sub-cellular motions of organelles. The large video data set can be efficiently analyzed by automated algorithms. We develop a novel, data-oriented algorithm, which can track organelle movements and reconstruct their trajectories on stacks of image data. Our method proceeds with three steps: (ⅰ) identification, (ⅱ) localization, and (ⅲ) linking. This method combines topological data analysis and Ensemble Kalman Filtering, and does not assume a specific motion model. Application of this method on simulated data sets shows an agreement with ground truth. We also successfully test our method on real microscopy data.

Citation: Le Yin, Ioannis Sgouralis, Vasileios Maroulas. Topological reconstruction of sub-cellular motion with Ensemble Kalman velocimetry. Foundations of Data Science, 2020, 2 (2) : 101-121. doi: 10.3934/fods.2020007
##### References:

show all references

##### References:
The motion of organelles, during an experiment starting at $t_1 = 0$ ending at $t_N = T$, is identified at discrete times $t_n$ (dots). For simplicity, space is represented with one dimension, although real datasets are two dimensional. The black dots represent the locations of organelles at different time levels. $\tilde {\mathcal{R}}$ is the set contains the locations of all black dots
Here, $\bar x^j$ is the position of an organelle producing the images shown (gray), $\bar f_{n, +}$ and $\bar f_{n, -}$ illustrate 1-level forward displacement and backward displacement of $\bar x^j$, respectively. For clarity, the image produced by the organelle are shown as multi-peaked and space as 1D
The relations of forward fields and backward fields are indicated here. (a) shows the approach depiction of forward displacement fields, (b) shows the approach depiction of backward displacement fields. For clarity, time marches forward in (a) and backward in (b)
(a) shows $\tilde {\mathcal{R}}$ as black dots and ${\mathcal{R}}$ as gray lines; (b) shows ${\mathcal{R}}$, ${\mathcal{T}}_n$ in Eq. (12) and $P_ {\mathcal{R}}^{-1}( {\mathcal{T}}_n)$ as blue segments; (c) shows ${\mathcal{R}}$, ${\mathcal{T}}_n$, $P_ {\mathcal{R}}^{-1}( {\mathcal{T}}_n)$, $\tilde R$ and reconstructed discrete trajectories. For visualization purpose, space is shown in 1D
Case I: The frame size is 320 by 320 pixels. Trajectories of 20 organelles are in red spanning from time $t = 0\;$s to $t = 99\;$s. Their motion is described by a diffusion process containing both a diffusion and a drift term. The starting distance of any two adjacent organelles at $t = 0\;$s is 10 pixels
Case I: Four histograms of mean error of each frame. Each one compares estimated forward and backward displacement with ground truth along $x$-axis and $y$-axis, respectively
Case I: Linking result of all trajectories in red. The accuracy rate is 100%
Case I: Positions of organelles over time after adding perturbation $U(-\epsilon, \epsilon)$ when $\epsilon = 0, \ 1, \ 1.5, \ 2, \ 2.5, \ 3, \ 3.5, \ 4\;$pixels, respectively. If $\epsilon$ increases, it is more difficult to detect trajectories, especially, when $\epsilon = 4\;$ pixels, there are no clear patterns for all trajectories to be reconstructed
Case II: The left shows the rough detection result, the right show the locations after correction. The red dots in the left penal and blue pentagons in the right penal are the original locations before Bayesian identification. The red pentagons in the right panel are the fitted location after Bayesian identification
Case II: Estimated displacement fields of 17th frame using EnKF. Panel (a) shows the estimated displacement fields for the entire focal plane. Panel (b) shows the enlarged area of $[140,230]\times[260,350]$ in Penal (a)
Case II: Trajectories reconstruction result
Case II: Four specific sets of trajectory reconstructions vs ground truth. Each panel shows reconstructions versus one true trajectory. The upper left is amplified from the area $[290,380]\times[40,130]$ in Fig. 11; the upper right is amplified from the area $[40,190]\times[190,340]$ in Fig. 11; the bottom left is amplified from the area $[80,210]\times[200,330]$ in Fig. 11; the bottom right is amplified from the area $[230,380]\times[200,350]$ in Fig. 11;
Case III: Panel (a) is the first frame of the video. Panel (b) exhibits all estimated trajectories in red. Panel (c) further shows each estimated trajectory in different colors
Case I: Table of detection result
 $\epsilon$ (in pixels) total $> 10\; s$ $=100\%$ $\geq 90\%$ $\geq 50\%$ $\epsilon=1$ 20 20 20 20 20 $\epsilon=1.5$ 20 20 20 20 20 $\epsilon=2$ 20 20 20 20 20 $\epsilon=2.5$ 24 24 14 17 20 $\epsilon=3$ 32 27 9 14 17 $\epsilon=3.5$ 33 30 5 10 15 $\epsilon=4$ 53 40 1 4 14
 $\epsilon$ (in pixels) total $> 10\; s$ $=100\%$ $\geq 90\%$ $\geq 50\%$ $\epsilon=1$ 20 20 20 20 20 $\epsilon=1.5$ 20 20 20 20 20 $\epsilon=2$ 20 20 20 20 20 $\epsilon=2.5$ 24 24 14 17 20 $\epsilon=3$ 32 27 9 14 17 $\epsilon=3.5$ 33 30 5 10 15 $\epsilon=4$ 53 40 1 4 14
 [1] Zhiyan Ding, Qin Li, Jianfeng Lu. Ensemble Kalman Inversion for nonlinear problems: Weights, consistency, and variance bounds. Foundations of Data Science, 2020  doi: 10.3934/fods.2020018 [2] Håkon Hoel, Gaukhar Shaimerdenova, Raúl Tempone. Multilevel Ensemble Kalman Filtering based on a sample average of independent EnKF estimators. Foundations of Data Science, 2020, 2 (4) : 351-390. doi: 10.3934/fods.2020017 [3] Geir Evensen, Javier Amezcua, Marc Bocquet, Alberto Carrassi, Alban Farchi, Alison Fowler, Pieter L. Houtekamer, Christopher K. Jones, Rafael J. de Moraes, Manuel Pulido, Christian Sampson, Femke C. Vossepoel. An international initiative of predicting the SARS-CoV-2 pandemic using ensemble data assimilation. Foundations of Data Science, 2020  doi: 10.3934/fods.2021001 [4] Xin Guo, Lei Shi. Preface of the special issue on analysis in data science: Methods and applications. Mathematical Foundations of Computing, 2020, 3 (4) : i-ii. doi: 10.3934/mfc.2020026 [5] A. M. Elaiw, N. H. AlShamrani, A. Abdel-Aty, H. Dutta. Stability analysis of a general HIV dynamics model with multi-stages of infected cells and two routes of infection. Discrete & Continuous Dynamical Systems - S, 2020  doi: 10.3934/dcdss.2020441 [6] Giulia Cavagnari, Antonio Marigonda. Attainability property for a probabilistic target in wasserstein spaces. Discrete & Continuous Dynamical Systems - A, 2021, 41 (2) : 777-812. doi: 10.3934/dcds.2020300 [7] Álvaro Castañeda, Pablo González, Gonzalo Robledo. Topological Equivalence of nonautonomous difference equations with a family of dichotomies on the half line. Communications on Pure & Applied Analysis, , () : -. doi: 10.3934/cpaa.2020278 [8] Tian Ma, Shouhong Wang. Topological phase transition III: Solar surface eruptions and sunspots. Discrete & Continuous Dynamical Systems - B, 2021, 26 (1) : 501-514. doi: 10.3934/dcdsb.2020350 [9] Jann-Long Chern, Sze-Guang Yang, Zhi-You Chen, Chih-Her Chen. On the family of non-topological solutions for the elliptic system arising from a product Abelian gauge field theory. Discrete & Continuous Dynamical Systems - A, 2020, 40 (6) : 3291-3304. doi: 10.3934/dcds.2020127 [10] Lin Jiang, Song Wang. Robust multi-period and multi-objective portfolio selection. Journal of Industrial & Management Optimization, 2021, 17 (2) : 695-709. doi: 10.3934/jimo.2019130 [11] Bilel Elbetch, Tounsia Benzekri, Daniel Massart, Tewfik Sari. The multi-patch logistic equation. Discrete & Continuous Dynamical Systems - B, 2021  doi: 10.3934/dcdsb.2021025 [12] Hyung-Chun Lee. Efficient computations for linear feedback control problems for target velocity matching of Navier-Stokes flows via POD and LSTM-ROM. Electronic Research Archive, , () : -. doi: 10.3934/era.2020128 [13] Paul E. Anderson, Timothy P. Chartier, Amy N. Langville, Kathryn E. Pedings-Behling. The rankability of weighted data from pairwise comparisons. Foundations of Data Science, 2021  doi: 10.3934/fods.2021002 [14] George W. Patrick. The geometry of convergence in numerical analysis. Journal of Computational Dynamics, 2021, 8 (1) : 33-58. doi: 10.3934/jcd.2021003 [15] Liang Huang, Jiao Chen. The boundedness of multi-linear and multi-parameter pseudo-differential operators. Communications on Pure & Applied Analysis, , () : -. doi: 10.3934/cpaa.2020291 [16] Anna Anop, Robert Denk, Aleksandr Murach. Elliptic problems with rough boundary data in generalized Sobolev spaces. Communications on Pure & Applied Analysis, , () : -. doi: 10.3934/cpaa.2020286 [17] Haruki Umakoshi. A semilinear heat equation with initial data in negative Sobolev spaces. Discrete & Continuous Dynamical Systems - S, 2021, 14 (2) : 745-767. doi: 10.3934/dcdss.2020365 [18] Min Chen, Olivier Goubet, Shenghao Li. Mathematical analysis of bump to bucket problem. Communications on Pure & Applied Analysis, 2020, 19 (12) : 5567-5580. doi: 10.3934/cpaa.2020251 [19] Hirokazu Ninomiya. Entire solutions of the Allen–Cahn–Nagumo equation in a multi-dimensional space. Discrete & Continuous Dynamical Systems - A, 2021, 41 (1) : 395-412. doi: 10.3934/dcds.2020364 [20] Shun Zhang, Jianlin Jiang, Su Zhang, Yibing Lv, Yuzhen Guo. ADMM-type methods for generalized multi-facility Weber problem. Journal of Industrial & Management Optimization, 2020  doi: 10.3934/jimo.2020171

Impact Factor: