
-
Previous Article
Convexification for a 1D hyperbolic coefficient inverse problem with single measurement data
- IPI Home
- This Issue
-
Next Article
A nonconvex truncated regularization and box-constrained model for CT reconstruction
Nonlocal regularized CNN for image segmentation
1. | Department of Mathematics, Hong Kong Baptist University, Hong Kong, China |
2. | Laboratory of Mathematics and Complex Systems (Ministry of Education of China), School of Mathematical Sciences, Beijing Normal University, Beijing, China |
Non-local dependency is a very important prior for many image segmentation tasks. Generally, convolutional operations are building blocks that process one local neighborhood at a time which means the convolutional neural networks(CNNs) usually do not explicitly make use of the non-local prior on image segmentation tasks. Though the pooling and dilated convolution techniques can enlarge the receptive field to use some nonlocal information during the feature extracting step, there is no nonlocal priori for feature classification step in the current CNNs' architectures. In this paper, we present a non-local total variation (TV) regularized softmax activation function method for semantic image segmentation tasks. The proposed method can be integrated into the architecture of CNNs. To handle the difficulty of back-propagation for CNNs due to the non-smoothness of nonlocal TV, we develop a primal-dual hybrid gradient method to realize the back-propagation of nonlocal TV in CNNs. Experimental evaluations of the non-local TV regularized softmax layer on a series of image segmentation datasets showcase its good performance. Many CNNs can benefit from our proposed method on image segmentation tasks.
References:
[1] |
R. Adams and L. Bischof,
Seeded region growing, IEEE Transactions on Pattern Analysis and Machine Intelligence, 16 (1994), 641-647.
doi: 10.1109/34.295913. |
[2] |
M. Z. Alom, M. Hasan, C. Yakopcic, T. M. Taha and V. K. Asari, Recurrent residual convolutional neural network based on u-net (r2u-net) for medical image segmentation, arXiv: 1802.06955. Google Scholar |
[3] |
V. Badrinarayanan, A. Kendall and R. Cipolla, Segnet: A deep convolutional encoder-decoder architecture for image segmentation, arXiv: 1511.00561.
doi: 10.1109/TPAMI.2016.2644615. |
[4] |
L. Barghout and L. Lee, Perceptual information processing system, US Patent App. 10/618,543, (2004). Google Scholar |
[5] |
M. Benning, C. Brune, M. Burger and J. Müller,
Higher-order tv methods–enhancement via bregman iteration, Journal of Scientific Computing, 54 (2013), 269-310.
doi: 10.1007/s10915-012-9650-3. |
[6] |
H. Birkholz,
A unifying approach to isotropic and anisotropic total variation denoising models, Journal of Computational and Applied Mathematics, 235 (2011), 2502-2514.
doi: 10.1016/j.cam.2010.11.003. |
[7] |
J. Canny,
A computational approach to edge detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, 8 (1986), 679-698.
doi: 10.1016/B978-0-08-051581-6.50024-6. |
[8] |
G. Gilboa and S. Osher,
Nonlocal operators with applications to image processing, Multiscale Modeling & Simulation, 7 (2008), 1005-1028.
doi: 10.1137/070698592. |
[9] |
K. He, X. Zhang, S. Ren and J. Sun, Delving deep into rectifiers: Surpassing human-level performance on imagenet classification, in Proceedings of the IEEE International Conference on Computer Vision, IEEE, 2015, 1026–1034.
doi: 10.1109/ICCV.2015.123. |
[10] |
F. Jia, J. Liu and X. Tai, A regularized convolutional neural network for semantic image segmentation, Analysis and Applications, (2020) 1–19. Google Scholar |
[11] |
M. Johnson-Roberson, C. Barto, R. Mehta, S. N. Sridhar, K. Rosaen and R. Vasudevan, Driving in the matrix: Can virtual worlds replace human-generated annotations for real world tasks?, preprint, arXiv: 1610.01983.
doi: 10.1109/ICRA.2017.7989092. |
[12] |
M. Kass, A. Witkin and D. Terzopoulos, Snakes: Active contour models, International Journal of Computer Vision, 1, (1988) 321–331.
doi: 10.1007/BF00133570. |
[13] |
P. Krähenbühl and V. Koltun, Efficient inference in fully connected crfs with gaussian edge potentials., Advances in Neural Information Processing Systems, (2011), 109–117. Google Scholar |
[14] |
A. Krizhevsky, I. Sutskever and G. E. Hinton, Imagenet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems, (2012), 1097–1105.
doi: 10.1145/3065386. |
[15] |
Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard and L. D. Jackel,
Backpropagation applied to handwritten zip code recognition, Neural Computation, 1 (1989), 541-551.
doi: 10.1162/neco.1989.1.4.541. |
[16] |
G. Lin, C. Shen, A. V. D. Hengel and I. Reid, Efficient piecewise training of deep structured models for semantic segmentation, in Proceedings of the IEEE Conference on Computer Cision and Pattern Recognition, IEEE, 2016, 3194–3203.
doi: 10.1109/CVPR.2016.348. |
[17] |
J. Long, E. Shelhamer, and T. Darrell, Fully convolutional networks for semantic segmentation, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, IEEE, 2015, 3431–3440.
doi: 10.1109/CVPR.2015.7298965. |
[18] |
M. Lysaker, A. Lundervold and X.-C. Tai, Noise removal using fourth-order partial differential equation with applications to medical magnetic resonance images in space and time, IEEE Transactions on Image Processing, 12, (2003), 1579–1590.
doi: 10.1109/TIP.2003.819229. |
[19] |
D. R. Martin, C. C. Fowlkes and and J. Malik,
Learning to detect natural image boundaries using local brightness, color, and texture cues, IEEE Transactions on Pattern Analysis and Machine Intelligence, 26 (2004), 530-549.
doi: 10.1109/TPAMI.2004.1273918. |
[20] |
K. Mikula, A. Sarti and F. Sgallari, Co-volume level set method in subjective surface based medical image segmentation, in Handbook of Biomedical Image Analysis, Springer, (2005), 583–626.
doi: 10.1007/0-306-48551-6_11. |
[21] |
D. Mumford and J. Shah,
Optimal approximations by piecewise smooth functions and associated variational problems, Communications on Pure and Applied Mathematics, 42 (1989), 577-685.
doi: 10.1002/cpa.3160420503. |
[22] |
H. Noh, S. Hong and B. Han, Learning deconvolution network for semantic segmentation, in Proceedings of the IEEE International Conference on Computer Vision, IEEE, 2015, 1520–1528.
doi: 10.1109/ICCV.2015.178. |
[23] |
O. Oktay, et al., Attention u-net: Learning where to look for the pancreas, preprint, arXiv: 1804.03999. Google Scholar |
[24] |
N. Otsu,
A threshold selection method from gray-level histograms, IEEE Transactions on Systems, Man and Cybernetics, 9 (1979), 62-66.
doi: 10.1109/TSMC.1979.4310076. |
[25] |
O. Ronneberger, P. Fischer and T. Brox, U-net: Convolutional networks for biomedical image segmentation, in International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer, 2015,234–241.
doi: 10.1007/978-3-319-24574-4_28. |
[26] |
L. I. Rudin, S. Osher and E. Fatemi,
Nonlinear total variation based noise removal algorithms, Physica D: Nonlinear Phenomena, 60 (1992), 259-268.
doi: 10.1016/0167-2789(92)90242-F. |
[27] | B. Schölkopf, K. Tsuda and J.-P. Vert, Support Vector Machine Applications in Computational Biology, MIT press, 2004. Google Scholar |
[28] |
L. Shapiro and G. C. Stockman, Computer Vision, Prentice Hall, 2001. Google Scholar |
[29] |
J. Shi and J. Malik, Normalized cuts and image segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 22 (2000), 888-908. Google Scholar |
[30] |
K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, preprint, arXiv: 1409.1556. Google Scholar |
[31] |
M. Unger, T. Mauthner, T. Pock and H. Bischof, Tracking as segmentation of spatial-temporal volumes by anisotropic weighted tv, in International Workshop on Energy Minimization Methods in Computer Vision and Pattern Recognition, Springer 2009,193–206.
doi: 10.1007/978-3-642-03641-5_15. |
[32] |
P. Wang, P. Chen, Y. Yuan, D. Liu, Z. Huang, X. Hou, and G. Cottrell, Understanding convolution for semantic segmentation, in 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), IEEE, 2018, 1451–1460.
doi: 10.1109/WACV.2018.00163. |
[33] |
K. Wei, K. Yin, X.-C. Tai and T. F. Chan, New region force for variational models in image segmentation and high dimensional data clustering, preprint, arXiv: 1704.08218.
doi: 10.4310/AMSA.2018.v3.n1.a8. |
[34] |
K. Yin and X.-C. Tai,
An effective region force for some variational models for learning and clustering, Journal of Scientific Computing, 74 (2018), 175-196.
doi: 10.1007/s10915-017-0429-4. |
[35] |
F. Yu and V. Koltun, Multi-scale context aggregation by dilated convolutions, preprint, arXiv: 1511.07122. Google Scholar |
[36] |
L. Zelnik-Manor and P. Perona, Self-tuning spectral clustering, Advances in Neural Information Processing Systems, (2005), 1601–1608. Google Scholar |
[37] |
X. Zheng, Y. Wang, G. Wang and J. Liu,
Fast and robust segmentation of white blood cell images by self-supervised learning, Micron, 107 (2018), 55-71.
doi: 10.1016/j.micron.2018.01.010. |
show all references
References:
[1] |
R. Adams and L. Bischof,
Seeded region growing, IEEE Transactions on Pattern Analysis and Machine Intelligence, 16 (1994), 641-647.
doi: 10.1109/34.295913. |
[2] |
M. Z. Alom, M. Hasan, C. Yakopcic, T. M. Taha and V. K. Asari, Recurrent residual convolutional neural network based on u-net (r2u-net) for medical image segmentation, arXiv: 1802.06955. Google Scholar |
[3] |
V. Badrinarayanan, A. Kendall and R. Cipolla, Segnet: A deep convolutional encoder-decoder architecture for image segmentation, arXiv: 1511.00561.
doi: 10.1109/TPAMI.2016.2644615. |
[4] |
L. Barghout and L. Lee, Perceptual information processing system, US Patent App. 10/618,543, (2004). Google Scholar |
[5] |
M. Benning, C. Brune, M. Burger and J. Müller,
Higher-order tv methods–enhancement via bregman iteration, Journal of Scientific Computing, 54 (2013), 269-310.
doi: 10.1007/s10915-012-9650-3. |
[6] |
H. Birkholz,
A unifying approach to isotropic and anisotropic total variation denoising models, Journal of Computational and Applied Mathematics, 235 (2011), 2502-2514.
doi: 10.1016/j.cam.2010.11.003. |
[7] |
J. Canny,
A computational approach to edge detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, 8 (1986), 679-698.
doi: 10.1016/B978-0-08-051581-6.50024-6. |
[8] |
G. Gilboa and S. Osher,
Nonlocal operators with applications to image processing, Multiscale Modeling & Simulation, 7 (2008), 1005-1028.
doi: 10.1137/070698592. |
[9] |
K. He, X. Zhang, S. Ren and J. Sun, Delving deep into rectifiers: Surpassing human-level performance on imagenet classification, in Proceedings of the IEEE International Conference on Computer Vision, IEEE, 2015, 1026–1034.
doi: 10.1109/ICCV.2015.123. |
[10] |
F. Jia, J. Liu and X. Tai, A regularized convolutional neural network for semantic image segmentation, Analysis and Applications, (2020) 1–19. Google Scholar |
[11] |
M. Johnson-Roberson, C. Barto, R. Mehta, S. N. Sridhar, K. Rosaen and R. Vasudevan, Driving in the matrix: Can virtual worlds replace human-generated annotations for real world tasks?, preprint, arXiv: 1610.01983.
doi: 10.1109/ICRA.2017.7989092. |
[12] |
M. Kass, A. Witkin and D. Terzopoulos, Snakes: Active contour models, International Journal of Computer Vision, 1, (1988) 321–331.
doi: 10.1007/BF00133570. |
[13] |
P. Krähenbühl and V. Koltun, Efficient inference in fully connected crfs with gaussian edge potentials., Advances in Neural Information Processing Systems, (2011), 109–117. Google Scholar |
[14] |
A. Krizhevsky, I. Sutskever and G. E. Hinton, Imagenet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems, (2012), 1097–1105.
doi: 10.1145/3065386. |
[15] |
Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard and L. D. Jackel,
Backpropagation applied to handwritten zip code recognition, Neural Computation, 1 (1989), 541-551.
doi: 10.1162/neco.1989.1.4.541. |
[16] |
G. Lin, C. Shen, A. V. D. Hengel and I. Reid, Efficient piecewise training of deep structured models for semantic segmentation, in Proceedings of the IEEE Conference on Computer Cision and Pattern Recognition, IEEE, 2016, 3194–3203.
doi: 10.1109/CVPR.2016.348. |
[17] |
J. Long, E. Shelhamer, and T. Darrell, Fully convolutional networks for semantic segmentation, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, IEEE, 2015, 3431–3440.
doi: 10.1109/CVPR.2015.7298965. |
[18] |
M. Lysaker, A. Lundervold and X.-C. Tai, Noise removal using fourth-order partial differential equation with applications to medical magnetic resonance images in space and time, IEEE Transactions on Image Processing, 12, (2003), 1579–1590.
doi: 10.1109/TIP.2003.819229. |
[19] |
D. R. Martin, C. C. Fowlkes and and J. Malik,
Learning to detect natural image boundaries using local brightness, color, and texture cues, IEEE Transactions on Pattern Analysis and Machine Intelligence, 26 (2004), 530-549.
doi: 10.1109/TPAMI.2004.1273918. |
[20] |
K. Mikula, A. Sarti and F. Sgallari, Co-volume level set method in subjective surface based medical image segmentation, in Handbook of Biomedical Image Analysis, Springer, (2005), 583–626.
doi: 10.1007/0-306-48551-6_11. |
[21] |
D. Mumford and J. Shah,
Optimal approximations by piecewise smooth functions and associated variational problems, Communications on Pure and Applied Mathematics, 42 (1989), 577-685.
doi: 10.1002/cpa.3160420503. |
[22] |
H. Noh, S. Hong and B. Han, Learning deconvolution network for semantic segmentation, in Proceedings of the IEEE International Conference on Computer Vision, IEEE, 2015, 1520–1528.
doi: 10.1109/ICCV.2015.178. |
[23] |
O. Oktay, et al., Attention u-net: Learning where to look for the pancreas, preprint, arXiv: 1804.03999. Google Scholar |
[24] |
N. Otsu,
A threshold selection method from gray-level histograms, IEEE Transactions on Systems, Man and Cybernetics, 9 (1979), 62-66.
doi: 10.1109/TSMC.1979.4310076. |
[25] |
O. Ronneberger, P. Fischer and T. Brox, U-net: Convolutional networks for biomedical image segmentation, in International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer, 2015,234–241.
doi: 10.1007/978-3-319-24574-4_28. |
[26] |
L. I. Rudin, S. Osher and E. Fatemi,
Nonlinear total variation based noise removal algorithms, Physica D: Nonlinear Phenomena, 60 (1992), 259-268.
doi: 10.1016/0167-2789(92)90242-F. |
[27] | B. Schölkopf, K. Tsuda and J.-P. Vert, Support Vector Machine Applications in Computational Biology, MIT press, 2004. Google Scholar |
[28] |
L. Shapiro and G. C. Stockman, Computer Vision, Prentice Hall, 2001. Google Scholar |
[29] |
J. Shi and J. Malik, Normalized cuts and image segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 22 (2000), 888-908. Google Scholar |
[30] |
K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, preprint, arXiv: 1409.1556. Google Scholar |
[31] |
M. Unger, T. Mauthner, T. Pock and H. Bischof, Tracking as segmentation of spatial-temporal volumes by anisotropic weighted tv, in International Workshop on Energy Minimization Methods in Computer Vision and Pattern Recognition, Springer 2009,193–206.
doi: 10.1007/978-3-642-03641-5_15. |
[32] |
P. Wang, P. Chen, Y. Yuan, D. Liu, Z. Huang, X. Hou, and G. Cottrell, Understanding convolution for semantic segmentation, in 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), IEEE, 2018, 1451–1460.
doi: 10.1109/WACV.2018.00163. |
[33] |
K. Wei, K. Yin, X.-C. Tai and T. F. Chan, New region force for variational models in image segmentation and high dimensional data clustering, preprint, arXiv: 1704.08218.
doi: 10.4310/AMSA.2018.v3.n1.a8. |
[34] |
K. Yin and X.-C. Tai,
An effective region force for some variational models for learning and clustering, Journal of Scientific Computing, 74 (2018), 175-196.
doi: 10.1007/s10915-017-0429-4. |
[35] |
F. Yu and V. Koltun, Multi-scale context aggregation by dilated convolutions, preprint, arXiv: 1511.07122. Google Scholar |
[36] |
L. Zelnik-Manor and P. Perona, Self-tuning spectral clustering, Advances in Neural Information Processing Systems, (2005), 1601–1608. Google Scholar |
[37] |
X. Zheng, Y. Wang, G. Wang and J. Liu,
Fast and robust segmentation of white blood cell images by self-supervised learning, Micron, 107 (2018), 55-71.
doi: 10.1016/j.micron.2018.01.010. |





[1] |
Shi Yan, Jun Liu, Haiyang Huang, Xue-Cheng Tai. A dual EM algorithm for TV regularized Gaussian mixture model in image segmentation. Inverse Problems & Imaging, 2019, 13 (3) : 653-677. doi: 10.3934/ipi.2019030 |
[2] |
Yuan Wang, Zhi-Feng Pang, Yuping Duan, Ke Chen. Image retinex based on the nonconvex TV-type regularization. Inverse Problems & Imaging, , () : -. doi: 10.3934/ipi.2020050 |
[3] |
Alina Toma, Bruno Sixou, Françoise Peyrin. Iterative choice of the optimal regularization parameter in TV image restoration. Inverse Problems & Imaging, 2015, 9 (4) : 1171-1191. doi: 10.3934/ipi.2015.9.1171 |
[4] |
Wei Wan, Haiyang Huang, Jun Liu. Local block operators and TV regularization based image inpainting. Inverse Problems & Imaging, 2018, 12 (6) : 1389-1410. doi: 10.3934/ipi.2018058 |
[5] |
Jianhong (Jackie) Shen, Sung Ha Kang. Quantum TV and applications in image processing. Inverse Problems & Imaging, 2007, 1 (3) : 557-575. doi: 10.3934/ipi.2007.1.557 |
[6] |
Ye Yuan, Yan Ren, Xiaodong Liu, Jing Wang. Approach to image segmentation based on interval neutrosophic set. Numerical Algebra, Control & Optimization, 2020, 10 (1) : 1-11. doi: 10.3934/naco.2019028 |
[7] |
Dominique Zosso, Jing An, James Stevick, Nicholas Takaki, Morgan Weiss, Liane S. Slaughter, Huan H. Cao, Paul S. Weiss, Andrea L. Bertozzi. Image segmentation with dynamic artifacts detection and bias correction. Inverse Problems & Imaging, 2017, 11 (3) : 577-600. doi: 10.3934/ipi.2017027 |
[8] |
Matthew S. Keegan, Berta Sandberg, Tony F. Chan. A multiphase logic framework for multichannel image segmentation. Inverse Problems & Imaging, 2012, 6 (1) : 95-110. doi: 10.3934/ipi.2012.6.95 |
[9] |
Gernot Holler, Karl Kunisch. Learning nonlocal regularization operators. Mathematical Control & Related Fields, 2021 doi: 10.3934/mcrf.2021003 |
[10] |
Manxue You, Shengjie Li. Perturbation of Image and conjugate duality for vector optimization. Journal of Industrial & Management Optimization, 2020 doi: 10.3934/jimo.2020176 |
[11] |
Feishe Chen, Lixin Shen, Yuesheng Xu, Xueying Zeng. The Moreau envelope approach for the L1/TV image denoising model. Inverse Problems & Imaging, 2014, 8 (1) : 53-77. doi: 10.3934/ipi.2014.8.53 |
[12] |
Ruiqiang He, Xiangchu Feng, Xiaolong Zhu, Hua Huang, Bingzhe Wei. RWRM: Residual Wasserstein regularization model for image restoration. Inverse Problems & Imaging, , () : -. doi: 10.3934/ipi.2020069 |
[13] |
Jianping Zhang, Ke Chen, Bo Yu, Derek A. Gould. A local information based variational model for selective image segmentation. Inverse Problems & Imaging, 2014, 8 (1) : 293-320. doi: 10.3934/ipi.2014.8.293 |
[14] |
Lu Tan, Ling Li, Senjian An, Zhenkuan Pan. Nonlinear diffusion based image segmentation using two fast algorithms. Mathematical Foundations of Computing, 2019, 2 (2) : 149-168. doi: 10.3934/mfc.2019011 |
[15] |
Ruiliang Zhang, Xavier Bresson, Tony F. Chan, Xue-Cheng Tai. Four color theorem and convex relaxation for image segmentation with any number of regions. Inverse Problems & Imaging, 2013, 7 (3) : 1099-1113. doi: 10.3934/ipi.2013.7.1099 |
[16] |
Balázs Kósa, Karol Mikula, Markjoe Olunna Uba, Antonia Weberling, Neophytos Christodoulou, Magdalena Zernicka-Goetz. 3D image segmentation supported by a point cloud. Discrete & Continuous Dynamical Systems - S, 2021, 14 (3) : 971-985. doi: 10.3934/dcdss.2020351 |
[17] |
Jie Huang, Xiaoping Yang, Yunmei Chen. A fast algorithm for global minimization of maximum likelihood based on ultrasound image segmentation. Inverse Problems & Imaging, 2011, 5 (3) : 645-657. doi: 10.3934/ipi.2011.5.645 |
[18] |
Liam Burrows, Weihong Guo, Ke Chen, Francesco Torella. Reproducible kernel Hilbert space based global and local image segmentation. Inverse Problems & Imaging, 2021, 15 (1) : 1-25. doi: 10.3934/ipi.2020048 |
[19] |
Tingting Wu, Yufei Yang, Huichao Jing. Two-step methods for image zooming using duality strategies. Numerical Algebra, Control & Optimization, 2014, 4 (3) : 209-225. doi: 10.3934/naco.2014.4.209 |
[20] |
Yun Chen, Jiasheng Huang, Si Li, Yao Lu, Yuesheng Xu. A content-adaptive unstructured grid based integral equation method with the TV regularization for SPECT reconstruction. Inverse Problems & Imaging, 2020, 14 (1) : 27-52. doi: 10.3934/ipi.2019062 |
2019 Impact Factor: 1.373
Tools
Metrics
Other articles
by authors
[Back to Top]