Hostname: page-component-cd9895bd7-jkksz Total loading time: 0 Render date: 2025-01-01T04:45:10.488Z Has data issue: false hasContentIssue false

Depth Estimation for Local Colon Structure in Monocular Capsule Endoscopy Based on Brightness and Camera Motion

Published online by Cambridge University Press:  27 May 2020

Lei Xu
Affiliation:
Advanced Innovation Center for Intelligent Robots and Systems, Beijing Institute of Technology, Beijing, China. E-mail: [email protected]
Jing Li*
Affiliation:
Advanced Innovation Center for Intelligent Robots and Systems, Beijing Institute of Technology, Beijing, China. E-mail: [email protected]
Yang Hao
Affiliation:
School of Mechatronical Engineering, Beijing Institute of Technology, Beijing, China. E-mails: [email protected], [email protected], [email protected]
Peisen Zhang
Affiliation:
School of Mechatronical Engineering, Beijing Institute of Technology, Beijing, China. E-mails: [email protected], [email protected], [email protected]
Gastone Ciuti
Affiliation:
Advanced Innovation Center for Intelligent Robots and Systems, Beijing Institute of Technology, Beijing, China. E-mail: [email protected] The BioRobotics Institute, Scuola Superiore Sant'Anna, Pisa, Italy. E-mails: [email protected], [email protected]
Paolo Dario
Affiliation:
Advanced Innovation Center for Intelligent Robots and Systems, Beijing Institute of Technology, Beijing, China. E-mail: [email protected] The BioRobotics Institute, Scuola Superiore Sant'Anna, Pisa, Italy. E-mails: [email protected], [email protected]
Qiang Huang
Affiliation:
Advanced Innovation Center for Intelligent Robots and Systems, Beijing Institute of Technology, Beijing, China. E-mail: [email protected] School of Mechatronical Engineering, Beijing Institute of Technology, Beijing, China. E-mails: [email protected], [email protected], [email protected]
*
*Corresponding author. E-mail: [email protected]

Summary

We present a 3D reconstruction method using brightness and camera motion estimation for registering local colon structure in colonoscopy. The proposed method is based on reverse projection from 2D fold contours to 3D space, motion estimation from 3D reconstructed points between neighboring frames, and model registration to reconstruct the fold structure. On the synthetic colon, the average percentages of the reconstructed depth error and circumference error are about 14.2% and 15.2%, respectively. The accuracy is enough for the navigation and control in capsule robot. This work demonstrates that the proposed method is superior to the methods using single-frame-based brightness intensity.

Type
Articles
Copyright
Copyright © The Author(s), 2020. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Smith, L., “Screening for colorectal cancer: Surveillance after resection of a colorectal cancer, and the removal of large adenomas,” Endoscopy 17, 98102 (1987).Google Scholar
Harewood, G. C., “Relationship of colonoscopy completion rates and endoscopist features,” Digestive Dis. Sci. 50(1), 4751 (2005).10.1007/s10620-005-1276-yCrossRefGoogle ScholarPubMed
Ciuti, G., Valdastri, P., Menciassi, A. and Dario, P., “Robotic magnetic steering and locomotion of capsule endoscope for diagnostic and surgical endoluminal procedures,” Robotica 28(2), 199207 (2010).10.1017/S0263574709990361CrossRefGoogle Scholar
Furukawa, Y. and Ponce, J., “Accurate, dense, and robust multi-view stereopsis,” IEEE Trans. Pattern Anal. Mach. Intell. 32(8), 13621376 (2010).10.1109/TPAMI.2009.161CrossRefGoogle Scholar
Cremers, D. and Kolev, K., “Multi-view stereo and silhouette consistency via convex functionals over convex domains,” IEEE Trans. Pattern Anal. Mach. Intell. 33(6), 11611174 (2011).10.1109/TPAMI.2010.174CrossRefGoogle Scholar
Kazó, C. and Hajder, L., “Rapid Weak-Perspective Structure from Motion with Missing Data,” Proceedings of the IEEE International Conference on Computer Vision Workshops (2011) pp. 491498.Google Scholar
Chandraker, M., “What Camera Motion Reveals about Shape with Unknown BRDF,” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR) (2014) pp. 21792186.Google Scholar
Hajder, L. and Chetverikov, D., “Weak-perspective structure from motion for strongly contaminated data,” Pattern Recognit. Lett. 27(14), 15811589 (2006).10.1016/j.patrec.2006.03.007CrossRefGoogle Scholar
Ahmed, A. H. and Farag, A. A., “Shape from Shading for Hybrid Surfaces,” Proceedings of the IEEE International Conference on Image Processing (2007) pp. 525528.Google Scholar
Ikeda, O., “Shape-from-Shading Algorithm for Oblique Light Source,” Proceedings of the International Symposium on Advances in Visual Computing (2007) pp. 357366.10.1007/978-3-540-76856-2_35CrossRefGoogle Scholar
Ming, X., Zhao, R. C. and Maria, P., “Solving Self-Shadow Problem of Shape from Shading in Light Source Projected System,” Proceedings of the International Symposium on Intelligent Multimedia, Video and Speech Processing (2004) pp. 334337.10.1109/ISIMP.2004.1434068CrossRefGoogle Scholar
VisentiniScarzanella, M., Stoyanov, D. and G. Yang, Z., “Metric Depth Recovery from Monocular Images Using Shape-from-Shading and Specularities,” Proceedings of the IEEE Conference on Image Processing (ICIP) (2013) pp. 2528.Google Scholar
Ciuti, G., VisentiniScarzanella, M., Dore, A., Menciassi, A., Dario, P. and Yang, G. Z., “Intra-Operative Monocular 3D Reconstruction for Image-Guided Navigation in Active Locomotion Capsule Endoscopy,” Proceedings of the IEEE RAS and EMBS International Conference on Biomedical Robotics and Biomechatronics (2012) pp. 768774.Google Scholar
Armin, M. A., Barnes, N., Alvarez, J., Li, H. D., Grimpen, F. and Salvado, O., “Learning Camera Pose from Optical Colonoscopy Frames Through Deep Convolutional Neural Network (CNN),” In: Computer Assisted and Robotic Endoscopy and Clinical Image-Based Procedures. 4th International Workshop (CARE, 2017) pp. 5059.10.1007/978-3-319-67543-5_5CrossRefGoogle Scholar
Mahmood, F. and Durr, N. J., “Deep learning and conditional random fields-based depth estimation and topographical reconstruction from conventional endoscopy,” Med. Image Anal. 48, 230243 (2018).10.1016/j.media.2018.06.005CrossRefGoogle ScholarPubMed
Rau, A., Edwards, P., Ahmad, O., Riordan, P., Janatka, M., Lovat, L. and Stoyanov, D., “Implicit domain adaptation with conditional generative adversarial networks for depth prediction in endoscopy,” Int. J. Comput. Assist. Radiol. Surg. 14(7), 11671176 (2019).10.1007/s11548-019-01962-wCrossRefGoogle ScholarPubMed
Hong, D., Tavanapong, W., Wong, J., Oh, J. and Groen, P. D., “3D Reconstruction of virtual colon structures from colonoscopy images,” Comput. Med. Imaging Graphics 38(1), 2233 (2014).10.1016/j.compmedimag.2013.10.005CrossRefGoogle ScholarPubMed
Canny, J., “A computational approach to edge detection,” IEEE Trans. Pattern Anal. Mach. Intell. 8(6), 679698 (1986).10.1109/TPAMI.1986.4767851CrossRefGoogle ScholarPubMed
Carsten, R. and Steger, A., “A comprehensive and versatile camera model for cameras with tilt lenses,” Int. J. Comput. Vis. 123(2), 121159 (2017).Google Scholar
Bakstein, H. and Pajdla, T., “Panoramic Mosaicing with a 180 Degree Field of View Lens,” Proceedings of the IEEE Workshop on Omnidirectional Vision (2002) pp. 6068.Google Scholar
Kaufman, A. and Wang, J., “3D surface reconstruction from endoscopic videos,” In: Visualization in Medicine and Life Sciences (Encarnação, J., eds.) (Springer Berlin Heidelberg, 2008) pp. 6174.10.1007/978-3-540-72630-2_4CrossRefGoogle Scholar
Kazhdan, M., olitho, M. and Hoppe, H., “Poisson Surface Reconstruction,” Proceedings of the Symposium on Geometry Processing (2006) pp. 6170.Google Scholar
Bruhn, A., Weickert, J., Feddern, C., Kohlberger, T. and Schnorr, C., “Variational optical flow computation in real-time,” IEEE Trans. Image Process. 14(5), 608615 (2005).10.1109/TIP.2005.846018CrossRefGoogle Scholar
Nagel, H. H. and Enkelmann, W., “An investigation of smoothness constraints for the estimation of displacement vector fields from image sequences,” IEEE Trans. Pattern Anal. Mach. Intell. 8(5), 565593 (1986).10.1109/TPAMI.1986.4767833CrossRefGoogle ScholarPubMed
Brox, T., Bruhn, A., Papenberg, N. and Weickert, J., “High Accuracy Optical Flow Estimation Based on a Theory for Warping,” Proceedings of the European Conference on Computer Vision(ECCV) (2004) pp. 2536.Google Scholar