| 研究生: |
張博一 Po-Yi Chang |
|---|---|
| 論文名稱: |
採用適應性支持權重方法之 光場影像深度估測 Depth Estimation Using Adaptive Support-Weight Approach for Light Field Images |
| 指導教授: |
唐之瑋
Chih-Wei Tang |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 通訊工程學系 Department of Communication Engineering |
| 論文出版年: | 2018 |
| 畢業學年度: | 106 |
| 語文別: | 中文 |
| 論文頁數: | 44 |
| 中文關鍵詞: | 光場影像 、視差估測 、適應性支持權重方法 、子像素精確度 、十字型視窗 |
| 外文關鍵詞: | light field images, disparity estimation, adaptive support-weight, sub-pixel accuracy, cross window |
| 相關次數: | 點閱:16 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
光場相機(Light field camera)以透鏡陣列(microlens array)擷取多視角影像,由於多視角之間的基線較窄,因此在計算視差時需精確至小數位之視差值。適應性支持權重方法為一種局部性的視差估測方案,雖現有文獻曾使用適應性支持權重方法(adaptive support-weight approach)對光場影像進行視差估測,但未考慮估測視差之子像素精確度(sub-pixel accuracy)。
因此,本論文提出基於改進的適應性支持權重方法(adaptive support-weight approach)之光場影像深度估測方案。在計算視差前,先對輸入之各視角的光場影像進行雙立方內插(bicubic interpolation),再以適應性支持權重方法(adaptive support-weight approach)計算視差,以提高視差估測之精確度。於計算適應性支持權重之匹配代價時,本論文採用十字形視窗以降低計算複雜度(computation complexity),並動態調整十字形視窗之交錯位置,以估測影像邊界的視差值,此外,加重影像中變化激烈處之像素點之適應性支持權重,以提高視差估測之準確率。最後,結合所有微透鏡陣列(microlens array)之同一水平位置的各視角之估測視差的值,以提高視差估測之準確性。實驗結果顯示,本論文所提出之方案,相較於現有於極平面影像(epipolar plane image)基於適應性視窗之方案,於new HCI dataset之training子集,平均上可降低5.4%之視差估測之錯誤率(badpixel)。
Light field cameras acquire muti-view images using the microlens array. Due to the narrow baseline between multiple views, sub-pixel accuracy of the estimated disparity is expected. Adaptive support-weight approach is a local based disparity estimation method. Although several adaptive support-weight approach (ASW) based disparity estimation schemes for light field images have been proposed, they did not consider the problem of sub-pixel accuracy.
Therefore, this thesis proposes to improve adaptive support-weight approach based depth estimation for light field images. Before disparity estimation, bicubic interpolation is applied to light field images for sub-pixel accuracy. Then the adaptive support-weight approach estimates disparities, where the cross window is adopted to reduce computation complexity. The intersection position of the vertical and horizontal arms is dynamically adjusted on the image border. Then, we increase the weights of pixels which have higher edge response. Finally, the estimated disparities from multiple view featuring with the same horizontal position are combined to generate the disparity map of the central view. Our experimental results show that the average error rate of the proposed method is lower than that of the EPI based adaptive window matching approach for 5.4%.
[1] A. Gershun, “The Light Field,” J. Math. Phys., Vol. 18, pp. 51–151, Apr. 1939.
[2] E. H. Adelson and J. R. Bergen, “The Plenoptic Function and The Elements of Early Vision,” Comput. Models Visual Process. MIT Press, pp. 3–20, Aug. 1991.
[3] M. Levoy and P. Hanrahan, “Light Field Rendering,” in Proc. 23rd Annu. Conf. Comput. Graph. Interactive Techn. pp. 31–42, Aug. 1996.
[4] S. Richard, G. Steven, G. Radek, and C. Michael F, “The Lumigraph,” in Proc. 23rd Annu. Conf. Comput. Graph. Interactive Techn, pp. 43–54, Aug. 1996.
[5] K. Yoon and I. Kweon, “Adaptive Support-weight Approach for Correspondence Search”, in Proceedings of IEEE Trans. Circuits and Systems for Video Technology, Vol. 28, No.4, pp. 650-656, May 2006.
[6] K. Zhang, J. Lu, and G. Lafruit , “Cross-Based Local Stereo Matching Using Orthogonal Integral Images,” in Proceedings of IEEE Trans. Circuits and Systems for Video Technology, Vol. 19, No. 7, pp. 1073-1079, July 2009.
[7] J. Kowalczuk, E. T. Psota, and L. C. Pérez “Light Field Assisted Stereo Matching using Depth from Focus and Image-Guided Cost-Volume Filtering,” in Proceedings of IEEE International Conference on Image Processing, Vol. 2, pp. 1114-1119, July 2012.
[8] K. T. Ng and Member, “A Multi-Camera Approach to Image-Based Rendering and 3-D/Multiview Display of Ancient Chinese Artifacts,” in Proceedings of IEEE Transactions on Multimedia, Vol. 14, No. 6, pp. 1631-1641, Dec. 2012.
[9] W. S. Russell, “Polynomial Interpretation Schemes for Internal Derivative Distribution on Structured Grids,’’ Appl. Numer. Math, Vol. 17, pp. 129-171, May 1995.
[10] K. Honauer, O. Johannsen, D. Kondermann, and B. Goldluecke, “A Dataset and Evaluation Methodology for Depth Estimation on 4D Light Fields,” Asian Conference on Computer Vision. Lecture Notes in Computer Science, Vol. 10113. pp. 19-34, Springer, Cham, Mar. 2016.
[11] P. H. Lin, J. S. Yeh , F. C. Wu, and Y. Y. Chuang, “Depth Estimation for Lytro Images by Adaptive Window Matching on EPI,” J. Imaging, Vol. 3, Issue 2, 17 , May 2017.
[12] D. Scharstein and R. Szeliski “A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms,” in Proceedings of IEEE Workshop Stereo and Multi-Baseline Vision, Dec. 2001.
[13] Q. Yang, L. Wang, R. Yang, S. Wang, M. Liao1, and D Nist´er, “Real-time Global Stereo Matching Using Hierarchical Belief Propagation,” British Machine Vision Conference, pp.989-998, Jan. 2006.
[14] R. Bellman, “Dynamic Programming,” Dover Books on Computer Science, Reprint Edition, Mar. 2003.
[15] Y. Boykov, O. Veksler, and R. Zabih , “Fast Approximate Energy Minimization via Graph Cuts,” in Proceedings of IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 23 Issue 11, pp. 1222-1239, Nov. 2001.
[16] J. Sun and N.-N. Zheng, “Stereo Matching Using Belief Propagation,” in Proceedings of IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 25, Issue: 7, July 2003.
[17] C.-K. Liang, C.-C. Cheng, and Y. C. Lai, “Hardware-Efficient Belief Propagation,” in Proceedings of IEEE Transactions on Circuits and Systems for Video Technology, pp. 80-87, June 2009.
[18] R. Nagpal and A. Bhowmik “Light Field Denoising and Upsampling Using Variations of the 4D Bilateral Filter,” Taiwan Online Programming Contest, Vol. 38 Issue 6-7, pp. 329-341, June 2012.
[19] C. Perwass and P. Wietzke. “Single Lens 3D-camera with Extended Depth-of-field,” International Society for Optics and Photonics Elect. Imaging, Feb. 2012.
[20] S. Wanner and B. Goldluecke. “Globally Consistent Depth Labeling of 4D Light Fields,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp.41-48, June 2012.
[21] R. C. Bolles, H. H. Baker, and D. H. Marimont, “Epipolar-plane Image Analysis: An Approach to Determining Structure from Motion,” International Journal of Computer Vision, Vol. 1, Issue 1, pp. 7–55, Mar. 1987.
[22] S. Wanner and B. Goldluecke, “Reconstructing Reflective and Transparent Surfaces from Epipolar Plane Images, Heidelberg Collaboratory for Image Processing,” Lecture Notes in Computer Science, Vol. 8142, 2017.
[23] M.A. Nuno-Maganda, M.O. Arias-Estrada, “Real-Time FPGA-Based Architecture for Bicubic Interpolation: An Application for Digital Image Scaling,” International Conference on Reconfigurable Computing and FPGAs, Sep. 2005.
[24] http://www.neoimaging.cn
[25] T.-C. Wang, A.-A. Efros, R. Ramamoorthi, and Senior Member, “Depth Estimation with Occlusion Modeling Using Light-Field Cameras,” in Proceedings of IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 2170-2181, Jan. 2016.
[26] M. Strecke, A. Alperovich, and B. Goldluecke, “Accurate Depth and Normal Maps from Occlusion-Aware Focal Stack Symmetry,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 2529-2537, July 2017.
[27] S. Borman, and R.L. Stevenson, “Spatial Resolution Enhancement of Low-resolution Image Sequences. A Comprehensive Review with Directions for Future Research Lab,” Image and Signal Analysis Tech. Rep., University of Notre Dame, 1998.
[28] G. H. Schut, "Review of Interpolation Methods for Digital Terrain Models," Canadian Surveyor, Vol. 30, No. 5, pp. 389-412, Dec. 1976.
[29] D. Kidner, M. Dorey, “What's The Point? Interpolation and Extrapolation with a Regular Grid DEM,” Division of Mathematics & Computing, 2001.
[30] C. Tomas, R. Manduchi, “Bilateral Filtering for Gray and Color Images,” in Proceedings of Sixth International Conference on Computer Vision, Jan. 1998.
[31] YouTube: Lytro Camera Light Field Technology, 2011 [Video file]. Retrieved from https://www.youtube.com/watch?v=xNJZHFZEkYQ.
[32] H. Lin, C. Chen, S. B. Kang, and J. Yu, “Depth Recovery from Light Field Using Focal Stack Symmetry,” in Proceedings of IEEE Conference on Computer Vision. pp. 3451–3459, Feb. 2015.
[33] E. Adelson and J. Wang. “Single Lens Stereo with a Plenoptic Camera,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 14, Issue: 2, Feb. 1992.
[34] G. Wu, B. Masia, A. Jarabo, Y. Zhang, L. Wang, Q. Dai, and Senior Member,“Light Field Image Processing: An Overvie,” IEEE Journal of Selected Topics in Signal Processing, Vol. 11, No. 7, pp.926-954, Oct. 2017.