Dewar** Document Image by Displacement Flow Estimation with Fully Convolutional Network

**e, Guo-Wang; Yin, Fei; Zhang, Xu-Yao; Liu, Cheng-Lin

doi:10.1007/978-3-030-57058-3_10

Guo-Wang **e^11,12,
Fei Yin¹²,
Xu-Yao Zhang^11,12 &
…
Cheng-Lin Liu^11,12,13

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12116))

Included in the following conference series:

International Workshop on Document Analysis Systems

1570 Accesses

Abstract

As camera-based documents are increasingly used, the rectification of distorted document images becomes a need to improve the recognition performance. In this paper, we propose a novel framework for both rectifying distorted document image and removing background finely, by estimating pixel-wise displacements using a fully convolutional network (FCN). The document image is rectified by transformation according to the displacements of pixels. The FCN is trained by regressing displacements of synthesized distorted documents, and to control the smoothness of displacements, we propose a Local Smooth Constraint (LSC) in regularization. Our approach is easy to implement and consumes moderate computing resource. Experiments proved that our approach can dewarp document images effectively under various geometric distortions, and has achieved the state-of-the-art performance in terms of local details and overall effect.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Germany)

eBook: EUR 42.79; Price includes VAT (Germany)

Softcover Book: EUR 53.49; Price includes VAT (Germany)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Amidror, I.: Scattered data interpolation methods for electronic imaging systems: a survey. J. Electron. Imaging 11, 157–76 (2002)
Article Google Scholar
Brown, M.S., Tsoi, Y.C.: Geometric and shading correction for images of printed materials using boundary. IEEE Trans. Image Process. 15(6), 1544–1554 (2006)
Article Google Scholar
Cao, H., Ding, X., Liu, C.: A cylindrical surface model to rectify the bound document image. In: Proceedings Ninth IEEE International Conference on Computer Vision, pp. 228–233. IEEE (2003)
Google Scholar
Chen, L.C., Papandreou, G., Schroff, F., Adam, H.: Rethinking atrous convolution for semantic image segmentation. ar**v preprint ar**v:1706.05587 (2017)
Courteille, F., Crouzil, A., Durou, J.D., Gurdjos, P.: Shape from shading for the digitization of curved documents. Mach. Vis. Appl. 18(5), 301–316 (2007)
Article Google Scholar
Das, S., Ma, K., Shu, Z., Samaras, D., Shilkrot, R.: Dewarpnet: single-image document unwar** with stacked 3D and 2D regression networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 131–140 (2019)
Google Scholar
Das, S., Mishra, G., Sudharshana, A., Shilkrot, R.: The common fold: utilizing the four-fold to dewarp printed documents from a single image. In: Proceedings of the 2017 ACM Symposium on Document Engineering, pp. 125–128. ACM (2017)
Google Scholar
Fu, B., Wu, M., Li, R., Li, W., Xu, Z., Yang, C.: A model-based book dewar** method using text line detection. In: Proceedings of 2nd International Workshop on Camera Based Document Analysis and Recognition, Curitiba, Barazil, pp. 63–70 (2007)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. ar**v preprint ar**v:1412.6980 (2014)
Li, X., Zhang, B., Liao, J., Sander, P.V.: Document rectification and illumination correction using a patch-based cnn. ACM Trans. Graph. 38(6), 1–11 (2019)
Google Scholar
Liang, J., DeMenthon, D., Doermann, D.: Geometric rectification of camera-captured document images. IEEE Trans. Pattern Anal. Mach. Intell. 30(4), 591–605 (2008)
Article Google Scholar
Liu, C., Yuen, J., Torralba, A.: Sift flow: dense correspondence across scenes and its applications. IEEE Trans. Pattern Anal. Mach. Intell. 33(5), 978–994 (2010)
Article Google Scholar
Liu, C., Zhang, Y., Wang, B., Ding, X.: Restoring camera-captured distorted document images. Int. J. Doc. Anal. Recogn. 18(2), 111–124 (2015)
Article Google Scholar
Ma, K., Shu, Z., Bai, X., Wang, J., Samaras, D.: Docunet: document image unwar** via a stacked u-net. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4700–4709 (2018)
Google Scholar
Meng, G., Wang, Y., Qu, S., **ang, S., Pan, C.: Active flattening of curved document images via two structured beams. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3890–3897 (2014)
Google Scholar
Ramanna, V., Bukhari, S.S., Dengel, A.: Document image dewar** using deep learning. In: International Conference on Pattern Recognition Applications and Methods (2019)
Google Scholar
Tian, Y., Narasimhan, S.G.: Rectification and 3D reconstruction of curved document images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 377–384. IEEE (2011)
Google Scholar
Tsoi, Y.C., Brown, M.S.: Multi-view document rectification using boundary. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE (2007)
Google Scholar
Wada, T., Ukida, H., Matsuyama, T.: Shape from shading with interreflections under a proximal light source: distortion-free copying of an unfolded book. Int. J. Comput. Vision 24(2), 125–135 (1997)
Article Google Scholar
Wang, P., et al.: Understanding convolution for semantic segmentation. In: IEEE Winter Conference on Applications of Computer Vision, pp. 1451–1460. IEEE (2018)
Google Scholar
Wang, Z., Simoncelli, E.P., Bovik, A.C.: Multiscale structural similarity for image quality assessment. In: The Thirty-Seventh Asilomar Conference on Signals, Systems and Computers, vol. 2, pp. 1398–1402. IEEE (2003)
Google Scholar
**ng, Y., Li, R., Cheng, L., Wu, Z.: Research on curved Chinese document correction based on deep neural network. In: International Symposium on Computational Intelligence and Design, vol. 2, pp. 342–345. IEEE (2018)
Google Scholar
You, S., Matsushita, Y., Sinha, S., Bou, Y., Ikeuchi, K.: Multiview rectification of folded documents. IEEE Trans. Pattern Anal. Mach. Intell. 40(2), 505–511 (2017)
Article Google Scholar
Zhang, L., Zhang, Y., Tan, C.: An improved physically-based method for geometric restoration of distorted document images. IEEE Trans. Pattern Anal. Mach. Intell. 30(4), 728–734 (2008)
Article Google Scholar

Download references

Acknowledgements

This work has been supported by National Natural Science Foundation of China (NSFC) Grants 61733007, 61573355 and 61721004.

Author information

Authors and Affiliations

School of Artificial Intelligence, University of Chinese Academy of Sciences, Bei**g, 100049, People’s Republic of China
Guo-Wang **e, Xu-Yao Zhang & Cheng-Lin Liu
National Laboratory of Pattern Recognition, Institute of Automation of Chinese Academy of Sciences, 95 Zhongguancun East Road, Bei**g, 100190, People’s Republic of China
Guo-Wang **e, Fei Yin, Xu-Yao Zhang & Cheng-Lin Liu
CAS Center for Excellence of Brain Science and Intelligence Technology, Bei**g, People’s Republic of China
Cheng-Lin Liu

Authors

Guo-Wang **e
View author publications
You can also search for this author in PubMed Google Scholar
Fei Yin
View author publications
You can also search for this author in PubMed Google Scholar
Xu-Yao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Cheng-Lin Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Cheng-Lin Liu .

Editor information

Editors and Affiliations

Huazhong University of Science and Technology, Wuhan, China
**%20Document%20Image%20by%20Displacement%20Flow%20Estimation%20with%20Fully%20Convolutional%20Network&author=Guo-Wang%20** Document Image by Displacement Flow Estimation with Fully Convolutional Network. In: Bai, X., Karatzas, D., Lopresti, D. (eds) Document Analysis Systems. DAS 2020. Lecture Notes in Computer Science(), vol 12116. Springer, Cham. https://doi.org/10.1007/978-3-030-57058-3_10
Download citation
- DOI: https://doi.org/10.1007/978-3-030-57058-3_10
- Published: 14 August 2020
- Publisher Name: Springer, Cham
- Print ISBN: 978-3-030-57057-6
- Online ISBN: 978-3-030-57058-3
- eBook Packages: Computer ScienceComputer Science (R0)

Dewar** Document Image by Displacement Flow Estimation with Fully Convolutional Network

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Dewar** of document images: A semi-CNN based approach

Restoring camera-captured distorted document images

Document Dewar** with Control Points

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Download citation

Publish with us

Societies and partnerships

Subscribe and save

Buy Now

Navigation

Dewar** Document Image by Displacement Flow Estimation with Fully Convolutional Network

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Dewar** of document images: A semi-CNN based approach

Restoring camera-captured distorted document images

Document Dewar** with Control Points

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation