Template Assembly for Detailed Urban Reconstruction
Abstract
We propose a new framework to reconstruct building details by automatically assembling 3D templates on coarse textured building models. In a preprocessing step, we generate an initial coarse model to approximate a point cloud computed using Structure from Motion and Multi View Stereo, and we model a set of 3D templates of facade details. Next, we optimize the initial coarse model to enforce consistency between geometry and appearance (texture images). Then, building details are reconstructed by assembling templates on the textured faces of the coarse model. The 3D templates are automatically chosen and located by our optimization-based template assembly algorithm that balances image matching and structural regularity. In the results, we demonstrate how our framework can enrich the details of coarse models using various data sets.
Supporting Information
Filename | Description |
---|---|
cgf12554-sup-0001-S1.pdf558.4 KB | Supporting Information |
cgf12554-sup-0002-S1.pdf1.1 MB | Supporting Information |
cgf12554-sup-0003-S1.mov56.9 MB | Supporting Information |
Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.
References
- Arikan M., Schwärzler M., Flöry S., Wimmer M., Maierhofer S.: O-snap: Optimization-based snapping for modeling architecture. ACM Transactions on Graphics 32, 1 (2013), 6. 2, 4
- Ceylan D., Mitra N.J., Zheng Y., Pauly M.: Coupled structure-from-motion and 3d symmetry detection for urban facades. ACM Transactions on Graphics 33, 1 (2014), 2. 2
- Candes E.J., Tao T.: Near-optimal signal recovery from random projections: Universal encoding strategies? Information Theory, IEEE Transactions on 52, 12 (2006), 5406–5425. 7
- Cheng M.-M., Zhang F.-L., Mitra N.J., Huang X., Hu S.-M.: Repfinder: Finding approximately repeated scene elements for image editing. SIGGRAPH 29, 4 (2010), 83:1-8. 2
- Devernay F.: C++ minpack. http://devernay.free.fr/hacks/cminpack/, 2010. 5
- Dalal N., Triggs B.: Histograms of oriented gradients for human detection. In CVPR (2005), vol. 1, pp. 886–893. 5
- Dick A.R., Torr P. H. S., Cipolla R.: Modelling and interpretation of architecture from several images. Int. J. Comput. Vision 60, 2 (2004), 111–134. 2
- Furukawa Y., Curless B., Seitz S.M., Szeliski R.: Manhattan-world stereo. In CVPR (2009), pp. 1422–1429. 3
- Furukawa Y., Ponce J.: Accurate, dense, and robust multi-view stereopsis. PAMI 32, 8 (2010), 1362–1376. 1, 2
- Goesele M., Snavely N., Curless B., Hoppe H., Seitz S.: Multi-view stereo for community photo collections. In ICCV (2007), pp. 1–8. 2
- Gurobi: Gurobi optimization. http://www.gurobi.com/. 7
- Hastie T., Tibshirani R., Friedman J., Hastie T., Friedman J., Tibshirani R.: The elements of statistical learning, vol. 2. Springer, 2009. 7
10.1007/978-0-387-84858-7 Google Scholar
- Hartley R., Zisserman A.: Multiple view geometry in computer vision, vol. 2. Cambridge Univ Press, 2000. 1
- Lin H., Gao J., Zhou Y., LU G., YE M., Zhang C., Liu L., Yang R.: Semantic decomposition and reconstruction of residential scenes from lidar data. SIGGRAPH 32, 4 (2013). 1, 2
- Li Y., Wu X., Chrysathou Y., Sharf A., Cohen-OR D., Mitra N.J.: Globfit: consistently fitting primitives by discovering global relations. In ACM Transactions on Graphics (2011), vol. 30, ACM, p. 52. 2, 4
- Liebowitz D., Zisserman A.: Metric rectification for perspective images of planes. In CVPR (1998), pp. 482–488. 4
- Musialski P., Wonka P., Aliaga D.G., Wimmer M., Gool L., Purgathofer W.: A survey of urban reconstruction. In Computer Graphics Forum (2013). 2
- Müller P., Zeng G., Wonka P., Gool L. J. V.: Image-based procedural modeling offacades. ACM Transactions on Graphics 26, 3 (2007), 85. 6, 8, 10
- Nan L., Sharf A., Zhang H., Cohen-OR D., Chen B.: Smartboxes for interactive urban reconstruction. SIGGRAPH (2010). 2, 5
- Park M., Brocklehurst K., Collins R.T., Liu Y.: Translation-symmetry-based perceptual grouping with applications to urban scenes. In ACCV. 2010. 2
- Pollefeys M. Nister D., Frahm E.: Detailed realtime urban 3D reconstruction from video. Int. J. Comput. Vision 78, 2-3 (2008), 143–167. 2
- Schnabel R., Degener P., Klein R.: Completion and reconstruction with primitive shapes. EUROGRAPHICS 28, 2 (2009), 503–512. 2
- Shen C.-H., Huang S.-S., FU H., HU S.-M.: Adaptive partitioning of urban facades. In ACM Transactions on Graphics (2011), vol. 30, p. 184. 2
10.1145/2070781.2024218 Google Scholar
- Sinha S.N., Steedly D., Szeliski R., Agrawala M., Pollefeys M.: Interactive 3D architectural modeling from unordered photo collections. ACM Transactions on Graphics 27, 5 (2008), 1–10. 2, 3
- Teboul O., Kokkinos I., Simon L., Koutsourakis P., Paragios N.: Parsing facades with shape grammars and reinforcement learning. PAMI 35, 7 (2013), 1744–1756. 2
- Vanegas C.A., Aliaga D.G., Benes B.: Building reconstruction using manhattan-world grammars. In CVPR (2010), pp. 358–365. 1, 3
- Vanegas C.A., Aliaga D.G., Benes B.: Automatic extraction of manhattan-world building masses from 3d laser range scans. Visualization and Computer Graphics, IEEE Transactions on 18, 10 (2012), 1627–1637. 1, 3
- Vanegas C.A., Aliaga D.G., Wonka P., Mueller P., Wad dell P., Watson B.: Modeling the appearance and behavior of urban spaces. In Proc. of Eurographics State-of-the-Art Report (2009). 2
- Von Gioi R.G., Jakubowicz J., Morel J.-M., Randall G.: Lsd: A fast line segment detector with a false detection control. PAMI 32, 4 (2010), 722–732. 4
- Wu C., Agarwal S., Curless B., Seitz S.M.: Multicore bundle adjustment. In CVPR (2010), pp. 3057–3064. 2, 3
- WU C., Agarwal S., Curless B., Seitz S.M.: Schematic surface reconstruction. In CVPR (2012), pp. 14981505. 2
- Wu C., Frahm J.-M., Pollefeys M.: Repetition-based dense single-view reconstruction. In CVPR (2011), pp. 3113–3120. 2
- Wu C.: Visualsfm: A visual structure from motion system. http://ccwu.me/vsfm/. 2, 3
- Wright J., Yang A.Y., Ganesh A., Sastry S.S., MA Y.: Robust face recognition via sparse representation. PAMI 31, 2 (2009), 210–227. 7
- WU T.-P., Yeung S.-K., Jia J., Tang C.-K.: Quasi-dense 3d reconstruction using tensor-based multiview stereo. In CVPR (2010), IEEE, pp. 1482–1489. 2
- Werner T., Zisserman A.: New techniques for automated architecture reconstruction from photographs. In Proceedings of the 7th European Conference on Computer Vision, Copenhagen, Denmark (2002), vol. 2, pp. 541–555. 2
- Xiao J., Fang T., Tan P., Zhao P., Ofek E., Quan L.: Image-based facade modeling. In ACM Transactions on Graphics (2008), vol. 27, p. 161. 2
- Xiao J., Fang T., Zhao P., Lhuillier M., Quan L.: Image-based street-side city modeling. ACM Transactions on Graphics 28, 5 (2009), 114:1–114:12. 1, 2, 3
- Zebedin L., Bauer J., Karner K., Bischof H.: Fusion of feature-and area-based information for urban buildings modeling from aerial imagery. In ECCV. 2008, pp. 873–886. 3
- Zhou Q.-Y., Neumann U.: 2.5 d building modeling by discovering global regularities. In CVPR (2012), pp. 326–333. 1, 3
- Zhou Q.-Y., Neumann U.: Complete residential urban area reconstruction from dense aerial lidar point clouds. Graphical Models 75, 3 (2013), 118–125. 1, 3