Motion capture sequences may contain erroneous data, especially when the motion is complex or performers are interacting closely and occlusions are frequent. Common practice is to have specialists visually detect the abnormalities and fix them manually. In this paper, we present a method to automatically analyze and fix motion capture sequences by using self-similarity analysis. The premise of this work is that human motion data has a high-degree of self-similarity. Therefore, given enough motion data, erroneous motions are distinct when compared to other motions. We utilize motion-words that consist of short sequences of transformations of groups of joints around a given motion frame. We search for the K-nearest neighbors (KNN) set of each word using dynamic time warping and use it to detect and fix erroneous motions automatically. We demonstrate the effectiveness of our method in various examples, and evaluate by comparing to alternative methods and to manual cleaning.

Supporting Information

References

Aristidou A., Charalambous P., Chrysanthou Y.: Emotion analysis and classification: Understanding the performers’ emotions using the lma entities. Comput. Graph. Forum 34, 6 (Sept. 2015), 262–276. 2
10.1111/cgf.12598
Web of Science® Google Scholar
Aristidou A., Chrysanthou Y., Lasenby J.: Extending FABRIK with model constraints. Comput. Animat. Virtual Worlds 27, 1 (January 2016), 35–57. 3
10.1002/cav.1630
Web of Science® Google Scholar
Aristidou A., Lasenby J.: Real-time marker prediction and CoR estimation in optical motion capture. Vis. Comput. 29, 1 (2013), 7–26. 1, 3
10.1007/s00371-011-0671-y
Web of Science® Google Scholar
Akhter I., Simon T., Khan S., Matthews I., Sheikh Y.: Bilinear spatiotemporal basis models. ACM Trans. Graph. 31, 2 (Apr. 2012), 17:1–17:12 3
10.1145/2159516.2159523
Web of Science® Google Scholar
Buades A., Coll B., Morel J.-M.: A non-local algorithm for image denoising. In Proc. of the IEEE Conference on Computer Vision and Pattern Recognition - Volume 02 (Washington, DC, USA, 2005), CVPR ‘05, IEEE Computer Society, pp. 60–65. 1
Google Scholar
Beaudoin P., Coros S., van de Panne M., Poulin P.: Motion-motif graphs. In Proc. of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation (Aire-la-Ville, CH, 2008), SCA ‘08, Eurographics Association, pp. 117–126. 2
Google Scholar
Bellini R., Kleiman Y., Cohen-Or D.: Time-varying weathering in texture space. ACM Trans. Graph. 35, 4 (July 2016), 141:1–141:11 3
10.1145/2897824.2925891
Web of Science® Google Scholar
Burke M., Lasenby J.: Estimating missing marker positions using low dimensional kalman smoothing. Journal of Biomechanics 49, 9 (2016), 1854–1858. 3, 9
10.1016/j.jbiomech.2016.04.016
PubMed Web of Science® Google Scholar
Chai J., Hodgins J. K.: Performance animation from low-dimensional control signals. ACM Trans. Graph. 24, 3 (July 2005), 686–696. 2, 8, 9
Web of Science® Google Scholar
CMU: Carnegie Mellon University MoCap Database: http://mocap.cs.cmu.edu/, 2017. 1, 6
Google Scholar
Forbes K., Fiume E.: An efficient search algorithm for motion data using weighted PCA. In Proc. of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation (NY, USA, 2005), SCA ‘05, ACM, pp. 67–76. 6
Google Scholar
Feng Y., Ji M., Xiao J., Yang X., Zhang J. J., Zhuang Y., Li X.: Mining spatial-temporal patterns and structural sparsity for human motion data denoising. IEEE Transactions on Cybernetics 45, 12 (Dec 2015), 2693–2706. 3, 9, 10
10.1109/TCYB.2014.2381659
PubMed Web of Science® Google Scholar
Gl⊘ersen Ø., Federolf P.: Predicting missing marker trajectories in human motion data using marker intercorrelations. PLOS ONE 11, 3 (2016), 1–14. 3, 10
Web of Science® Google Scholar
Herda L., Fua P., Plänkers R., Boulic R., Thalmann D.: Skeleton-based motion capture for robust reconstruction of human motion. In Proc. of the Computer Animation (Washington, DC, USA, 2000), CA ‘00, IEEE Computer Society, pp. 77–86. 3
Google Scholar
Hsu E., Gentry S., Popović J.: Example-based control of human motion. In Proc. of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation (Aire-la-Ville, CH, 2004), SCA ‘04, Eurographics Association, pp. 69–77. 3
Google Scholar
Ho E. S. L., Komura T., Tai C.-L.: Spatial relationship preserving character motion adaptation. ACM Trans. Graph. 29, 4 (July 2010), 33:1–33:8 2
10.1145/1778765.1778770
Web of Science® Google Scholar
Holden D., Saito J., Komura T., Joyce T.: Learning motion manifolds with convolutional autoencoders. In SIGGRAPH Asia 2015 Technical Briefs (NY, USA, 2015), SA ‘15, ACM, pp. 18:1–18:4 3, 9, 10
Google Scholar
Ikemoto L., Arikan O., Forsyth D.: Quick transitions with cached multi-way blends. In Proc. of the Symposium on Interactive 3D Graphics and Games (NY, USA, 2007), I3D ‘07, ACM, pp. 145–151. 5
Google Scholar
iPi Soft: iPi Motion Capture: http://www.ipisoft.com/, 2017. 6
Google Scholar
Ilan S., Shamir A.: A survey on data-driven video completion. Comput. Graph. Forum 34, 6 (Sept. 2015), 60–85. 3
10.1111/cgf.12518
Web of Science® Google Scholar
Kapadia M., Chiang I.-k., Thomas T., Badler N. I., Kider Jr. J. T.: Efficient motion retrieval in large motion databases. In Proc. of the ACM SIGGRAPH Symposium on Interactive 3D Graphics and Games (NY, USA, 2013), I3D ‘13, ACM, pp. 19–28. 2
Google Scholar
Kovar L., Gleicher M.: Flexible automatic motion blending with registration curves. In Proc. of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation (Aire-la-Ville, CH, 2003), SCA ‘03, Eurographics Association, pp. 214–224. 5
Google Scholar
Kovar L., Gleicher M.: Automated extraction and parameterization of motions in large data sets. ACM Trans. Graph. 23, 3 (Aug. 2004), 559–568. 2
10.1145/1015706.1015760
Web of Science® Google Scholar
Kovar L., Gleicher M., Pighin F.: Motion graphs. ACM Trans. Graph. 21, 3 (July 2002), 473–482. 2, 4
10.1145/566654.566605
Web of Science® Google Scholar
Keogh E., Palpanas T., Zordan V. B., Gunopulos D., Cardle M.: Indexing large human-motion databases. In Proc. of the International Conference on Very Large Data Bases (2004), VLDB ‘04, pp. 780–791. 2
Google Scholar
Kim W., Rehg J. M.: Detection of unnatural movement using epitomic analysis. In Proc. of the Seventh International Conference on Machine Learning and Applications (Washington, DC, USA, 2008), ICMLA ‘08, IEEE Computer Society, pp. 271–276. 3
Google Scholar
Krüger B., Tautges J., Weber A., Zinke A.: Fast local and global similarity searches in large motion capture databases. In Proc. of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation (Aire-la-Ville, CH, 2010), SCA ‘10, Eurographics Association, pp. 1–10. 2
Google Scholar
Lou H., Chai J.: Example-based human motion denoising. IEEE Transactions on Visualization and Computer Graphics 16, 5 (Sept. 2010), 870–879. 3
10.1109/TVCG.2010.23
PubMed Web of Science® Google Scholar
Liu X., Cheung Y.-m., Peng S.-J., Cui Z., Zhong B., Du J.-X.: Automatic motion capture data denoising via filtered sub-space clustering and low rank matrix approximation. Signal Process. 105 (2014), 350–362. 1, 3
10.1016/j.sigpro.2014.06.009
Web of Science® Google Scholar
Lee J., Chai J., Reitsma P. S. A., Hodgins J. K., Pollard N. S.: Interactive control of avatars animated with human motion data. ACM Trans. Graph. 21, 3 (July 2002), 491–500. 4, 10
10.1145/566654.566607
Web of Science® Google Scholar
Lv X., Chai J., Xia S.: Data-driven inverse dynamics for human motion. ACM Trans. Graph. 35, 6 (Nov. 2016), 163:1–163:12 2
10.1145/2980179.2982440
Web of Science® Google Scholar
Liu G., McMillan L.: Estimation of missing markers in human motion capture. Vis. Comput. 22, 9 (Sept. 2006), 721–728. 3
10.1007/s00371-006-0080-9
Web of Science® Google Scholar
Li L., McCann J., Pollard N., Faloutsos C.: Bolero: A principled technique for including bone length constraints in motion capture occlusion filling. In Proc‥ of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation (Aire-la-Ville, CH, 2010), SCA ‘10, Eurographics Association, pp. 179–188. 1, 3
Google Scholar
Lyard E., Magnenat-Thalmann N.: A simple footskate removal method for virtual reality applications. Vis. Comput. 23, 9 (Aug. 2007), 689–695. 6
10.1007/s00371-007-0135-6
Web of Science® Google Scholar
Li Y., Wang T., Shum H.-Y.: Motion texture: A two-level statistical model for character motion synthesis. ACM Trans. Graph. 21, 3 (July 2002), 465–472. 4
10.1145/566654.566604
Web of Science® Google Scholar
Müller M.: Dynamic Time Warping. Springer-Verlag New York, Inc., Secaucus, NJ, USA, 2007, pp. 69–84. 13
Google Scholar
Müller M., Röder T., Clausen M.: Efficient content-based retrieval of motion capture data. ACM Trans. Graph. 24, 3 (July 2005), 677–685. 2
10.1145/1073204.1073247
Web of Science® Google Scholar
Mehta D., Sridhar S., Sotnychenko O., Rhodin H., Shafiei M., Seidel H.-P., Xu W., Casas D., Theobalt C.: VNect: Real-time 3d human pose estimation with a single rgb camera. ACM Trans. Graph. 36, 4 (July 2017), 44:1–44:14 1
10.1145/3072959.3073596
Web of Science® Google Scholar
Park S. I., Hodgins J. K.: Capturing and animating skin deformation in human motion. ACM Trans. Graph. 25, 3 (July 2006), 881–889. 3
10.1145/1141911.1141970
Web of Science® Google Scholar
PhaseSpace Inc.: Optical Motion Capture Systems: http://www.phasespace.com, 2017. 6
Google Scholar
Peng S.-J., He G.-F., Liu X., Wang H.-Z.: Hierarchical block-based incomplete human mocap data recovery using adaptive non-negative matrix factorization. Computer & Graphics 49, C (June 2015), 10–23. 1, 3
10.1016/j.cag.2015.04.004
Web of Science® Google Scholar
Rose C., Cohen M. F., Bodenheimer B.: Verbs and adverbs: Multidimensional motion interpolation. IEEE Comput. Graph. Appl. 18, 5 (Sept. 1998), 32–40. 3
10.1109/38.708559
Web of Science® Google Scholar
Root-Motion: FINAL-IK: http://root-motion.com/, accessed 01/2017, 2017. 5
Google Scholar
Rothweiler J.: Polyphase quadrature filters - a new sub-band coding technique. In IEEE International Conference on Acoustics, Speech, and Signal Processing (1983), ICASSP ‘83, IEEE, p. 1280ǎĂŞ1283. 5
Google Scholar
Ren L., Patrick A., Efros A. A., Hodgins J. K., Rehg J. M.: A data-driven approach to quantifying natural human motion. ACM Trans. Graph. 24, 3 (July 2005), 1090–1097. 3
10.1145/1073204.1073316
Web of Science® Google Scholar
Shen W., Deng K., Bai X., Leyvand T., Guo B., Tu Z.: Exemplar-based human action pose correction and tagging. In Proc. of the IEEE Conference on Computer Vision and Pattern Recognition (Washington, DC, USA, 2012), CVPR ‘12, IEEE Computer Society, pp. 1784–1791. 3
Google Scholar
Slyper R., Hodgins J. K.: Action capture with accelerometers. In Proc. of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation (Aire-la-Ville, CH, 2008), SCA ‘08, Eurographics Association, pp. 193–199. 3
Google Scholar
Shin H. J., Lee J., Shin S. Y., Gleicher M.: Computer puppetry: An importance-based approach. ACM Trans. Graph. 20, 2 (Apr. 2001), 67–94. 3
10.1145/502122.502123
Web of Science® Google Scholar
Shotton J., Sharp T., Kipman A., Fitzgibbon A., Finocchio M., Blake A., Cook M., Moore R.: Real-time human pose recognition in parts from single depth images. Commun. ACM 56, 1 (Jan. 2013), 116–124. 1
10.1145/2398356.2398381
Web of Science® Google Scholar
Trumble M., Gilbert A., Malleson C., Hilton A., Collomosse J.: Total Capture: 3D human pose estimation fusing video and inertial sensors. In Proc. of the 2017 British Machine Vision Conference (2017), BMVC ‘17. 3
Google Scholar
Taylor G. W., Hinton G. E., Roweis S.: Modeling human motion using binary latent variables. In Proc. of the 19th International Conference on Neural Information Processing Systems (Cambridge, MA, USA, 2006), NIPS'06, MIT Press, pp. 1345–1352. 3
Google Scholar
Tak S., Ko H.-S.: A physically-based motion retargeting filter. ACM Trans. Graph. 24, 1 (Jan. 2005), 98–117. 3
10.1145/1037957.1037963
Web of Science® Google Scholar
Tautges J., Zinke A., Krüger B., Baumann J., Weber A., Helten T., Müller M., Seidel H.-P., Eberhardt B.: Motion reconstruction using sparse accelerometer data. ACM Trans. Graph. 30, 3 (May 2011), 18:1–18:12 1, 3
10.1145/1966394.1966397
Web of Science® Google Scholar
Vondrak M., Sigal L., Hodgins J., Jenkins O.: Video-based 3d motion capture through biped control. ACM Trans. Graph. 31, 4 (July 2012), 27:1–27:12 2
10.1145/2185520.2185523
Web of Science® Google Scholar
Wang J., Bodenheimer B.: An evaluation of a cost metric for selecting transitions between motion segments. In Proc. of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation (Aire-la-Ville, CH, 2003), SCA ‘03, Eurographics Association, pp. 232–238. 4
Google Scholar
Won J., Lee K., O'Sullivan C., Hodgins J. K., Lee J.: Generating and ranking diverse multi-character interactions. ACM Trans. Graph. 33, 6 (Nov. 2014), 219:1–219:12 6
10.1145/2661229.2661271
Web of Science® Google Scholar
Xiao J., Feng Y., Ji M., Yang X., Zhang J. J., Zhuang Y.: Sparse motion bases selection for human motion denoising. Signal Process. 110, C (May 2015), 108–122. 3
10.1016/j.sigpro.2014.08.017
Web of Science® Google Scholar
Xia G., Sun H., Zhang G., Feng L.: Human motion recovery jointly utilizing statistical and kinematic information. Information Sciences 339, C (Apr. 2016), 189–205. 3
10.1016/j.ins.2015.12.041
Web of Science® Google Scholar
Zordan V. B., Van Der Horst N. C.: Mapping optical motion capture data to skeletal motion using a physical model. In Proc. of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation (Aire-la-Ville, CH, 2003), SCA ‘03, Eurographics Association, pp. 245–250. 3
Google Scholar

Citing Literature

Volume37, Issue2

May 2018

Pages 297-309

Self-similarity Analysis for Motion Capture Cleaning

Abstract

Supporting Information

References

Citing Literature

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Self-similarity Analysis for Motion Capture Cleaning

Abstract

Supporting Information

References

Citing Literature

References

Related

Information