A pose-based feature fusion and classification framework for the early prediction of cerebral palsy in infants

Abstract

The early diagnosis of cerebral palsy is an area which has recently seen significant multi-disciplinary research. Diagnostic tools such as the General Movements Assessment (GMA), have produced some very promising results. However, the prospect of automating these processes may improve accessibility of the assessment and also enhance the understanding of movement development of infants. Previous works have established the viability of using pose-based features extracted from RGB video sequences to undertake classification of infant body movements based upon the GMA. In this paper, we propose a series of new and improved features, and a feature fusion pipeline for this classification task. We also introduce the RVI-38 dataset, a series of videos captured as part of routine clinical care. By utilising this challenging dataset we establish the robustness of several motion features for classification, subsequently informing the design of our proposed feature fusion framework based upon the GMA. We evaluate our proposed framework’s classification performance using both the RVI-38 dataset and the publicly available MINI-RGBD dataset. We also implement several other methods from the literature for direct comparison using these two independent datasets. Our experimental results and feature analysis show that our proposed pose-based method performs well across both datasets. The proposed features afford us the opportunity to include finer detail than previous methods, and further model GMA specific body movements. These new features also allow us to take advantage of additional body-part specific information as a means of improving the overall classification performance, whilst retaining GMA relevant, interpretable, and shareable features.

Publication
IEEE Transactions on Neural Systems and Rehabilitation Engineering
Pengpeng Hu
Pengpeng Hu
Senior Lecturer (Associate Professor)

Pengpeng Hu is currently a Senior Lecturer (Associate Professor) with The University of Manchester. His research interests include biometrics, geometric deep learning, 3D human body reconstruction, point cloud processing, and vision-based measurement. He serves as an Associate Editor for IEEE Transactions on Neural Networks and Learning Systems, IEEE Transactions on Automation Science and Engineering, and Engineering and Mathematics in Medical and Life Sciences, as well as an Academic Editor for PLOS ONE and a member of the editorial board for Scientific Reports. He is also the Programme Chair for the 25th UK Workshop on Computational Intelligence (UKCI 2026) and an Area Chair for the 35th British Machine Vision Conference (BMVC 2024). He is the recipient of the Emerald Literati Award for an outstanding paper in 2019.