Vanderbilt DSI and Women’s Basketball team collaborate using data science to improve performance while reducing risk of injury

Womens Basketball. (Photo by Joe Howell)

The Vanderbilt University Data Science Institute continued its project with the Vanderbilt Women’s Basketball strength and conditioning program to use data to improve athletic performance while reducing the risk of sport-associated injury. This semester, the focus was on making better use of training and competition data through improved analytics efforts, data consolidation, the creation of predictive models, and improved data visualization.

The data science team working on the project this semester included Umang Chaudhry, Grace Ko, Minwoo Sohn, Nikkhil Niranjan, and Sovann Chang. The team also continued to work on the project with Tyler Clarke from Vanderbilt Athletics.

The project utilizes data from various sources, including Catapult GPS technology, Vald: Force Decks dual force plate technology, Vald: NordBord hamstring testing systems, and Fusionetics assessment protocols. Last semester, the team worked on understanding the data collected by the team, getting the APIs to work, and beginning work on creating a dynamically changing and de-identified dashboard.

This semester, the team worked on challenges with the APIs themselves as they incorporated them into the dashboard, pulling the necessary data, and detecting anomalies. Anomaly detection was a key area of focus this semester, with the goal of finding any indications of injury in each athlete’s data and alerting staff if similar signals are found to prevent injury.

The team utilized Variational AutoEncoders (VAE) for anomaly detection, which detects anomalies of each metric of each player. Over the summer, they will decide which anomalies will be considered by each metric and finish up the code. A report of possible signals of injury will also be created.

Additionally, the team worked on pulling more recent data and modifying the daily pull function to make it more seamless. The dashboard took a while to load because it was pulling in data that was months old, but now it pulls in data that is more recent and useful for training purposes. However, the team faced a dilemma with dashboard hosting, as their data is pulled from various sources and APIs, and most of their packages are sourced from Github/CRAN, which are compatible with Shinyapps.io. They experienced some difficulties with pulling in Catapult data into the dashboard because it was on Bitbucket and not Github, which made their dashboard incomplete and not fully comprehensive. Therefore, the dashboard is currently hosted locally.

The next steps for the project include optimizing, enhancing, and deploying the dashboard, implementing anomaly detection across athletes and metrics, and incorporating it into the dashboard. The team also plans to create predictive models for ForceDecks, Nordbord, and Fusionetics metrics using Catapult metrics.

Another semester in the books on this project, and our DS Team continues to make significant progress towards achieving its goals. With the incorporation of anomaly detection and the refinement of the dashboard, the program is well on its way to providing coaches with a comprehensive tool that allows them to analyze athletic health and performance at a single glance to propel their teams forward.