Faculty, Staff and Student Publications

Language

English

Publication Date

1-1-2025

Journal

Computers in Biology and Medicine

DOI

10.1016/j.compbiomed.2024.109460

PMID

39615234

Abstract

Objective: This paper aims to introduce and assess KeyGAN, a generative modeling-based keystroke data synthesizer. The synthesizer is designed to generate realistic synthetic keystroke data capturing the nuances of fine motor control and cognitive processes that govern finger-keyboard kinematics, thereby paving the way to support biomarker development for psychomotor impairment due to neurodegeneration.

Methods: KeyGAN is designed with two primary objectives: (i) to ensure high realism in the synthetic distributions of the keystroke features and (ii) to analyze its ability to replicate the subtleties of natural typing for enhancing biomarker development. The quality of synthetic keystroke data produced by KeyGAN is evaluated against two keystroke-based applications, TypeNet and nQiMechPD, employed as'referee' controls. The performance of KeyGAN is compared with a reference random Gaussian generator, testing its ability to fool the biometric authentication method TypeNet, and its ability to characterize fine motor impairment in Parkinson's Disease using nQiMechPD.

Results: KeyGAN outperformed the reference comparator in fooling the biometric authentication method TypeNet. It also exhibited a superior approximation to real data than the reference comparator when using nQiMechPD, showcasing its adaptability and versatility in mimicking early signs of Parkinson's Disease in natural typing. KeyGAN's synthetic data demonstrated that almost 20% of real PD samples could be replaced in the training set without a decline in classification performance on the real test set. Low Fréchet Distance (< 0.03) and Kullback-Leibler Divergence (< 700) between KeyGAN outputs and real data distributions underline the high performance of KeyGAN.

Conclusion: KeyGAN presents strong potential as a realistic keystroke data synthesizer, displaying impressive capability to reproduce complex typing patterns relevant to biomarkers for neurological disorders, like Parkinson's Disease. The ability of its synthetic data to effectively supplement real data for training algorithms without affecting performance implies significant promise for advancing research in digital biomarkers for neurodegenerative and psychomotor disorders.

Keywords

Humans, Parkinson Disease, Phenotype, Fingers, Algorithms, GAN, Keystroke dynamics, Parkinson’s disease, Synthetic data

Published Open-Access

yes

Share

COinS
 
 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.