Faculty, Staff and Student Publications
Language
English
Publication Date
1-1-2024
Journal
AMIA 2026 Annual Symposium
PMID
41726486
PMCID
PMC12919532
PubMedCentral® Posted Date
2-14-2026
PubMedCentral® Full Text Version
Post-print
Abstract
Provenance tracking ensures data integrity, security, and accountability in healthcare and biomedical research. As biomedical data grows in complexity, comprehensive tracking mechanisms are needed to maintain reproducibility, transparency, and compliance with regulatory standards, such as GDPR. Traditional log-based and ontology-based approaches capture and standardize data lineage, while cryptographic and blockchain-based methods enhance security and verifiability. However, challenges remain in scalability, security, and usability. To address these, we introduce the Resource-Provenance Visualization Engine (RPVE), an advanced system integrating data lineage tracking and interactive visualization. RPVE employs the Randomized N-gram Hashing Identifier (NHash ID) to establish precise data links within the BRAIN Initiative Cell Atlas Network (BICAN) and features an interactive Sankey visualization engine for seamless data exploration. The system enhances provenance tracking by improving data retrieval efficiency, ensuring reliable verification processes, and maintaining data integrity.
Keywords
Humans, Computer Security, Information Storage and Retrieval, Databases, Factual, Brain, Database Management Systems
Published Open-Access
yes
Recommended Citation
Li, Xiaojin; Huang, Yan; Ng, Lydia; et al., "Relational Database-Based Resource-Provenance Visualization Engine: With an Application to BICAN Data" (2024). Faculty, Staff and Student Publications. 3747.
https://digitalcommons.library.tmc.edu/uthmed_docs/3747