International Journal on Science and Technology

E-ISSN: 2229-7677     Impact Factor: 9.88

A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal

Call for Paper Volume 16 Issue 4 October-December 2025 Submit your research before last 3 days of December to publish your research paper in the issue of October-December.

Machine Learning-Enhanced Data Quality Validation for Electronic Health Record Integration

Author(s) Arjun Warrier
Country United States
Abstract The harmonization and interoperability of electronic health records (EHRs) are critical for enabling comprehensive, patient-centred care. Data quality issues, including incompleteness, inconsistency, duplication, and semantic inaccuracies, significantly compromise the clinical utility of EHR systems. Nevertheless, the clinical utility of EHR systems. Conventional rule-based validation methods have limited flexibility and scalability, especially with the increasing prevalence of complex and massive healthcare data sources. To consolidate the EHRs from independent health information systems, this paper presents a machine learning (ML) empowered data quality validation framework for automating quality control, enriching data authenticity, and mitigating integration flaws.
The study presents a layered architecture that utilizes machine learning (ML) algorithms to detect anomalies in the data, infer missing values, and provide automated quality scores. The model leverages supervised and unsupervised algorithms trained on historical EHR integration failures and common data integrity violations. Algorithms such as random forests, support vector machines, and autoencoders are applied to identify structural and semantic errors in semistructured data (e.g., patient demographics, medication lists, and laboratory results). Additionally, NLP models are utilized to review unstructured notes and discharge summaries, ensuring that the context remains consistent.
Keywords Electronic Health Records (EHR), Data Quality Assurance, Machine Learning, Healthcare Interoperability, Data Validation Algorithms, Automated Quality Scoring, AI in Health Informatics, Clinical Data Integration, Semantic Consistency, Data Anomaly Detection.
Field Engineering
Published In Volume 11, Issue 1, January-March 2020
Published On 2020-02-02
DOI https://doi.org/10.71097/IJSAT.v11.i1.8387
Short DOI https://doi.org/g95m8s

Share this