International Journal on Science and Technology
E-ISSN: 2229-7677
•
Impact Factor: 9.88
A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal
Plagiarism is checked by the leading plagiarism checker
Call for Paper
Volume 16 Issue 4
October-December 2025
Indexing Partners
Machine Learning-Enhanced Data Quality Validation for Electronic Health Record Integration
| Author(s) | Arjun Warrier |
|---|---|
| Country | United States |
| Abstract | The harmonization and interoperability of electronic health records (EHRs) are critical for enabling comprehensive, patient-centred care. Data quality issues, including incompleteness, inconsistency, duplication, and semantic inaccuracies, significantly compromise the clinical utility of EHR systems. Nevertheless, the clinical utility of EHR systems. Conventional rule-based validation methods have limited flexibility and scalability, especially with the increasing prevalence of complex and massive healthcare data sources. To consolidate the EHRs from independent health information systems, this paper presents a machine learning (ML) empowered data quality validation framework for automating quality control, enriching data authenticity, and mitigating integration flaws. The study presents a layered architecture that utilizes machine learning (ML) algorithms to detect anomalies in the data, infer missing values, and provide automated quality scores. The model leverages supervised and unsupervised algorithms trained on historical EHR integration failures and common data integrity violations. Algorithms such as random forests, support vector machines, and autoencoders are applied to identify structural and semantic errors in semistructured data (e.g., patient demographics, medication lists, and laboratory results). Additionally, NLP models are utilized to review unstructured notes and discharge summaries, ensuring that the context remains consistent. |
| Keywords | Electronic Health Records (EHR), Data Quality Assurance, Machine Learning, Healthcare Interoperability, Data Validation Algorithms, Automated Quality Scoring, AI in Health Informatics, Clinical Data Integration, Semantic Consistency, Data Anomaly Detection. |
| Field | Engineering |
| Published In | Volume 11, Issue 1, January-March 2020 |
| Published On | 2020-02-02 |
| DOI | https://doi.org/10.71097/IJSAT.v11.i1.8387 |
| Short DOI | https://doi.org/g95m8s |
Share this

CrossRef DOI is assigned to each research paper published in our journal.
IJSAT DOI prefix is
10.71097/IJSAT
Downloads
All research papers published on this website are licensed under Creative Commons Attribution-ShareAlike 4.0 International License, and all rights belong to their respective authors/researchers.