Commit cb059b1e authored by Chia Ying Chiu's avatar Chia Ying Chiu
Browse files


parent 8b7b3570
......@@ -9,6 +9,9 @@ Method
Data are from MIMIC-III, Medical Information Mart for Intensive Care, database, compromising health information of each encounter at the critical care units of a large tertiary care hospital (Johnson et al, 2016). For this study, data of diagnostic codes and clinical notes are included.
In order to simulate coder’s work in hospitals, our goal is to construct a model that predicts ICD-10 codes based on the given free-form texts. In our model, we first apply basic preprocessing methods via NLTK [8], and then build a neural network model for learning the features from input texts. The preprocessing procedure includes spell checking, converting into lower cases, stop words removal, tokenization, and removing infrequent
words. The preprocessed data are then split to training and validation set by Scikit-Learn library. (
Supports Markdown
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment