Your privacy, your choice

We use essential cookies to make sure the site can function. We also use optional cookies for advertising, personalisation of content, usage analysis, and social media.

By accepting optional cookies, you consent to the processing of your personal data - including transfers to third parties. Some third parties are outside of the European Economic Area, with varying standards of data protection.

See our privacy policy for more information on the use of your personal data.

for further information and to change your choices.

Skip to main content

Table 1 Algorithm description

From: Natural language processing to identify lupus nephritis phenotype in electronic health records

Algorithm name

Classification model

Description

Baseline algorithm

Rule-based

A patient is confirmed to have lupus nephritis if he/she has proteinuria > 0.5 mg in laboratory test or has ICD 9/10 diagnosis code for lupus nephritis.

Full MetaMap model (binary)

L2-regularized logistic regression

Features are the non-negative mention of MetaMap CUIs. We treated CUIs as binary variables and fitted L2-regularized logistic regression to predict lupus nephritis.

Full MetaMap model (count)

L2-regularized logistic regression

The same as the full MetaMap model (binary) except that MetaMap CUIs are treated as numeric variables representing the count of instances each concept is mentioned in the clinical text.

MetaMap mixed model

L2-regularized logistic regression

There are 13 features in this model including 7 CUI features, 5 RegEx concepts, and 1 feature from structured data.