First name
Jinbo
Last name
Chen

Title

A maximum likelihood approach to electronic health record phenotyping using positive and unlabeled patients.

Year of Publication

2019

Date Published

2019 Nov 13

ISSN Number

1527-974X

Abstract

<p><strong>OBJECTIVE: </strong>Phenotyping patients using electronic health record (EHR) data conventionally requires labeled cases and controls. Assigning labels requires manual medical chart review and therefore is labor intensive. For some phenotypes, identifying gold-standard controls is prohibitive. We developed an accurate EHR phenotyping approach that does not require labeled controls.</p>

<p><strong>MATERIALS AND METHODS: </strong>Our framework relies on a random subset of cases, which can be specified using an anchor variable that has excellent positive predictive value and sensitivity independent of predictors. We proposed a maximum likelihood approach that efficiently leverages data from the specified cases and unlabeled patients to develop logistic regression phenotyping models, and compare model performance with existing algorithms.</p>

<p><strong>RESULTS: </strong>Our method outperformed the existing algorithms on predictive accuracy in Monte Carlo simulation studies, application to identify hypertension patients with hypokalemia requiring oral supplementation using a simulated anchor, and application to identify primary aldosteronism patients using real-world cases and anchor variables. Our method additionally generated consistent estimates of 2 important parameters, phenotype prevalence and the proportion of true cases that are labeled.</p>

<p><strong>DISCUSSION: </strong>Upon identification of an anchor variable that is scalable and transferable to different practices, our approach should facilitate development of scalable, transferable, and practice-specific phenotyping models.</p>

<p><strong>CONCLUSIONS: </strong>Our proposed approach enables accurate semiautomated EHR phenotyping with minimal manual labeling and therefore should greatly facilitate EHR clinical decision support and research.</p>

DOI

10.1093/jamia/ocz170

Alternate Title

J Am Med Inform Assoc

PMID

31722396

Title

Germline genetic variation and treatment response on CCG-1891.

Year of Publication

2012

Number of Pages

695-700

Date Published

2012 May

ISSN Number

1545-5017

Abstract

<p><strong>BACKGROUND: </strong>Recent studies suggest that polymorphisms in genes encoding enzymes involved in drug detoxification and metabolism may influence disease outcome in pediatric acute lymphoblastic leukemia (ALL). We sought to extend current knowledge by using standard and novel statistical methodology to examine polymorphic variants of genes and relapse risk, toxicity, and drug dose delivery in standard risk ALL.</p>

<p><strong>PROCEDURE: </strong>We genotyped and abstracted chemotherapy drug dose data from treatment roadmaps on 557 patients on the Children's Cancer Group ALL study, CCG-1891. Fourteen common polymorphisms in genes involved in folate metabolism and/or phase I and II drug detoxification were evaluated individually and clique-finding methodology was employed for detection of significant gene-gene interactions.</p>

<p><strong>RESULTS: </strong>After controlling for known risk factors, polymorphisms in four genes: GSTP1*B (HR = 1.94, P = 0.047), MTHFR (HR = 1.61, P = 0.034), MTRR (HR = 1.95, P = 0.01), and TS (3R/4R, HR = 3.69, P = 0.007) were found to significantly increase relapse risk. One gene-gene pair, MTRR A/G and GSTM1 null genotype, significantly increased the risk of relapse after correction for multiple comparisons (P = 0.012). Multiple polymorphisms were associated with various toxicities and there was no significant difference in dose of chemotherapy delivered by genotypes.</p>

<p><strong>CONCLUSIONS: </strong>These data suggest that various polymorphisms play a role in relapse risk and toxicity during childhood ALL therapy and that genotype does not play a role in adjustment of drug dose administered. Additionally, gene-gene interactions may increase the risk of relapse in childhood ALL and the clique method may have utility in further exploring these interactions. childhood ALL therapy.</p>

DOI

10.1002/pbc.23192

Alternate Title

Pediatr Blood Cancer

PMID

21618417

WATCH THIS PAGE

Subscription is not available for this page.