Safe Medicine Recommendation via Medical Knowledge Graph Embedding

Most of the existing medicine recommendation systems that are mainly based on electronic medical records (EMRs) are significantly assisting doctors to make better clinical decisions benefiting both patients and caregivers. Even though the growth of EMRs is at a lighting fast speed in the era of big data, content limitations in EMRs restrain the existed recommendation systems to reflect relevant medical facts, such as drug-drug interactions. Many medical knowledge graphs that contain drug-related information, such as DrugBank, may give hope for the recommendation systems. However, the direct use of these knowledge graphs in the systems suffers from robustness caused by the incompleteness of the graphs. To address these challenges, we stand on recent advances in graph embedding learning techniques and propose a novel framework, called Safe Medicine Recommendation (SMR), in this paper. Specifically, SMR first constructs a high-quality heterogeneous graph by bridging EMRs (MIMIC-III) and medical knowledge graphs (ICD-9 ontology and DrugBank). Then, SMR jointly embeds diseases, medicines, patients, and their corresponding relations into a shared lower dimensional space. Finally, SMR uses the embeddings to decompose the medicine recommendation into a link prediction process while considering the patient's diagnoses and adverse drug reactions. To our best knowledge, SMR is the first to learn embeddings of a patient-disease-medicine graph for medicine recommendation in the world. Extensive experiments on real datasets are conducted to evaluate the effectiveness of proposed framework.



page 1

page 2

page 3

page 4


Predicting Rich Drug-Drug Interactions via Biomedical Knowledge Graphs and Text Jointly Embedding

Minimizing adverse reactions caused by drug-drug interactions has always...

MedSim: A Novel Semantic Similarity Measure in Bio-medical Knowledge Graphs

We present MedSim, a novel semantic SIMilarity method based on public we...

Drug Package Recommendation via Interaction-aware Graph Induction

Recent years have witnessed the rapid accumulation of massive electronic...

Conditional Generation Net for Medication Recommendation

Medication recommendation targets to provide a proper set of medicines a...

RecoMed: A Knowledge-Aware Recommender System for Hypertension Medications

Background and Objective High medicine diversity has always been a signi...

PDD Graph: Bridging Electronic Medical Records and Biomedical Knowledge Graphs via Entity Linking

Electronic medical records contain multi-format electronic medical data ...

PREMIER: Personalized REcommendation for Medical prescrIptions from Electronic Records

The broad adoption of Electronic Health Records (EHR) has led to vast am...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.

1 Introduction

Over the last few years, medicine recommendation systems have been developed to assist doctors in making accurate medicine prescriptions. On the one hand, many researchers [Chen et al.2016, Almirall et al.2012]

adopt rule-based protocols that are defined by the clinical guidelines and the experienced doctors. Constructing, curating, and maintaining these protocols are time-consuming and labor-intensive. Rule-based protocols might be effective for a general medicine recommendation for a specific diagnosis, but give little help to tailored recommendations for complicated patients. On the other hand, supervised learning algorithms and variations, such as Multi-Instance Multi-label (MIML) learning

[Zhang et al.2017], have been proposed to recommend medicines for patients. Both input features and ground-truth information that are extracted from massive EMRs are trained to obtain a predictive model that outputs multiple labels of the new testing data as medicine recommendations. It is a fact that therapies and treatments in clinical practices are rapidly updated. Unfortunately, supervised learning methods cannot deal with those medicines that are not included in the training phase. Incomplete training data set will be a detriment to the recommendation system performance.

It is reported in [Panagioti et al.2015, Juurlink et al.2003] that patients with two or more diseases, acute or chronic, often take five or more different medicines simultaneously and have immense health risks. Studies [Edwards and Aronson2000, Leape et al.1995] have shown that 3-5% of all in-hospital misused prescriptions blame to ignorances of adverse drug reactions, which is difficult to prohibit even for the highly trained and experienced clinicians. With the assistance from the conventional medicine recommendation systems, clinicians still need to cautiously rule out those recommendations that have potential adverse effects caused by drug-drug interactions. Most of the existing works have largely ignored the exploit of medical facts in medicines, such as drug-drug interactions, which is crucial in medicine recommendation system. One possible reason might because there is little medical expert knowledge in EMRs. Content limitations in EMRs constrain the systems to barely associate accurate medical facts with the recommended prescriptions, which makes the final recommendation less trustworthy for the complicated patients.

With the increasing emergence of knowledge graphs, many world-leading researchers have successfully extracted information from huge volumes of medical databases to build up giant heterogeneous graphs that reflect medical facts of medicines and diseases. For instance, DrugBank [Law et al.2014] is a rich source of medicine information. It contains extensive entities (drugs, drug targets, chemistry, etc.) and relationships (enzymatic pathways, drug-drug interactions, etc.). ICD-9 ontology [Schriml et al.2011]

represents a knowledge base of human diseases and can be used to classify diagnoses of patients. Harnessing well-built medical knowledge graphs in EMRs-based medicine recommendation system might enable the reinvented system to provide appropriate prescriptions for special patients, as well as alerts of possible side effects and serious drug-drug interactions (DDIs).

Figure 1: Left part is a medical knowledge graph with missing relationships. Right part is prescription records in EMRs, and each edge indicates that a patient takes a medicine. There is no relationship between medicines in EMRs.

As shown in Figure1, linking EMRs and medical knowledge graphs to generate a large and high-quality heterogeneous graph is a promising pathway for medicine recommendations in a wider scope, but never easy. Specifically, the newly designed system confronts with the following challenges: 1. Computational Efficiency. Querying specialized medical entities and relationships based on conventional graph-based algorithms have limitations in portability and scalability. The computational complexity becomes unfeasible when the heterogeneous graph reaches a very large scale. 2. Data Incompleteness. The medical knowledge graphs also follow the long-tail distribution as same as other types of large-scale knowledge bases. Data incompleteness is another serious problem existing among entities and relationships in such a distribution. For example, since the DDIs is not usually identified in the clinical trial phase, there is a lack of significant DDIs in DrugBank which cannot support a comprehensive precaution to the medication. Last but not least, medicine recommendation suffers from 3. Cold Start. As conventional systems normally recommend medicines based on the historical records, the pace of recommendation changes cannot keep up with the frequent updates of new therapies and treatments in medical practices. Little information on adverse reactions to the newly updated medicines in historical EMRs or even in well-built knowledge graphs makes the evidence-based recommendation model hardly support the new medicines as updated recommendations.

Taking all the challenges above into account, we propose a novel medicine recommendation framework based on graph embedding techniques, inspired by the idea of link prediction. We name our framework as Safe Medicine Recommendation (SMR) throughout this paper. The recommendation process mainly includes:

  1. A large heterogeneous graph is constructed from EMRs and medical knowledge graphs, where the nodes are entities (medicines, diseases, patients), and the edges (or links) represent various relations between entities, such as drug-drug interactions.

  2. The different parts of the generated heterogeneous graph (patient-medicine bipartite graph, patient-disease bipartite graph, medicine knowledge graph, disease knowledge graph) are embedded into a shared low-dimension space based on graph-based embedding models. Afterward, a joint learning algorithm is proposed to optimize the integrated graph simultaneously.

  3. Based on the learned embeddings, a new patient, represented by the vectors of his/her diagnoses, is modeled as an entity in the disease-patient graph. Recommending medicines for the patient is translated to predict links from the patient to medicines.

The primary contributions of this work are summarized as follows:

  1. We have developed graph-based embedding models to learn the effective representations of patients, diseases, and medicines in a shared low-dimension space. The representation of medicines enables the proposed framework can even effectively recommend newly emerged medicines for patients, which distinguishes most of the existing works.

  2. To recommend safe medicines for new patients, we propose a novel method for modeling a patient based on the learned graph embeddings and make a safe recommendation by minimizing the potential adverse drug reactions.

  3. We have conducted extensive experiments on large real-world datasets (MIMIC-III, DrugBank, and ICD-9 ontology) to evaluate the effectiveness of our framework. The experimental results have shown that the proposed framework outperforms all the compared methods.

  4. To our best knowledge, we firstly propose a framework to conduct Safe Medicine Recommendation (SMR) and formulate it as a link prediction problem. The implementation generates a high-quality heterogeneous graph in which relationships among patients, diseases, and medicines can be unveiled in a wider scope.

The remainder of this paper is organized as follows: Section 2 details our proposed framework SMR. Section 3 reports the experimental results and Section 4 reviews related work. Section 5 presents the conclusions and future work.

2 The Proposed Framework

In this section, we will first describe the notations and formulate medicine recommendation problem, and then present graph embedding models and how to use learned embeddings to recommend safe medicines for patients.

Figure 2: Overview of our framework. Patient-disease and patient-medicine graphs are all bipartite graphs, while disease and medicine graphs are general graphs. Patient, disease, and medicine are encoded into a low dimensional metric by graph-based methods. Diverse factors can be connected through patients.

2.1 Problem Formulation

Before we focus on the medicine recommendation problem, we first briefly introduce the important notations employed in the remainder of this paper. Table 1 also summarizes them.

Variable Interpretation
, the set of entities and relations
a triple in knowledge graph
both are dimensional vector embeddings
a dimensional relation embedding
a dimensional projection matrix
a patient, medicine, disease
, ,
a dimensional vector of a patient,
a medicine or a disease
and dimensional latent space of
entities and relations
Table 1: Notations.

The medical knowledge graph describes the medical entities collected from the integrated sources, as well as relationships among these entities. For instance, a triple (glucocorticoid, adverse interaction, aspirin) indicates that there is a relationship adverse interaction from glucocorticoid to aspirin in DrugBank. We define the medical knowledge graph as follow.

Definition 1 (Medical Knowledge Graph)

The medical knowledge graph is a set of triples in the form , where is a set of entities, is a set of relations, and .

To capture the co-relationships of patients, diseases, and medicines in EMRs, we define the patient-disease, patient-medicine bipartite graphs as follow.

Definition 2 (Patient-Medicine Bipartite Graph)

The patient-medicine bipartite graph is denoted as , where is a set of patients and is a set of medicines. is the set of edges. If a patient takes a medicine , there will be an edge between them, otherwise none. The weight of the edge between patient and medicine is defined as the total times of patient takes the medicine .

Definition 3 (Patient-Disease Bipartite Graph)

The patient-disease bipartite graph is denoted as , where is a set of patients and is a set of diseases. is the set of edges. If a patient is diagnosed with a disease , there will be an edge between them, otherwise none. The weight is set to 1 when the edge exists.

Figure 2 illustrates a heterogeneous graph by constructing patient-disease, patient-medicine bipartite graphs from MIMIC-III, and linking them to medical knowledge graphs, ICD-9 ontology, and DrugBank. Finally, we formally define the safe medicine recommendation problem as follows.

Problem 1 (Safe Medicine Recommendation)

Given a patient and his/her diagnoses dataset , recommending safe medicines for each is predicting edges from to medicines dataset . The output is a set of medicines with minimum drug-drug interactions.

2.2 Model Description and Optimization

In this section, we propose embedding learning approaches to encode the heterogeneous graph in the latent space and its optimization method.

Medical Knowledge Graph Embedding

A medical knowledge graph is a multi-relational graph, in which entities and relations can be different types. For a triple, , we use bold letters h, r, t to denote the corresponding embedding representations of . Plenty of graph embedding methods has been proposed to encode a multi-relational graphs into a continuous vector space. Translation-based models [Bordes et al.2013, Wang et al.2014, Lin et al.2015] regard the relation in each as a translation from to within the low dimensional space, i.e., , and perform much more effectively and efficiently than conventional models. TransR [Lin et al.2015] is a state-of-the-art translation-based embedding approach. It represents entities and relations in distinct vector space bridged by relation-specific matrices to get better graph representations.

Consider the above reason, we set entities embeddings and relations embeddings . And we set a projection matrix , which projects entities from entity space to relation space. We define the translations between entities and get the energy function as:


where is a bias constant.

Then, the conditional probability of a triple

is defined as follows:


and , can be defined in the analogous manner. We define the likelihood of observing a triple as:


We define an objective function by maximizing the conditional likelihoods of existing triples in :


Based on Eq.(4), the objective functions of medicine and disease knowledge graph , can be defined respectively:


Bipartite Graph Embedding

Different from the medical knowledge graph, the patient-disease, patient-medicine are bipartite graphs. A bipartite graph has only one single type of relations, i.e., it is a homogeneous graph. For a homogeneous graph, LINE [Tang et al.2015] model achieves the state-of-the-art performance of encoding the entities into a continuous vector space while preserving co-relations information of the graph. Hence, we follow LINE and set patients, medicines, and diseases embeddings . We present the process of encoding patient-medicine bipartite graph as follow.

Given a patient-medicine bipartite graph . We first define the conditional probability of that a patient in set takes medicine in set as follow:


where , is the embedding vector of the patient in , and is the embedding vector of medicine in . Eq. (7) defines a conditional distribution over all medicines in . The empirical distribution is defined as , where is the weight of the edge and is the total times that the patient takes medicines. We maximize the following objective function:


where is the distance between two distributions. In this paper, we use KL-divergence to compute . As is different from patients, we use in the objective function to represent the personalization of the patient in the graph. After omitting some constants, we have:


For the patient-disease bipartite graph, we can get the object function in the analogous manner:


Optimization and Training

To learn the medical knowledge graph and bipartite graphs embeddings simultaneously, an intuitive approach is to collectively embed the four graphs (patient-medicine bipartite graph, patient-disease bipartite graph, medicine knowledge graph, disease knowledge graph) by maximizing the sum of the four logarithm likelihood objective functions just as follow:


where stands for the embeddings , of entities and relations in the heterogeneous graph we construct, is a hyper-parameter weighting the regularization factor , which is defined as follows:


where denotes the positive part of . The regularization factor will normalize the embeddings during learning. And we adopt the asynchronous stochastic gradient algorithm (ASGD) [Recht et al.2011] to maximize the transformed objective function.

Optimizing objective functions Eq. (5), Eq. (6), Eq. (9) and Eq. (10) in Eq.(11) are computationally expensive, as calculating them need to sum over the entire set of entities and relations. To address this problem, we use the negative sampling method [Mikolov et al.2013] to transform the objective functions.

For Eq.(5) and Eq.(6), we should transform , , in Eq.(3). Taking as an example, we maximize the following objective function instead of it:


where is the number of negative examples,

is the sigmoid function.

is the invalid triple set, and is a function randomly sampling instances from . When a positive triple is selected, to maximize Eq.(13),

negative triples are constructed by sampling entities from an uniform distribution over

and replacing the head of . The transformed objective of , are maximized in the same manner, but for , the negative relations are sampled from a uniform distribution over to corrupt the positive relation . We iteratively select random mini-batch from the training set to learn embeddings until converge.

For Eq. (9), we also use the negative sampling method to transform it to the following objective function:


where is the sigmoid function, is the number of negative edges. according to the empirical setting of [Mikolov et al.2013], is the total number of times that the medicine is taken by patients. we can simplify Eq.(10) and maximize it in the same way.

Finally, we can efficiently learn the embeddings of different types of parts in the heterogeneous graph.

2.3 Safe Medicine Recommendation Process

In this section, we present how to recommend safe medicines based on the learned embeddings and diagnoses of a given patient. For an existing patient , we use the learned embedding to predict new medicine recommendations. For a new given patient , we first use diseases embeddings of ’s diagnoses to represent , and then recommend safe medicines for , as shown in Figure 2,.

New Patient Model

We aim to present a new patient by his/her diagnoses embeddings. We should consider the time sequence of diseases that a patient is diagnosed, especially for the patient with multiple diseases. Assume a patient in the hospital or on medication is associated with ranked diseases according to their timestamps in an increasing order. Then, the patient embedding can be encoded as follow:


where is the -th embedding of disease .

Medicine Recommendation

Given a query patient with the query disease , i.e., , we first project disease and patient into their latent space, and then select top- safe medicines111In MIMIC-III, patients in ICUs are sicker and usually need more medicines for a diagnosis. We set =3 in this paper.. More precisely, given a query , for each medicine which could be useful for , we compute its ranking score as in Eq. (16), and then select the medicine with the top- highest ranking scores as the recommendation.


where is the representation of patient and is the -th medicines to be considered from medicines based on the the already selected medicine .

3 Experiments and Evaluation

We attempt to demonstrate the effectiveness of our recommendation method in this section, which is referred to as SMR in this paper. In particular, we expect to answer “how well does our method compare with the competing techniques?” in Section 3.2. The results show that our recommendation method significantly outperforms the three baselines. The detailed experimental settings of our evaluations are described in Section 3.1.

3.1 Experimental Settings

Data Sets

Our experiments are performed on the real EMRs datasets, MIMIC-III [Johnson et al.2016], and two medical knowledge graphs, ICD-9 ontology [Schriml et al.2011] and DrugBank [Law et al.2014]. These real datasets are publicly available in different forms.

  • MIMIC-III (Medical Information Mart for Intensive Care III) collected bedside monitor trends, electronic medical notes, laboratory test results, and waveforms from the ICUs (Intensive Care Units) of Beth Israel Deaconess Medical Center between 2001 and 2012. It contains distinct 46,520 patients, 650,987 diagnoses and 1,517,702 prescription records that associated with 6,985 distinct diseases and 4,525 medicines.

  • ICD-9 ontology222 (International classification of diseases-version 9) contains 13,000 international standard codes of diagnoses and the relationships between them.

  • DrugBank is a bioinformatics/cheminformatics resource which consists of medicine related entities. The medical knowledge graph version333 contains 8,054 medicines, 4,038 other related entities (e.g., protein or drug targets) and 21 relationships.

Entities Relations
#Disease 6,985 #Medicine-related 71,460
#Medicine 8,054 #Disease-related 170,579
#Patient 46,520 #Patient-Disease 489,923
#Medicine-related 305,642 #Patient-Medicine 975,647
Table 2: Entities and relations in the heterogeneous graph.
Prediction accuracy DDIs rate
Rule-based 0.3068 32.01%
-Most frequent 0.3473 14.08%
LEAP 0.5582 1.66%
SMR 0.6113 0.17%
Table 3: Experiments on medicine group 1.
Prediction accuracy DDIs rate
Rule-based 0.2736 27.01%
K-Most frequent NA NA
SMR 0.5214 2.01%
Table 4: Experiments on medicine group 2.

Heterogeneous Graph Construction We connect MIMIC-III, ICD-9 ontology, and DrugBank (medicine group 1) by constructing the patient-medicine bipartite graph and the patient-disease bipartite graph.

For the patient-disease bipartite graph, MIMIC-III provides ICD-9 codes for diagnoses, which implicitly the diagnoses of MIMIC-III can be linked to ICD-9 ontology by string matching. For the patient-medicine bipartite graph, the prescriptions in MIMIC-III consist of the drug information, e.g., the names, the duration, and the dosage. However, various names to a single type of medicine in MIMIC-III exist due to some noisy words (20%, 50ml, glass bottle, etc.), which becomes an obstacle to link medicine names to DrugBank when directly applying to the string matching method. We use an entity linking method [Wang et al.2017] instead to address this problem. Table 2 shows the statistic of the heterogeneous graph we construct. The heterogeneous graph will be used to learn low-dimension representations of entities and relations by the SMR framework. Afterward, we categorize the medicines in the heterogeneous graph into two groups: 1). The first group consists of all 4,525 medicines that are recorded in EMRs, and will be used as inputs of the baseline methods. 2). The second group contains 3,529 medicines that haven’t been observed in EMRs, and will be used as test data for cold start recommendation.


We compare our SMR with the following baselines:

  • Rule-based method [Almirall et al.2012] recommends medicines based on mappings from existing medicine categories to diseases in the MEDI database [Wei et al.2013]. For each disease, a drug is assigned to the patient according to the mappings.

  • -Most frequent method is a basic baseline which retrieves the top medicines that most frequently co-occur with each disease as their recommendation. We set in this paper.

  • LEAP method [Zhang et al.2017] uses a Multi-Instance Multi-Label learning framework to train a predictive model taking disease conditions as input features and yielding multiple medicine labels as recommendations.

3.2 Evaluation Methods

To guarantee medicine recommendations generated by SMR work effectively, we evaluate four indice, the prediction accuracy, the ability to avoid adverse drug-drug interactions, the experienced clinical doctor assessments, and the capacity to process cold start problem. In all experiments, the ratio of training to validation to test sets is 0.7:0.1:0.2. The hyper-parameters was adjusted by a validation set.

Prediction Accuracy and DDIs Rate

We utilize Jaccard Coefficient to compare the similarity of the prescriptions generated by SMR and the corresponding prescriptions written by doctors. Given the recommendation medicines set generated by SMR for a patient , is the medicines set prescribed by doctors in the data. The mean of Jaccard coefficient is defined as follows,


where is the number of samples in a test set. Table 4 shows the accuracy of the baselines and SMR on medicine group 1, the rule-based method performs the worst because it is the only one provides a general recommendation for a specific diagnosis and it is not able to endow personalized recommendations, especially for the patients with multiple diseases. The frequency of each medicine-disease pair remains high in ICUs. Hence, recommendations based on frequency, -Most frequent method, also work deficiently. Our method SMR outperforms LEAP by 1.49% because more accurate medical facts are involved in medical knowledge graphs rather than the prescription information in EMRs.

We extract all adverse drug-drug interactions (DDIs) from DrugBank to evaluate whether medicine recommendations embrace unsafe DDIs. Table 4 shows the percentages of different medicine recommendations consisting of adverse DDIs. The result indicates that SMR can recommend most harmless medicines for patients as its drug interaction rate is the lowest. The rule-based method and -Most frequent method select medicines by a greedy strategy only regardless of specific adverse DDIs. For the rarely used medicines and unknown DDIs in EMRs, SMR is more reliable than LEAP. The reason is that SMR can predict each patient-medicine link and compute potential hidden DDIs by the learned embeddings of medical knowledge graphs.

Cold Start

We evaluate the ability of baselines and SMR in addressing cold start medicine recommendations on the medicine group 2. -Most frequent method and LEAP are not applicable (NA) on recommending new medicines in the cold start scenario. Since our SMR process can present new medicines by the learned vector representations of used medicines, the potential patient-medicine links between cold start medicines and patients will be captured correspondingly. In other words, SMR can leverage not only the patient-medicine links in EMRs but also the medical knowledge graphs when recommending cold start medicines.

Diagnosis Methods Medicine Recommendations
Acute respiratry failure
Rule-based Teicoplanin, Metoprolol
-Most frequent Vancomycin, Furosemide, Metoprolol, Insulin
LEAP Vancomycin, Furosemide, Metoprolol Tartrate
SMR Vancomycin, Furosemide, Amlodipine, Norepinephrine, Acetaminophen
Type 2 diabetes
Rheumatoid arthritis
Rule-based Gliclazide, Phenylbutazone, Sulfasalazine, Fenofibrate
-Most frequent Furosemide, Tolbutamide, Phenylbutazone, Metoprolol, Insulin, Acetaminophen
LEAP Metformin, Amethopterin, Amiloride/HCTZ, Fenofibrate
SMR Metformin, Insulin, Acetaminophen, Nifedipine, Fenofibrate
Table 5: Examples of medicine recommendations generated by Baselines and SMR.

Clinical Assessment

We invited three experienced clinical experts to evaluate the effectiveness of the medicine recommendations by scoring on a 6-point scale: 5 corresponding to completely cover all diagnoses without DDIs; 4 to partially (at least 50%) diagnoses include without DDIs; 3 to completely cover all diagnoses with DDIs; 2 to less than 50% diagnoses without DDIs; 1 to partially (at least 50%) diagnoses covered with DDIs; 0 to less than 50% diagnoses with DDIs. The average score of three experts is used as the final clinical assessment score for each recommendation, as shown in Figure 3.

Figure 3: Clinical Assessment.

Case Study

In table 5 we illustrate two events of medicine recommendations on medicine group 1 for patients associated with multiple types of diseases. SMR is qualified to succeed in all these two cases when comparing it against other baselines. For the first patient, SMR recommended a set of medicines with 100% coverage, with Vancomycin for Sepsis, Norepinephrine, Acetaminophen for respiratry failure, Furosemide and Amlodipine for Hypertension. In contrast, other baselines are not capable of make an adequate consideration. The rule-based method adopted Teicoplanin, targeting Sepsis only and not appropriate. The -Most frequent method and LEAP only selected Vancomycin for Sepsis and other medicines for Hypertension. For the second patient, SMR recommends more suitable medicines than LEAP and Rule-based method, i.e., Metformin and Insulin for Type 2 diabetes, Acetaminophen to release Rheumatoid arthritis, Nifedipine for Hypertension, and Fenofibrate for Hyperlipidemia. There is an adverse DDI among the medicines recommended by the -Most frequent method. Tolbutamide and Phenylbutazone can lead to harmful, potentially fatal effects when taken together. This case also indicates SMR can avoid the adverse DDIs when recommending medicines.

4 Related Work

In this section, we discuss related work, including medicine recommendation and medical knowledge graphs.

Medicine Recommendation

As introduced in Section 1, two types of methods, rule-based protocols [Chen et al.2016, Gunlicks-Stoessel et al.2016, Almirall et al.2012], and supervised-learning-based methods [Zhang et al.2017, Zhang et al.2014], are currently utilizing EMRs to recommending medicines. Ideally, medicine recommendation systems aim to tailor treatment to the individual characteristics of each patient [Fernald et al.2011]. Hence, medicine recommendation has also received attention recently in genetics/genomics research fields. There are already existing medicine recommendation systems [Rosen-Zvi et al.2008, Bennett and Hauser2013] by leveraging genetics/genomics information of patients in current practice, such information is not yet widely available in everyday clinical practice, and is insufficient since it only addresses one of many factors affecting response to medication.

Medical Knowledge Graphs

Recent evaluation efforts on knowledge graphs have focused on automatic knowledge base population and completion. Some knowledge graphs have also been constructed from huge volumes of medical databases over the last years, such as [Ernst, Siu, and Weikum2015], Bio2RDF[Dumontier et al.2014], and Chem2Bio2RDF[Chen et al.2010]. Medical knowledge graphs contain an abundance of basic medical facts of medicines and diseases and provide a pathway for medical discovery and applications, such as effective medicine recommendations. Unfortunately, such medical knowledge graphs suffer from serious data incomplete problem, which impedes its application in the field of clinical medicine.

5 Conclusion and Future Work

In this paper, we propose a novel framework SMR to recommend safe medicines for patients, especially for the patients with multiple diseases. SMR first constructs a high-quality heterogeneous graph by bridging EMRs (MIMIC-III) and medical knowledge graphs (ICD-9 ontology and DrugBank). Then, SMR jointly embeds diseases, medicines, patients, and their corresponding relations into a shared lower dimensional space. Finally, SMR uses the embeddings to decompose the medicine recommendation into a linked prediction process while considering the patient’s diagnoses and adverse drug reactions. Extensive experiments on real world datasets are conducted and demonstrate the effectiveness of SMR. In future work, we will improve the linking accuracy by considering more information of patients, such as the clinical outcomes and demographics.


  • [Almirall et al.2012] Almirall, D.; Compton, S. N.; Gunlicks-Stoessel, M.; Duan, N.; and Murphy, S. A. 2012. Designing a pilot sequential multiple assignment randomized trial for developing an adaptive treatment strategy. Statistics in Medicine 31(17):1887–1902.
  • [Bennett and Hauser2013] Bennett, C. C., and Hauser, K. 2013. Artificial intelligence framework for simulating clinical decision-making: A markov decision process approach. Artificial Intelligence in Medicine 57(1):9–19.
  • [Bordes et al.2013] Bordes, A.; Usunier, N.; Garcia-Duran, A.; Weston, J.; and Yakhnenko, O. 2013. Translating embeddings for modeling multi-relational data. In Advances in neural information processing systems, 2787–2795.
  • [Chen et al.2010] Chen, B.; Dong, X.; Jiao, D.; Wang, H.; Zhu, Q.; Ding, Y.; and Wild, D. J. 2010. Chem2bio2rdf: a semantic framework for linking and data mining chemogenomic and systems chemical biology data. BMC Bioinformatics 11(1):255.
  • [Chen et al.2016] Chen, Z.; Marple, K.; Salazar, E.; Gupta, G.; and Tamil, L. 2016. A physician advisory system for chronic heart failure management based on knowledge patterns.

    Theory and Practice of Logic Programming

  • [Dumontier et al.2014] Dumontier, M.; Callahan, A.; Cruz-Toledo, J.; Ansell, P.; Emonet, V.; Belleau, F.; and Droit, A. 2014. Bio2rdf release 3: a larger connected network of linked data for the life sciences. In Proceedings of the 2014 International Conference on Posters & Demonstrations Track-Volume 1272, 401–404. CEUR-WS. org.
  • [Edwards and Aronson2000] Edwards, I. R., and Aronson, J. K. 2000. Adverse drug reactions: definitions, diagnosis, and management. The Lancet 356(9237):1255–1259.
  • [Ernst, Siu, and Weikum2015] Ernst, P.; Siu, A.; and Weikum, G. 2015. Knowlife: a versatile approach for constructing a large knowledge graph for biomedical sciences. BMC Bioinformatics 16(1):157.
  • [Fernald et al.2011] Fernald, G. H.; Capriotti, E.; Daneshjou, R.; Karczewski, K. J.; and Altman, R. B. 2011. Bioinformatics challenges for personalized medicine. Bioinformatics 27(13):1741–1748.
  • [Gunlicks-Stoessel et al.2016] Gunlicks-Stoessel, M.; Mufson, L.; Westervelt, A.; and Almirall, D. 2016. A pilot smart for developing an adaptive treatment strategy for adolescent depression. Journal of Clinical Child & Adolescent Psychology 45(4):480–494.
  • [Johnson et al.2016] Johnson, A. E.; Pollard, T. J.; Shen, L.; Lehman, L.-w. H.; Feng, M.; Ghassemi, M.; Moody, B.; Szolovits, P.; Celi, L. A.; and Mark, R. G. 2016. Mimic-iii, a freely accessible critical care database. Scientific Data 3.
  • [Juurlink et al.2003] Juurlink, D. N.; Mamdani, M.; Kopp, A.; Laupacis, A.; and Redelmeier, D. A. 2003. Drug-drug interactions among elderly patients hospitalized for drug toxicity. Journal of The American Medical Association 289(13):1652–1658.
  • [Law et al.2014] Law, V.; Knox, C.; Djoumbou, Y.; Jewison, T.; Guo, A. C.; Liu, Y.; Maciejewski, A.; Arndt, D.; Wilson, M.; Neveu, V.; et al. 2014. Drugbank 4.0: shedding new light on drug metabolism. Nucleic Acids Research 42(D1):D1091–D1097.
  • [Leape et al.1995] Leape, L. L.; Bates, D. W.; Cullen, D. J.; Cooper, J.; Demonaco, H. J.; Gallivan, T.; Hallisey, R.; Ives, J.; Laird, N.; Laffel, G.; et al. 1995. Systems analysis of adverse drug events. Journal of The American Medical Association 274(1):35–43.
  • [Lin et al.2015] Lin, Y.; Liu, Z.; Sun, M.; Liu, Y.; and Zhu, X. 2015. Learning entity and relation embeddings for knowledge graph completion. In Proceedings of the 29th AAAI Conference on Artificial Intelligence, 2181–2187.
  • [Mikolov et al.2013] Mikolov, T.; Sutskever, I.; Chen, K.; Corrado, G. S.; and Dean, J. 2013. Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems, 3111–3119.
  • [Panagioti et al.2015] Panagioti, M.; Stokes, J.; Esmail, A.; Coventry, P.; Cheraghi-Sohi, S.; Alam, R.; and Bower, P. 2015. Multimorbidity and patient safety incidents in primary care: a systematic review and meta-analysis. PloS One 10(8):e0135947.
  • [Recht et al.2011] Recht, B.; Re, C.; Wright, S.; and Niu, F. 2011.

    Hogwild: A lock-free approach to parallelizing stochastic gradient descent.

    In Advances in Neural Information Processing Systems, 693–701.
  • [Rosen-Zvi et al.2008] Rosen-Zvi, M.; Altmann, A.; Prosperi, M.; Aharoni, E.; Neuvirth, H.; Sönnerborg, A.; Schülter, E.; Struck, D.; Peres, Y.; Incardona, F.; et al. 2008. Selecting anti-hiv therapies based on a variety of genomic and clinical factors. Bioinformatics 24(13):i399–i406.
  • [Schriml et al.2011] Schriml, L. M.; Arze, C.; Nadendla, S.; Chang, Y.-W. W.; Mazaitis, M.; Felix, V.; Feng, G.; and Kibbe, W. A. 2011. Disease ontology: a backbone for disease semantic integration. Nucleic Acids Research 40(D1):D940–D946.
  • [Tang et al.2015] Tang, J.; Qu, M.; Wang, M.; Zhang, M.; Yan, J.; and Mei, Q. 2015. Line: Large-scale information network embedding. In Proceedings of the 24th International Conference on World Wide Web, 1067–1077. International World Wide Web Conferences Steering Committee.
  • [Wang et al.2014] Wang, Z.; Zhang, J.; Feng, J.; and Chen, Z. 2014.

    Knowledge graph embedding by translating on hyperplanes.

    In Proceedings of the 28th AAAI Conference on Artificial Intelligence, 1112–1119.
  • [Wang et al.2017] Wang, M.; Zhang, J.; Liu, J.; Hu, W.; Wang, S.; Li, X.; and Liu, W. 2017. Pdd graph: Bridging electronic medical records and biomedical knowledge graphs via entity linking. In International Semantic Web Conference. Springer.
  • [Wei et al.2013] Wei, W.-Q.; Cronin, R. M.; Xu, H.; Lasko, T. A.; Bastarache, L.; and Denny, J. C. 2013. Development and evaluation of an ensemble resource linking medications to their indications. Journal of the American Medical Informatics Association 20(5):954–961.
  • [Zhang et al.2014] Zhang, P.; Wang, F.; Hu, J.; and Sorrentino, R. 2014. Towards personalized medicine: leveraging patient similarity and drug similarity analytics. Proceedings of AMIA Summits on Translational Science 2014:132.
  • [Zhang et al.2017] Zhang, Y.; Chen, R.; Tang, J.; Stewart, W. F.; and Sun, J. 2017. Leap: Learning to prescribe effective and safe treatment combinations for multimorbidity. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1315–1324. ACM.