KGE
Some papers on Knowledge Graph Embedding(KGE)
view repo
Knowledge graph (KG) entity typing aims at inferring possible missing entity type instances in KG, which is a very significant but still under-explored subtask of knowledge graph completion. In this paper, we propose a novel approach for KG entity typing which is trained by jointly utilizing local typing knowledge from existing entity type assertions and global triple knowledge from KGs. Specifically, we present two distinct knowledge-driven effective mechanisms of entity type inference. Accordingly, we build two novel embedding models to realize the mechanisms. Afterward, a joint model with them is used to infer missing entity type instances, which favors inferences that agree with both entity type instances and triple knowledge in KGs. Experimental results on two real-world datasets (Freebase and YAGO) demonstrate the effectiveness of our proposed mechanisms and models for improving KG entity typing. The source code and data of this paper can be obtained from: https://github.com/ Adam1679/ConnectE
READ FULL TEXT VIEW PDF
We demonstrate the complementary natures of neural knowledge graph embed...
read it
Knowledge graph embedding (KGE), aiming to embed entities and relations ...
read it
Knowledge graph embedding has recently become a popular way to model
rel...
read it
Knowledge graph (KG) refinement mainly aims at KG completion and correct...
read it
20 Questions (20Q) is a two-player game. One player is the answerer, and...
read it
The task of Knowledge Graph Completion (KGC) aims to automatically infer...
read it
Knowledge graphs (KGs) contains an instance-level entity graph and an
on...
read it
Some papers on Knowledge Graph Embedding(KGE)
The past decade has witnessed great thrive in building web-scale knowledge graphs (KGs), such as Freebase Bollacker et al. (2008), YAGO Suchanek et al. (2007), Google Knowledge Graph Dong et al. (2014), which usually consists of a huge amount of triples in the form of (head entity, relation, tail entity) (denoted ()). KGs usually suffer from incompleteness and miss important facts, jeopardizing their usefulness in downstream tasks such as question answering Elsahar et al. (2018), semantic parsing Berant et al. (2013), relation classification Zeng et al. (2014). Hence, the task of knowledge graph completion (KGC, i.e. completing knowledge graph entries) is extremely significant and attracts wide attention.
This paper concentrates on KG entity typing, i.e. inferring missing entity type instances in KGs, which is an important sub-problem of KGC. Entity type instances, each of which is in the formed of (entity, entity type) (denoted ()), are essential entries of KGs and widely used in many NLP tasks such as relation extraction Zhang et al. (2018); Jain et al. (2018), coreference resolution Hajishirzi et al. (2013), entity linking Gupta et al. (2017). Most previous works of KGC focus on inferring missing entities and relationships Bordes et al. (2013); Wang et al. (2014); Lin et al. (2015); Dettmers et al. (2017); Ding et al. (2018); Nathani et al. (2019), paying less attention to entity type prediction. However, KGs also usually suffer from entity types incompleteness. For instance, 10% of entities in FB15k Bordes et al. (2013), which have the /music/artist type, miss the /people/person type Moon et al. (2017). KG entity type incompleteness leads to some type-involved algorithms in KG-driven tasks grossly inefficient or even unavailable.
To solve KG entity type incompleteness issue, in this paper we propose a novel embedding methodology to infer missing entity type instances that employs not only local typing knowledge from entity type assertions, as most conventional models do, but also leverages global triple knowledge from KGs. Accordingly, we build two distinct knowledge-driven type inference mechanisms with these two kinds of structural knowledge.
Missing entity types of an entity can be found from other entities that are close to the entity in the embedding space, using local typing knowledge as in Fig.1(Mech.1).
Missing entity types of an (head or tail) entity can be inferred from the types of other (tail or head) entities through their relationships, using global triple knowledge as in Fig.1(Mech.2).
The main idea behind Mech.1 is based on the observation that the learned entities’ embeddings by conventional KG embedding methods Ji et al. (2016); Xie et al. (2016)
cluster well according to their types in vector space. For instance, in Fig.
1(Mech.1), given an entity Barack Obama, it’s missing hierarchical type /people/person can be induced by the given hierarchical type of similar entity Donald Trump. In addition, the key motivation behind Mech.2 is that the relationship shall remain unchanged if the entities in a triple fact are replaced with their corresponding hierarchical types. For instance, given a global triple fact (Barack Obama, bornin, Honolulu), under this assumption, we can induce a new type triple (/people/person, bornin, /location/location)111For more clarity, we represent it as (/location/location, born_in, /people/person) in Fig.1(Mech.2).. Formally, (= ), which can be used to infer missing entity types, e.g. (Barack Obama, type=? ) via = , as Mech.2 does. Fig.1 demonstrates a simple illustration of effective mechanisms of entity type inference. Both mechanisms are utilized to build our final composite model.Specifically, we build two embedding models to realize the two mechanisms respectively. First, considering entities and entity types are completely distinct objects, we build two distinct embedding spaces for them, i.e., entity space and entity type space. Accordingly, we encode entity type instance by projecting the entity from entity space to entity type space with mapping matrix , hence we have (1): , called E2T. Moreover, we learn the plausibility of () global type triple by newly generalizing from () global triple fact, even though this type triple is not present originally. Following translating assumption Bordes et al. (2013), we have (2): , called TRT. E2T and TRT are the implementation models of the two mechanisms. Fig.2 demonstrates a brief illustration of our models. A ranking-based embedding framework is used to train our models. Thereby, entities, entity hierarchical types, and relationships are all embedded into low-dimensional vector spaces, where the composite energy score of both E2T and TRT are computed and utilized to determine the optimal types for (entity, entity type=?) incomplete assertions. The experimental results on real-world datasets show that our composite model achieves significant and consistent improvement compared to all baselines in entity type prediction and achieves comparable performance in entity type classification.
Our contributions are as follows:
We propose a novel framework for inferring missing entity type instances in KGs by connecting entity type instances and global triple information and correspondingly present two effective mechanisms.
Under these mechanisms, we propose two novel embedding-based models: one for predicting entity types given entities and another one to encode the interactions among entity types and relationships from KGs. A combination of both models are utilized to conduct entity type inference.
We conduct empirical experiments on two real-world datasets for entity type inference, which demonstrate our model can successfully take into account global triple information to improve KG entity typing.
Models | Energy function | Parameters | Sources | Training strategy | ||||||
---|---|---|---|---|---|---|---|---|---|---|
LM Neelakantan et al. (2015) | N/A | entity type instances | N/A | |||||||
PEM Neelakantan et al. (2015) | N/A |
|
entity type instance | N/A | ||||||
RESCAL Nickel et al. (2011) | N/A |
|
mixed triple knowledge | syn. | ||||||
RESCAL-ET Moon et al. (2017) |
|
entity type inst./ triple know. | asyn. | |||||||
HOLE Nickel et al. (2016) | N/A | mixed triple knowledge | syn. | |||||||
HOLE-ET Moon et al. (2017) | entity type inst./ triple know. | asyn. | ||||||||
TransE Bordes et al. (2013) | N/A | mixed triple knowledge | syn. | |||||||
TransE-ET Moon et al. (2017) | entity type inst./ triple know. | asyn. | ||||||||
ETE Moon et al. (2017) | entity type inst./ triple know. | asyn. | ||||||||
|
|
|
entity type inst./ triple know. | syn. |
Entity typing is valuable for many NLP tasks Yaghoobzadeh et al. (2018), such as knowledge base population Zhou et al. (2018), question answering Elsahar et al. (2018), etc. In recent years, researchers attempt to mine fine-grained entity types Yogatama et al. (2015); Choi et al. (2018); Xu and Barbosa (2018); Yuan and Downey (2018) with external text information, such as web search query logs Pantel et al. (2012), the textual surface patterns Yao et al. (2013), context representation Abhishek et al. (2017), Wikipedia Zhou et al. (2018). Despite their success, existing methods rely on additional external sources, which might not be feasible for some KGs.
To be more universal, Neelakantan et al. Neelakantan et al. (2015) propose two embedding models, i.e. linear model (LM) and projection embedding model (PEM), which can infer missing entity types only with KG itself. Although PEM has more expressive power than LM, however, both of them ignore global triple knowledge, which could also be helpful for encoding entity type assertions via shared entities’ embeddings. To address this issue, Moon et al. Moon et al. (2017) propose a state-of-the-art model (ETE) to combine triple knowledge and entity type instances for entity type prediction, and build two entity type embedding methodologies: (1) Synchronous training: treat (entity, entity type) assertions as special triple facts that have a unique relationship “rdf:type”, e.g. (Barack Obama, “rdf:type”, person), and encode all mixed triple facts (original triple data fused with all generated special ones) by conventional entity relation embedding models, such as RESCAL Nickel et al. (2011), HOLE Nickel et al. (2016) and TransE Bordes et al. (2013). (2) Asynchronous training: first learn the entities’ embeddings by conventional entity relation embedding models mentioned above, and then only update entity types’ embeddings for while keeping fixed, called RESCAL-ET, HOLE-ET, TransE-ET and ETE. Although these approaches expect to explore global triple knowledge for entity type prediction, they still lack of expressive ability due to its simplicity of embeddings. In addition, they irrationally assume both the embeddings of entities and entity types being in the same latent space (). Since entities and entity types are completely distinct objects, it may not be reasonable to represent them in a common semantic space.
In this paper, we introduce an enhanced KG entity type embedding model with better expressing and reasoning capability considering both local entity typing information and global triple knowledge in KGs. Note that incorporating more external information Jin et al. (2018); Neelakantan et al. (2015) is not the main focus in this paper, as we only consider the internal structural information in KGs instead, which correspondingly makes our work much more challenging but also more universal and flexible due to the limited information. Recently, Lv et al. (2018); Hao et al. (2019) also attempt to embedding structural information in KG. However, the goals and models are very different from ours. They encodes the “concepts”, not hierarchical types. On the contrary, we focus on the latter not the former. Table 1 summarizes the energy functions and other different settings of entity type embedding models.
We consider a KG containing entity type instances of the form ( is the training set consists of lots of (entity, entity type) assertions), where ( is the set of all entities) is an entity in the KG with the type ( is the set of all types). For example, could be Barack Obama and could be /people/person. As a single entity can have multiple types, entities in KG often miss some of their types. The aim of this work is to infer missing entity type instances in KGs.
Our work concerns energy-based methods, which learn low-dimensional vector representations (embeddings) of atomic symbols (i.e. entities, entity hierarchical types, relationships). In this framework, we learn two submodels: (1) one for predicting entity types given entities, and (2) another one to encode the interactions among entity types and relationships from KGs. The joint action of both models in prediction allows us to use the connection between triple knowledge and entity type instances to perform KG entity typing.
The first model (E2T) of the framework concerns the learning of a function with local typing knowledge from entity type instances, which is designed to score the similarity of an entity and a type . The main ideas behind this model are as follows: (1) Since the learned entity embeddings cluster well when they have the same or similar types, therefore, it is rather intuitive that the entity type embedding represents the projective common concept representation of a cluster of entities, i.e., . is the embedding of the entity , is the embedding of the type . The entity type embedding represents common information of their entities, it thus should have fewer variates, i.e., . (2) Since the entities and entity types are totally distinct objects, we respectively build two embedding space for them, i.e., entity space and entity type space. (3) Inspired by the previous work TranSparse Ji et al. (2016) projecting entities from entity space to relation space with operation matrix , which we adapted, replacing relation space with entity type space, we thus define . Therefore, this model consists of first projecting entity embedding into entity type space, and then computing a similarity measure between this projection and an entity type embedding. The scoring function of E2T given () is:
(1) |
where is a transfer matrix mapping entity embeddings into entity type space. The score is expected to be lower for a golden entity type instance and higher for an incorrect one.
Using only entity type instances for training ignores much of relational knowledge that can leverage from triple facts in KGs. In order to connect this relational data with our model, we propose to learn entity type and relationship embeddings from global triple knowledge from KGs. The key motivations behind this model are: (1) As mentioned above, the entities cluster well according to their types. Therefore, we believe that an essential premise of a triple (head entity, relationship, tail entity) holds is that its corresponding entity types should first conform to this relationship. Hence, we can build a new entity type triple (head type, relationship, tail type) by replacing both head entity and tail entity with their corresponding types: i.e. . , is the training set consists of a lot of triples. ( is the set of relationships). and stand for the hierarchical types of left entity and right entity respectively. (2) Since the relationship remains unchanged in replacement, we build two differentiated embeddings for the -th relationship in two embedding spaces: in entity space and in entity type space. (3) Given entity type triple , under translation assumption 222We chose TransE in this paper, and it is not difficult for other enhanced translation-based methods to model triple knowledge, such as Trans(H, R, D and G) Wang et al. (2017). as in Bordes et al. (2013), we have: . Hence, the scoring function is defined as:
(2) |
where . The model returns a lower score if the two entity types is close under this relationship and a higher one otherwise.
Fig.2 shows an illustration of E2T and TRT.
Our framework can be used for entity type prediction in the following way. First, for each entity that appears in the testing set, a prediction by E2T is performed with:
(3) |
In addition, a composite score (E2T+TRT) by connecting entity type instances and entity type triples with embedding model, which we call ConnectE 333We also call it ConnectE-(E2T+TRT), and use ConnectE-(E2T+0) to denote E2T for uniformity in the experiments., is defined as follows:
where
is a hyperparameter for the trade-off.
(i.e. given is head entity, is the set of all corresponding tail entities’ types.), and (i.e. given is tail entity, is the set of all corresponding head entities’ types.). and represent the total number of entity types in and respectively. A prediction is performed with:(4) |
Hence, our final composite model ConnectE-(E2T+TRT) favors predictions that agree with both entity type instances and global triple information in KGs.
We use ranking loss algorithm for training ConnectE-(E2T+TRT), in which the parameter set . stand for the collection of all entities’ and types’ embeddings respectively. denotes the collections of relationships’ differentiated embeddings. The ranking objectives are designed to assign lower scores to true facts (including triple facts, entity type instances and type triples) versus any corrupt ones. We build three sub-objective functions, i.e., , and implement dynamic optimization strategy, i.e., fix a partial of parameters and only update the rest when minimizing each function. (1) : We choose TransE (see Bordes et al. Bordes et al. (2013)) to model triple facts as , in which we update the embeddings of entities () and the embeddings of relationships (). (2) : We only update the embeddings of entity types () and projecting matrix , not the entities’ embeddings that have been trained in . (3) : We only update the embeddings of relationships () while keeping the entity types’ embeddings fixed. The training is performed using Adagrad Kingma and Ba (2014). All embeddings in
are initialized with uniform distribution. The procedure, from
, to , is iterated for a given number of iterations. We have:are margin hyperparameters, and the corrupted datasets are built as follows:
are training datasets of triple facts and entity type instances in KG. is the training data of type triples, built by replacing entities in with their corresponding entity types.
We conduct the experiments on two real-world datasets () widely used in KG embedding literature, i.e. FB15k Bordes et al. (2013) and YAGO43k Moon et al. (2017), which are subsets of Freebase Bollacker et al. (2008) and YAGO Suchanek et al. (2007) respectively. They consist of triples, each of which is formed as (left entity, relationship, right entity). We utilize two entity type data (, each of it is formed as (entity, entity type)) built in Moon et al. (2017), called FB15kET and YAGO43kET, in which the entity types are mapped to entities from FB15k and YAGO43k respectively.
Moreover, we build new type triple datasets (, each one in it is formed as (head type, relationship, tail type)), to train our model. They are built based on and . First, for each triple , we replace the head and the tail with their types according to . The generated datasets are called FB15kTRT(full) and YAGO43kTRT(full). Second, considering about the scalability of the proposed approach for full KGs, we further modify the generation method of type triples, which is the major training bottleneck. We discard newly generated ones with low-frequency (i.e. #frequency = 1). After that the size of both FB15kTRT(full) and YAGO43kTRT(full) decreased by about 90%, called FB15kTRT(disc.) and YAGO43kTRT(disc.) respectively. The statistics of the datasets are showed in Table 2. For saving space, we put more data processing details (include cleaning , building , etc.) on our github website.
Dataset | #Ent | #Rel | #Train | #Valid | #Test |
---|---|---|---|---|---|
FB15k | 14,951 | 1,345 | 483,142 | 50,000 | 59,071 |
YAGO43k | 42,335 | 37 | 331,687 | 29,599 | 29,593 |
Dataset | #Ent | #Type | #Train | #Valid | #Test |
---|---|---|---|---|---|
FB15kET | 14,951 | 3,851 | 136,618 | 15,749 | 15,780 |
YAGO43kET | 41,723 | 45,182 | 375,853 | 42,739 | 42,750 |
Dataset | #Type | #Rel | #Train | Valid | Test |
---|---|---|---|---|---|
FB15kTRT(full) | 3,851 | 1,345 | 2,015,338 | – | – |
FB15kTRT(disc.) | 2,060 | 614 | 231,315 | – | – |
YAGO43kTRT(full) | 45,128 | 37 | 1,727,708 | – | – |
YAGO43kTRT(disc.) | 17,910 | 32 | 189,781 | – | – |
This task concentrates to complete a pair (entity, entity type) when its type is missing, which aims to verify the capability of our model for inferring missing entity type instances.
Evaluation Protocol. We focus on entity type prediction determined by Formula (3) and (4). We use ranking criteria for evaluation. Firstly for each test pair, we remove the type and replace it by each of the types in in turn. The function value of the negative pairs would be computed by the related models and then sorted by ascending order. We can obtain the exact rank of the correct type in the candidates. Finally, we use two metrics for comparison: (1) the mean reciprocal rank (MRR), and (2) the proportion of correct entities ranked in the top 1/3/10 (HITS@1/3/10)(%). Since the evaluation setting of “Raw” is not as accurate as “Filter” Bordes et al. (2013), we only report the experimental results with latter setting in this paper.
where is a set of test pairs, and is the rank position of the true entity type for the -th pair.
DATASET | FB15kET | YAGO43kET | |||||||
---|---|---|---|---|---|---|---|---|---|
METRICS | MRR | HITS@1 | HITS@3 | HITS@10 | MRR | HITS@1 | HITS@3 | HITS@10 | |
RESCAL Nickel et al. (2011) | 0.19 | 9.71 | 19.58 | 37.58 | 0.08 | 4.24 | 8.31 | 15.31 | |
RES.-ET Moon et al. (2017) | 0.24 | 12.17 | 27.92 | 50.72 | 0.09 | 4.32 | 9.62 | 19.40 | |
HOLE Nickel et al. (2016) | 0.22 | 13.29 | 23.35 | 38.16 | 0.16 | 9.02 | 17.28 | 29.25 | |
HOLE-ET Moon et al. (2017) | 0.42 | 29.40 | 48.04 | 66.73 | 0.18 | 10.28 | 20.13 | 34.90 | |
TransE Bordes et al. (2013) | 0.45 | 31.51 | 51.45 | 73.93 | 0.21 | 12.63 | 23.24 | 38.93 | |
TransE-ET Moon et al. (2017) | 0.46 | 33.56 | 52.96 | 71.16 | 0.18 | 9.19 | 19.41 | 35.58 | |
ETE Moon et al. (2017) | 0.50 | 38.51 | 55.33 | 71.93 | 0.23 | 13.73 | 26.28 | 42.18 | |
|
0.57 +- .00 | 45.54 +- .28 | 62.31 +- .29 | 78.12 +- .12 | 0.24 +- .01 | 13.54 +- .12 | 26.20 +- .18 | 44.51 +- .09 | |
|
0.59 +- .01 | 48.54 +- .71 | 63.66 +- .39 | 78.27 +- .16 | 0.27 +- .01 | 15.1 +- .15 | 29.14 +- .13 | 47.08 +- .09 | |
|
0.59 +- .00 | 49.55 +- .62 | 64.32 +- .37 | 79.92 +- .14 | 0.28 +- .01 | 16.01 +- .12 | 30.85 +- .13 | 47.92 +- .07 |
Implementation. The results of entity type prediction are shown in Table 3, where the results for the baselines are directly taken from original literature Moon et al. (2017). We do not choose LM and PEM Neelakantan et al. (2015) as baselines since they do not utilize triple knowledge, thus it is not fair to compare with them. For training our model, we select the learning rate {0.1, 0.05, 0.001}, the margins , the embedding dimension pairs {(100, 50), (150, 75), (200, 100), (250, 125)}, and the weight {0.5, 0.65, 0.85, 0.95}. We use negative sampling, and gradient descent with AdaGrad as our optimization approach to improve convergence performance. During the initialization process, each embedding vector of the entities, entity types and relationships is initialized with a random number following a uniform distribution , where #Ent, #Type, #Rel and
. During the whole training process, we normalize the entity embeddings after each epoch.
We select the parameters based on MRR in valid dataset. The optimal configurations are: on FB15k/ET/TRT; on YAGO43k/ET/TRT. We run 800 epochs on both datasets, and the batch size is 4096.
Experimental Results. We can see from Table 3 that our ConnectEs outperform all baselines for entity type prediction in terms of all metrics on FB15kET and YAGO43kET. It confirms the capability of ConnectEs in modeling with local typing and global triple knowledge and inferring missing entity type instances in KGs. The model ConnectE-(E2T+TRT)(full) achieves the highest scores.
Analysis. (1) In E2T, we utilize a mapping matrix M which compresses entity embeddings into type embedding space, considering that entity type embedding represents common information of all the entities which belong to this type. The type embedding should be in a sharing subspace of entity embeddings. The experimental results of E2T compared with the baselines demonstrate that this assumption would be quite reasonable. (2) In E2T+TRT, we build new type-relation-type data, and then connect them with entity type instances. This approach provides more direct useful information to (weakly) supervise entity type prediction. For example, given a fact that head entity Barack Obama belongs to type /people/person and the relationship born_in, we could make the best guess of the type of tail entity Honolulu as /location/location. Hence, the addition of type triples in ConnectE-(E2T+TRT) provides superior performance than ConnectE-(E2T+0). (3) Concerning about the scalability of our approach for big KGs, we utilize FB15kTRT(disc.) and YAGO43kTRT(disc.) for prediction, the training time of which reduced by 90% as the training data size decreased by 90%. Moreover, the results of ConnectE-(E2T+TRT)(disc.) show that it’s comparable with the best ConnectE-(E2T+TRT)(full).
This task aims to judge whether each entity type instance in testing data holds or not, which could be viewed as a binary classification problem.
Evaluation Protocol.
Since there are no explicit negative entity type instances in existing KGs, in order to create datasets for classification, we build negative facts by randomly switching type from entity type pairs in validation and testing set with equal number of positive and negative examples. Inspired by the evaluation metric of triple classification in
Socher et al. (2013), we calculate the scores of all entity type instances based on model energy function, and rank all instances in testing set with these scores. Those instances with lower scores are considered to be true. We use precision/recall curves to show the performances of all models. Moreover, we also compare the accuracy among different models. We first use validate set to find best threshold . For instance, if the model scorein classification, the entity type instance will be classified to be positive, otherwise to be negative. The final accuracy is based on how many facts are classified correctly.
Implementation. We utilize the source codes and parameter settings of several baselines provided by Moon et al. (2017) for this task. The optimal parameter settings for our proposed models are: on FB15kET; on YAGO43kET. In both datasets, we learn all the training data for 800 epochs and the batch size is 4096. After training, we firstly draw PR-curves with dynamic thresholds. We select the best threshold based on the accuracy in valid dataset, which is used to calculate the accuracy in test dataset.
Experimental Results. We draw the PR-curves for type classification task on both datasets in Fig.3. Note that we only report the results of ConnectE-(E2T+TRT)(disc.) not ConnectE-(E2T+TRT)(full), since the learning speed of the former is much more faster than the latter and its results are close to the best results of the latter. We can see from Fig.3 that when the recall rate is between 0.88 0.97, ConnectE-(E2T+TRT)(disc.) model could achieve the highest precision rate on FB15kET. In other ranges, our ConnectE-(E2T+TRT)(disc.) model also shows comparable performance. The result is consistent on YAGO43kET. Specifically, ConnectE-(E2T+TRT)(disc.) achieves the best F1 score of 94.66% when recall = 94.27% and precision = 95.05% on FB15kET. Also, ConnectE-(E2T+TRT)(disc.) surpasses other models and gets F1 score of 92.13% when precision = 93.18% and recall = 91.11% on YAGO43kET. It confirms the capability of our model, for they could not only infer missing types in KGs, but also perform well in KG entity type classification.
Table 4 demonstrates the evaluation accuracy results of entity type classification, from which we can observe that: (1) On FB15kET, ConnectE-(E2T+TRT)(disc.) achieves the best accuracy score (94.49%). Compared to the mostly related model ETE, our model shows 0.48% absolute performance improvement. On YAGO43kET, ConnectE-(E2T+TRT)(disc.) model outperforms other models as well. The improvement of our model compared to ETE is almost 1.51%. (2) Comparing to the improvement on YAGO43kET, the advantage ConnectE-(E2T+TRT)(disc.) has over ConnectE-(E2T+0) in this task on FB15kET seems to be insignificant, which indicates that the type triples in FB15kTRT have fewer contribution on entity type classification than ones in YAGO43kTRT. It may be partially caused by the fact that the number of relations in YAGO43k (#Rel=37) is far less than that in FB15k (#Rel=1,345), which could considerably influence the effectiveness of the type-relation-type training set. Due to the rareness of relationships in YAGO43k, each entity usually connects with a large number of other entities through one single relationships, which means that the magnitude of and in the composite model scoring function are large. After averaging in ConnectE-(E2T+TRT)(disc.), it could achieve more stable and significant results on YAGO43kET.
Dataset |
|
|
||
---|---|---|---|---|
RESCAL-ET | 90.02% | 82.28% | ||
HOLE-ET | 93.23% | 90.14% | ||
TransE-ET | 93.88% | 90.76% | ||
ETE | 94.01% | 90.82% | ||
ConnectE (E2T+0) | 94.45% | 91.78% | ||
ConnectE (E2T+TRT)(disc.) | 94.49% | 92.33% |
Table 5 shows the examples of entity type prediction by our model from FB15k/ET/TRT, which demonstrate our motivation of Mech. 2 that head type and tail type really maintain the relationship between head entity and tail entity. Given entity Peter Berg, TRT can find HITS@1 type prediction /people/person for it via the existing entity type assertion (New Youk, /location/location) and the relationship (/loc./loc./people_born_here) between them, i.e. = .
|
Rel |
|
||||||
1 |
|
/location/location/ people_born_here | /location/location | |||||
head entity | Peter Berg | New York | tail entity | |||||
Gus Van Sant | Louisville | |||||||
2 |
|
/film/film/ directed_by | /film/director | |||||
head entity | Very Bad Things | Peter Berg | tail entity | |||||
Rush Hour | Brett Ratner | |||||||
3 |
|
people/cause_of _death/people | /people/person | |||||
head entity |
|
Dick Clark | tail entity | |||||
|
John Hurt |
In this paper, we described a framework for leveraging global triple knowledge to improve KG entity typing by training not only on (entity, entity type) assertions but also using newly generated (head type, relationship, tail type) type triples. Specifically, we propose two novel embedding-based models to encode entity type instances and entity type triples respectively. The connection of both models is utilized to infer missing entity type instances. The empirical experiments demonstrate the effectiveness of our proposed model. Our modeling method is general and should apply to other type-oriented tasks. Next, we are considering to use this framework to conduct KG entity type noise detection.
The authors would like to thank all anonymous reviewers for their insightful comments. We also want to thank Zhiyuan Liu (Tsinghua University) and Linmei Hu (BUPT) for their useful suggestions and comments on early drafts. This work was supported by the National Natural Science Foundation of China under Grant No.61922085, 61906159, the Sichuan Science and Technology Program under Grant No.2018JY0607, the Fundamental Research Funds for the Central Universities under Grant No.JBK2003008, Fintech Innovation Center, and Financial Intelligence and Financial Engineering Key Laboratory of Sichuan Province.
Reasoning with neural tensor networks for knowledge base completion
. In Proceedings of NeurIPS, , pp. 926–934. Cited by: §4.3.Knowledge graph embedding by translating on hyperplanes.
. In Proceedings of AAAI, , pp. 1112–1119. Cited by: §1.Journal of Artificial Intelligence Research
61 (), pp. 835–862. External Links: Document Cited by: §2.Relation classification via convolutional deep neural network
. In Proceedings of COLING, , pp. 23–29. Cited by: §1.
Comments
There are no comments yet.