Hongyu Ren, Weihua Hu, Jure Leskovec
Answering complex logical queries on large-scale incomplete knowledge graphs (KGs) is a fundamental yet challenging task. Recently, a promising approach to this problem has been to embed KG entities as well as the query into a vector space such that entities that answer the query are embedded close to the query. However, prior work models queries as single points in the vector space, which is problematic because a complex query represents a potentially large set of its answer entities, but it is unclear how such a set can be represented as a single point. Furthermore, prior work can only handle queries that use conjunctions ($\wedge$) and existential quantifiers ($\exists$). Handling queries with logical disjunctions ($\vee$) remains an open problem. Here we propose query2box, an embedding-based framework for reasoning over arbitrary queries with $\wedge$, $\vee$, and $\exists$ operators in massive and incomplete KGs. Our main insight is that queries can be embedded as boxes (i.e., hyper-rectangles), where a set of points inside the box corresponds to a set of answer entities of the query. We show that conjunctions can be naturally represented as intersections of boxes and also prove a negative result that handling disjunctions would require embedding with dimension proportional to the number of KG entities. However, we show that by transforming queries into a Disjunctive Normal Form, query2box is capable of handling arbitrary logical queries with $\wedge$, $\vee$, $\exists$ in a scalable manner. We demonstrate the effectiveness of query2box on three large KGs and show that query2box achieves up to 25% relative improvement over the state of the art.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Knowledge Graphs | FB15k | MRR 1p | 0.68 | Q2B |
| Knowledge Graphs | FB15k | MRR 2i | 0.551 | Q2B |
| Knowledge Graphs | FB15k | MRR 2p | 0.21 | Q2B |
| Knowledge Graphs | FB15k | MRR 2u | 0.351 | Q2B |
| Knowledge Graphs | FB15k | MRR 3i | 0.665 | Q2B |
| Knowledge Graphs | FB15k | MRR 3p | 0.142 | Q2B |
| Knowledge Graphs | FB15k | MRR ip | 0.261 | Q2B |
| Knowledge Graphs | FB15k | MRR pi | 0.394 | Q2B |
| Knowledge Graphs | FB15k | MRR up | 0.167 | Q2B |
| Knowledge Graphs | NELL-995 | MRR 1p | 0.422 | Q2B |
| Knowledge Graphs | NELL-995 | MRR 2i | 0.333 | Q2B |
| Knowledge Graphs | NELL-995 | MRR 2p | 0.14 | Q2B |
| Knowledge Graphs | NELL-995 | MRR 2u | 0.113 | Q2B |
| Knowledge Graphs | NELL-995 | MRR 3i | 0.445 | Q2B |
| Knowledge Graphs | NELL-995 | MRR 3p | 0.112 | Q2B |
| Knowledge Graphs | NELL-995 | MRR ip | 0.168 | Q2B |
| Knowledge Graphs | NELL-995 | MRR pi | 0.224 | Q2B |
| Knowledge Graphs | NELL-995 | MRR up | 0.1103 | Q2B |
| Knowledge Graphs | FB15k-237 | MRR 1p | 0.406 | Q2B |
| Knowledge Graphs | FB15k-237 | MRR 2i | 0.295 | Q2B |
| Knowledge Graphs | FB15k-237 | MRR 2p | 0.094 | Q2B |
| Knowledge Graphs | FB15k-237 | MRR 2u | 0.113 | Q2B |
| Knowledge Graphs | FB15k-237 | MRR 3i | 0.423 | Q2B |
| Knowledge Graphs | FB15k-237 | MRR 3p | 0.068 | Q2B |
| Knowledge Graphs | FB15k-237 | MRR ip | 0.126 | Q2B |
| Knowledge Graphs | FB15k-237 | MRR pi | 0.212 | Q2B |
| Knowledge Graphs | FB15k-237 | MRR up | 0.076 | Q2B |
| Knowledge Graph Completion | FB15k | MRR 1p | 0.68 | Q2B |
| Knowledge Graph Completion | FB15k | MRR 2i | 0.551 | Q2B |
| Knowledge Graph Completion | FB15k | MRR 2p | 0.21 | Q2B |
| Knowledge Graph Completion | FB15k | MRR 2u | 0.351 | Q2B |
| Knowledge Graph Completion | FB15k | MRR 3i | 0.665 | Q2B |
| Knowledge Graph Completion | FB15k | MRR 3p | 0.142 | Q2B |
| Knowledge Graph Completion | FB15k | MRR ip | 0.261 | Q2B |
| Knowledge Graph Completion | FB15k | MRR pi | 0.394 | Q2B |
| Knowledge Graph Completion | FB15k | MRR up | 0.167 | Q2B |
| Knowledge Graph Completion | NELL-995 | MRR 1p | 0.422 | Q2B |
| Knowledge Graph Completion | NELL-995 | MRR 2i | 0.333 | Q2B |
| Knowledge Graph Completion | NELL-995 | MRR 2p | 0.14 | Q2B |
| Knowledge Graph Completion | NELL-995 | MRR 2u | 0.113 | Q2B |
| Knowledge Graph Completion | NELL-995 | MRR 3i | 0.445 | Q2B |
| Knowledge Graph Completion | NELL-995 | MRR 3p | 0.112 | Q2B |
| Knowledge Graph Completion | NELL-995 | MRR ip | 0.168 | Q2B |
| Knowledge Graph Completion | NELL-995 | MRR pi | 0.224 | Q2B |
| Knowledge Graph Completion | NELL-995 | MRR up | 0.1103 | Q2B |
| Knowledge Graph Completion | FB15k-237 | MRR 1p | 0.406 | Q2B |
| Knowledge Graph Completion | FB15k-237 | MRR 2i | 0.295 | Q2B |
| Knowledge Graph Completion | FB15k-237 | MRR 2p | 0.094 | Q2B |
| Knowledge Graph Completion | FB15k-237 | MRR 2u | 0.113 | Q2B |
| Knowledge Graph Completion | FB15k-237 | MRR 3i | 0.423 | Q2B |
| Knowledge Graph Completion | FB15k-237 | MRR 3p | 0.068 | Q2B |
| Knowledge Graph Completion | FB15k-237 | MRR ip | 0.126 | Q2B |
| Knowledge Graph Completion | FB15k-237 | MRR pi | 0.212 | Q2B |
| Knowledge Graph Completion | FB15k-237 | MRR up | 0.076 | Q2B |
| Large Language Model | FB15k | MRR 1p | 0.68 | Q2B |
| Large Language Model | FB15k | MRR 2i | 0.551 | Q2B |
| Large Language Model | FB15k | MRR 2p | 0.21 | Q2B |
| Large Language Model | FB15k | MRR 2u | 0.351 | Q2B |
| Large Language Model | FB15k | MRR 3i | 0.665 | Q2B |
| Large Language Model | FB15k | MRR 3p | 0.142 | Q2B |
| Large Language Model | FB15k | MRR ip | 0.261 | Q2B |
| Large Language Model | FB15k | MRR pi | 0.394 | Q2B |
| Large Language Model | FB15k | MRR up | 0.167 | Q2B |
| Large Language Model | NELL-995 | MRR 1p | 0.422 | Q2B |
| Large Language Model | NELL-995 | MRR 2i | 0.333 | Q2B |
| Large Language Model | NELL-995 | MRR 2p | 0.14 | Q2B |
| Large Language Model | NELL-995 | MRR 2u | 0.113 | Q2B |
| Large Language Model | NELL-995 | MRR 3i | 0.445 | Q2B |
| Large Language Model | NELL-995 | MRR 3p | 0.112 | Q2B |
| Large Language Model | NELL-995 | MRR ip | 0.168 | Q2B |
| Large Language Model | NELL-995 | MRR pi | 0.224 | Q2B |
| Large Language Model | NELL-995 | MRR up | 0.1103 | Q2B |
| Large Language Model | FB15k-237 | MRR 1p | 0.406 | Q2B |
| Large Language Model | FB15k-237 | MRR 2i | 0.295 | Q2B |
| Large Language Model | FB15k-237 | MRR 2p | 0.094 | Q2B |
| Large Language Model | FB15k-237 | MRR 2u | 0.113 | Q2B |
| Large Language Model | FB15k-237 | MRR 3i | 0.423 | Q2B |
| Large Language Model | FB15k-237 | MRR 3p | 0.068 | Q2B |
| Large Language Model | FB15k-237 | MRR ip | 0.126 | Q2B |
| Large Language Model | FB15k-237 | MRR pi | 0.212 | Q2B |
| Large Language Model | FB15k-237 | MRR up | 0.076 | Q2B |
| Inductive knowledge graph completion | FB15k | MRR 1p | 0.68 | Q2B |
| Inductive knowledge graph completion | FB15k | MRR 2i | 0.551 | Q2B |
| Inductive knowledge graph completion | FB15k | MRR 2p | 0.21 | Q2B |
| Inductive knowledge graph completion | FB15k | MRR 2u | 0.351 | Q2B |
| Inductive knowledge graph completion | FB15k | MRR 3i | 0.665 | Q2B |
| Inductive knowledge graph completion | FB15k | MRR 3p | 0.142 | Q2B |
| Inductive knowledge graph completion | FB15k | MRR ip | 0.261 | Q2B |
| Inductive knowledge graph completion | FB15k | MRR pi | 0.394 | Q2B |
| Inductive knowledge graph completion | FB15k | MRR up | 0.167 | Q2B |
| Inductive knowledge graph completion | NELL-995 | MRR 1p | 0.422 | Q2B |
| Inductive knowledge graph completion | NELL-995 | MRR 2i | 0.333 | Q2B |
| Inductive knowledge graph completion | NELL-995 | MRR 2p | 0.14 | Q2B |
| Inductive knowledge graph completion | NELL-995 | MRR 2u | 0.113 | Q2B |
| Inductive knowledge graph completion | NELL-995 | MRR 3i | 0.445 | Q2B |
| Inductive knowledge graph completion | NELL-995 | MRR 3p | 0.112 | Q2B |
| Inductive knowledge graph completion | NELL-995 | MRR ip | 0.168 | Q2B |
| Inductive knowledge graph completion | NELL-995 | MRR pi | 0.224 | Q2B |
| Inductive knowledge graph completion | NELL-995 | MRR up | 0.1103 | Q2B |
| Inductive knowledge graph completion | FB15k-237 | MRR 1p | 0.406 | Q2B |
| Inductive knowledge graph completion | FB15k-237 | MRR 2i | 0.295 | Q2B |
| Inductive knowledge graph completion | FB15k-237 | MRR 2p | 0.094 | Q2B |
| Inductive knowledge graph completion | FB15k-237 | MRR 2u | 0.113 | Q2B |
| Inductive knowledge graph completion | FB15k-237 | MRR 3i | 0.423 | Q2B |
| Inductive knowledge graph completion | FB15k-237 | MRR 3p | 0.068 | Q2B |
| Inductive knowledge graph completion | FB15k-237 | MRR ip | 0.126 | Q2B |
| Inductive knowledge graph completion | FB15k-237 | MRR pi | 0.212 | Q2B |
| Inductive knowledge graph completion | FB15k-237 | MRR up | 0.076 | Q2B |