https://github.com/neuralgraphdatabases/awesome-logical-query

A collection of resources on the topic of Complex Logical Query Answering
https://github.com/neuralgraphdatabases/awesome-logical-query
List: awesome-logical-query
Last synced: 6 months ago
JSON representation
A collection of resources on the topic of Complex Logical Query Answering
Host: GitHub
URL: https://github.com/neuralgraphdatabases/awesome-logical-query
Owner: neuralgraphdatabases
License: cc0-1.0
Created: 2022-08-09T20:37:28.000Z (almost 3 years ago)
Default Branch: main
Last Pushed: 2023-09-30T03:13:32.000Z (over 1 year ago)
Last Synced: 2024-05-23T01:06:09.422Z (about 1 year ago)
Size: 442 KB
Stars: 156
Watchers: 10
Forks: 13
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project

ultimate-awesome - awesome-logical-query - A collection of resources on the topic of Complex Logical Query Answering. (Other Lists / Julia Lists)
README

        # Complex Logical Query Answering & Neural Graph Databases #



  

  

  

      



A collection of resources on the topic of Complex Logical Query Answering accompanying the paper [Neural Graph Reasoning: Complex Logical Query Answering Meets Graph Databases](https://arxiv.org/abs/2303.14617). 

Feel free to open PRs and issues to add new papers, datasets, and implementations!

This repo follows the Neural Query Engine taxonomy proposed in the paper (Figure 9).

![hierarchy](asset/tree.png)

## :scroll: Categorization of papers

### Graphs | Modalities

  Triple-based KGs (44)

  1. [GQE](https://proceedings.neurips.cc/paper/2018/file/ef50c335cca9f340bde656363ebd02fd-Paper.pdf), NeurIPS 2018  

  2. [GQE+hashing](https://ieeexplore.ieee.org/abstract/document/8970688/?casa_token=gLHFmr7V2ekAAAAA:wwDbUufdkwHTQo68pansuhJsJ2XQAF0P21_mQSu75KVRUkgqARmXBs_VEmFOkFgz_Lq-FXP8OA), ICDM 2019

  3. [CGA](https://arxiv.org/pdf/1910.00084.pdf), K-CAP 2019

  4. [TractOR](https://arxiv.org/pdf/2002.10029.pdf), UAI 2020  

  5. [Query2Box](https://openreview.net/pdf?id=BJgr4kSFDS), ICLR 2020  

  6. [BetaE](https://proceedings.neurips.cc/paper/2020/file/e43739bba7cdb577e9e3e4e42447f5a5-Paper.pdf), NeurIPS 2020

  7. [EmQL](https://proceedings.neurips.cc/paper/2020/hash/fe74074593f21197b7b7be3c08678616-Abstract.html), NeurIPS 2020

  8. [MPQE](https://grlplus.github.io/papers/26.pdf), ICML 2020 Workshop

  9. [RotatE-Box](https://arxiv.org/abs/2005.00480), AKBC 2021

  10. [BiQE](https://ojs.aaai.org/index.php/AAAI/article/view/16630), AAAI 2021

  11. [Knowledge Sheaves: A Sheaf-Theoretic Framework for Knowledge Graph Embedding](https://arxiv.org/pdf/2110.03789.pdf), arxiv 2021

  12. [CQD](https://openreview.net/forum?id=Mos9F9kDwkz), ICLR 2021

  13. [HypE](https://dl.acm.org/doi/abs/10.1145/3442381.3449974?casa_token=tQx20rBUtMwAAAAA:GRqp4JBze6ybzZzeSdCc9cNGlqN0wkAP0BVBPctAgtUVviVSoNiUQcNxBbEuGVGZxnCzJyLqfMmiYg), WWW 2021

  14. [NewLook](http://tonghanghang.org/pdfs/kdd21_newlook.pdf), KDD 2021

  15. [ConE](https://proceedings.neurips.cc/paper/2021/hash/a0160709701140704575d499c997b6ca-Abstract.html), NeurIPS 2021

  16. [PERM](https://proceedings.neurips.cc/paper/2021/hash/c4d2ce3f3ebb5393a77c33c0cd95dc93-Abstract.html), NeurIPS 2021

  17. [Neural-symbolic Approach for Ontology-mediated Query Answering](https://arxiv.org/pdf/2106.14052.pdf), arxiv 2021

  18. [LogicE](https://arxiv.org/pdf/2103.00418.pdf), arxiv 2021

  19. [MLPMix](https://openreview.net/forum?id=tgcAoUVHRIB), ICLR 2022

  20. [FuzzQE](https://ojs.aaai.org/index.php/AAAI/article/view/20310), AAAI 2022

  21. [GNN-QE](https://arxiv.org/abs/2205.10128), ICML 2022

  22. [SMORE](https://arxiv.org/abs/2110.14890), KDD 2022

  23. [kgTransformer](https://keg.cs.tsinghua.edu.cn/jietang/publications/KDD22-Liu-et-al-KG-Transformer.pdf), KDD 2022

  24. [LinE](https://dl.acm.org/doi/pdf/10.1145/3534678.3539338?casa_token=_jPlNJj2TlYAAAAA:pKAA42_lrZ2JIHc1YZV0fchIlRiIcqCy8oCBL2UU3Gm84MOeTSYLfQn31DKtXBbU2yqzC7LUsYvREBw), KDD 2022

  25. [Query2Particles](https://arxiv.org/abs/2204.12847), NAACL 2022

  26. [TAR](https://arxiv.org/abs/2205.14591), arxiv 2022

  27. [TeMP](https://arxiv.org/abs/2205.00782), arxiv 2022

  28. [FLEX](https://arxiv.org/abs/2205.11039), arxiv 2022

  29. [TFLEX](https://arxiv.org/pdf/2205.14307.pdf), arxiv 2022

  30. [GNNQ](https://scholar.archive.org/work/sswyx3mbc5danaxosfnyr45cw4/access/wayback/https://s3-eu-west-1.amazonaws.com/pstorage-oxford-0703704596/36647646/GNNQ_full.pdf?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIA3OGA3B5WHKHW6LWV/20220814/eu-west-1/s3/aws4_request&X-Amz-Date=20220814T213541Z&X-Amz-Expires=10&X-Amz-SignedHeaders=host&X-Amz-Signature=1bf627735fbce377c2478f514fbac1dfcf49423980d372cc71e2bed732d6ae2c), ISWC 2022

  31. [ENeSy](https://arxiv.org/pdf/2209.08779.pdf), NeurIPS 2022

  32. [NodePiece-QE](https://arxiv.org/abs/2210.08008), NeurIPS 2022

  33. [GammaE](https://arxiv.org/pdf/2210.15578.pdf), EMNLP 2022

  34. [NMP-QEM](https://aclanthology.org/2022.emnlp-main.194.pdf), EMNLP 2022

  35. [SignalE](https://link.springer.com/content/pdf/10.1007/978-3-031-10983-6_20.pdf?pdf=inline%20link), KSEM 2022

  36. [Query2Geom](https://link.springer.com/chapter/10.1007/978-3-031-26438-2_6), AICS 2022

  37. [LMPNN](https://openreview.net/pdf?id=SoyOsp7i_l), ICLR 2023

  38. [QTO](https://arxiv.org/pdf/2212.09567.pdf), arxiv 2023

  39. [Var2Vec](https://ora.ox.ac.uk/objects/uuid:27c9b239-20d0-449f-b425-b5065eb128fe/download_file?file_format=application%2Fpdf&safe_filename=Wang_et_al_2022_Efficient_Embeddings_ofAM.pdf&type_of_work=Conference+item), AAAI 2023

  40. [CQD A](https://arxiv.org/pdf/2301.12313.pdf), arxiv 2023

  41. [SQE](https://arxiv.org/pdf/2302.13114.pdf), arxiv 2023

  42. [NRN](https://arxiv.org/abs/2306.01399) KDD 2023

  43. [FIT](https://arxiv.org/pdf/2304.07063.pdf), arxiv 2023  

  44. [WFRE](https://arxiv.org/pdf/2305.04034.pdf), ACL 2023

   

  Hyper-relational KGs (2)

  1. [StarQE](https://arxiv.org/abs/2106.08166), ICLR 2022

  2. [NQE](https://arxiv.org/pdf/2211.13469.pdf), AAAI 2023

  Hyper-graphs and Multi-modal graphs (0)

  0. None as of March 2023

  

### Graphs | Reasoning Domain

  Discrete (Entities only) (45) 

  1. [GQE](https://proceedings.neurips.cc/paper/2018/file/ef50c335cca9f340bde656363ebd02fd-Paper.pdf), NeurIPS 2018  

  2. [GQE+hashing](https://ieeexplore.ieee.org/abstract/document/8970688/?casa_token=gLHFmr7V2ekAAAAA:wwDbUufdkwHTQo68pansuhJsJ2XQAF0P21_mQSu75KVRUkgqARmXBs_VEmFOkFgz_Lq-FXP8OA), ICDM 2019

  3. [CGA](https://arxiv.org/pdf/1910.00084.pdf), K-CAP 2019

  4. [TractOR](https://arxiv.org/pdf/2002.10029.pdf), UAI 2020  

  5. [Query2Box](https://openreview.net/pdf?id=BJgr4kSFDS), ICLR 2020  

  6. [BetaE](https://proceedings.neurips.cc/paper/2020/file/e43739bba7cdb577e9e3e4e42447f5a5-Paper.pdf), NeurIPS 2020

  7. [EmQL](https://proceedings.neurips.cc/paper/2020/hash/fe74074593f21197b7b7be3c08678616-Abstract.html), NeurIPS 2020

  8. [MPQE](https://grlplus.github.io/papers/26.pdf), ICML 2020 Workshop

  9. [RotatE-Box](https://arxiv.org/abs/2005.00480), AKBC 2021

  10. [BiQE](https://ojs.aaai.org/index.php/AAAI/article/view/16630), AAAI 2021

  11. [Knowledge Sheaves: A Sheaf-Theoretic Framework for Knowledge Graph Embedding](https://arxiv.org/pdf/2110.03789.pdf), arxiv 2021

  12. [CQD](https://openreview.net/forum?id=Mos9F9kDwkz), ICLR 2021

  13. [HypE](https://dl.acm.org/doi/abs/10.1145/3442381.3449974?casa_token=tQx20rBUtMwAAAAA:GRqp4JBze6ybzZzeSdCc9cNGlqN0wkAP0BVBPctAgtUVviVSoNiUQcNxBbEuGVGZxnCzJyLqfMmiYg), WWW 2021

  14. [NewLook](http://tonghanghang.org/pdfs/kdd21_newlook.pdf), KDD 2021

  15. [ConE](https://proceedings.neurips.cc/paper/2021/hash/a0160709701140704575d499c997b6ca-Abstract.html), NeurIPS 2021

  16. [PERM](https://proceedings.neurips.cc/paper/2021/hash/c4d2ce3f3ebb5393a77c33c0cd95dc93-Abstract.html), NeurIPS 2021

  17. [Neural-symbolic Approach for Ontology-mediated Query Answering](https://arxiv.org/pdf/2106.14052.pdf), arxiv 2021

  18. [LogicE](https://arxiv.org/pdf/2103.00418.pdf), arxiv 2021

  19. [MLPMix](https://openreview.net/forum?id=tgcAoUVHRIB), ICLR 2022

  20. [StarQE](https://arxiv.org/abs/2106.08166), ICLR 2022

  21. [FuzzQE](https://ojs.aaai.org/index.php/AAAI/article/view/20310), AAAI 2022

  22. [GNN-QE](https://arxiv.org/abs/2205.10128), ICML 2022

  23. [CBR-SUBG](https://proceedings.mlr.press/v162/das22a.html), ICML 2022

  24. [SMORE](https://arxiv.org/abs/2110.14890), KDD 2022

  25. [kgTransformer](https://keg.cs.tsinghua.edu.cn/jietang/publications/KDD22-Liu-et-al-KG-Transformer.pdf), KDD 2022

  26. [LinE](https://dl.acm.org/doi/pdf/10.1145/3534678.3539338?casa_token=_jPlNJj2TlYAAAAA:pKAA42_lrZ2JIHc1YZV0fchIlRiIcqCy8oCBL2UU3Gm84MOeTSYLfQn31DKtXBbU2yqzC7LUsYvREBw), KDD 2022

  27. [Query2Particles](https://arxiv.org/abs/2204.12847), NAACL 2022

  28. [TAR](https://arxiv.org/abs/2205.14591), arxiv 2022

  29. [TeMP](https://arxiv.org/abs/2205.00782), arxiv 2022

  30. [FLEX](https://arxiv.org/abs/2205.11039), arxiv 2022

  31. [GNNQ](https://scholar.archive.org/work/sswyx3mbc5danaxosfnyr45cw4/access/wayback/https://s3-eu-west-1.amazonaws.com/pstorage-oxford-0703704596/36647646/GNNQ_full.pdf?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIA3OGA3B5WHKHW6LWV/20220814/eu-west-1/s3/aws4_request&X-Amz-Date=20220814T213541Z&X-Amz-Expires=10&X-Amz-SignedHeaders=host&X-Amz-Signature=1bf627735fbce377c2478f514fbac1dfcf49423980d372cc71e2bed732d6ae2c), ISWC 2022

  32. [ENeSy](https://arxiv.org/pdf/2209.08779.pdf), NeurIPS 2022

  33. [NodePiece-QE](https://arxiv.org/abs/2210.08008), NeurIPS 2022

  34. [GammaE](https://arxiv.org/pdf/2210.15578.pdf), EMNLP 2022

  35. [NMP-QEM](https://aclanthology.org/2022.emnlp-main.194.pdf), EMNLP 2022

  36. [SignalE](https://link.springer.com/content/pdf/10.1007/978-3-031-10983-6_20.pdf?pdf=inline%20link), KSEM 2022

  37. [Query2Geom](https://link.springer.com/chapter/10.1007/978-3-031-26438-2_6), AICS 2022

  38. [LMPNN](https://openreview.net/pdf?id=SoyOsp7i_l), ICLR 2023

  39. [QTO](https://arxiv.org/pdf/2212.09567.pdf), arxiv 2023

  40. [Var2Vec](https://ora.ox.ac.uk/objects/uuid:27c9b239-20d0-449f-b425-b5065eb128fe/download_file?file_format=application%2Fpdf&safe_filename=Wang_et_al_2022_Efficient_Embeddings_ofAM.pdf&type_of_work=Conference+item), AAAI 2023

  41. [NQE](https://arxiv.org/pdf/2211.13469.pdf), AAAI 2023

  42. [CQD A](https://arxiv.org/pdf/2301.12313.pdf), arxiv 2023

  43. [SQE](https://arxiv.org/pdf/2302.13114.pdf), TMLR 2023

  44. [FIT](https://arxiv.org/pdf/2304.07063.pdf), arxiv 2023  

  45. [WFRE](https://arxiv.org/pdf/2305.04034.pdf), ACL 2023

  

  Discrete Temporal (Entities + Dates) (1)

  1. [TFLEX: Temporal Feature-Logic Embedding Framework for Complex Reasoning over Temporal Knowledge Graph](https://arxiv.org/pdf/2205.14307.pdf), arxiv 2022

  

 Discrete + Continuous (Entities + string/numerical Literals) (0)

  0. None as of March 2023

  

### Graphs | Background Semantics

  Facts-only (ABOX) (42) 

  1. [GQE](https://proceedings.neurips.cc/paper/2018/file/ef50c335cca9f340bde656363ebd02fd-Paper.pdf), NeurIPS 2018  

  2. [GQE+hashing](https://ieeexplore.ieee.org/abstract/document/8970688/?casa_token=gLHFmr7V2ekAAAAA:wwDbUufdkwHTQo68pansuhJsJ2XQAF0P21_mQSu75KVRUkgqARmXBs_VEmFOkFgz_Lq-FXP8OA), ICDM 2019

  3. [TractOR](https://arxiv.org/pdf/2002.10029.pdf), UAI 2020  

  4. [Query2Box](https://openreview.net/pdf?id=BJgr4kSFDS), ICLR 2020  

  5. [BetaE](https://proceedings.neurips.cc/paper/2020/file/e43739bba7cdb577e9e3e4e42447f5a5-Paper.pdf), NeurIPS 2020

  6. [EmQL](https://proceedings.neurips.cc/paper/2020/hash/fe74074593f21197b7b7be3c08678616-Abstract.html), NeurIPS 2020

  7. [MPQE](https://grlplus.github.io/papers/26.pdf), ICML 2020 Workshop

  8. [RotatE-Box](https://arxiv.org/abs/2005.00480), AKBC 2021

  9.  [BiQE](https://ojs.aaai.org/index.php/AAAI/article/view/16630), AAAI 2021

  10. [Knowledge Sheaves: A Sheaf-Theoretic Framework for Knowledge Graph Embedding](https://arxiv.org/pdf/2110.03789.pdf), arxiv 2021

  11. [CQD](https://openreview.net/forum?id=Mos9F9kDwkz), ICLR 2021

  12. [HypE](https://dl.acm.org/doi/abs/10.1145/3442381.3449974?casa_token=tQx20rBUtMwAAAAA:GRqp4JBze6ybzZzeSdCc9cNGlqN0wkAP0BVBPctAgtUVviVSoNiUQcNxBbEuGVGZxnCzJyLqfMmiYg), WWW 2021

  13. [NewLook](http://tonghanghang.org/pdfs/kdd21_newlook.pdf), KDD 2021

  14. [ConE](https://proceedings.neurips.cc/paper/2021/hash/a0160709701140704575d499c997b6ca-Abstract.html), NeurIPS 2021

  15. [PERM](https://proceedings.neurips.cc/paper/2021/hash/c4d2ce3f3ebb5393a77c33c0cd95dc93-Abstract.html), NeurIPS 2021

  16. [LogicE](https://arxiv.org/pdf/2103.00418.pdf), arxiv 2021

  17. [MLPMix](https://openreview.net/forum?id=tgcAoUVHRIB), ICLR 2022

  18. [FuzzQE](https://ojs.aaai.org/index.php/AAAI/article/view/20310), AAAI 2022

  19. [GNN-QE](https://arxiv.org/abs/2205.10128), ICML 2022

  20. [CBR-SUBG](https://proceedings.mlr.press/v162/das22a.html), ICML 2022

  21. [SMORE](https://arxiv.org/abs/2110.14890), KDD 2022

  22. [kgTransformer](https://keg.cs.tsinghua.edu.cn/jietang/publications/KDD22-Liu-et-al-KG-Transformer.pdf), KDD 2022

  23. [LinE](https://dl.acm.org/doi/pdf/10.1145/3534678.3539338?casa_token=_jPlNJj2TlYAAAAA:pKAA42_lrZ2JIHc1YZV0fchIlRiIcqCy8oCBL2UU3Gm84MOeTSYLfQn31DKtXBbU2yqzC7LUsYvREBw), KDD 2022

  24. [Query2Particles](https://arxiv.org/abs/2204.12847), NAACL 2022

  25. [FLEX](https://arxiv.org/abs/2205.11039), arxiv 2022

  26. [TFLEX](https://arxiv.org/pdf/2205.14307.pdf), arxiv 2022

  27. [GNNQ](https://scholar.archive.org/work/sswyx3mbc5danaxosfnyr45cw4/access/wayback/https://s3-eu-west-1.amazonaws.com/pstorage-oxford-0703704596/36647646/GNNQ_full.pdf?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIA3OGA3B5WHKHW6LWV/20220814/eu-west-1/s3/aws4_request&X-Amz-Date=20220814T213541Z&X-Amz-Expires=10&X-Amz-SignedHeaders=host&X-Amz-Signature=1bf627735fbce377c2478f514fbac1dfcf49423980d372cc71e2bed732d6ae2c), ISWC 2022

  28. [ENesy](https://arxiv.org/pdf/2209.08779.pdf), NeurIPS 2022

  29. [NodePiece-QE](https://arxiv.org/abs/2210.08008), NeurIPS 2022

  30. [GammaE](https://arxiv.org/pdf/2210.15578.pdf), EMNLP 2022

  31. [NMP-QEM](https://aclanthology.org/2022.emnlp-main.194.pdf), EMNLP 2022

  32. [SignalE](https://link.springer.com/content/pdf/10.1007/978-3-031-10983-6_20.pdf?pdf=inline%20link), KSEM 2022

  33. [Query2Geom](https://link.springer.com/chapter/10.1007/978-3-031-26438-2_6), AICS 2022

  34. [LMPNN](https://openreview.net/pdf?id=SoyOsp7i_l), ICLR 2023

  35. [QTO](https://arxiv.org/pdf/2212.09567.pdf), arxiv 2023

  36. [Var2Vec](https://ora.ox.ac.uk/objects/uuid:27c9b239-20d0-449f-b425-b5065eb128fe/download_file?file_format=application%2Fpdf&safe_filename=Wang_et_al_2022_Efficient_Embeddings_ofAM.pdf&type_of_work=Conference+item), AAAI 2023

  37. [NQE](https://arxiv.org/pdf/2211.13469.pdf), AAAI 2023

  38. [CQD A](https://arxiv.org/pdf/2301.12313.pdf), arxiv 2023

  39. [SQE](https://arxiv.org/pdf/2302.13114.pdf), TMLR 2023

  40. [NRN](https://arxiv.org/abs/2306.01399) KDD 2023

  41. [FIT](https://arxiv.org/pdf/2304.07063.pdf), arxiv 2023  

  42. [WFRE](https://arxiv.org/pdf/2305.04034.pdf), ACL 2023

  

  Class Hierarchy (3) 

  1. [CGA](https://arxiv.org/pdf/1910.00084.pdf), K-CAP 2019

  2. [TeMP](https://arxiv.org/abs/2205.00782), arxiv 2022

  3. [TAR](https://arxiv.org/abs/2205.14591), arxiv 2022

  Complex axioms (TBOX) (1) 

  1. [Neural-symbolic Approach for Ontology-mediated Query Answering](https://arxiv.org/pdf/2106.14052.pdf), arxiv 2021

### Modeling | Encoder 

  Shallow Embedding (32) 

  1. [GQE](https://proceedings.neurips.cc/paper/2018/file/ef50c335cca9f340bde656363ebd02fd-Paper.pdf), NeurIPS 2018  

  2. [GQE+hashing](https://ieeexplore.ieee.org/abstract/document/8970688/?casa_token=gLHFmr7V2ekAAAAA:wwDbUufdkwHTQo68pansuhJsJ2XQAF0P21_mQSu75KVRUkgqARmXBs_VEmFOkFgz_Lq-FXP8OA), ICDM 2019 

  3. [CGA](https://arxiv.org/pdf/1910.00084.pdf), K-CAP 2019 

  4. [TractOR](https://arxiv.org/pdf/2002.10029.pdf), UAI 2020 

  5. [Query2Box](https://openreview.net/pdf?id=BJgr4kSFDS), ICLR 2020  

  6. [BetaE](https://proceedings.neurips.cc/paper/2020/file/e43739bba7cdb577e9e3e4e42447f5a5-Paper.pdf), NeurIPS 2020

  7. [EmQL](https://proceedings.neurips.cc/paper/2020/hash/fe74074593f21197b7b7be3c08678616-Abstract.html), NeurIPS 2020

  8. [Knowledge Sheaves: A Sheaf-Theoretic Framework for Knowledge Graph Embedding](https://arxiv.org/pdf/2110.03789.pdf), arxiv 2021

  9. [RotatE-Box](https://arxiv.org/abs/2005.00480), AKBC 2021

  10. [Neural-symbolic Approach for Ontology-mediated Query Answering](https://arxiv.org/pdf/2106.14052.pdf), arxiv 2021

  11. [HypE](https://dl.acm.org/doi/abs/10.1145/3442381.3449974?casa_token=tQx20rBUtMwAAAAA:GRqp4JBze6ybzZzeSdCc9cNGlqN0wkAP0BVBPctAgtUVviVSoNiUQcNxBbEuGVGZxnCzJyLqfMmiYg), WWW 2021

  12. [NewLook](http://tonghanghang.org/pdfs/kdd21_newlook.pdf), KDD 2021

  13. [CQD](https://openreview.net/forum?id=Mos9F9kDwkz), ICLR 2021

  14. [ConE](https://proceedings.neurips.cc/paper/2021/hash/a0160709701140704575d499c997b6ca-Abstract.html), NeurIPS 2021

  15. [PERM](https://proceedings.neurips.cc/paper/2021/hash/c4d2ce3f3ebb5393a77c33c0cd95dc93-Abstract.html), NeurIPS 2021

  16. [LogicE](https://arxiv.org/pdf/2103.00418.pdf), arxiv 2021

  17. [FuzzQE](https://ojs.aaai.org/index.php/AAAI/article/view/20310), AAAI 2022

  18. [SMORE](https://arxiv.org/abs/2110.14890), KDD 2022

  19. [LinE](https://dl.acm.org/doi/pdf/10.1145/3534678.3539338?casa_token=_jPlNJj2TlYAAAAA:pKAA42_lrZ2JIHc1YZV0fchIlRiIcqCy8oCBL2UU3Gm84MOeTSYLfQn31DKtXBbU2yqzC7LUsYvREBw), KDD 2022

  20. [TAR](https://arxiv.org/abs/2205.14591), arxiv 2022

  21. [Query2Particles](https://arxiv.org/abs/2204.12847), NAACL 2022

  22. [FLEX](https://arxiv.org/abs/2205.11039), arxiv 2022

  23. [TFLEX](https://arxiv.org/pdf/2205.14307.pdf), arxiv 2022

  24. [GammaE](https://arxiv.org/pdf/2210.15578.pdf), EMNLP 2022

  25. [NMP-QEM](https://aclanthology.org/2022.emnlp-main.194.pdf), EMNLP 2022

  26. [SignalE](https://link.springer.com/content/pdf/10.1007/978-3-031-10983-6_20.pdf?pdf=inline%20link), KSEM 2022

  27. [Query2Geom](https://link.springer.com/chapter/10.1007/978-3-031-26438-2_6), AICS 2022

  28. [QTO](https://arxiv.org/pdf/2212.09567.pdf), arxiv 2023

  29. [Var2Vec](https://ora.ox.ac.uk/objects/uuid:27c9b239-20d0-449f-b425-b5065eb128fe/download_file?file_format=application%2Fpdf&safe_filename=Wang_et_al_2022_Efficient_Embeddings_ofAM.pdf&type_of_work=Conference+item), AAAI 2023

  30. [CQD A](https://arxiv.org/pdf/2301.12313.pdf), arxiv 2023

  31. [FIT](https://arxiv.org/pdf/2304.07063.pdf), arxiv 2023  

  32. [WFRE](https://arxiv.org/pdf/2305.04034.pdf), ACL 2023

  Transductive Encoder (9) 

  1. [MPQE](https://grlplus.github.io/papers/26.pdf), ICML 2020 Workshop

  2. [BiQE](https://ojs.aaai.org/index.php/AAAI/article/view/16630), AAAI 2021

  3. [kgTransformer](https://keg.cs.tsinghua.edu.cn/jietang/publications/KDD22-Liu-et-al-KG-Transformer.pdf), KDD 2022

  4.  [MLPMix](https://openreview.net/forum?id=tgcAoUVHRIB), ICLR 2022

  5.  [StarQE](https://arxiv.org/abs/2106.08166), ICLR 2022

  6.  [ENeSy](https://arxiv.org/pdf/2209.08779.pdf) NeurIPS 2022

  7.  [LMPNN](https://openreview.net/pdf?id=SoyOsp7i_l) ICLR 2023

  8.  [NQE](https://arxiv.org/pdf/2211.13469.pdf), AAAI 2023

  9.  [SQE](https://arxiv.org/pdf/2302.13114.pdf), TMLR 2023

  Inductive Encoder (4) 

  1. (TeMP) [Type-aware embeddings for multi-hop reasoning over knowledge graphs](https://arxiv.org/abs/2205.00782), arxiv 2022

  2. (GNN-QE) [Neural-Symbolic Models for Logical Queries on Knowledge Graphs](https://arxiv.org/abs/2205.10128), ICML 2022

  3. (GNNQ) [GNNQ: A Neuro-Symbolic Approach for Query Answering over Incomplete Knowledge Graphs](https://scholar.archive.org/work/sswyx3mbc5danaxosfnyr45cw4/access/wayback/https://s3-eu-west-1.amazonaws.com/pstorage-oxford-0703704596/36647646/GNNQ_full.pdf?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIA3OGA3B5WHKHW6LWV/20220814/eu-west-1/s3/aws4_request&X-Amz-Date=20220814T213541Z&X-Amz-Expires=10&X-Amz-SignedHeaders=host&X-Amz-Signature=1bf627735fbce377c2478f514fbac1dfcf49423980d372cc71e2bed732d6ae2c), ISWC 2022

  4. (NodePiece-QE) [Inductive Logical Query Answering in Knowledge Graphs](https://arxiv.org/abs/2210.08008), NeurIPS 2022

### Modeling | Processor

  Any Processor (2) 

  1. [TeMP](https://arxiv.org/abs/2205.00782), arxiv 2022 

  2. [NodePiece-QE](https://arxiv.org/abs/2210.08008), NeurIPS 2022

  End-to-end Neural (15) 

  1. [GQE](https://proceedings.neurips.cc/paper/2018/file/ef50c335cca9f340bde656363ebd02fd-Paper.pdf), NeurIPS 2018  

  2. [GQE+hashing](https://ieeexplore.ieee.org/abstract/document/8970688/?casa_token=gLHFmr7V2ekAAAAA:wwDbUufdkwHTQo68pansuhJsJ2XQAF0P21_mQSu75KVRUkgqARmXBs_VEmFOkFgz_Lq-FXP8OA), ICDM 2019

  3. [CGA](https://arxiv.org/pdf/1910.00084.pdf), K-CAP 2019

  4. [MPQE](https://grlplus.github.io/papers/26.pdf), ICML 2020 Workshop

  5. [BiQE](https://ojs.aaai.org/index.php/AAAI/article/view/16630), AAAI 2021

  6. [MLPMix](https://openreview.net/forum?id=tgcAoUVHRIB), ICLR 2022

  7. [StarQE](https://arxiv.org/abs/2106.08166), ICLR 2022

  8. [kgTransformer](https://keg.cs.tsinghua.edu.cn/jietang/publications/KDD22-Liu-et-al-KG-Transformer.pdf), KDD 2022

  9. [Query2Particles](https://arxiv.org/abs/2204.12847), NAACL 2022

  10. [SMORE](https://arxiv.org/abs/2110.14890), KDD 2022

  11. [GNNQ](https://scholar.archive.org/work/sswyx3mbc5danaxosfnyr45cw4/access/wayback/https://s3-eu-west-1.amazonaws.com/pstorage-oxford-0703704596/36647646/GNNQ_full.pdf?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIA3OGA3B5WHKHW6LWV/20220814/eu-west-1/s3/aws4_request&X-Amz-Date=20220814T213541Z&X-Amz-Expires=10&X-Amz-SignedHeaders=host&X-Amz-Signature=1bf627735fbce377c2478f514fbac1dfcf49423980d372cc71e2bed732d6ae2c), ISWC 2022

  12. [SignalE](https://link.springer.com/content/pdf/10.1007/978-3-031-10983-6_20.pdf?pdf=inline%20link), KSEM 2022

  13. [LMPNN](https://openreview.net/pdf?id=SoyOsp7i_l) ICLR 2023

  14. [SQE](https://arxiv.org/pdf/2302.13114.pdf), TMLR 2023

  15. [WFRE](https://arxiv.org/pdf/2305.04034.pdf), ACL 2023

  Neuro-Symbolic | Geometric (8) 

  1. [Query2Box](https://openreview.net/pdf?id=BJgr4kSFDS), ICLR 2020

  2. [Neural-symbolic Approach for Ontology-mediated Query Answering](https://arxiv.org/pdf/2106.14052.pdf), arxiv 2021

  3. [Knowledge Sheaves: A Sheaf-Theoretic Framework for Knowledge Graph Embedding](https://arxiv.org/pdf/2110.03789.pdf), arxiv 2021

  4. [RotatE-Box](https://arxiv.org/abs/2005.00480), AKBC 2021

  5. [NewLook](http://tonghanghang.org/pdfs/kdd21_newlook.pdf), KDD 2021

  6. [HypE](https://dl.acm.org/doi/abs/10.1145/3442381.3449974?casa_token=tQx20rBUtMwAAAAA:GRqp4JBze6ybzZzeSdCc9cNGlqN0wkAP0BVBPctAgtUVviVSoNiUQcNxBbEuGVGZxnCzJyLqfMmiYg), WWW 2021

  7. [ConE](https://proceedings.neurips.cc/paper/2021/hash/a0160709701140704575d499c997b6ca-Abstract.html), NeurIPS 2021

  8. [Query2Geom](https://link.springer.com/chapter/10.1007/978-3-031-26438-2_6), AICS 2022

   

  Neuro-Symbolic | Probabilistic (5) 

  1. [BetaE](https://proceedings.neurips.cc/paper/2020/file/e43739bba7cdb577e9e3e4e42447f5a5-Paper.pdf), NeurIPS 2020

  2. [PERM](https://proceedings.neurips.cc/paper/2021/hash/c4d2ce3f3ebb5393a77c33c0cd95dc93-Abstract.html), NeurIPS 2021

  3. [LinE](https://dl.acm.org/doi/pdf/10.1145/3534678.3539338?casa_token=_jPlNJj2TlYAAAAA:pKAA42_lrZ2JIHc1YZV0fchIlRiIcqCy8oCBL2UU3Gm84MOeTSYLfQn31DKtXBbU2yqzC7LUsYvREBw), KDD 2022

  4. [GammaE](https://arxiv.org/pdf/2210.15578.pdf), EMNLP 2022

  5. [NMP-QEM](https://aclanthology.org/2022.emnlp-main.194.pdf), EMNLP 2022

  Neuro-Symbolic | Fuzzy Logic (16) 

  1. [EmQL](https://proceedings.neurips.cc/paper/2020/hash/fe74074593f21197b7b7be3c08678616-Abstract.html), NeurIPS 2020

  2. [TractOR](https://arxiv.org/pdf/2002.10029.pdf), UAI 2020   

  3. [CQD](https://openreview.net/forum?id=Mos9F9kDwkz), ICLR 2021

  4. [LogicE](https://arxiv.org/pdf/2103.00418.pdf), arxiv 2021

  5. [FuzzQE](https://ojs.aaai.org/index.php/AAAI/article/view/20310), AAAI 2022

  6. [TAR](https://arxiv.org/abs/2205.14591), arxiv 2022

  7. [FLEX](https://arxiv.org/abs/2205.11039), arxiv 2022

  8. [TFLEX](https://arxiv.org/pdf/2205.14307.pdf), arxiv 2022

  9. [GNN-QE](https://arxiv.org/abs/2205.10128), ICML 2022

  10. [ENeSy](https://arxiv.org/pdf/2209.08779.pdf) NeurIPS 2022

  11. [QTO](https://arxiv.org/pdf/2212.09567.pdf), arxiv 2023

  12. [NQE](https://arxiv.org/pdf/2211.13469.pdf), AAAI 2023

  13. [Var2Vec](https://ora.ox.ac.uk/objects/uuid:27c9b239-20d0-449f-b425-b5065eb128fe/download_file?file_format=application%2Fpdf&safe_filename=Wang_et_al_2022_Efficient_Embeddings_ofAM.pdf&type_of_work=Conference+item), AAAI 2023

  14. [CQD A](https://arxiv.org/pdf/2301.12313.pdf), arxiv 2023

  15. [FIT](https://arxiv.org/pdf/2304.07063.pdf), arxiv 2023

  16. [WFRE](https://arxiv.org/pdf/2305.04034.pdf), ACL 2023

### Modeling | Decoder

  Non-Parametric (all) 

  All existing models up to March 2023

  Parametric (0) 

  1. None as of March 2023

### Queries | Query Operators

Progressive scale of supported operators. That is, all models listed under the "NOT" category also support JOIN and UNION.

  PROJECTION + JOIN (intersection) (10) 

  1. [GQE](https://proceedings.neurips.cc/paper/2018/file/ef50c335cca9f340bde656363ebd02fd-Paper.pdf), NeurIPS 2018  

  2. [GQE+hashing](https://ieeexplore.ieee.org/abstract/document/8970688/?casa_token=gLHFmr7V2ekAAAAA:wwDbUufdkwHTQo68pansuhJsJ2XQAF0P21_mQSu75KVRUkgqARmXBs_VEmFOkFgz_Lq-FXP8OA), ICDM 2019

  3. [CGA](https://arxiv.org/pdf/1910.00084.pdf), K-CAP 2019 

  4. [TractOR](https://arxiv.org/pdf/2002.10029.pdf), UAI 2020

  5. [MPQE](https://grlplus.github.io/papers/26.pdf), ICML 2020 Workshop

  6. [Knowledge Sheaves: A Sheaf-Theoretic Framework for Knowledge Graph Embedding](https://arxiv.org/pdf/2110.03789.pdf), arxiv 2021

  7. [BiQE](https://ojs.aaai.org/index.php/AAAI/article/view/16630), AAAI 2021

  8. [StarQE](https://arxiv.org/abs/2106.08166), ICLR 2022

  9. [SMORE](https://arxiv.org/abs/2110.14890), KDD 2022

  10. [GNNQ](https://scholar.archive.org/work/sswyx3mbc5danaxosfnyr45cw4/access/wayback/https://s3-eu-west-1.amazonaws.com/pstorage-oxford-0703704596/36647646/GNNQ_full.pdf?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIA3OGA3B5WHKHW6LWV/20220814/eu-west-1/s3/aws4_request&X-Amz-Date=20220814T213541Z&X-Amz-Expires=10&X-Amz-SignedHeaders=host&X-Amz-Signature=1bf627735fbce377c2478f514fbac1dfcf49423980d372cc71e2bed732d6ae2c), ISWC 2022

   + UNION (9) 

  1. [Query2Box](https://openreview.net/pdf?id=BJgr4kSFDS), ICLR 2020

  2. [EmQL](https://proceedings.neurips.cc/paper/2020/hash/fe74074593f21197b7b7be3c08678616-Abstract.html), NeurIPS 2020

  3. [HypE](https://dl.acm.org/doi/abs/10.1145/3442381.3449974?casa_token=tQx20rBUtMwAAAAA:GRqp4JBze6ybzZzeSdCc9cNGlqN0wkAP0BVBPctAgtUVviVSoNiUQcNxBbEuGVGZxnCzJyLqfMmiYg), WWW 2021

  4. [NewLook](http://tonghanghang.org/pdfs/kdd21_newlook.pdf), KDD 2021

  5. [PERM](https://proceedings.neurips.cc/paper/2021/hash/c4d2ce3f3ebb5393a77c33c0cd95dc93-Abstract.html), NeurIPS 2021

  6. [CQD](https://openreview.net/forum?id=Mos9F9kDwkz) ICLR’21

  7. [Neural-symbolic Approach for Ontology-mediated Query Answering](https://arxiv.org/pdf/2106.14052.pdf), arxiv 2021

  8. [kgTransformer](https://keg.cs.tsinghua.edu.cn/jietang/publications/KDD22-Liu-et-al-KG-Transformer.pdf), KDD 2022

  9. [Query2Geom](https://link.springer.com/chapter/10.1007/978-3-031-26438-2_6), AICS 2022

   + NOT (negation) (23) 

  1. [BetaE](https://proceedings.neurips.cc/paper/2020/file/e43739bba7cdb577e9e3e4e42447f5a5-Paper.pdf), NeurIPS 2020

  2. [ConE](https://proceedings.neurips.cc/paper/2021/hash/a0160709701140704575d499c997b6ca-Abstract.html), NeurIPS 2021

  3. [LogicE](https://arxiv.org/pdf/2103.00418.pdf), arxiv 2021

  4. [MLPMix](https://openreview.net/forum?id=tgcAoUVHRIB), ICLR 2022

  5. [FuzzQE](https://ojs.aaai.org/index.php/AAAI/article/view/20310), AAAI 2022

  6. [GNN-QE](https://arxiv.org/abs/2205.10128), ICML 2022

  7. [LinE](https://dl.acm.org/doi/pdf/10.1145/3534678.3539338?casa_token=_jPlNJj2TlYAAAAA:pKAA42_lrZ2JIHc1YZV0fchIlRiIcqCy8oCBL2UU3Gm84MOeTSYLfQn31DKtXBbU2yqzC7LUsYvREBw), KDD 2022

  8. [Query2Particles](https://arxiv.org/abs/2204.12847), NAACL 2022

  9. [GammaE](https://arxiv.org/pdf/2210.15578.pdf), EMNLP 2022

  10. [NMP-QEM](https://aclanthology.org/2022.emnlp-main.194.pdf), EMNLP 2022

  11. [TAR](https://arxiv.org/abs/2205.14591), arxiv 2022

  12. [FLEX](https://arxiv.org/abs/2205.11039), arxiv 2022

  13. [TFLEX](https://arxiv.org/pdf/2205.14307.pdf), arxiv 2022

  14. [ENeSy](https://arxiv.org/pdf/2209.08779.pdf), NeurIPS 2022

  15. [SignalE](https://link.springer.com/content/pdf/10.1007/978-3-031-10983-6_20.pdf?pdf=inline%20link), KSEM 2022

  16. [QTO](https://arxiv.org/pdf/2212.09567.pdf), arxiv 2023

  17. [LMPNN](https://openreview.net/pdf?id=SoyOsp7i_l), ICLR 2023

  18. [NQE](https://arxiv.org/pdf/2211.13469.pdf), AAAI 2023

  19. [Var2Vec](https://ora.ox.ac.uk/objects/uuid:27c9b239-20d0-449f-b425-b5065eb128fe/download_file?file_format=application%2Fpdf&safe_filename=Wang_et_al_2022_Efficient_Embeddings_ofAM.pdf&type_of_work=Conference+item), AAAI 2023

  20. [CQD A](https://arxiv.org/pdf/2301.12313.pdf), arxiv 2023

  21. [SQE](https://arxiv.org/pdf/2302.13114.pdf), TMLR 2023

  22. [FIT](https://arxiv.org/pdf/2304.07063.pdf), arxiv 2023  

  23. [WFRE](https://arxiv.org/pdf/2305.04034.pdf), ACL 2023

   Kleene Plus (1) 

  1. [RotatE-Box](https://arxiv.org/abs/2005.00480), AKBC 2021

   FILTER (0) 

  0. None as of March 2023

   AGGREGATIONS (GROUP BY, ORDER BY, etc) (0) 

  0. None as of March 2023

### Queries | Query Patterns

  Tree-structured (47) 

  1. All existing processors as of March 2023

  Arbitrary DAGs (1) 

  1. [FIT](https://arxiv.org/pdf/2304.07063.pdf), arxiv 2023

  Cyclic Queries (1) 

  1. [FIT](https://arxiv.org/pdf/2304.07063.pdf), arxiv 2023

### Queries | Projected Variables

   Zero Projected Vars (ASK queries) (0) 

  0. None as of March 2023

   One Projected Variable (all) 

  1. All processors as of March 2023

   Multiple Projected Variables (1) 

 

  1. (EFOk-CQA) [EFOk-CQA: Towards Knowledge Graph Complex Query Answering beyond Set Operation](https://arxiv.org/abs/2307.13701) arxiv 2023

### Metrics

- Original metrics: ROC AUC and Average Percentile Rank over 1000 negative samples. 

  - Proposed by original [GQE](https://proceedings.neurips.cc/paper/2018/file/ef50c335cca9f340bde656363ebd02fd-Paper.pdf) (NeurIPS 2018), used by [GQE+hashing](https://ieeexplore.ieee.org/abstract/document/8970688/?casa_token=gLHFmr7V2ekAAAAA:wwDbUufdkwHTQo68pansuhJsJ2XQAF0P21_mQSu75KVRUkgqARmXBs_VEmFOkFgz_Lq-FXP8OA), [CGA](https://arxiv.org/pdf/1910.00084.pdf), and [TractOR](https://arxiv.org/pdf/2002.10029.pdf). **Not used after**.

- Generalization: predicting hard answers (MRR / Hits@k).

  - Introduced by [Query2Box](https://openreview.net/pdf?id=BJgr4kSFDS) (ICLR 2020). **Standard metric**.

- Generalization: from ranking to binary classification

  - Proposed in [Approximate knowledge graph query answering: from ranking to binary classification](https://arxiv.org/abs/2102.11389)

- Entailment: faithfulness - ability to recover easy answers (no link prediction) (MRR / Hits@k)

  - Proposed by [EmQL](https://proceedings.neurips.cc/paper/2020/hash/fe74074593f21197b7b7be3c08678616-Abstract.html) (NeurIPS 2020)

- Estimating the cardinality of answer set size (Spearman's rank correlation, MAPE)

  - Used in [GNN-QE](https://arxiv.org/abs/2205.10128), [QTO](https://arxiv.org/pdf/2212.09567.pdf)

- Predicting easy answers before hard answers (ROC-AUC)

  - Used in [NodePiece-QE](https://arxiv.org/abs/2210.08008)

- Multiple variable queries, (multiply / marginal / joint) x (MRR/ HITs@k)

  - Proposed in [EFOk-CQA](https://arxiv.org/abs/2307.13701) arxiv 2023

## 📈 Datasets and Benchmarking

### Inference (datasets)

  Transductive datasets (15) 

  1. (GQE datasets) [GQE](https://proceedings.neurips.cc/paper/2018/file/ef50c335cca9f340bde656363ebd02fd-Paper.pdf), NeurIPS 2018

  2. (Query2Box datasets) [Query2Box](https://openreview.net/pdf?id=BJgr4kSFDS), ICLR 2020  

  3. (BetaE datasets) [BetaE](https://proceedings.neurips.cc/paper/2020/file/e43739bba7cdb577e9e3e4e42447f5a5-Paper.pdf), NeurIPS 2020

  4. (Regex datasets) [Regex Queries](https://arxiv.org/abs/2005.00480), AKBC 2021

  5. (BiQE dataset) [BiQE](https://ojs.aaai.org/index.php/AAAI/article/view/16630), AAAI 2021

  6. (Query2Onto datasets) [Neural-symbolic Approach for Ontology-mediated Query Answering](https://arxiv.org/pdf/2106.14052.pdf), arxiv 2021

  7. (EFO-1 dataset) [Benchmarking the Combinatorial Generalizability of Complex Query Answering on Knowledge Graphs](https://arxiv.org/abs/2109.08925), NeurIPS 2021 (Datasets and Benchmarks)

  8. (SMORE datasets) [SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs](https://arxiv.org/abs/2110.14890), KDD 2022

  9. (StarQE dataset) [Query Embedding on Hyper-relational Knowledge Graphs](https://arxiv.org/abs/2106.08166) ICLR 2022,

  10. (TFLEX dataset) [TFLEX: Temporal Feature-Logic Embedding Framework for Complex Reasoning over Temporal Knowledge Graph](https://arxiv.org/pdf/2205.14307.pdf), arxiv 2022

  11. (WD50K-NFOL dataset) [NQE: N-ary Query Embedding for Complex Query Answering over Hyper-relational Knowledge Graphs](https://arxiv.org/pdf/2211.13469.pdf), AAAI 2023

  12. (SQE dataset) [Sequential Query Encoding For Complex Query Answering on Knowledge Graphs](https://arxiv.org/pdf/2302.13114.pdf)

  13. (Real EFO-1) [On Existential First Order Queries Inference on Knowledge Graphs](https://arxiv.org/pdf/2304.07063.pdf), arxiv 2023

  14. (Numerical CQA dataset) [Knowledge Graph Reasoning over Entities and Numerical Values](https://arxiv.org/abs/2306.01399) KDD 2023

  15. (EFOk-CQA) [EFOk-CQA: Towards Knowledge Graph Complex Query Answering beyond Set Operation](https://arxiv.org/abs/2307.13701) arxiv 2023

  Inductive datasets (3) 

  1. (TeMP datasets) [Type-aware embeddings for multi-hop reasoning over knowledge graphs](https://arxiv.org/abs/2205.00782), arxiv 2022

  2. (InductiveQE datasets) [Inductive Logical Query Answering in Knowledge Graphs](https://arxiv.org/abs/2210.08008) NeurIPS 2022

  3. (GNNQ dataset) [GNNQ: A Neuro-Symbolic Approach for Query Answering over Incomplete Knowledge Graphs](https://scholar.archive.org/work/sswyx3mbc5danaxosfnyr45cw4/access/wayback/https://s3-eu-west-1.amazonaws.com/pstorage-oxford-0703704596/36647646/GNNQ_full.pdf?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIA3OGA3B5WHKHW6LWV/20220814/eu-west-1/s3/aws4_request&X-Amz-Date=20220814T213541Z&X-Amz-Expires=10&X-Amz-SignedHeaders=host&X-Amz-Signature=1bf627735fbce377c2478f514fbac1dfcf49423980d372cc71e2bed732d6ae2c) ISWC 2022

### GQE Datasets

Are Bio and Reddit available at all? Introduced in GQE, used in 4 papers overall (GQE, GQE+hashing, CGA, TractOR).

### BetaE Datasets

The main difference with Query2Box datasets: queries in the BetaE datasets have less than 100 answers. Has queries with negation.

Introduced in [Beta Embeddings for Multi-Hop Logical Reasoning in Knowledge Graphs](https://proceedings.neurips.cc/paper/2020/file/e43739bba7cdb577e9e3e4e42447f5a5-Paper.pdf), NeurIPS 2020

   Graphs 

  | Dataset | Entities | Relations | Training Edges | Validation Edges | Test Edges | Total Edges

  |-----|----:|----:|----:|----:|----:|----:|

  | FB15k    | 14,951 | 1,345 | 483,142 | 50,000 | 59,071 | 592,213 |

  | FB15k237 | 14,505 | 237   | 272,115 | 17,526 | 20,438 | 310,079 |

  | NELL995  | 63,361 | 200   | 114,213 | 14,324 | 14,267 | 142,804 | 

   Queries 

  | Queries | Training | Training |  Validation | Validation |  Test | Test |

  |---------|---------:|-----------:|-----:|---------:|-----------:|-----:|

  | Dataset | 1p/2p/3p/2i/3i  | 2in/3in/inp/pin/pni  | 1p | others | 1p | others |

  | FB15k    | 273,710 | 27,371 | 59,097 | 8,000 | 67,016 | 8,000 |

  | FB15k237 | 149,689 | 14,968 | 20,101 | 5,000 | 22,812 | 5,000 |

  | NELL995  | 107,982 | 10,798 | 16,927 | 4,000 | 17,034 | 4,000 | 

   Average Number of Answers 

  | Dataset | 1p | 2p |  3p | 2i |  3i | ip | pi | 2u | up | 2in | 3in | inp | pin | pni |

  |---------|---:|---:|----:|---:|----:|---:|---:|---:|---:|---:|----:|---:|---:|---:|

  | FB15k   | 1.7 | 19.6 | 24.4 | 8.0 | 5.2 | 18.3 | 12.5 | 18.9 | 23.8 | 15.9 | 14.6 | 19.8 | 21.6 | 16.9 |

  | FB15k237 | 1.7 | 17.3 | 24.3 | 6.9 | 4.5 | 17.7 | 10.4 | 19.6 | 24.3 | 16.3 | 13.4 | 19.5 | 21.7 | 18.2 |

  | NELL995 | 1.6 | 14.9 | 17.5 | 5.7 | 6.0 | 17.4 | 11.9 | 14.9 | 19.0 | 12.9 | 11.1 | 12.9 | 16.0 | 13.0 | 

  

### Query2Box Datasets

Introduced in [Query2box: Reasoning over Knowledge Graphs in Vector Space Using Box Embeddings](https://openreview.net/pdf?id=BJgr4kSFDS), ICLR 2020. 

EPFO queries are considered **easier** than BetaE datasets. Doesn't have queries with negations.

   Graphs 

  | Dataset | Entities | Relations | Training Edges | Validation Edges | Test Edges | Total Edges

  |-----|----:|----:|----:|----:|----:|----:|

  | FB15k    | 14,951 | 1,345 | 483,142 | 50,000 | 59,071 | 592,213 |

  | FB15k237 | 14,505 | 237   | 272,115 | 17,526 | 20,438 | 310,079 |

  | NELL995  | 63,361 | 200   | 114,213 | 14,324 | 14,267 | 142,804 | 

   Queries 

  | Queries | Training | Training |  Validation | Validation |  Test | Test |

  |---------|---------:|-----------:|-----:|---------:|-----------:|-----:|

  | Dataset | 1p  | others  | 1p | others | 1p | others |

  | FB15k    | 273,710 | 273,710 | 59,097 | 8,000 | 67,016 | 8,000 |

  | FB15k237 | 149,689 | 149,689 | 20,101 | 5,000 | 22,812 | 5,000 |

  | NELL995  | 107,982 | 107,982 | 16,927 | 4,000 | 17,034 | 4,000 | 

   Average Number of Answers 

  | Dataset | 1p | 2p |  3p | 2i |  3i | ip | pi | 2u | up | 

  |---------|---:|---:|----:|---:|----:|---:|---:|---:|---:|

  | FB15k   | 10.8 | 255.6 | 250.0 | 90.3 | 64.1 | 593.8 | 190.1 | 27.8 | 227.0 | 

  | FB15k237 | 13.3 | 131.4 | 215.3 | 69.0 | 48.9 | 593.8 | 257.7 | 35.6 | 127.7 | 

  | NELL995 | 8.5 | 56.6 | 65.3 | 30.3 | 15.9 | 310.0 | 144.9 | 14.4 | 62.5 | 

  

### CGA Datasets

GQE-like patterns mined on subsets of DBpedia and Wikidata. The datasets are DB18 and WikiGeo19, introduced in [CGA](https://arxiv.org/pdf/1910.00084.pdf), K-CAP 2019.

**As of Sept 2022: not available**.

### Regex Queries

Queries emulating property paths in SPARQL with variable length of relation paths (up to length 5). Queries are EPFO queries, i.e., no negation.

New operators over relations resemble those from SPARQL:

* $r_1 / r_2 / \dots$ - relational path, aka classic projection queries

* $r_1 \lor r_2$ - a union of decomposed patterns $(e, r_1, ?) \lor (e, r_2, ?)$

* Kleene plus $r^{+}$ - one or more occurence of relation $r$, eg, $r_1/r_2^{+}$ corresponds to $r_1 / r_2$, $r_1 / r_2 / r_2$, $r_1 / r_2 / r_2 / r_2 / \dots$ up to some final depth. Those *can* be cyclic patterns. 

Two datasets:

* FB15k-Regex is based on Freebase, queries have less than 50 answers, 21 query types

* Wiki100-Regex is based on query logs from the official Wikidata SPARQL endpoint, 5 query types. 

Introduced in [RotatE-Box](https://arxiv.org/abs/2005.00480), AKBC 2021.

Repo: [GitHub](https://github.com/dair-iitd/kbi-regex) - no actual data dumps are present :(

   Graphs 

  | Dataset | Entities | Relations | Training Edges | Validation Edges | Test Edges | Total Edges

  |-----|----:|----:|----:|----:|----:|----:|

  | FB15k    | 14,951 | 1,345 | 483,142 | 50,000 | 59,071 | 592,213 |

  | Wiki100 | 41,291 | 100   | 389,795 | 21,655 | 21,656 | 433,106 |

   Queries 

  #### FB15k-Regex

  | Query type | Train | Valid | Test |

  |-----------:|------:|------:|-----:|

  |  $(e_1, r_1^+, ?)$    | 24,476 | 4,614 | 8,405 |

  |  $(e_1, r_1/r_2, ?)$  | 25,378 | 4,927  | 8,844 |

  | $(e_1, r_1^+/r_2^+, ?)$  | 26,391  | 4,978  | 9,028 |

  | $(e_1, r_1^+/r_2^+/r_3^+, ?)$ | 25,470  | 4,878 | 8,816 |

  | $(e_1, r_1/r_2^+, ?)$  | 26,335 | 5,007 | 9,062 |

  | $(e_1, r_1^+/r_2, ?)$  | 27,614 | 5,229 | 9,429 |

  | $(e_1, r_1^+/r_2^+/r_3, ?)$ | 27,865 | 5,283 | 9,509 |

  | $(e_1, r_1^+/r_2/r_3^+, ?)$ | 26,366 | 5,058 | 9,159 |

  | $(e_1, r_1/r_2^+/r_3^+, ?)$ | 26,366 | 5,045 | 9,099 |

  | $(e_1, r_1/r_2/r_3^+, ?)$   | 26,703 | 5,155 | 9,313 |

  | $(e_1, r_1/r_2^+/r_3, ?)$   | 28,005 | 5,380 | 9,688 |

  | $(e_1, r_1^+/r_2/r_3, ?)$   | 27,884 | 5,338 | 9,632 |

  | $(e_1, r_1\lor r_2, ?)$ | 30,080 | 5,828 | 9,664 |

  | $(e_1, (r_1\lor r_2)/r_3, ?)$ | 31,559 | 6,606 | 10,974 |

  | $(e_1, r_1/(r_2\lor r_3), ?)$ | 41,886 | 7,755 | 13,611 |

  | $(e_1, r_1^+\lor r_2^+, ?)$   | 23,109 | 4,469 | 8,367  |

  | $(e_1, (r_1\lor r_2)/r_3^+, ?)$ | 27,658 | 5,738 | 9,711 |

  | $(e_1, (r_1^+\lor r_2^+)/r_3, ?)$ | 24,462 | 4,865 | 8,863 |

  | $(e_1, r_1^+/(r_2\lor r_3), ?)$ | 27,676 | 5,340 | 9,267 |

  | $(e_1, r_1/(r_2^+\lor r_3^+), ?)$ | 28,542 | 5,475 | 9,436 |

  | $(e_1, (r_1\lor r_2)^+, ?)$  | 26,260 | 5,523 | 10,360 |

  |  **Total**                   | 580,085 | 112,491 | 200,237 |

  #### Wiki100-Regex

  | Query type | Train | Valid | Test |

  |-----------:|------:|------:|-----:|

  |  $(e_1, r_1^+, ?)$    | 490,562 | 24,878 | 23,443 |

  | $(e_1, r_1^+/r_2^+, ?)$  | 6,945  | 620  | 772 |

  | $(e_1, r_1/r_2^+, ?)$  | 85,253 | 10,013 | 8,377 |

  | $(e_1, r_1\lor r_2, ?)$ | 274,012 | 14,900 | 14,915 |

  | $(e_1, (r_1\lor r_2)^+, ?)$  | 348,274 | 15,720 | 15,311 |

  |  **Total**                   | 1,205,046 | 66,131 | 62,818 |

### DAG Queries

Conjunctive queries (w/o union) not limited to 9 patterns from Query2Box/BetaE datasets. The task is to predict *all* intermediate entities, not just final leaf nodes. Query depth: 2-5; max 3 intersecting branches.

Introduced in [Answering complex queries in knowledge graphs with bidirectional sequence encoders](https://ojs.aaai.org/index.php/AAAI/article/view/16630), AAAI 2021.

New FB15K-237-CQ and WN18RR-CQ datasets have two variations:

* CQ (conjunctive queries) - Training on triples + paths + DAGs, Validation/Test on **DAGs only**;

* Paths - Training on triples + paths, Validation/Test on **paths only**

**Sept 2022: the datasets are not publicly available.**

   Graphs 

  

  | Dataset | FB15K-237-CQ | FB15K-237-CQ | FB15K-237-CQ | WN18RR-CQ | WN18RR-CQ | WN18RR-CQ |

  |-----|----:|----:|----:|----:|----:|----:|

  | Dataset | Train | Validation | Test | Train | Validation | Test |

  | Entities | 14,505 | - | - | 40,943 | - | - |

  | Relations | 237 | 237 | 237 | 11 | 11 | 11 |

  | Triples  | 272,115 | - | - | 86,835 | - | - |

  | Paths | 50,000 | - | - | 10,000 | - | - |

  | DAGs | 48,865 | 2,785 | 2,599 | 9,465 | 112 | 95 |

  | Avg Masks | 1.86 | 5.91 | 6.05 | 1.84 | 5.13 | 4.91 |

  | Avg Query Len (Tokens) | 152 | 460 | 479 | 71 | 198 | 199 |   

   Queries 

  

  No detailed breakdown by query type is available, only the DAGs stats from the main table.

  | Dataset | FB15K-237-CQ | FB15K-237-CQ | FB15K-237-CQ | WN18RR-CQ | WN18RR-CQ | WN18RR-CQ |

  |-----|----:|----:|----:|----:|----:|----:|

  | Dataset | Train | Validation | Test | Train | Validation | Test |

  | Paths | 50,000 | - | - | 10,000 | - | - |

  | DAGs | 48,865 | 2,785 | 2,599 | 9,465 | 112 | 95 |

  | Avg Masks | 1.86 | 5.91 | 6.05 | 1.84 | 5.13 | 4.91 |

  | Avg Query Len (Tokens) | 152 | 460 | 479 | 71 | 198 | 199 | 

###  EFO-1 Queries

Existential First-Order queries  with Single Free Variable, extended from BetaE. The goal is to evaluate the combinatorial generalizability. 

Introduced in [Benchmarking the Combinatorial Generalizability of Complex Query Answering on Knowledge Graphs](https://arxiv.org/abs/2109.08925), NeurIPS 2021 (Datasets and Benchmarks)

   Graphs 

  | Queries | Training | Training |  Validation | Validation |  Test | Test |

  |---------|---------:|-----------:|-----:|---------:|-----------:|-----:|

  | Dataset | 1p/2p/3p/2i/3i  | 2in/3in/inp/pin/pni  | 1p | others | 1p | others |

  | FB15k    | 273,710 | 27,371 | 59,097 | 8,000 | 67,016 | 8,000 |

  | FB15k237 | 149,689 | 14,968 | 20,101 | 5,000 | 22,812 | 5,000 |

  | NELL995  | 107,982 | 10,798 | 16,927 | 4,000 | 17,034 | 4,000 | 

   Queries 

Cannot list all the 301 query types. Details can be found in a summarization excel file [here](https://docs.google.com/spreadsheets/d/18RJL1puHpbGZJKLkffqKEbzRmRFdSnGFQvekApFWc3Q/edit?usp=sharing).

### Real EFO-1 dataset

Rethinking the EFO-1 formulation by introducing leaf nodes, multi edge, and cycle.

For standard FB15k, FB15k-237, and NELL - 9 new query types (10 with reworked `pni` type) including:

-  `l` - queries with existentially quantified variables as leaf nodes (2il, 3il)

-  `m` - queries with multiple relation projection edges from one variable to another (2m, 2nm, 3mp, 3pm, im)

-  `c` - queries with cycles (3c, cm)

All new query have 5000 instances in three KGs. Introduced in [On Existential First Order Queries Inference on Knowledge Graphs](https://arxiv.org/pdf/2304.07063.pdf), arxiv 2023. The dataset can be downloaded from [here](https://drive.google.com/drive/folders/17bPr6_ESqh5D0LgWNgpE4mY8gpg2iC5o?usp=sharing).

#### EPFO queries with Literals

Based on a variation of the FB15k-237 dataset with entity attributes (12,390 entities, 237 relations, 115 attributes, 29,229 (?) triples).

Literals are restricted to numerical values, three additional filter functions (less than, equal, greater then).

The dataset includes standard 9 EPFO query types and adds 8 more variations of those patterns enriched with literals:

- 5 query types where literals are in queries, but the answer is an entity (ai, 2ai, pai, aip, au)

- 3 query types where literals are in queries, and the answer is a mean of relevant literal values (1ap, 2ap, 3ap)

Introduced in [LitCQD: Multi-Hop Reasoning in Incomplete Knowledge Graphs with Numeric Literals](https://arxiv.org/pdf/2304.14742.pdf), arxiv 2023

###  SQE Queries

Existential First-Order queries aimed at evaluating compositional generalization to OOD query patterns (29 in-distribution types, 29 out-of-distribution). In contrast to BetaE datasets, does **not** have restrictions on the number of answers per query, long tails are possible.  

Introduced in [Sequential Query Encoding For Complex Query Answering on Knowledge Graphs](https://openreview.net/pdf?id=ERqGqZzSu5)

   Graphs 

  | Queries | Training | Training |  Validation | Test |

  |---------|---------:|-----------:|-----:|---------:|

  | Dataset | 1p | others  | all | all |

  | FB15k    | 273,710 | 821,130 | 8,000 | 8,000 |

  | FB15k237 | 149,689 | 449,067 | 5,000 | 5,000 |

  | NELL995  | 107,982 | 323,946 | 4,000 | 4,000 | 

   Queries 

58 query types, refer to Appendix A in the paper for the full list of patterns.

### EFOk queries

Introduced by (EFOk-CQA) [EFOk-CQA: Towards Knowledge Graph Complex Query Answering beyond Set Operation](https://arxiv.org/abs/2307.13701) with 741 query types in total.

Featured by:

- existential first-order queries with more than multiple variables.

- combinatorial space with multi-edge and cyclic queries

### Numerical CQA Queries

The Numerical CQA queries both include entities and typed numerical attribute values. 

Introduced by [Knowledge Graph Reasoning over Entities and Numerical Values](https://arxiv.org/abs/2306.01399)

   Graphs 

  

|  Graphs | Data Split |    1p   |    2p   |    2i   |    3i   |    pi   |    ip   |    2u   |    up   |    All    |

|:-------:|:----------:|:-------:|:-------:|:-------:|:-------:|:-------:|:-------:|:-------:|:-------:|:---------:|

|  FB15K  |  Training  | 304,633 | 138,192 | 226,729 | 288,874 | 260,057 | 233,834 | 284,301 | 284,931 | 2,021,551 |

|         | Validation |  8,271  |  15,860 |  23,359 |  28,836 |  25,081 |  22,930 |  29,187 |  29,210 |  182,734  |

|         |   Testing  |  7,969  |  15,431 |  23,346 |  28,865 |  24,810 |  22,232 |  29,212 |  29,274 |  181,139  |

|  DB15K  |  Training  | 124,851 |  99,698 | 140,427 | 190,413 | 171,353 | 163,687 | 190,364 | 194,244 | 1,275,037 |

|         | Validation |  3,529  |  10,388 |  9,792  |  13,817 |  14,594 |  16,651 |  19,512 |  19,792 |  108,075  |

|         |   Testing  |  3,387  |  10,047 |  9,914  |  14,603 |  14,642 |  15,897 |  19,504 |  19,773 |  107,767  |

| YAGO15K |  Training  |  84,014 |  76,238 | 136,282 | 183,850 | 162,712 | 145,994 | 183,963 | 183,459 | 1,156,512 |

|         | Validation |  2,833  |  7,986  |  10,757 |  16,884 |  13,485 |  13,899 |  18,444 |  19,105 |  103,393  |

|         |   Testing  |  2,713  |  7,949  |  10,935 |  17,171 |  13,481 |  13,526 |  18,433 |  18,997 |  103,205  |

### Type-Aware Datasets

In addition to a normal graph of entities (instances) a-la BetaE datasets, the type-aware datasets offer an additional set of classes, classes hierarchy (from a pre-existing ontology), and `instanceOf` links between entities and classes.

Those datasets might include an additional task of predicting types of answer entities (Concept Retrieval).

- LUBM, introduced in [Neuro-Symbolic Ontology-Mediated Query Answering](https://openreview.net/pdf?id=wwVb95CkrFm), OpenReview 2021

- NELL, introduced in [Neuro-Symbolic Ontology-Mediated Query Answering](https://openreview.net/pdf?id=wwVb95CkrFm), OpenReview 2021. The base graph is the same as in the BetaE datasets, but a few ontological axioms were added.

- YAGO 4, introduced in [TAR: Neural Logical Reasoning across TBox and ABox](https://github.com/lilv98/TAR)

- DBpedia, introduced in [TAR: Neural Logical Reasoning across TBox and ABox](https://github.com/lilv98/TAR)

LUBM and NELL employ ontological axioms of the DL-Lite (R) family of Description Logics.

   Graphs 

  **TODO** Figure out Concept Retrieval edges in TAR 

  

  | Dataset | Entities | Relations | Axioms | Base Graph | Materialized Graph | 

  |-----|----:|----:|----:|----:|----:|

  | LUBM | 55,684 | 28 | 68 | 284k | 565k | 

  | NELL | 63,361 | 400 | 307 | 285k | 497k | 

  Axioms breakdown in ontologies for LUBM and NELL

  | Rules | LUBM | NELL | 

  |------:|-----:|-----:|

  | $\mathcal{O}$ (Total) |  68  | 307  |

  | $A \sqsubseteq A'$ (Subclass) | 13 | - |

  | $p \sqsubseteq s$ | 5 | 92 |

  | $p^{-} \sqsubseteq s$ | 28 | 215 |

  | $\exists p \sqsubseteq A$ | 11 | - |

  | $\exists p^{-} \sqsubseteq A$ | 11 | - |

  | Dataset | Entities | Relations | Classes | Training Edges | Validation Edges | Test Edges | Entity-Class Edges | Class Hierarchy Edges | Total Edges |

  |-----|----:|----:|----:|----:|----:|----:|---:|---:|---:|

  | YAGO 4 | 32,465 | 75 | 8,382 | 101,417 | 1,000 | 1,000 | 83,291 | 16,644 | 184,708 |

  | DBpedia | 28,824 | 327 | 981 | 136,821 | 1,000 | 1,000 | 225,436 | 2,582 | 362,257 |

  

   Queries 

  

  | Dataset | Train / Test | 1p | 2p | 3p | 2i | 3i | ip | pi | 2u | up |

  |--------:|-------------:|---:|---:|---:|---:|---:|---:|---:|---:|---:|

  | LUBM    | Plain (**Train**)| 110,000 | 110,000 | 110,000 | 110,000 | 110,000 | - | - | - | - |

  | LUBM    | Generalized (**Train**)| 117,124 | 136,731 | 150,653 | 181,234 | 208,710 | - | - | - | - | 

  | LUBM    | Specialized (**Train**)| 117,780 | 154,851 | 173,678 | 271,532 | 230,085 | - | - | - | - | 

  | LUBM    | Ontological (**Train**)| 116,893 | 166,159 | 333,406 | 212,718 | 491,707 | - | - | - | - |

  | LUBM    | Induction (w/ missing links in queries) (**Val/Test**)| 8,000 | 8,000 | 8,000 | 8,000 | 8,000 | 8,000 | 8,000 | 8,000 | 8,000 | 

  | LUBM    | Deduction (w/o missing link in training) (**Val/Test**)| 1,241 | 4,701 | 6,472 | 3,829 | 4,746 | 7,393 | 7,557 | 4,986 | 7,122 |

  | LUBM    | Induction + Deduction (**Val/Test**)| 8,000 | 8,000 | 8,000 | 8,000 | 8,000 | 8,000 | 8,000 | 7,986 | 8,000 |

  | NELL    | Plain (**Train**)| 107,982 | 107,982 | 107,982 | 107,982 | 107,982 | - | - | - | - |

  | NELL    | Generalized (**Train**)| 174,310 | 408,842 | 864,268 | 398,412 | 930,787 | - | - | - | - | 

  | NELL    | Specialized (**Train**)| 174,310 | 419,664 | 906,609 | 401,954 | 936,537 | - | - | - | - | 

  | NELL    | Ontological (**Train**)| 114,614 | 542,923 | 864,268 | 629,144 | 930,787 | - | - | - | - |

  | NELL    | Induction (w/ missing links in queries) (**Val/Test**)| 15,688 | 3,910 | 3,918 | 3,828 | 3,786 | 3,932 | 3,895 |3,940 | 3,966 | 

  | NELL    | Deduction (w/o missing link in training) (**Val/Test**)| 346 | 4,461 | 4,294 | 4,842 | 5,996 | 7,295 | 5,862 | 5,646 | 6,894 |

  | NELL    | Induction + Deduction (**Val/Test**)| 8,000 | 8,000 | 8,000 | 8,000 | 8,000 | 8,000 | 8,000 | 7,990 | 8,000 |

  | Queries | Training | Training |  Validation | Validation |  Test | Test |

  |---------|---------:|-----------:|-----:|---------:|-----------:|-----:|

  | Dataset | 1p  | others  | 1p | others | 1p | others |

  | YAGO 4 (Concept Retrieval)    | 189,338 | 10,000 | 1,000 | 1,000 | 1,000 | 1,000 |

  | YAGO 4 (Entity Only)    | 101,417 | 10,000 | 1,000 | 1,000 | 1,000 | 1,000 |

  | YAGO 4 (Entity + Instantiations)    | 184,708 | 10,000 | 1,000 | 1,000 | 1,000 | 1,000 |

  | DBpedia (Concept Retrieval)    | 473,924 | 10,000 | 1,000 | 1,000 | 1,000 | 1,000 |

  | DBpedia (Entity Only)    | 136,821 | 10,000 | 1,000 | 1,000 | 1,000 | 1,000 |

  | DBpedia (Entity + Instantiations)    | 362,257 | 10,000 | 1,000 | 1,000 | 1,000 | 1,000 |

### Very Large Datasets

Introduced in [SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs](https://arxiv.org/abs/2110.14890), KDD 2022.

Training queries are sampled on-the-fly during training due to the huge size of underlying graphs. 

The underlying graphs are FB400k (400K nodes), WikiKG 2 (2.5M nodes) [(from OGB)](https://ogb.stanford.edu/docs/linkprop/#ogbl-wikikg2), and full Freebase (86M nodes)

TODO: confirm with Hongyu the number of validation / test queries.

   Graphs 

  | Dataset | Entities | Relations | Training Edges | Validation Edges | Test Edges | Total Edges

  |-----|----:|----:|----:|----:|----:|----:|

  | FB400k    | 409,829 | 918 | 1,075,837 | 537,917 | 537,917 | 2,151,671 |

  | WikiKG2 | 2,500,604 | 535   | 16,109,182 | 429,456 | 598,543 | 17,137,181 |

  | Freebase  | 86,054,361 | 14,824  | 304,727,650 | 16,929,318 | 16,929,308 | 338,586,276 | 

   Queries 

  | Queries |  Validation | Validation |  Test | Test |

  |---------|------------:|-----------:|------:|-----:|

  | Dataset |  1p | others | 1p | others |

  | FB400k    | TODO | TODO | TODO | TODO |

  | WikiKG2 | TODO | TODO | TODO | TODO |

  | Freebase  | TODO | TODO | TODO | TODO |

### Hyper-Relational Datasets

The main difference of hyper-relational datasets is that edges are no longer plain triples $(h, r, t)$ but *statements* (in terms of Wikidata or RDF-Star) $\Big(h, r, t, (q\_{ri}, q\_{ei})\_i\Big)$ with key-value (relation:entity) qualifiers $(q\_{r}, q\_{e})$ over the *main* triple. 

For example, in the statment `(Albert Einstein, educated at, ETH Zurich, (degree, Bachelor))`, the main triple is `Albert Einstein, educated at, ETH Zurich` and its unique qualifier is `(degree, Bachelor)`. 

Qualifiers provide an additional context to the edge - the tail node might change with another qualifier, e.g., `(Albert Einstein, educated at, University of Zurich, (degree, Doctorate))`.

Entities and relations in qualifiers are still legit entities and relations which could be present in main triples. Some entities and relations can be found *only* in qualifiers.

The WD50K dataset has only conjunctive queries (projection + intersection), neither union nor negation.

Introduced in [Query Embedding on Hyper-Relational Knowledge Graphs](https://openreview.net/pdf?id=4rLw09TgRw9), ICLR 2022

The WD50K-NFOL dataset introduced in [NQE: N-ary Query Embedding for Complex Query Answering over Hyper-relational Knowledge Graphs](https://arxiv.org/pdf/2211.13469.pdf)adds unions and negations, as well as possibility of variables at qualifier entity positions. **As of Nov 2022, not openly available)

   Graph 

  The original WD50K graph from the [StarE paper](https://aclanthology.org/2020.emnlp-main.596.pdf) by Galkin et al.

  | Dataset | Entities | Relations | Qualifier-only Entities | Qualifier-only Relations | Training Edges | Validation Edges | Test Edges | Total Edges

  |-----|----:|----:|----:|----:|----:|----:|----:|----:|

  | WD50K    | 47,156 | 532 | 5460 | 45 | 166,435 | 23,913 | 46,159 | 236,508 |

  32,167 edges have at least one key-value (relation:entity) qualifier.

 

   Queries 

  | Split | 1p | 2p | 3p | 2i | 3i | ip | pi |

  |-----|----:|----:|----:|----:|----:|----:|----:|

  | train | 24,819 | 313,088 | 5,950,990 | 48,513 | 318,735 | 306,022 | 1,088,539 |

  | validation | 4,100 | 100,706 | 2,968,315 | 15,648 | 169,195 | 169,438 | 569,957 |

  | test  | 7,716 | 202,045 | 6,433,476 | 38,207 | 547,272 | 445,007 | 1,267,452 |

  WD50K-NFOL stats are not yet available

### Inductive Datasets

As of March 2023, there are no existing purely inductive datasets such that the training and validation/test graphs are different (validation and test containing new unseen entities) and predictions should only rely on the graph structure w/o external data.

#### Type-based Inductive

As a bridge between shallow transductive models and inductive inference, [Type-aware Embeddings for Multi-Hop Reasoning over Knowledge Graphs](https://arxiv.org/pdf/2205.00782.pdf) propose to mine entity types as the invariant that remains the same at training and inference.

As a result, the following datasets assume an existing and known in advance class hierarchy (or a graph of classes). Technically, those can be put in the [Type-Aware Datasets](#type-aware-datasets) category. The query datasets only have EPFO queries (no negation).

Inductive splits have been published, see the [GitHub issue](https://github.com/zhiweihu1103/QE-TEMP/issues/2)

   Graphs 

  

  The underlying graphs are FB15k-237-V2 and NELL995-V3 from [Inductive relation prediction by subgraph reasoning](https://arxiv.org/abs/1911.06962) by Teru et al, ICML 2020. The original repo and other datasets are [here](https://github.com/kkteru/grail).

  Training is performed on the Train Graph, but at validation/test time the model is fed with a new Inference Graph with completely new nodes. The Inference Graph has missing edges that have to be predicted at validation or test time.

  | Dataset | Relations | Types | Train Graph | Train Graph | Inference Graph | Inference Graph | Inference Graph | Inference Graph |

  |-----|----:|----:|----:|----:|----:|----:|----:|----:|

  |  |  |  |  Train Entities | Train Edges | Inference Entities | Inference Edges | Validation Edges | Test Edges |

  | FB15k-237-V2 | 203 | 3851 | 3,000 | 4,245 | 2,000 | 4,145 | 469 | 478 |

  | NELL995-V3 | 142 | 267 | 4,647 | 16,393 | 4,921 | 8.048 | 811 | 809 |

  The type hierarchy created for those datasets remains unknown.

   Queries 

  | Queries | Training |  Validation | Validation |  Test | Test |

  |---------|---------:|-----:|---------:|-----------:|-----:|

  | Dataset | 1p/2p/3p/2i/3i  | 1p | others | 1p | others |

  | FB15k-237-V2  | 9,964 | 1,738 | 2,000 | 791 | 1,000 |

  | NELL995-V3 | 12,010 | 2,197 | 2,000 | 1,167 | 1,500 |

 

#### Tree-like Conjunctive Inductive

The dataset proposed in [GNNQ](https://github.com/KRR-Oxford/GNNQ) frames query answering as node classification. The dataset has 9 tree-like conjunctive queries (6 synthetic from WatDiv and 3 from FB15k237), no unions nor negations. For each query, there are **P** KGs with an answer entity satisfying a query and **N** KGs with negative samplies where an answer does not satisfy a query. 

Test splits have graphs with new entities (but the same query shapes).

   Graphs 

    Many - each WatDiv query has 2K positive GRAPHS and 700K negative GRAPHS (each of about 100K triples); each FB15k237 query has about 1K positive GRAPHS and 1K negative GRAPHS (each of about 10K triples)

   Queries 

  Each query in the table has many associated graphs where one node is an answer (positive graph sample) and where nodes are not answers (negative graph samples)

  

  | Query | Relations | Num atoms / tree depth |  Train: pos/neg | Test: pos/neg | 

  |---------|---------:|-----:|---------:|-----------:|

  | WatDiv-Q1 | 158  | 8 / 4 | 2114 / 699699 | 1085 / 349877 |

  | WatDiv-Q2 | 158  | 8 / 3 | 3258 / 698396 | 1769 / 349119 |

  | WatDiv-Q3 | 158  | 8 / 3 | 1520 / 700276 | 798 / 350165 | 

  | WatDiv-Q4 | 158  | 10 / 4 | 2397 / 698986 | 1226 / 349546 | 

  | WatDiv-Q5 | 158  | 10 / 4 | 6338 / 693988 | 2866 / 347570 | 

  | WatDiv-Q6 | 158  | 10 / 4 | 7545 / 692439 | 3744 / 346290 | 

  | FB15k237-Q1 | 237  | 7 / 4 | 1185 / 1180 | 395 / 395 | 

  | FB15k237-Q2 | 237  | 7 / 4 | 650 / 660 | 220 / 220 | 

  | FB15k237-Q3 | 237  | 5 / 4 | 860 / 870 | 290 / 290 | 

### Temporal Datasets

Introduced in [TFLEX: Temporal Feature-Logic Embedding Framework for Complex Reasoning over Temporal Knowledge Graph](https://arxiv.org/abs/2205.14307), KDD 2022.

Based on FOL operators, the dataset focuses on temporal reasoning, which includes `after`, `before` and `between` on any timestamp set.

   Graphs 

  | Dataset | Entities | Relations | Timestamps | Training Edges | Validation Edges | Test Edges | Total Edges

  |-----|----:|----:|----:|----:|----:|----:|----:|

  | ICEWS14 | 7,128 | 230 | 365 | 72,826 | 8,941 | 8,963 | 90,730 |

  | ICEWS05-15 | 10,488 | 251 | 4,017 | 386,962 | 46,275 | 46,092 | 479,329 |

  | GDELT-500  | 500 | 20 | 366 | 2,735,685 | 341,961 | 341,961 | 3,419,607 |

Queries 
 
| Query Name 
|------------|----- 
| Pe2        | 72826 
| Pe3        | 72826 
| Pe_Pt      | 7282 
| e2i        | 72826 
| e3i        | 72826 
| e2i_Pe     | - 
| Pe_e2i     | - 
| Pe_t2i     | - 
| e2i_NPe       | 7282 
| e2i_peN       | 7282 
| Pe_e2i_Pe_NPe | 7282 
| e2i_N         | 7282 
| e3i_N         | 7282 
| e2u           | - 
| Pe_e2u        | - 
| Pt_lPe     | 7282 
| Pt_rPe     | 7282 
| t2i        | 72826 
| t3i        | 72826 
| t2i_Pe     | - 
| Pt_le2i    | 7282 
| Pt_re2i    | 7282 
| t2i_NPt 
| t2i_PtN 
| Pe_t2i_PtPe_NPt | 7282 
| t2i_N 
| t3i_N 
| t2u        | - 
| Pe_t2u     | - 
| Pe_aPt     | 7282 
| Pe_bPt     | 7282 
| Pe_at2i    | 7282 
| Pe_bt2i    | 7282 
| between    | 7282

| ICEWS14-Train | Validation | Test | ICES05-15-Train | Validation | Test  | GDELT-500-Train | Validation | Test  | ----------|------------|------|-----------------|------------|-------|-----------------|------------|-------| | 3482       | 4037 | 368962          | 10000      | 10000 | 2215309         | 10000      | 10000 | | 3492       | 4083 | 368962          | 10000      | 10000 | 2215309         | 10000      | 10000 | | 3385       | 3638 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 | | 3305       | 3655 | 368962          | 10000      | 10000 | 2215309         | 10000      | 10000 | | 2966       | 3023 | 368962          | 10000      | 10000 | 2215309         | 10000      | 10000 | | 2913       | 2913 | -               | 10000      | 10000 | -               | 10000      | 10000 | | 2913       | 2913 | -               | 10000      | 10000 | -               | 10000      | 10000 | | 2913       | 2913 | -               | 10000      | 10000 | -               | 10000      | 10000 | | 3061       | 3192 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 | | 2971       | 3031 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 | | 2968       | 3012 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 | | 2949       | 2975 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 | | 2913       | 2914 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 | | 2913       | 2913 | -               | 10000      | 10000 | -               | 10000      | 10000 | | 2913       | 2913 | -               | 10000      | 10000 | -               | 10000      | 10000 | | 4976       | 5608 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 | | 3321       | 3621 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 | | 5112       | 6631 | 368962          | 10000      | 10000 | 2215309         | 10000      | 10000 | | 3094       | 3296 | 368962          | 10000      | 10000 | 2215309         | 10000      | 10000 | | 2913       | 2913 | -               | 10000      | 10000 | -               | 10000      | 10000 | | 3226       | 3466 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 | | 3236       | 3485 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 | | 7282          | 4873       | 5464 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 | | 7282          | 3300       | 3609 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 | | 3031       | 3127 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 | | 7282          | 3135       | 3328 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 | | 7282          | 2924       | 2944 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 | | 2913       | 2913 | -               | 10000      | 10000 | -               | 10000      | 10000 | | 2913       | 2913 | -               | 10000      | 10000 | -               | 10000      | 10000 | | 4134       | 4733 | 68262           | 10000      | 10000 | 221530          | 10000      | 10000 | | 3970       | 4565 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 | | 4607       | 5338 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 | | 4583       | 5386 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 | | 2913       | 2913 | 36896           | 10000      | 10000 | 221530          | 10000      | 10000 |

### Dataset tools

- [Graph Query Sampler](https://github.com/miselico/graph_query_sampler): Not a method, rather a dataset generator

- [EFO-1-QA-benchmark](https://github.com/HKUST-KnowComp/EFO-1-QA-benchmark): Generating combiantorial tree-formed query types and sampling the data.

- [EFOk-CQA](https://github.com/HKUST-KnowComp/EFOK-CQA): Generating combinatorial existential first order query types with multiple (k) variables and sampling the data.

## :wrench: Implementations

- [KGReasoning](https://github.com/snap-stanford/KGReasoning): GQE, Query2Box, BetaE

- [CQD](https://github.com/pminervini/KGReasoning): GQE, Query2Box, BetaE, CQD

- [EFO-1-QA-benchmark](https://github.com/HKUST-KnowComp/EFO-1-QA-benchmark): Query2Box, BetaE, LogicE, NewLook, ConE, FuzzQE

- [Query2particles](https://github.com/HKUST-KnowComp/query2particles)

- [StarQE](https://github.com/DimitrisAlivas/StarQE): StarQE

- [SMORE](https://github.com/google-research/smore): GQE, Query2Box, BetaE + Very Large Datasets

- [GNN-QE](https://github.com/DeepGraphLearning/GNN-QE): GNN-QE

- [InductiveQE](https://github.com/DeepGraphLearning/InductiveQE): Inductive QE with NodePiece and GNN-QE

- [TAR](https://github.com/lilv98/TAR): TAR

- [QE-TeMP](https://github.com/zhiweihu1103/QE-TEMP): TeMP (based on KGReasoning)

- [GNNQ](https://github.com/KRR-Oxford/GNNQ): GNNQ

- [SE-KGE](https://github.com/gengchenmai/se-kge): GQE, CGA, and geospatial model

- [LARK](https://github.com/Akirato/LLM-KG-Reasoning/): LARK (uses Huggingface LLMs)

- [WFRE](https://github.com/HKUST-KnowComp/WFRE): WFRE

- [FIT](https://github.com/HKUST-KnowComp/FIT): FIT

- [SQE](https://github.com/HKUST-KnowComp/SQE): SQE with Transformer/LSTM/GRU/TCN, Tree-LSTM, Tree-RNN, BetaE, BiQE, ConE, FuzzQE, GQE, HypE, NerualMLP (Mixer), Query2Box, Query2Particles

- [NRN](https://github.com/HKUST-KnowComp/NRN): NRN with GQE, Query2Box, Query2Particles

- [EFOk-CQA](https://github.com/HKUST-KnowComp/EFOK-CQA): EFOk

## All Papers on Complex Logical Query Answering (54)

  Click to expand 

  1. (GQE) [Embedding Logical Queries on Knowledge Graphs](https://proceedings.neurips.cc/paper/2018/file/ef50c335cca9f340bde656363ebd02fd-Paper.pdf) NeurIPS 2018

  2. (GQE + hashing) [Learning to Hash for Efficient Search over Incomplete Knowledge Graphs](https://ieeexplore.ieee.org/abstract/document/8970688/?casa_token=gLHFmr7V2ekAAAAA:wwDbUufdkwHTQo68pansuhJsJ2XQAF0P21_mQSu75KVRUkgqARmXBs_VEmFOkFgz_Lq-FXP8OA) ICDM 2019

  3. (CGA) [Contextual Graph Attention for Answering Logical Queries over Incomplete Knowledge Graphs](https://arxiv.org/pdf/1910.00084.pdf) K-CAP 2019, GQE + self-attention instead of DeepSet

  4. (TractOR) [Symbolic querying of vector spaces: Probabilistic databases meets relational embeddings](https://arxiv.org/pdf/2002.10029.pdf), UAI 2020

  5. (Query2Box) [Query2box: Reasoning over Knowledge Graphs in Vector Space Using Box Embeddings](https://openreview.net/pdf?id=BJgr4kSFDS) ICLR 2020

  6. (BetaE) [Beta Embeddings for Multi-Hop Logical Reasoning in Knowledge Graphs](https://proceedings.neurips.cc/paper/2020/file/e43739bba7cdb577e9e3e4e42447f5a5-Paper.pdf) NeurIPS 2020

  7. (EmQL) [Faithful embeddings for knowledge base queries](https://proceedings.neurips.cc/paper/2020/hash/fe74074593f21197b7b7be3c08678616-Abstract.html) NeurIPS 2020

  8. (MPQE) [Message Passing Query Embedding](https://grlplus.github.io/papers/26.pdf) ICML’20 Workshop

  9. (RotatE-Box)[Regex Queries over Incomplete Knowledge Bases](https://arxiv.org/abs/2005.00480) AKBC’21

  10. (BiQE) [Answering complex queries in knowledge graphs with bidirectional sequence encoders](https://ojs.aaai.org/index.php/AAAI/article/view/16630), AAAI’21 

  11. [Approximate knowledge graph query answering: from ranking to binary classification](https://arxiv.org/abs/2102.11389)

  12. [Knowledge Sheaves: A Sheaf-Theoretic Framework for Knowledge Graph Embedding](https://arxiv.org/pdf/2110.03789.pdf) arxiv, 2021

  13. (ConE) [Cone: Cone embeddings for multi-hop reasoning over knowledge graphs](https://proceedings.neurips.cc/paper/2021/hash/a0160709701140704575d499c997b6ca-Abstract.html) NeurIPS’21

  14. (PERM) [Probabilistic entity representation model for reasoning over knowledge graphs](https://proceedings.neurips.cc/paper/2021/hash/c4d2ce3f3ebb5393a77c33c0cd95dc93-Abstract.html) (improv over BetaE) NeurIPS’21

  15. (CQD) [Complex Query Answering with Neural Link Predictors](https://openreview.net/forum?id=Mos9F9kDwkz) ICLR’21

  16. (HypE) [Self-Supervised Hyperboloid Representations from Logical Queries over Knowledge Graphs](https://arxiv.org/pdf/2012.13023.pdf), WWW 2021

  17. (NewLook) [Neural-Answering Logical Queries on Knowledge Graphs](http://tonghanghang.org/pdfs/kdd21_newlook.pdf) (KDD’21) 

  18. [Benchmarking the Combinatorial Generalizability of Complex Query Answering on Knowledge Graphs](https://arxiv.org/abs/2109.08925), NeurIPS 2021 (Datasets and Benchmarks)

  19. [Neuro-Symbolic Ontology-Mediated Query Answering](https://openreview.net/pdf?id=wwVb95CkrFm) OpenReview 2021

  20. (LogicE) [Logic Embeddings for Complex Query Answering](https://arxiv.org/pdf/2103.00418.pdf) arxiv 2021

  21. (StarQE) [Query Embedding on Hyper-relational Knowledge Graphs](https://arxiv.org/abs/2106.08166) ICLR 2022, 

  22. (MLPMix) [Neural Methods for Logical Reasoning over Knowledge Graphs](https://openreview.net/forum?id=tgcAoUVHRIB) ICLR 2022

  23. (FuzzQE) [Fuzzy Logic Based Logical Query Answering on Knowledge Graphs](https://ojs.aaai.org/index.php/AAAI/article/view/20310), AAAI 2022

  24. (GNN-QE)  [Neural-Symbolic Models for Logical Queries on Knowledge Graphs](https://arxiv.org/abs/2205.10128), ICML 2022

  25. (SMORE) [SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs](https://arxiv.org/abs/2110.14890) KDD 2022 

  26. (kgTransformer) [Mask and Reason: Pre-Training Knowledge Graph Transformers for Complex Logical Queries](https://dl.acm.org/doi/pdf/10.1145/3534678.3539472?casa_token=c_fD5BwP0MoAAAAA:eYO12-vcXophXPFSYro_6go4b2sQRkCTcEYRZRcVgDT6SiDpKa0AvGhqS8spvhn8-y_H0ZmCz72E) KDD 2022

  27. (Query2Particles) [Query2Particles: Knowledge Graph Reasoning with Particle Embeddings](https://arxiv.org/abs/2204.12847), Findings NAACL’22

  28. (TAR) [TAR: Neural Logical Reasoning across TBox and ABox](https://arxiv.org/abs/2205.14591) (arxiv, 2022) 

  29. (TeMP) [Type-aware embeddings for multi-hop reasoning over knowledge graphs](https://arxiv.org/abs/2205.00782) (IJCAI-ECAI 2022) 

  30. (FLEX) [FLEX: Feature-Logic Embedding Framework for CompleX Knowledge Graph Reasoning](https://arxiv.org/abs/2205.11039) (arxiv 2022)

  31. (TFLEX) [TFLEX: Temporal Feature-Logic Embedding Framework for Complex Reasoning over Temporal Knowledge Graph](https://arxiv.org/pdf/2205.14307.pdf) (arxiv, 2022)

  32. (LinE) [LinE: Logical Query Reasoning over Hierarchical Knowledge Graphs](https://dl.acm.org/doi/pdf/10.1145/3534678.3539338?casa_token=_jPlNJj2TlYAAAAA:pKAA42_lrZ2JIHc1YZV0fchIlRiIcqCy8oCBL2UU3Gm84MOeTSYLfQn31DKtXBbU2yqzC7LUsYvREBw) KDD 2022

  33. [GNNQ: A Neuro-Symbolic Approach for Query Answering over Incomplete Knowledge Graphs](https://scholar.archive.org/work/sswyx3mbc5danaxosfnyr45cw4/access/wayback/https://s3-eu-west-1.amazonaws.com/pstorage-oxford-0703704596/36647646/GNNQ_full.pdf?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIA3OGA3B5WHKHW6LWV/20220814/eu-west-1/s3/aws4_request&X-Amz-Date=20220814T213541Z&X-Amz-Expires=10&X-Amz-SignedHeaders=host&X-Amz-Signature=1bf627735fbce377c2478f514fbac1dfcf49423980d372cc71e2bed732d6ae2c) ISWC 2022

  34. (ENeSy) [Neural-Symbolic Entangled Framework for Complex Query Answering](https://arxiv.org/pdf/2209.08779.pdf) NeurIPS 2022  

  35. (NodePiece-QE, InductiveQE) [Inductive Logical Query Answering in Knowledge Graphs](https://arxiv.org/abs/2210.08008) NeurIPS 2022  

  36. (RoMA) [Reasoning over Multi-view Knowledge Graphs](https://arxiv.org/pdf/2209.13702.pdf) arxiv 2022, some new datasets, but no code/data published  

  37. (LMPNN) [Logical Message Passing Networks With One-Hop Inference On Atomic Formulas](https://openreview.net/pdf?id=SoyOsp7i_l) ICLR'23  

  38. (GammaE) [GammaE: Gamma Embeddings for Logical Queries on Knowledge Graphs](https://arxiv.org/pdf/2210.15578.pdf) EMNLP 2022  

  39. (NMP-QEM) [Neural-based Mixture Probabilistic Query Embedding for Answering FOL queries on Knowledge Graphs](https://aclanthology.org/2022.emnlp-main.194.pdf), EMNLP 2022  

  40. (NQE) [NQE: N-ary Query Embedding for Complex Query Answering over Hyper-relational Knowledge Graphs](https://arxiv.org/pdf/2211.13469.pdf) AAAI 2023  

  41. (QTO) [Answering Complex Logical Queries on Knowledge Graphs via Query Computation Tree Optimization](https://arxiv.org/pdf/2212.09567.pdf), ICML'23 submission  

  42. (SignalE) [Signal Embeddings for Complex Logical Reasoning in Knowledge Graphs](https://link.springer.com/content/pdf/10.1007/978-3-031-10983-6_20.pdf?pdf=inline%20link), KSEM'22  

  43. (Var2Vec) [Efficient Embeddings of Logical Variables for Query Answering over Incomplete Knowledge Graphs](https://ora.ox.ac.uk/objects/uuid:27c9b239-20d0-449f-b425-b5065eb128fe/download_file?file_format=application%2Fpdf&safe_filename=Wang_et_al_2022_Efficient_Embeddings_ofAM.pdf&type_of_work=Conference+item), AAAI'23

  44. (CQD-A) [Adapting Neural Link Predictors for Complex Query Answering](https://arxiv.org/pdf/2301.12313.pdf)

  45. (Query2Geom) [Analysis of Attention Mechanisms in Box-Embedding Systems](https://link.springer.com/chapter/10.1007/978-3-031-26438-2_6), 2023

  46. (SQE) [Sequential Query Encoding For Complex Query Answering on Knowledge Graphs](https://openreview.net/pdf?id=ERqGqZzSu5), TMLR 2023  

  47. (CylE) [CylE: Cylinder Embeddings for Multi-hop Reasoning over Knowledge Graphs](https://aclanthology.org/2023.eacl-main.127.pdf), EACL 2023  

  48. (RoConE) [Modeling Relational Patterns for Logical Query Answering over Knowledge Graphs](https://arxiv.org/pdf/2303.11858.pdf)  

  49. (FIT) [On Existential First Order Queries Inference on Knowledge Graphs](https://arxiv.org/pdf/2304.07063.pdf), arxiv 2023  

  50. (LitCQD) [LitCQD: Multi-Hop Reasoning in Incomplete Knowledge Graphs with Numeric Literals](https://arxiv.org/pdf/2304.14742.pdf), arxiv 2023  

  51. (LARK) [Complex Logical Reasoning over Knowledge Graphs using Large Language Models](https://arxiv.org/pdf/2305.01157.pdf), arxiv 2023  

  52. (WFRE) [Wasserstein-Fisher-Rao Embedding: Logical Query Embeddings with Local Comparison and Global Transport](https://arxiv.org/pdf/2305.04034.pdf), arxiv 2023

  53. (NRN) [Knowledge Graph Reasoning over Entities and Numerical Values](https://arxiv.org/abs/2306.01399) KDD 2023

  54. (EFOk-CQA) [EFOk-CQA: Towards Knowledge Graph Complex Query Answering beyond Set Operation](https://arxiv.org/abs/2307.13701) arxiv 2023

## Application Papers (7)

  Click to expand 

  1. (SE-KGE) [SE-KGE: A Location-Aware Knowledge Graph Embedding Model for Geographic Question Answering and Spatial Semantic Lifting](https://arxiv.org/pdf/2004.14171.pdf), Transactions in GIS 2020, GQE with scalar (x,y) coordinate prediction / encoding

  2. (LEGO) [Lego: Latent execution-guided reasoning for multi-hop question answering on knowledge graphs](https://proceedings.mlr.press/v139/ren21a.html), ICML 2021

  3. (CBR-SubG) [Knowledge base question answering by case-based reasoning over subgraphs](https://proceedings.mlr.press/v162/das22a.html) ICML 2022, application to Question Answering, entailment only, custom datasets

  4. (LogiRec) [Towards High-Order Complementary Recommendation via Logical Reasoning Network](https://arxiv.org/pdf/2212.04966.pdf) Application: BetaE in RecSys, arxiv 2022

  5. [Context-aware explainable recommendation based on domain knowledge graph](https://www.mdpi.com/1458952), Big Data and Cognitive Computing, 2022

  6. (PLM4CLQA) [Unifying Structure Reasoning and Language Model Pre-training for Complex Reasoning](https://arxiv.org/pdf/2301.08913.pdf), arxiv 2023

  7. [Unifying structure reasoning and language model pre-training for complex reasoning](https://arxiv.org/pdf/2301.08913), arxiv 2023

## Potentially Relevant 

  Click to expand 

  1. [Hybrid Structured and Similarity Queries over Wikidata plus Embeddings with Kypher-V](https://www.isi.edu/~hans/publications/WDWS-ISWC-2022.pdf), ISWC 2022

  2. [Combining RDF Graph Data and Embedding Models for an Augmented Knowledge Graph](https://dl.acm.org/doi/fullHtml/10.1145/3184558.3191527), BigNet 2018 Workshop @ WWW'18

  3. [TrQuery: An Embedding-based Framework for Recommanding SPARQL Queries](https://arxiv.org/pdf/1806.06205.pdf), 2018

  4. [Towards Empty Answers in SPARQL: Approximating Querying with RDF Embedding](https://link.springer.com/content/pdf/10.1007/978-3-030-00671-6.pdf), ISWC 2018

## Citation

If you find this work useful, please cite the original paper:

```bibtex

@article{ren2023ngdb,

    title={Neural Graph Reasoning: Complex Logical Query Answering Meets Graph Databases},

    author={Hongyu Ren and Mikhail Galkin and Michael Cochez and Zhaocheng Zhu and Jure Leskovec},

    year={2023},

    eprint={2303.14617},

    archivePrefix={arXiv},

}

```
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/neuralgraphdatabases/awesome-logical-query

Awesome Lists containing this project

README