Note: This task is highly related with Entity Linking and Disambiguation as well as Relation Linking and Disambiguation.
- Core techniques of question answering systems over knowledge bases: a survey (Knowledge and Information Systems 2017) [Paper]
- A Survey on Complex Question Answering over Knowledge Base: Recent Advances and Challenges (2020) [Paper]
- Question Answering Summary (not limited to KBQA) [GitHub]
- Introduction to Neural Network based Approaches for Question Answering over Knowledge Graphs (2019) [Paper]
- Awesome KGQA [GitHub]
- What is in the KGQA Benchmark Datasets? Survey on Challenges in Datasets for Question Answering on Knowledge Graphs (Journal on Data Semantics, 2021)[Paper]
- Complex Knowledge Base Question Answering: A Survey [Paper]
- Knowledge Graphs & LLMs: Multi-Hop Question Answering [Neo4j Developer Blog] [Another Similar Discussion]
- Multilingual Question Answering Systems for Knowledge Graphs—A Survey (Semantic Web 2023) [Paper]
General KGQA
- Question Answering Over Knowledge Graphs: Question Understanding Via Template Decomposition (VLDB 2018) [Paper]🌟
- KBQA: Learning Question Answering over QA Corpora and Knowledge Bases
- SPARQA: Skeleton-based Semantic Parsing for Complex Questions over Knowledge Bases [Paper]
- Improving Multi-hop Question Answering over Knowledge Graphs using Knowledge Base Embeddings (ACL 2020) [Paper]
Efficiency issue.
- AskNow: A Framework for Natural Language Query Formalization in SPARQL (ESWC 2016)
- Answering Natural Language Questions by Subgraph Matching over Knowledge Graphs (TKDE 2018) [Paper]
- Natural language question answering over RDF: a graph data driven approach (SIGMOD 2014) 🌟
- Complex Factoid Question Answering with a Free-Text Knowledge Graph (WWW 2020)
- Automated template generation for question answering over knowledge graphs (WWW 2017) [Paper]
- Never-Ending Learning for Open-Domain Question Answering over Knowledge Bases (WWW 2018) [Paper]
- Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering (ICLR 2020)
- An Interpretable Reasoning Network for Multi-Relation Question Answering (COLING 2018)
- Pattern-revising Enhanced Simple Question Answering over Knowledge Bases (COLING 2018)
- TEQUILA: Temporal Question Answering over Knowledge Bases (CIKM 2018)
- FreebaseQA: A New Factoid QA Data Set Matching Trivia-Style Question-Answer Pairs with Freebase (NAACL 2019) [Paper]
- Knowledge Graph Embedding Based Question Answering (WSDM 2019) [Paper] [Code]
- Graph-Based Reasoning over Heterogeneous External Knowledge for Commonsense Question Answering (AAAI 2020) [Paper]
- Open Question Answering Over Curated and Extracted Knowledge Bases (KDD 2014) [Paper]
- RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering (2021) [Paper]
- Natural language question/answering let users talk with the knowledge graph (CIKM 2017)
- Asking Clarification Questions in Knowledge-Based Question Answering (EMNLP 2019)
- What's Missing: A Knowledge Gap Guided Approach for Multi-hop Question Answering (EMNLP 2019)
- KagNet: Learning to Answer Commonsense Questions with Knowledge-Aware Graph Networks (EMNLP 2019)
- Message Passing for Complex Question Answering over Knowledge Graphs (CIKM 2019) [Paper]
- Keyword Search on RDF Graphs — A Query Graph Assembly Approach (CIKM 2017) [Paper] 🌟
- Semantic Guided and Response Times Bounded Top-k Similarity Search over Knowledge Graphs (ICDE 2020) [Paper] 🌟
- Beyond I.I.D.: Three Levels of Generalization for Question Answering on Knowledge Bases (WWW 2021) [Paper]
- Query Graph Generation for Answering Multi-hop Complex Questions from Knowledge Bases (ACL 2020) [Paper] [Code]
- CBench: Towards Better Evaluation of Question Answering Over Knowledge Graphs [Paper] (VLDB 2021) 🌟
- SAKE: Spatial Question Answering over Knowledge Graph Based on Embedding Techniques (ICDE 2021) 🌟
- BeamQA: Multi-hop Knowledge Graph Question Answering with Sequence-to-Sequence Prediction and Beam Search (SIGIR 2023) [Paper]🌟
- Joint Knowledge Graph Completion and Question Answering (KDD 2022) [Paper]
- Would You Ask it that Way?: Measuring and Improving Question Naturalness for Knowledge Graph Question Answering (SIGIR 2022) [Paper]
- Sequence-to-Sequence Knowledge Graph Completion and Question Answering (ACL 2022) [Paper]
- Knowledge Graph Question Answering with Ambiguous Query (WWW 2023) [Paper]
- Efficient and Accurate Contextual Re-Ranking for Knowledge Graph Question Answering (LREC-COLING 2024) [Paper]
- STaR: Space and Time-aware Statistic Query Answering (CIKM 2024) [Paper]
- DiaKoP: Dialogue-based Knowledge-oriented Programming for Neural-symbolic Knowledge Base Question Answering (CIKM 2024) [Paper]
- FZI-WIM at AVeriTeC Shared Task: Real-World Fact-Checking with Question Answering (EMNLP 2024) [Paper] - Related to
knowledge store
instead of knowledge graph. - Enhancing Complex Question Answering over Knowledge Graphs through Evidence Pattern Retrieval (SIGIR 2024) [Paper]
- NuTrea: neural tree search for context-guided multi-hop KGQA (NeurIPS 2023) [Paper]
- Uniqorn: Unified question answering over RDF knowledge graphs and natural language text (Journal of Web Semantics, 2024) [Paper]
- Reasoning with Trees: Faithful Question Answering over Knowledge Graph (COLING 2025)
- Aligning Complex Knowledge Graph Question Answering as Knowledge-Aware Constrained Code Generation (COLING 2025)
Multiple Hop QA
- Scalable Multi-Hop Relational Reasoning for Knowledge-Aware Question Answering (EMNLP 2020) [Video]
- QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering (Jure's group, NAACL-HLT 2021) [Paper]
- Cognitive Graph for Multi-Hop Reading Comprehension at Scale (ACL 2019) [Paper] [Code]
- Note: Cognitive Graph is not directly equal to Knowledge Graph. You can view CG as a (dynamic, partial, local) KG generated instantly from the query.
- Complex Question Answering on knowledge graphs using machine translation and multi-task learning (EACL 2021) [Paper]
- Improving Multi-hop Question Answering over Knowledge Graphs using Knowledge Base Embeddings (ACL 2020) [Paper] More than 300 stars in Github in Dec 2023!
Multiple-Options QA
- RiddleSense: Reasoning about Riddle Questions Featuring Linguistic Creativity and Commonsense Knowledge (ACL 2021) [Paper]
- QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering (Jure's group, NAACL-HLT 2021) [Paper]
Multiligual QA
- Improving Zero-Shot Cross-lingual Transfer for Multilingual Question Answering over Knowledge Graph (ACL 2021) [Paper]
- A System for Answering Simple Questions in Multiple Languages (ACL 2023, demo) [Paper]
LLM for KGQA, or KG for QA based on LLM 🔥
- It becomes a hot topic in both academics and industries, especially with the developement of KG for RAG.
- Leveraging LLMs in Scholarly Knowledge Graph Question Answering (Arxiv, Nov 2023) [Paper]
- Answering Questions Over Knowledge Graphs Using Logic Programming Along with Language Models (submitted to ICLR 2023 but not revised) [Paper]
- Retrieve-Rewrite-Answer: A KG-to-Text Enhanced LLMs Framework for Knowledge Graph Question Answering (ijckg, Sep 2023) [Paper]
- Language Models as Controlled Natural Language Semantic Parsers for Knowledge Graph Question Answering (from Amazon Science, 2023) [Paper]
- Is GPT fit for KGQA? – Preliminary Results (CEUR Workshop 2023) [Paper]
- Only GPT3 and GPT3.5 are used, so the experiemnts prove that there is limitation.
- Bring Your Own KG: Self-Supervised Program Synthesis for Zero-shot KGQA (Amazon and UMass, Nov 2023) [Paper]
- In BYOKG, exploration leverages an LLM-backed symbolic agent that generates a diverse set of queryprogram exemplars, which are then used to ground a retrieval-augmented reasoning procedure to predict programs for arbitrary questions.
- A Benchmark to Understand the Role of Knowledge Graphs on Large Language Model's Accuracy for Question Answering on Enterprise SQL Databases (Arxiv, Nov 2023) [Paper]
- Knowledge-augmented language model prompting for zero-shot knowledge graph question answering (ACL 2023, KAIST + MBZUAI + Amazon) [Paper]
- Empowering Language Models with Knowledge Graph Reasoning for Open-Domain Question Answering (EMNLP 2022) [Paper]
- KnowledgeNavigator: Leveraging Large Language Models for Enhanced Reasoning over Knowledge Graph (Arxiv, Dec 2023) [Paper]
- The baselines also include StructGPT (A general framework for large language model to reason over structured data, Arxiv 2023) and TOG (Think-on-graph: Deep and responsible reasoning of large language model with knowledge graph, ICLR 2024 poster)
- Pretrained transformers for simple question answering over knowledge graphs (Web–ISWC 2019)
- The first work to utilize LLMs as classifiers for relation prediction.
- An empirical study of pre-trained language models in simple knowledge graph question answering (Arxiv 2023)
- It introduce two LLM-based KGQA frameworks that adopt LLMs to detect mentioned entities and relations. Then, they query the answer in KGs using the extracted entity-relation pairs.
- QA-GNN: QAGNN: Reasoning with language models and knowledge graphs for question answering (ACL 2021)
- It uses LLMs to encode the question and candidate answer pairs, which are adopted to estimate the importance of relative KG entities. The entities are retrieved to form a subgraph, where an answer reasoning is conducted by a GNN.
- A bert-based approach with relationaware attention for knowledge base question answering (IJCNN 2021)
- It use LLMs to calculate the similarities between relations and questions to retrieve related facts.
- Subgraph retrieval enhanced model for multi-hop knowledge base question answering (ACL 2022)
- a LLM-based path retriever to retrieve question-related relations hop-byhop and construct several paths.
- Can ChatGPT Replace Traditional KBQA Models? An In-Depth Analysis of the Question Answering Performance of the GPT LLM Family (ISWC 2023) [Paper]
- ReasoningLM: Enabling Structural Subgraph Reasoning in Pre-trained Language Models for Question Answering over Knowledge Graph (ACL 2023) [Paper]
- Language Models as Controlled Natural Language Semantic Parsers for Knowledge Graph Question Answering (ECAI 2023) [Paper]
- LLM-Based SPARQL Generation with Selected Schema from Large Scale Knowledge Base (CCKS2023 CKBQA competition, F1 score is 75.63% on CKBQA dataset) [Paper]
- Improving Subgraph Extraction Algorithms for One-Shot SPARQL Query Generation with Large Language Models (ISWC 2023) [Paper]
- A Structure and Content Prompt-based Method for Knowledge Graph Question Answering over Scholarly Data (ISWC 2023) [Paper]
- Bring Your Own KG: Self-Supervised Program Synthesis for Zero-Shot KGQA (Arxiv 2023) [Paper]
- Check for updates LLM-Based SPARQL Generation with Selected Schema from Large Scale Knowledge Base (Springer Nature Singapore 2023)
- A Knowledge Graph Question Answering Approach to IoT Forensics (IoTDI 2023) [Paper]
- Don’t Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments (ACL 2023) [Paper]
- Pangu consists of a symbolic agent and a neural LM working in a concerted fashion: The agent explores the environment to incrementally construct valid plans, and the LM evaluates the plausibility of the candidate plans to guide the search process.
- The evaluation is conducted on KBQA.
- FlexKBQA: A Flexible LLM-Powered Framework for Few-Shot Knowledge Base Question Answering (AAAI 2024) [Paper]
- Knowledge Graph Prompting for Multi-Document Question Answering (AAAI 2024) [Paper]
- CyberQ: Generating Questions and Answers for Cybersecurity Education Using Knowledge Graph-Augmented LLMs (AAAI 2024) [Paper]
- Grape: Knowledge Graph Enhanced Passage Reader for Open-domain Question Answering (EMNLP 2022) [Paper]
- Chatgpt versus traditional question answering for knowledge graphs: Current status and future directions towards knowledge graph chatbots (Arxiv 2023, with 77+ citations in Aug 2024) [Paper]
- Cross-Data Knowledge Graph Construction for LLM-enabled Educational Question-Answering System: A Case Study at HCMUT (AIQAM 2024) [Paper]
- Retrieval-augmented generation with knowledge graphs for customer service question answering (SIGIR 2024) [Paper]
- Knowledge Graph aided LLM based ESG Question-Answering from News ((KG4S2024, co-located with ESWC2024) [Paper]
- Improving LLM-based KGQA for multi-hop Question Answering with implicit reasoning in few-shot examples (KaLLM 2024) [Paper]
- Towards Faithful Knowledge Graph Explanation Through Deep Alignment in Commonsense Question Answering (EMNLP 2024) [Paper]
- Retrieval and Reasoning on KGs: Integrate Knowledge Graphs into Large Language Models for Complex Question Answering (EMNLP 2024) [Paper]
- Training-free Deep Concept Injection Enables Language Models for Video Question Answering (EMNLP 2024) [Paper]
- A GAIL Fine-Tuned LLM Enhanced Framework for Low-Resource Knowledge Graph Question Answering (CIKM 2024) [Paper]
- Generate-then-Ground in Retrieval-Augmented Generation for Multi-hop Question Answering (ACL 2024) [Paper]
- RGR-KBQA: Generating Logical Forms for Question Answering Using Knowledge-Graph-Enhanced Large Language Model (COLING 2025) [Paper]
- Awakening Augmented Generation: Learning to Awaken Internal Knowledge of Large Language Models for Question Answering (COLING 2025)
SPARQL
- Enhancing SPARQL Generation by Triplet-order-sensitive Pre-training (CIKM 2024) [Paper]
- QALD-9: The 9th challenge on question answering over linked data (QALD-9) (invited paper) (CEUR Workshop 2018)
- Lc-quad 2.0: A large dataset for complex question answering over wikidata and dbpedia (ISWC 2019)
- Knowledge Graph Question Answering Leaderboard: A Community Resource to Prevent a Replication Crisis (LREC 2022) [Paper] [Link]
- MetaQA: Variational reasoning for question answering with knowledge graph (AAAI 2018)
- A large scale multi-hop KGQA dataset with more than 400k questions in the movie domain. It has 1-hop, 2-hop, and 3-hop questions.
- It also provides a KG with 135k triples, 43k entities, and nine relations.
- WebQuestionsSP: The value of semantic parse labeling for knowledge base question answering (ACL 2016)
- a smaller QA dataset with 4,737 questions. The questions in this dataset are 1-hop and 2-hop questions and are answerable through Freebase KG.
- ComplexWebQuestions: The web as a knowledge-base for answering complex questions (ACL 2018)
- GraphQ: On generating characteristic-rich question sets for QA evaluation (EMNLP 2016)
- MKQA: a linguistically diverse benchmark for multilingual open domain question answering (ACL 2021)
- GrailQA: Beyond IID: three levels of generalization for question answering on knowledge bases (WWW 2021)
- KQA Pro: a dataset with explicit compositional programs for complex question answering over knowledge base (ACL 2022)
- Spider4SPARQL: A Complex Benchmark for Evaluating Knowledge Graph Question Answering Systems (Arxiv, Sep 2023) [Github]
- GQA: A new dataset for real-world visual reasoning and compositional question answering (CVPR 2019)
- A discussion of KGQA datasets: Knowledge Graph Question Answering Datasets and Their Generalizability: Are They Enough for Future Research? (SIGIR 2022) [Paper]
- Leaderboard: Knowledge Graph Question Answering Leaderboard: A Community Resource to Prevent a Replication Crisis (LREC 2022) [Paper]
- ToolQA: A Dataset for LLM Question Answering with External Tools (NeurIPS 2023, Datasets and Benchmarks Track) [Paper]
- QuALITY: Question Answering with Long Input Texts, Yes! (NAACL 2022) [Link]
- GeoQuestions1089: The question answering system GeoQA2 and a new benchmark for its evaluation (International Journal of Applied Earth Observation and Geoinformation 2024) [Paper]
- CommonsenseQA in-house (IH) data split: Commonsenseqa: A question answering challenge targeting commonsense knowledge (NAACL 2019)
- OpenBookQA: Can a suit of armor conduct electricity? a new dataset for open book question answering (EMNLP 2018)
- RAC: Retrieval-augmented Conversation Dataset for Open-domain Question Answering in Conversational Settings (EMNLP 2024, industry) [Paper]
- 2WikiMultiHop Dataset: Constructing a multihop QA dataset for comprehensive evaluation of reasoning steps (COLING 2020)
- HotpotQA (HQA): HotpotQA: A dataset for diverse, explainable multi-hop question answering (EMNLP 2018)
- MuSiQue (MQA): MuSiQue: Multihop questions via single-hop question composition (TACL 2022)
- StrategyQA (SQA): Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies (TACL 2021)
- CQW: The web as a knowledge-base for answering complex questions (NAACL 2018)
- WebQSP: The value of semantic parse labeling for knowledge base question answering (ACL 2016)
Related Readings
- Unsupervised Question Decomposition for Question Answering (EMNLP 2020) [Paper] [Code]
- The Web as a Knowledge-base for Answering Complex Questions (NAACL-HLT 2018) [Paper]
- Scalable Join Processing on Very Large RDF Graphs (SIGMOD 2009) [Paper] 🌟
Some thoughts
- Consider Combine embedding and subgraph matching for KGQA?