Towards Efficient Knowledge Graph Generation From Textbooks: A Dual Framework Approach
dc.contributor.advisor | Baraniuk, Richard G | en_US |
dc.creator | Hatchett, Johaun | en_US |
dc.date.accessioned | 2025-05-16T14:40:20Z | en_US |
dc.date.available | 2025-05-16T14:40:20Z | en_US |
dc.date.created | 2025-05 | en_US |
dc.date.issued | 2025-02-05 | en_US |
dc.date.submitted | May 2025 | en_US |
dc.date.updated | 2025-05-16T14:40:21Z | en_US |
dc.description.abstract | The recent proliferation of LLMs necessitates a strategy for addressing these models' deleterious shortcomings: hallucination, and lack of explainability. Knowledge graphs (KGs) have gained attention as a potential solution to these problems, as they can serve as a traceable, factual database for LLMs; however, constructing high-quality KGs efficiently remains a challenge. To address these challenges, this thesis proposes Words2Wisdom, a logic-informed, LLM-based framework for generating quality KGs from textbooks. Words2Wisdom creates expressive KGs by leveraging the structure of propositional logic, and ensures accurate fact representation, demonstrating knowledge validity (precision) greater than 95% when using the GPT-4o model in a few-shot environment. Our results suggest targeted fine-tuning and model specialization can further enhance KG quality. Furthermore, this thesis examines whether LLMs are able to assess the quality of KGs. We introduce Libra, a framework establishing a novel KG evaluation protocol for validating KGs against textbook sources. Preliminary results show high observed agreement between Libra and human experts, suggesting that KG construction and evaluation and can indeed be effectively automated, paving the way for future research on the role of LLMs in hallucination mitigation. | en_US |
dc.format.mimetype | application/pdf | en_US |
dc.identifier.uri | https://hdl.handle.net/1911/118353 | en_US |
dc.language.iso | en | en_US |
dc.subject | knowledge graphs | en_US |
dc.subject | large language models | en_US |
dc.title | Towards Efficient Knowledge Graph Generation From Textbooks: A Dual Framework Approach | en_US |
dc.type | Thesis | en_US |
dc.type.material | Text | en_US |
thesis.degree.department | Applied Physics | en_US |
thesis.degree.discipline | Applied Physics/Electrical Eng | en_US |
thesis.degree.grantor | Rice University | en_US |
thesis.degree.level | Masters | en_US |
thesis.degree.name | Master of Science | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- HATCHETT-DOCUMENT-2025.pdf
- Size:
- 768.72 KB
- Format:
- Adobe Portable Document Format