LLMs One-shot Learning from Human Demonstration on Inductive Reasoning Tasks
dc.contributor.advisor | Hu, Xia | en_US |
dc.contributor.advisor | Segarra, Santiago | en_US |
dc.creator | Wang, Jeff | en_US |
dc.date.accessioned | 2025-01-16T20:00:32Z | en_US |
dc.date.available | 2025-01-16T20:00:32Z | en_US |
dc.date.created | 2024-12 | en_US |
dc.date.issued | 2024-08-29 | en_US |
dc.date.submitted | December 2024 | en_US |
dc.date.updated | 2025-01-16T20:00:32Z | en_US |
dc.description.abstract | Large Language Models (LLMs) have shown impressive proficiency across a range of natural language tasks. However, recent research has highlighted their limitations in inductive reasoning, a key aspect of human cognitive ability. While chain-of-thought demonstration have improved LLM performance on various tasks, there has been limited exploration into their effectiveness for inductive reasoning tasks specifically. Since inductive reasoning rules can vary widely, providing a tailored demonstration for every question is impractical. This study aims to investigate how LLMs perform with minimal demonstration and whether they can generalize in different regimes. To support this research, we design a programmable dataset with inputs of varying lengths and complexities to test LLMs' generalizability. Our findings suggest that a single human demonstration can enable LLMs to achieve perfect in-distribution generalization performance in some tasks. LLMs sometimes exhibit comparable or even better out-of-distribution generalization performance relative to their in-distribution performance. | en_US |
dc.format.mimetype | application/pdf | en_US |
dc.identifier.uri | https://hdl.handle.net/1911/118180 | en_US |
dc.language.iso | en | en_US |
dc.subject | LLM | en_US |
dc.subject | AI | en_US |
dc.title | LLMs One-shot Learning from Human Demonstration on Inductive Reasoning Tasks | en_US |
dc.type | Thesis | en_US |
dc.type.material | Text | en_US |
thesis.degree.department | Electrical and Computer Engineering | en_US |
thesis.degree.discipline | Artificial Intelligence | en_US |
thesis.degree.grantor | Rice University | en_US |
thesis.degree.level | Masters | en_US |
thesis.degree.name | Master of Science | en_US |
Files
Original bundle
1 - 1 of 1