231 reads

Deductive Verification with Natural Programs: Case Studies

by Cosmological thinking: time, space and universal causation September 8th, 2024

Too Long; Didn't Read

We present examples of deductive verification using Natural Program formats, showcasing how ChatGPT identifies ungrounded information and logical errors, and highlighting cases where the model struggles with premise numbers and grounded values.

featured image - Deductive Verification with Natural Programs: Case Studies

Authors:

(1) Zhan Ling, UC San Diego and equal contribution;

(2) Yunhao Fang, UC San Diego and equal contribution;

(3) Xuanlin Li, UC San Diego;

(4) Zhiao Huang, UC San Diego;

(5) Mingu Lee, Qualcomm AI Research and Qualcomm AI Research

(6) Roland Memisevic, Qualcomm AI Research;

(7) Hao Su, UC San Diego.

Table of Links

Abstract and Introduction

Related work

Motivation and Problem Formulation

Deductively Verifiable Chain-of-Thought Reasoning

Experiments

Limitations

Conclusion, Acknowledgements and References

A Deductive Verification with Vicuna Models

B More Discussion on Improvements of Deductive Verification Accuracy Versus Improvements on Final Answer Correctness

C More Details on Answer Extraction

D Prompts

E More Deductive Verification Examples

In this section, we present more deductive verification examples using our Natural Program-based approach on single reasoning steps.

In Tab. 18, we demonstrate that the language model (ChatGPT) not only successfully identifies ungrounded information, but also identifies logical errors within the given solutions.

In Tab. 19, we illustrate a case where the language model fails to detect ungrounded premise numbers, mistakenly assuming that these numbers can be derived from grounded ones.

Lastly, in Tab. 20, we illustrate a case where the language model is sometimes unable to correctly identify grounded numbers.

This paper is available on arxiv under CC BY 4.0 DEED license.

L O A D I N G
. . . comments & more!

About Author

Cosmological thinking: time, space and universal causation @cosmological

From Big Bang's singularity to galaxies' cosmic dance the universe unfolds its majestic tapestry of space and time.

Read my stories Learn More

TOPICS

machine-learning #ai #llm-prompting #chain-of-thought-prompting #ai-hallucinations #natural-program #self-verification-in-ai #cot-verification-models #ai-trustworthiness

THIS ARTICLE WAS FEATURED IN...

Join HackerNoon

Latest technology trends. Customized Experience. Curated Stories. Publish Your Ideas

Deductive Verification with Natural Programs: Case Studies

Too Long; Didn't Read

Table of Links

E More Deductive Verification Examples

About Author

TOPICS

THIS ARTICLE WAS FEATURED IN...

RELATED STORIES