paint-brush
Testing the Depths of AI Empathy: Q1 2024 Benchmarksby@anywhichway
267 reads

Testing the Depths of AI Empathy: Q1 2024 Benchmarks

by Simon Y. Blackwell6mMarch 8th, 2024
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

This article presents benchmark results for assessing the empathetic capabilities of generative AI models using psychological and purpose-built measures. The tests include TAS-20, EQ-60, SQ-R, and IRI. The measure AEQ (Applied Empathy Quotient) was introduced. Most raw LLMs struggle to connect empathetically with users due to their balanced empathetic and systemized thinking capabilities. The closed model Willow demonstrates the highest empathetic capacity, while ChatGPT does not stand out significantly among other LLMs. Claude v3 Opus showed a decline in empathetic ability compared to its previous version. More specialized tests need to be developed.
featured image - Testing the Depths of AI Empathy: Q1 2024 Benchmarks
Simon Y. Blackwell HackerNoon profile picture
Simon Y. Blackwell

Simon Y. Blackwell

@anywhichway

Working in the clouds around Seattle on open source projects. Sailing when it's clear.

0-item
1-item
2-item

STORY’S CREDIBILITY

Original Reporting

Original Reporting

This story contains new, firsthand information uncovered by the writer.

Vested Interest

Vested Interest

This writer has a vested interest be it monetary, business, or otherwise, with 1 or more of the products or companies mentioned within.

Opinion piece / Thought Leadership

Opinion piece / Thought Leadership

The is an opinion piece based on the author’s POV and does not necessarily reflect the views of HackerNoon.

L O A D I N G
. . . comments & more!

About Author

Simon Y. Blackwell HackerNoon profile picture
Simon Y. Blackwell@anywhichway
Working in the clouds around Seattle on open source projects. Sailing when it's clear.

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite